-
Virtual Classification: Modulating Domain-Specific Knowledge for Multidomain Crowd Counting
Authors:
Mingyue Guo,
Binghui Chen,
Zhaoyi Yan,
Yaowei Wang,
Qixiang Ye
Abstract:
Multidomain crowd counting aims to learn a general model for multiple diverse datasets. However, deep networks prefer modeling distributions of the dominant domains instead of all domains, which is known as domain bias. In this study, we propose a simple-yet-effective Modulating Domain-specific Knowledge Network (MDKNet) to handle the domain bias issue in multidomain crowd counting. MDKNet is achi…
▽ More
Multidomain crowd counting aims to learn a general model for multiple diverse datasets. However, deep networks prefer modeling distributions of the dominant domains instead of all domains, which is known as domain bias. In this study, we propose a simple-yet-effective Modulating Domain-specific Knowledge Network (MDKNet) to handle the domain bias issue in multidomain crowd counting. MDKNet is achieved by employing the idea of `modulating', enabling deep network balancing and modeling different distributions of diverse datasets with little bias. Specifically, we propose an Instance-specific Batch Normalization (IsBN) module, which serves as a base modulator to refine the information flow to be adaptive to domain distributions. To precisely modulating the domain-specific information, the Domain-guided Virtual Classifier (DVC) is then introduced to learn a domain-separable latent space. This space is employed as an input guidance for the IsBN modulator, such that the mixture distributions of multiple datasets can be well treated. Extensive experiments performed on popular benchmarks, including Shanghai-tech A/B, QNRF and NWPU, validate the superiority of MDKNet in tackling multidomain crowd counting and the effectiveness for multidomain learning. Code is available at \url{https://github.com/csguomy/MDKNet}.
△ Less
Submitted 6 February, 2024;
originally announced February 2024.
-
Intrinsic nonlinear Hall effect in two-dimensional honeycomb topological antiferromagnets
Authors:
Zheng-Yang Zhuang,
Zhongbo Yan
Abstract:
Two-dimensional systems with honeycomb lattice are known to be a paradigmatic platform to explore the various types of Hall effects, owing to that the interplay of lattice geometry, spin-orbit coupling and magnetism can give rise to very rich features in the quantum geometry of wave functions. In this work, we consider honeycomb topological antiferromagets that are effectively described by a…
▽ More
Two-dimensional systems with honeycomb lattice are known to be a paradigmatic platform to explore the various types of Hall effects, owing to that the interplay of lattice geometry, spin-orbit coupling and magnetism can give rise to very rich features in the quantum geometry of wave functions. In this work, we consider honeycomb topological antiferromagets that are effectively described by a $\mathcal{PT}$-symmetric antiferromagnetic Kane-Mele model, and explore the evolution of its nonlinear Hall response with respect to the change of lattice anisotropy, chemical potential, and the direction of the Néel vector. Due to the $\mathcal{PT}$-symmetry, the leading-order Hall effect of quantum geometric origin is the intrinsic nonlinear Hall effect, which is a second-order effect of electric fields and is independent of the scattering time. We investigate the behavior of the intrinsic nonlinear Hall conductivity tensor across topological phase transitions driven by antiferromagnetic exchange field and lattice anisotropy and find that its components do not change sign, which is different from the extrinsic nonlinear Hall effect. In the weakly doped regime, we find that the intrinsic nonlinear Hall effect is valley-polarized. By varying the chemical potential, we find that the nonlinear Hall conductivity tensors exhibit kinks when the Fermi surface undergoes Lifshitz transitions. Furthermore, we find that the existence of spin-orbit coupling to lift the spin-rotation symmetry is decisive for the use of intrinsic nonlinear Hall effect to detect the direction of the Néel vector. Our work shows that the two-dimensional honeycomb topological antiferromagnets are an ideal class of material systems with rich properties for the study of intrinsic nonlinear Hall effect.
△ Less
Submitted 4 February, 2024;
originally announced February 2024.
-
A self-induced mechanism of large-scale helical structures in compressible turbulent flows
Authors:
Zheng Yan,
Jianchun Wang,
Lifeng Wang,
Zhu Lei,
Junfeng Wu,
Junyi Duan,
Fulin Tong,
Xinliang Li,
Chang** Yu
Abstract:
A novel self-sustaining mechanism is proposed for large-scale helical structures in compressible turbulent flows. The existence of two channels of subgrid-scale and viscosity terms for large-scale helicity evolution is confirmed for the first time, through selecting a physical definition of the large-scale helicity in compressible turbulence. Under the influence of the fluid element expansion, it…
▽ More
A novel self-sustaining mechanism is proposed for large-scale helical structures in compressible turbulent flows. The existence of two channels of subgrid-scale and viscosity terms for large-scale helicity evolution is confirmed for the first time, through selecting a physical definition of the large-scale helicity in compressible turbulence. Under the influence of the fluid element expansion, it is found that the helicity is generated at small scales via the second-channel viscosity, and the inverse cross-scale helicity transfers at inertial scales through the second-channel helicity flux. Together, they form a self-induced mechanism, which provides a physical insight into the long-period characteristic of large-scale helical structures in the evolution of compressible flow systems.
△ Less
Submitted 2 February, 2024;
originally announced February 2024.
-
AssertLLM: Generating and Evaluating Hardware Verification Assertions from Design Specifications via Multi-LLMs
Authors:
Wenji Fang,
Mengming Li,
Min Li,
Zhiyuan Yan,
Shang Liu,
Hongce Zhang,
Zhiyao Xie
Abstract:
Assertion-based verification (ABV) is a critical method for ensuring design circuits comply with their architectural specifications, which are typically described in natural language. This process often requires significant interpretation by engineers to convert these specifications into functional verification assertions. Existing methods for generating assertions from natural language specificat…
▽ More
Assertion-based verification (ABV) is a critical method for ensuring design circuits comply with their architectural specifications, which are typically described in natural language. This process often requires significant interpretation by engineers to convert these specifications into functional verification assertions. Existing methods for generating assertions from natural language specifications are limited to sentences extracted by engineers, discouraging the practical application. In this work, we present AssertLLM, an automatic assertion generation framework for complete specification files. AssertLLM breaks down the complex task into three phases, incorporating three customized Large Language Models (LLMs) for extracting structural specifications, map** signal definitions, and generating assertions. Additionally, we provide an open-source benchmark for assessing assertion generation capabilities. Our evaluation of AssertLLM on a full design, encompassing 23 signals, demonstrates that 89% of the generated assertions are both syntactically and functionally accurate.
△ Less
Submitted 1 February, 2024;
originally announced February 2024.
-
Multi-agent Path Finding for Cooperative Autonomous Driving
Authors:
Zhongxia Yan,
Han Zheng,
Cathy Wu
Abstract:
Anticipating possible future deployment of connected and automated vehicles (CAVs), cooperative autonomous driving at intersections has been studied by many works in control theory and intelligent transportation across decades. Simultaneously, recent parallel works in robotics have devised efficient algorithms for multi-agent path finding (MAPF), though often in environments with simplified kinema…
▽ More
Anticipating possible future deployment of connected and automated vehicles (CAVs), cooperative autonomous driving at intersections has been studied by many works in control theory and intelligent transportation across decades. Simultaneously, recent parallel works in robotics have devised efficient algorithms for multi-agent path finding (MAPF), though often in environments with simplified kinematics. In this work, we hybridize insights and algorithms from MAPF with the structure and heuristics of optimizing the crossing order of CAVs at signal-free intersections. We devise an optimal and complete algorithm, Order-based Search with Kinematics Arrival Time Scheduling (OBS-KATS), which significantly outperforms existing algorithms, fixed heuristics, and prioritized planning with KATS. The performance is maintained under different vehicle arrival rates, lane lengths, crossing speeds, and control horizon. Through ablations and dissections, we offer insight on the contributing factors to OBS-KATS's performance. Our work is directly applicable to many similarly scaled traffic and multi-robot scenarios with directed lanes.
△ Less
Submitted 31 January, 2024;
originally announced February 2024.
-
Formation Mechanism of Laser-Driven Magnetized "Pillars of Creation"
Authors:
Zhu Lei,
Lifeng Wang,
Jiwei Li,
Shiyang Zou,
Junfeng Wu,
Zhonghai Zhao,
Wei Sun,
Wenqiang Yuan,
Longxing Li,
Zheng Yan,
Jun Li,
Wenhua Ye,
Xiantu He,
Bin Qiao
Abstract:
Pillars of Creation, one of the most recognized objects in the sky, are believed to be associated with the formation of young stars. However, so far, the formation and maintenance mechanism for the pillars are still not fully understood due to the complexity of the nonlinear radiation magneto-hydrodynamics (RMHD). Here, assuming laboratory laser-driven conditions, we studied the self-consistent dy…
▽ More
Pillars of Creation, one of the most recognized objects in the sky, are believed to be associated with the formation of young stars. However, so far, the formation and maintenance mechanism for the pillars are still not fully understood due to the complexity of the nonlinear radiation magneto-hydrodynamics (RMHD). Here, assuming laboratory laser-driven conditions, we studied the self-consistent dynamics of pillar structures in magnetic fields by means of two-dimensional (2D) and three-dimensional (3D) RMHD simulations, and these results also support our proposed experimental scheme. We find only when the magnetic pressure and ablation pressure are comparable, the magnetic field can significantly alter the plasma hydrodynamics. For medium magnetized cases ($β_{initial} \approx 3.5$), {the initial magnetic fields undergo compression and amplification. This amplification results in the magnetic pressure inside the pillar becoming large enough to support the sides of the pillar against radial collapse due to pressure from the surrounding hot plasma. This effect is particularly pronounced for the parallel component ($B_y$), which is consistent with observational results.} In contrast, a strong perpendicular ($B_x, B_z$) magnetic field ($β_{initial} < 1$) almost remains its initial distribution and significantly suppresses the expansion of blow-off gas plasma, leading to the inability to form pillar-like structures. The 3D simulations suggest that the bending at the head of `Column \uppercase\expandafter{\romannumeral1}' in pillars of creation may be due to the non-parallel magnetic fields. After similarity scaling transformation, our results can be applied to explain the formation and maintenance mechanism of the pillars, and can also provide useful information for future experimental designs.
△ Less
Submitted 30 January, 2024;
originally announced January 2024.
-
Continuum excitations in a spin-supersolid on a triangular lattice
Authors:
M. Zhu,
V. Romerio,
N. Steiger,
S. D. Nabi,
N. Murai,
S. Ohira-Kawamura,
K. Yu. Povarov,
Y. Skourski,
R. Sibille,
L. Keller,
Z. Yan,
S. Gvasaliya,
A. Zheludev
Abstract:
Magnetic, thermodynamic, neutron diffraction and inelastic neutron scattering are used to study spin correlations in the easy-axis XXZ triangular lattice magnet K2Co(SeO3)2. Despite the presence of quasi-2D "supersolid" magnetic order, the low-energy excitation spectrum contains no sharp modes and is instead a broad and structured multi-particle continuum. Applying a weak magnetic field drives the…
▽ More
Magnetic, thermodynamic, neutron diffraction and inelastic neutron scattering are used to study spin correlations in the easy-axis XXZ triangular lattice magnet K2Co(SeO3)2. Despite the presence of quasi-2D "supersolid" magnetic order, the low-energy excitation spectrum contains no sharp modes and is instead a broad and structured multi-particle continuum. Applying a weak magnetic field drives the system into an m = 1/3 fractional magnetization plateau phase and restores sharp spin wave modes. To some extent, the behavior at zero field can be understood in terms of spin wave decay. However, the presence of clear excitation minima at the M-points of the Brillouin zone suggest that the spinon language may provide a more adequate description, and signals a possible proximity to a Dirac spin liquid state.
△ Less
Submitted 26 April, 2024; v1 submitted 29 January, 2024;
originally announced January 2024.
-
Unitary and efficient spin squeezing in cavity optomechanics
Authors:
Lei Xie,
Zhiqi Yan,
Lingxia Wang,
Di Wang,
**feng Liu,
Yiling Song,
Wei Xiong,
Mingfeng Wang
Abstract:
We propose an approach to produce spin squeezed states of a large number of nitrogen-vacancy centers in diamond nanostructures coupled to an optical cavity. Unlike the previous squeezing method proposed by Bennett et al. [Phys. Rev. Lett. 110, 156402 (2013)], which is limited by phonon number fluctuations due to the existence of phonon-spin entanglement, our proposal can completely erase the entan…
▽ More
We propose an approach to produce spin squeezed states of a large number of nitrogen-vacancy centers in diamond nanostructures coupled to an optical cavity. Unlike the previous squeezing method proposed by Bennett et al. [Phys. Rev. Lett. 110, 156402 (2013)], which is limited by phonon number fluctuations due to the existence of phonon-spin entanglement, our proposal can completely erase the entanglement between spins and hybrid phonon-photon mode mediating the effective spin-spin interaction, and thus achieves unitary one-axis-twisting interactions between nitrogen-vacancy centres, yielding a squeezing scaling $J^{-2/3}$, where J is the total angular momentum. We found that, under certain conditions, our method has the potential to enhance the spin-spin nonlinear interactions. We also proposed a scheme utilizing repeatedly applying the one-axis-twisting evolution to two orthogonal spin directions, which enables the transformation of the one-axis-twisting interactions into two-axis-twisting type, and therefore leads to the spin squeezing with Heisenberg-limited scaling $J^{-1}$. Taking into account the noise effects of spin dephasing and relaxtion, we found that the proposed approaches are robust against imperfections.
△ Less
Submitted 27 January, 2024;
originally announced January 2024.
-
Insight-HXMT observations of thermonuclear X-ray bursts in 4U 1636-53
Authors:
Zhe Yan,
Guobao Zhang,
Yu-Peng Chen,
Shu Zhang,
Mariano Méndez,
**gqiang Peng,
Shuang-Nan Zhang,
**lu Qu,
Ming Lyu,
Jirong Mao,
Mingyu Ge,
Jiancheng Wang
Abstract:
We conducted an analysis of 45 bursts observed from 4U 1636$-$53. To investigate the mechanism behind the light curve profiles and the impact of thermonuclear X-ray bursts on the accretion environment in accreting neutron star low-mass X-ray binaries. This analysis employed both light curve and time-resolved spectroscopy methodologies, with data collected by the \textit{Insight}-HXMT instrument. W…
▽ More
We conducted an analysis of 45 bursts observed from 4U 1636$-$53. To investigate the mechanism behind the light curve profiles and the impact of thermonuclear X-ray bursts on the accretion environment in accreting neutron star low-mass X-ray binaries. This analysis employed both light curve and time-resolved spectroscopy methodologies, with data collected by the \textit{Insight}-HXMT instrument. We found that 30 bursts exhibited similar light curve profiles and were predominantly in the hard state, and two photospheric radius expansion (PRE) bursts were in the soft state. The light curves of most bursts did not follow a single exponential decay but displayed a dual-exponential behavior. The initial exponent had a duration of approximately 6 s. We utilized both the standard method and the `$f_{\rm a}$' method to fit the burst spectra. The majority of the `$f_{\rm a}$' values exceeded 1, indicating an enhancement of the persistent emission during the burst. Under the two comptonization components assumption, we suggest that the scattering of burst photons by the inner corona may mainly contribute to the persistent emission enhancement. We also observed an inverse correlation between the maximum $f_{\rm a}$ and the persistent emission flux in the non-PRE burst. This anti-correlation suggests that when the accretion rate is lower, there is a greater enhancement of persistent emission during the burst peak. The prediction based on Poynting-Robertson drag (P-R drag) aligns with this observed anti-correlation.
△ Less
Submitted 20 January, 2024;
originally announced January 2024.
-
Post-synthesis tuning of dielectric constant via ferroelectric domain wall engineering
Authors:
L. Zhou,
L. Puntigam,
P. Lunkenheimer,
E. Bourret,
Z. Yan,
I. Kézsmárki,
D. Meier,
S. Krohns,
J. Schultheiß,
D. M. Evans
Abstract:
A promising mechanism for achieving colossal dielectric constants is to use insulating internal barrier layers, which typically form during synthesis and then remain in the material. It has recently been shown that insulating domain walls in ferroelectrics can act as such barriers. One advantage domain walls have, in comparison to stationary interfaces, is that they can be moved, offering the pote…
▽ More
A promising mechanism for achieving colossal dielectric constants is to use insulating internal barrier layers, which typically form during synthesis and then remain in the material. It has recently been shown that insulating domain walls in ferroelectrics can act as such barriers. One advantage domain walls have, in comparison to stationary interfaces, is that they can be moved, offering the potential of post-synthesis control of the dielectric constant. However, to date, direct imaging of how changes in domain wall pattern cause a change in dielectric constant within a single sample has not been realized. In this work, we demonstrate that changing the domain wall density allows the engineering of the dielectric constant in hexagonal-ErMnO3 single crystals. The changes of the domain wall density are quantified via microscopy techniques, while the dielectric constant is determined via macroscopic dielectric spectroscopy measurements. The observed changes in the dielectric constant are quantitatively consistent with the observed variation in domain wall density, implying that the insulating domain walls behave as 'ideal' capacitors connected in series. Our approach to engineer the domain wall density can be readily extended to other control methods, e.g., electric fields or mechanical stresses, providing a novel degree of flexibility to in-situ tune the dielectric constant.
△ Less
Submitted 19 January, 2024;
originally announced January 2024.
-
Large-space and long-time asymptotic behaviors of $N_{\infty}$-soliton solutions (soliton gas) for the focusing Hirota equation
Authors:
Weifang Weng,
Zhenya Yan
Abstract:
The Hirota equation is one of the integrable higher-order extensions of the nonlinear Schrödinger equation, and can describe the ultra-short optical pulse propagation in the form $iq_t+α(q_{xx}+ 2|q|^2q)+iβ(q_{xxx}+ 6|q|^2q_x)=0,\, (x,t)\in\mathbb{R}^2\, (α,\,β\in\mathbb{R})$. In this paper, we analytically explore the asymptotic behaviors of a soliton gas for the Hirota equation including the com…
▽ More
The Hirota equation is one of the integrable higher-order extensions of the nonlinear Schrödinger equation, and can describe the ultra-short optical pulse propagation in the form $iq_t+α(q_{xx}+ 2|q|^2q)+iβ(q_{xxx}+ 6|q|^2q_x)=0,\, (x,t)\in\mathbb{R}^2\, (α,\,β\in\mathbb{R})$. In this paper, we analytically explore the asymptotic behaviors of a soliton gas for the Hirota equation including the complex modified KdV equation, in which the soliton gas is regarded as the limit $N\to \infty$ of $N$-soliton solutions, and characterized using the Riemann-Hilbert problem with discrete spectra restricted in the intervals $(ia, ib)\cup (-ib, -ia)\, (0<a<b)$. We find that this soliton gas tends slowly to the Jaocbian elliptic wave solution with an error $\mathcal{O}(|x|^{-1})$ (zero exponentially quickly ) as $x\to -\infty$ ($x\to +\infty$). We also present the long-time asymptotics of the soliton gas under the different velocity conditions: $x/t>4βb^2,\, ξ_c<x/t<4βb^2,\, x/t<ξ_c$. Moreover, we analyze the property of the soliton gas for the case of the discrete spectra filling uniformly a quadrature domain.
△ Less
Submitted 13 April, 2024; v1 submitted 16 January, 2024;
originally announced January 2024.
-
SRNI-CAR: A comprehensive dataset for analyzing the Chinese automotive market
Authors:
Ruixin Ding,
Bowei Chen,
James M. Wilson,
Zhi Yan,
Yufei Huang
Abstract:
The automotive industry plays a critical role in the global economy, and particularly important is the expanding Chinese automobile market due to its immense scale and influence. However, existing automotive sector datasets are limited in their coverage, failing to adequately consider the growing demand for more and diverse variables. This paper aims to bridge this data gap by introducing a compre…
▽ More
The automotive industry plays a critical role in the global economy, and particularly important is the expanding Chinese automobile market due to its immense scale and influence. However, existing automotive sector datasets are limited in their coverage, failing to adequately consider the growing demand for more and diverse variables. This paper aims to bridge this data gap by introducing a comprehensive dataset spanning the years from 2016 to 2022, encompassing sales data, online reviews, and a wealth of information related to the Chinese automotive industry. This dataset serves as a valuable resource, significantly expanding the available data. Its impact extends to various dimensions, including improving forecasting accuracy, expanding the scope of business applications, informing policy development and regulation, and advancing academic research within the automotive sector. To illustrate the dataset's potential applications in both business and academic contexts, we present two application examples. Our developed dataset enhances our understanding of the Chinese automotive market and offers a valuable tool for researchers, policymakers, and industry stakeholders worldwide.
△ Less
Submitted 19 December, 2023;
originally announced January 2024.
-
U-SWIM: Universal Selective Write-Verify for Computing-in-Memory Neural Accelerators
Authors:
Zheyu Yan,
Xiaobo Sharon Hu,
Yiyu Shi
Abstract:
Architectures that incorporate Computing-in-Memory (CiM) using emerging non-volatile memory (NVM) devices have become strong contenders for deep neural network (DNN) acceleration due to their impressive energy efficiency. Yet, a significant challenge arises when using these emerging devices: they can show substantial variations during the weight-map** process. This can severely impact DNN accura…
▽ More
Architectures that incorporate Computing-in-Memory (CiM) using emerging non-volatile memory (NVM) devices have become strong contenders for deep neural network (DNN) acceleration due to their impressive energy efficiency. Yet, a significant challenge arises when using these emerging devices: they can show substantial variations during the weight-map** process. This can severely impact DNN accuracy if not mitigated. A widely accepted remedy for imperfect weight map** is the iterative write-verify approach, which involves verifying conductance values and adjusting devices if needed. In all existing publications, this procedure is applied to every individual device, resulting in a significant programming time overhead. In our research, we illustrate that only a small fraction of weights need this write-verify treatment for the corresponding devices and the DNN accuracy can be preserved, yielding a notable programming acceleration. Building on this, we introduce USWIM, a novel method based on the second derivative. It leverages a single iteration of forward and backpropagation to pinpoint the weights demanding write-verify. Through extensive tests on diverse DNN designs and datasets, USWIM manifests up to a 10x programming acceleration against the traditional exhaustive write-verify method, all while maintaining a similar accuracy level. Furthermore, compared to our earlier SWIM technique, USWIM excels, showing a 7x speedup when dealing with devices exhibiting non-uniform variations.
△ Less
Submitted 11 December, 2023;
originally announced January 2024.
-
Narrowly avoided spin-nematic phase in BaCdVO(PO$_4$)$_2$: NMR evidence
Authors:
K. M. Ranjith,
K. Yu. Povarov,
Z. Yan,
A. Zheludev,
M. Horvatić
Abstract:
We present a $^{31}$P nuclear magnetic resonance (NMR) investigation of BaCdVO(PO$_4$)$_2$ focusing on the nearly saturated regime between $μ_0H_{c1}$ = 4.05 T and $μ_0H_{c2}$ = 6.5 T, which used to be considered a promising candidate for a spin-nematic phase. NMR spectra establish the absence of any dipolar order there, whereas the weak field dependence of the magnetization above $H_{c1}$ is acco…
▽ More
We present a $^{31}$P nuclear magnetic resonance (NMR) investigation of BaCdVO(PO$_4$)$_2$ focusing on the nearly saturated regime between $μ_0H_{c1}$ = 4.05 T and $μ_0H_{c2}$ = 6.5 T, which used to be considered a promising candidate for a spin-nematic phase. NMR spectra establish the absence of any dipolar order there, whereas the weak field dependence of the magnetization above $H_{c1}$ is accounted for by Dzyaloshinskii-Moriya interaction terms. The low-energy spin dynamics (fluctuations), measured by the nuclear spin-lattice relaxation rate $T_1^{-1}$, confirms the continuity of this phase and the absence of any low-temperature phase transition. Unexpectedly, the spin dynamics above $H_{c1}$ is largely dominated by two-magnon processes, which is expected above the saturation field of a spin-nematic phase, but not inside. This shows that BaCdVO(PO$_4$)$_2$ is indeed close to a spin-nematic instability; however, this phase is not stabilized. We thus confirm recent theoretical predictions that the spin-nematic phase can be stabilized, at most, in an extremely narrow field range close to saturation or is rather narrowly avoided [Jiang et al., Phys. Rev. Lett. 130, 116701 (2023)].
△ Less
Submitted 25 April, 2024; v1 submitted 10 January, 2024;
originally announced January 2024.
-
Coexistence of multi-scale domains in ferroelectric polycrystals with non-uniform grain-size distributions
Authors:
K. Wolk,
R. S. Dragland,
E. Chavez Panduro,
L. Richarz,
Z. Yan,
E. Bourret,
K. A. Hunnestad,
Ch. Tzschaschel,
J. Schultheiß,
D. Meier
Abstract:
Engineering of ferroelectric domain structures enables direct control over the switching dynamics and is crucial for tuning the functional properties of ferroelectrics for various applications, ranging from capacitors to future nanoelectronics. Here, we investigate domain formation in poly- and single crystalline improper ferroelectric DyMnO3. We show that a non-uniform grain-size distribution in…
▽ More
Engineering of ferroelectric domain structures enables direct control over the switching dynamics and is crucial for tuning the functional properties of ferroelectrics for various applications, ranging from capacitors to future nanoelectronics. Here, we investigate domain formation in poly- and single crystalline improper ferroelectric DyMnO3. We show that a non-uniform grain-size distribution in the polycrystals facilitates the coexistence of multi-scale domains, varying by up to one order of magnitude in size. This unusual domain structure originates from an inverted domain-size/grain-size dependence that is intrinsic to the hexagonal manganite polycrystals, expanding previous studies towards non-uniform grain-size distributions. Our results demonstrate that the micrometer-sized grains in DyMnO3 represent individual ferroelectric units with a characteristic domain structure, giving a new dimension to domain engineering in ferroelectric polycrystals with non-uniform microstructures.
△ Less
Submitted 9 January, 2024;
originally announced January 2024.
-
A quasi-dynamic one-equation model with joint constraints of kinetic energy and helicity fluxes for large eddy simulation of rotating turbulence
Authors:
Depei Song,
Chang** Yu,
Zheng Yan,
Xinliang Li
Abstract:
For settling the problem with rotating turbulence modelling, a quasi-dynamic one-equation subgrid-scale (SGS) model is proposed in this paper. Considering the key role of the joint cascade of kinetic energy and helicity in rotating turbulence, the new SGS model is constrained by the fluxes of kinetic energy and helicity. Specifically, the new theory of dual channels of helicity flux is taken into…
▽ More
For settling the problem with rotating turbulence modelling, a quasi-dynamic one-equation subgrid-scale (SGS) model is proposed in this paper. Considering the key role of the joint cascade of kinetic energy and helicity in rotating turbulence, the new SGS model is constrained by the fluxes of kinetic energy and helicity. Specifically, the new theory of dual channels of helicity flux is taken into account. The modelling of the unclosed quantities is achieved by adopting a quasi-dynamic process that eliminates the need for test filtering compared to the classic dynamic process, and the model coefficients are dynamically obtained through the SGS kinetic energy transport equation and considering the joint constraints of kinetic energy and helicity fluxes. As a result, the model demonstrates a high correlation with DNS data in a priori tests. We refer to this new model as the quasi-dynamic joint-constraint model (QCM), which is introduced for both incompressible and compressible flows. To assess the effectiveness of the QCM, numerical experiments are conducted for three typical cases: incompressible streamwise rotating channel flow, transonic streamwise rotating annular pipe flow, and hypersonic transition flow at Mach 6 over a rotating cone. The results suggest that the QCM has the potential to significantly improve the prediction of rotational flows that are strongly influenced by helicity. Additionally, the new model demonstrates excellent capability in handling the transition process.
△ Less
Submitted 3 January, 2024;
originally announced January 2024.
-
Boosting of Implicit Neural Representation-based Image Denoiser
Authors:
Zipei Yan,
Zhengji Liu,
Jizhou Li
Abstract:
Implicit Neural Representation (INR) has emerged as an effective method for unsupervised image denoising. However, INR models are typically overparameterized; consequently, these models are prone to overfitting during learning, resulting in suboptimal results, even noisy ones. To tackle this problem, we propose a general recipe for regularizing INR models in image denoising. In detail, we propose…
▽ More
Implicit Neural Representation (INR) has emerged as an effective method for unsupervised image denoising. However, INR models are typically overparameterized; consequently, these models are prone to overfitting during learning, resulting in suboptimal results, even noisy ones. To tackle this problem, we propose a general recipe for regularizing INR models in image denoising. In detail, we propose to iteratively substitute the supervision signal with the mean value derived from both the prediction and supervision signal during the learning process. We theoretically prove that such a simple iterative substitute can gradually enhance the signal-to-noise ratio of the supervision signal, thereby benefiting INR models during the learning process. Our experimental results demonstrate that INR models can be effectively regularized by the proposed approach, relieving overfitting and boosting image denoising performance.
△ Less
Submitted 3 January, 2024;
originally announced January 2024.
-
Exposure Bracketing is All You Need for Unifying Image Restoration and Enhancement Tasks
Authors:
Zhilu Zhang,
Shuohao Zhang,
Renlong Wu,
Zifei Yan,
Wangmeng Zuo
Abstract:
It is highly desired but challenging to acquire high-quality photos with clear content in low-light environments. Although multi-image processing methods (using burst, dual-exposure, or multi-exposure images) have made significant progress in addressing this issue, they typically focus on specific restoration or enhancement problems, and do not fully explore the potential of utilizing multiple ima…
▽ More
It is highly desired but challenging to acquire high-quality photos with clear content in low-light environments. Although multi-image processing methods (using burst, dual-exposure, or multi-exposure images) have made significant progress in addressing this issue, they typically focus on specific restoration or enhancement problems, and do not fully explore the potential of utilizing multiple images. Motivated by the fact that multi-exposure images are complementary in denoising, deblurring, high dynamic range imaging, and super-resolution, we propose to utilize exposure bracketing photography to unify image restoration and enhancement tasks in this work. Due to the difficulty in collecting real-world pairs, we suggest a solution that first pre-trains the model with synthetic paired data and then adapts it to real-world unlabeled images. In particular, a temporally modulated recurrent network (TMRNet) and self-supervised adaptation method are proposed. Moreover, we construct a data simulation pipeline to synthesize pairs and collect real-world images from 200 nighttime scenarios. Experiments on both datasets show that our method performs favorably against the state-of-the-art multi-image processing ones. The dataset, code, and pre-trained models are available at https://github.com/cszhilu1998/BracketIRE.
△ Less
Submitted 31 May, 2024; v1 submitted 1 January, 2024;
originally announced January 2024.
-
Engineering Plateau Phase Transition in Quantum Anomalous Hall Multilayers
Authors:
Deyi Zhuo,
Ling-Jie Zhou,
Yi-Fan Zhao,
Ruoxi Zhang,
Zi-Jie Yan,
Annie G. Wang,
Moses H. W. Chan,
Chao-Xing Liu,
Chui-Zhen Chen,
Cui-Zu Chang
Abstract:
The plateau phase transition in quantum anomalous Hall (QAH) insulators corresponds to a quantum state wherein a single magnetic domain gives way to multiple magnetic domains and then re-converges back to a single magnetic domain. The layer structure of the sample provides an external knob for adjusting the Chern number C of the QAH insulators. Here, we employ molecular beam epitaxy (MBE) to grow…
▽ More
The plateau phase transition in quantum anomalous Hall (QAH) insulators corresponds to a quantum state wherein a single magnetic domain gives way to multiple magnetic domains and then re-converges back to a single magnetic domain. The layer structure of the sample provides an external knob for adjusting the Chern number C of the QAH insulators. Here, we employ molecular beam epitaxy (MBE) to grow magnetic topological insulator (TI) multilayers with an asymmetric layer structure and realize the magnetic field-driven plateau phase transition between two QAH states with odd Chern number change ΔC. In multilayer structures with C=+-1 and C=+-2 QAH states, we find two characteristic power-law behaviors between temperature and the scaling variables on the magnetic field at transition points. The critical exponents extracted for the plateau phase transitions with ΔC=1 and ΔC=3 in QAH insulators are found to be nearly identical, specifically, k1~0.390+-0.021 and k2~0.388+-0.015, respectively. We construct a four-layer Chalker-Coddington network model to understand the consistent critical exponents for the plateau phase transitions with ΔC=1 and ΔC=3. This work will motivate further investigations into the critical behaviors of plateau phase transitions with different ΔC in QAH insulators and provide new opportunities for the development of QAH chiral edge current-based electronic and spintronic devices.
△ Less
Submitted 14 January, 2024; v1 submitted 22 December, 2023;
originally announced December 2023.
-
Advancing VAD Systems Based on Multi-Task Learning with Improved Model Structures
Authors:
Lingyun Zuo,
Keyu An,
Shiliang Zhang,
Zhijie Yan
Abstract:
In a speech recognition system, voice activity detection (VAD) is a crucial frontend module. Addressing the issues of poor noise robustness in traditional binary VAD systems based on DFSMN, the paper further proposes semantic VAD based on multi-task learning with improved models for real-time and offline systems, to meet specific application requirements. Evaluations on internal datasets show that…
▽ More
In a speech recognition system, voice activity detection (VAD) is a crucial frontend module. Addressing the issues of poor noise robustness in traditional binary VAD systems based on DFSMN, the paper further proposes semantic VAD based on multi-task learning with improved models for real-time and offline systems, to meet specific application requirements. Evaluations on internal datasets show that, compared to the real-time VAD system based on DFSMN, the real-time semantic VAD system based on RWKV achieves relative decreases in CER of 7.0\%, DCF of 26.1\% and relative improvement in NRR of 19.2\%. Similarly, when compared to the offline VAD system based on DFSMN, the offline VAD system based on SAN-M demonstrates relative decreases in CER of 4.4\%, DCF of 18.6\% and relative improvement in NRR of 3.5\%.
△ Less
Submitted 19 December, 2023;
originally announced December 2023.
-
Exact approaches on the string worldsheet
Authors:
Saskia Demulder,
Sibylle Driezen,
Bob Knighton,
Gerben Oling,
Ana L. Retore,
Fiona K. Seibold,
Alessandro Sfondrini,
Ziqi Yan
Abstract:
We review different exact approaches to string theory. In the context of the Green-Schwarz superstring, we discuss the action in curved backgrounds and its supercoset formulation, with particular attention to superstring backgrounds of the $AdS_3$ type supported by both Ramond-Ramond and Neveu-Schwarz-Neveu-Schwarz fluxes. This is the basis for the discussion of classical integrability, of worldsh…
▽ More
We review different exact approaches to string theory. In the context of the Green-Schwarz superstring, we discuss the action in curved backgrounds and its supercoset formulation, with particular attention to superstring backgrounds of the $AdS_3$ type supported by both Ramond-Ramond and Neveu-Schwarz-Neveu-Schwarz fluxes. This is the basis for the discussion of classical integrability, of worldsheet-scattering factorisation in the uniform lightcone gauge, and eventually of the string spectrum through the mirror thermodynamic Bethe ansatz, which for $AdS_3$ backgrounds was only derived and analysed very recently. We then illustrate some aspects of the Ramond-Neveu-Schwarz string, and introduce the formalism of Berkovits-Vafa-Witten, which has seen very recent applications to $AdS_3$ physics, which we also briefly review. Finally, we present the relation between M-theory in the discrete lightcone quantisation and decoupling limits of string theory that exhibit non-relativistic behaviours, highlighting the connection with integrable $T\bar{T}$ deformations, as well as the relation between spin-matrix theory and Landau-Lifshitz models. This review is based on lectures given at the Young Researchers Integrability School and Workshop 2022 "Taming the string worldsheet" at NORDITA, Stockholm.
△ Less
Submitted 28 January, 2024; v1 submitted 20 December, 2023;
originally announced December 2023.
-
FedA3I: Annotation Quality-Aware Aggregation for Federated Medical Image Segmentation against Heterogeneous Annotation Noise
Authors:
Nannan Wu,
Zhaobin Sun,
Zengqiang Yan,
Li Yu
Abstract:
Federated learning (FL) has emerged as a promising paradigm for training segmentation models on decentralized medical data, owing to its privacy-preserving property. However, existing research overlooks the prevalent annotation noise encountered in real-world medical datasets, which limits the performance ceilings of FL. In this paper, we, for the first time, identify and tackle this problem. For…
▽ More
Federated learning (FL) has emerged as a promising paradigm for training segmentation models on decentralized medical data, owing to its privacy-preserving property. However, existing research overlooks the prevalent annotation noise encountered in real-world medical datasets, which limits the performance ceilings of FL. In this paper, we, for the first time, identify and tackle this problem. For problem formulation, we propose a contour evolution for modeling non-independent and identically distributed (Non-IID) noise across pixels within each client and then extend it to the case of multi-source data to form a heterogeneous noise model (i.e., Non-IID annotation noise across clients). For robust learning from annotations with such two-level Non-IID noise, we emphasize the importance of data quality in model aggregation, allowing high-quality clients to have a greater impact on FL. To achieve this, we propose Federated learning with Annotation quAlity-aware AggregatIon, named FedA3I, by introducing a quality factor based on client-wise noise estimation. Specifically, noise estimation at each client is accomplished through the Gaussian mixture model and then incorporated into model aggregation in a layer-wise manner to up-weight high-quality clients. Extensive experiments on two real-world medical image segmentation datasets demonstrate the superior performance of FedA$^3$I against the state-of-the-art approaches in dealing with cross-client annotation noise. The code is available at https://github.com/wnn2000/FedAAAI.
△ Less
Submitted 18 January, 2024; v1 submitted 20 December, 2023;
originally announced December 2023.
-
Gemini: A Family of Highly Capable Multimodal Models
Authors:
Gemini Team,
Rohan Anil,
Sebastian Borgeaud,
Jean-Baptiste Alayrac,
Jiahui Yu,
Radu Soricut,
Johan Schalkwyk,
Andrew M. Dai,
Anja Hauth,
Katie Millican,
David Silver,
Melvin Johnson,
Ioannis Antonoglou,
Julian Schrittwieser,
Amelia Glaese,
Jilin Chen,
Emily Pitler,
Timothy Lillicrap,
Angeliki Lazaridou,
Orhan Firat,
James Molloy,
Michael Isard,
Paul R. Barham,
Tom Hennigan,
Benjamin Lee
, et al. (1325 additional authors not shown)
Abstract:
This report introduces a new family of multimodal models, Gemini, that exhibit remarkable capabilities across image, audio, video, and text understanding. The Gemini family consists of Ultra, Pro, and Nano sizes, suitable for applications ranging from complex reasoning tasks to on-device memory-constrained use-cases. Evaluation on a broad range of benchmarks shows that our most-capable Gemini Ultr…
▽ More
This report introduces a new family of multimodal models, Gemini, that exhibit remarkable capabilities across image, audio, video, and text understanding. The Gemini family consists of Ultra, Pro, and Nano sizes, suitable for applications ranging from complex reasoning tasks to on-device memory-constrained use-cases. Evaluation on a broad range of benchmarks shows that our most-capable Gemini Ultra model advances the state of the art in 30 of 32 of these benchmarks - notably being the first model to achieve human-expert performance on the well-studied exam benchmark MMLU, and improving the state of the art in every one of the 20 multimodal benchmarks we examined. We believe that the new capabilities of the Gemini family in cross-modal reasoning and language understanding will enable a wide variety of use cases. We discuss our approach toward post-training and deploying Gemini models responsibly to users through services including Gemini, Gemini Advanced, Google AI Studio, and Cloud Vertex AI.
△ Less
Submitted 17 June, 2024; v1 submitted 18 December, 2023;
originally announced December 2023.
-
MAC-SQL: A Multi-Agent Collaborative Framework for Text-to-SQL
Authors:
Bing Wang,
Changyu Ren,
Jian Yang,
Xinnian Liang,
Jiaqi Bai,
Linzheng Chai,
Zhao Yan,
Qian-Wen Zhang,
Di Yin,
Xing Sun,
Zhoujun Li
Abstract:
Recent LLM-based Text-to-SQL methods usually suffer from significant performance degradation on "huge" databases and complex user questions that require multi-step reasoning. Moreover, most existing methods neglect the crucial significance of LLMs utilizing external tools and model collaboration. To address these challenges, we introduce MAC-SQL, a novel LLM-based multi-agent collaborative framewo…
▽ More
Recent LLM-based Text-to-SQL methods usually suffer from significant performance degradation on "huge" databases and complex user questions that require multi-step reasoning. Moreover, most existing methods neglect the crucial significance of LLMs utilizing external tools and model collaboration. To address these challenges, we introduce MAC-SQL, a novel LLM-based multi-agent collaborative framework. Our framework comprises a core decomposer agent for Text-to-SQL generation with few-shot chain-of-thought reasoning, accompanied by two auxiliary agents that utilize external tools or models to acquire smaller sub-databases and refine erroneous SQL queries. The decomposer agent collaborates with auxiliary agents, which are activated as needed and can be expanded to accommodate new features or tools for effective Text-to-SQL parsing. In our framework, We initially leverage GPT-4 as the strong backbone LLM for all agent tasks to determine the upper bound of our framework. We then fine-tune an open-sourced instruction-followed model, SQL-Llama, by leveraging Code Llama 7B, to accomplish all tasks as GPT-4 does. Experiments show that SQL-Llama achieves a comparable execution accuracy of 43.94, compared to the baseline accuracy of 46.35 for vanilla GPT-4. At the time of writing, MAC-SQL+GPT-4 achieves an execution accuracy of 59.59 when evaluated on the BIRD benchmark, establishing a new state-of-the-art (SOTA) on its holdout test set (https://github.com/wbbeyourself/MAC-SQL).
△ Less
Submitted 16 June, 2024; v1 submitted 18 December, 2023;
originally announced December 2023.
-
FlowMur: A Stealthy and Practical Audio Backdoor Attack with Limited Knowledge
Authors:
Jiahe Lan,
Jie Wang,
Baochen Yan,
Zheng Yan,
Elisa Bertino
Abstract:
Speech recognition systems driven by DNNs have revolutionized human-computer interaction through voice interfaces, which significantly facilitate our daily lives. However, the growing popularity of these systems also raises special concerns on their security, particularly regarding backdoor attacks. A backdoor attack inserts one or more hidden backdoors into a DNN model during its training process…
▽ More
Speech recognition systems driven by DNNs have revolutionized human-computer interaction through voice interfaces, which significantly facilitate our daily lives. However, the growing popularity of these systems also raises special concerns on their security, particularly regarding backdoor attacks. A backdoor attack inserts one or more hidden backdoors into a DNN model during its training process, such that it does not affect the model's performance on benign inputs, but forces the model to produce an adversary-desired output if a specific trigger is present in the model input. Despite the initial success of current audio backdoor attacks, they suffer from the following limitations: (i) Most of them require sufficient knowledge, which limits their widespread adoption. (ii) They are not stealthy enough, thus easy to be detected by humans. (iii) Most of them cannot attack live speech, reducing their practicality. To address these problems, in this paper, we propose FlowMur, a stealthy and practical audio backdoor attack that can be launched with limited knowledge. FlowMur constructs an auxiliary dataset and a surrogate model to augment adversary knowledge. To achieve dynamicity, it formulates trigger generation as an optimization problem and optimizes the trigger over different attachment positions. To enhance stealthiness, we propose an adaptive data poisoning method according to Signal-to-Noise Ratio (SNR). Furthermore, ambient noise is incorporated into the process of trigger generation and data poisoning to make FlowMur robust to ambient noise and improve its practicality. Extensive experiments conducted on two datasets demonstrate that FlowMur achieves high attack performance in both digital and physical settings while remaining resilient to state-of-the-art defenses. In particular, a human study confirms that triggers generated by FlowMur are not easily detected by participants.
△ Less
Submitted 15 December, 2023;
originally announced December 2023.
-
Accessing Excitation Spectrum of Many-body Systems via Single-Mode Approximation within Quantum Monte Carlo Simulations
Authors:
Yan Liu,
Kemeng Wu,
Yan-Cheng Wang,
Jie Lou,
Zheng Yan,
Yan Chen
Abstract:
We extend the Single Mode Approximation (SMA) into quantum Monte Carlo (QMC) simulations to provides an efficient and fast method to obtain the dynamical dispersion of quantum many-body systems. Based on Stochastic Series Expansion (SSE) and its projector algorithms, The SMA + SSE method can simply extract the dispersion of the dynamical spectrum in the long wave-length limit and the upper bound o…
▽ More
We extend the Single Mode Approximation (SMA) into quantum Monte Carlo (QMC) simulations to provides an efficient and fast method to obtain the dynamical dispersion of quantum many-body systems. Based on Stochastic Series Expansion (SSE) and its projector algorithms, The SMA + SSE method can simply extract the dispersion of the dynamical spectrum in the long wave-length limit and the upper bound of the dispersion elsewhere, without external calculations and high technique barriers. Meanwhile, numerical analytic continuation methods require the fine data of imaginary time correlations and complex programming. Therefore, our method can approach the excitation dispersion of large systems, e.g., we take the two-dimensional Heisenberg model on a $512 \times 512$ square lattice. We demonstrate the effectiveness and efficiency of our method with high precision via additional examples. We also demonstrate that SMA combined with SSE goes beyond spin-wave theory with numerical results.
△ Less
Submitted 16 April, 2024; v1 submitted 13 December, 2023;
originally announced December 2023.
-
CartoMark: a benchmark dataset for map pattern recognition and 1 map content retrieval with machine intelligence
Authors:
Xiran Zhou,
Yi Wen,
Honghao Li,
Kaiyuan Li,
Zhenfeng Shao,
Zhigang Yan,
Xiao Xie
Abstract:
Maps are fundamental medium to visualize and represent the real word in a simple and 16 philosophical way. The emergence of the 3rd wave information has made a proportion of maps are available to be generated ubiquitously, which would significantly enrich the dimensions and perspectives to understand the characteristics of the real world. However, a majority of map dataset have never been discover…
▽ More
Maps are fundamental medium to visualize and represent the real word in a simple and 16 philosophical way. The emergence of the 3rd wave information has made a proportion of maps are available to be generated ubiquitously, which would significantly enrich the dimensions and perspectives to understand the characteristics of the real world. However, a majority of map dataset have never been discovered, acquired and effectively used, and the map data used in many applications might not be completely fitted for the authentic demands of these applications. This challenge is emerged due to the lack of numerous well-labelled benchmark datasets for implementing the deep learning approaches into identifying complicated map content. Thus, we develop a large-scale benchmark dataset that includes well-labelled dataset for map text annotation recognition, map scene classification, map super-resolution reconstruction, and map style transferring. Furthermore, these well-labelled datasets would facilitate the state-of-the-art machine intelligence technologies to conduct map feature detection, map pattern recognition and map content retrieval. We hope our efforts would be useful for AI-enhanced cartographical applications.
△ Less
Submitted 13 December, 2023;
originally announced December 2023.
-
State-insensitive wavelengths for light shifts and photon scattering from Zeeman states
Authors:
Stuart J. Masson,
Zhenjie Yan,
Jacquelyn Ho,
Yue-Hui Lu,
Dan M. Stamper-Kurn,
Ana Asenjo-Garcia
Abstract:
Atoms are not two-level systems, and their rich internal structure often leads to complex phenomena in the presence of light. Here, we analyze off-resonant light scattering including the full hyperfine and magnetic structure. We find a set of frequency detunings where the induced atomic dipole is the same irrespective of the Zeeman state, and where two-photon transitions that alter the atomic stat…
▽ More
Atoms are not two-level systems, and their rich internal structure often leads to complex phenomena in the presence of light. Here, we analyze off-resonant light scattering including the full hyperfine and magnetic structure. We find a set of frequency detunings where the induced atomic dipole is the same irrespective of the Zeeman state, and where two-photon transitions that alter the atomic state turn off. For alkali atoms and alkaline-earth ions, if the hyperfine splitting is dominated by the magnetic dipole moment contribution, these detunings approximately coincide. Therefore, at a given ``magical'' detuning, all Zeeman states in a hyperfine manifold behave almost identically, and can be traced out to good approximation. This feature prevents state decoherence due to light scattering, which impacts quantum optics experiments and quantum information applications.
△ Less
Submitted 17 June, 2024; v1 submitted 13 December, 2023;
originally announced December 2023.
-
Compute-in-Memory based Neural Network Accelerators for Safety-Critical Systems: Worst-Case Scenarios and Protections
Authors:
Zheyu Yan,
Xiaobo Sharon Hu,
Yiyu Shi
Abstract:
Emerging non-volatile memory (NVM)-based Computing-in-Memory (CiM) architectures show substantial promise in accelerating deep neural networks (DNNs) due to their exceptional energy efficiency. However, NVM devices are prone to device variations. Consequently, the actual DNN weights mapped to NVM devices can differ considerably from their targeted values, inducing significant performance degradati…
▽ More
Emerging non-volatile memory (NVM)-based Computing-in-Memory (CiM) architectures show substantial promise in accelerating deep neural networks (DNNs) due to their exceptional energy efficiency. However, NVM devices are prone to device variations. Consequently, the actual DNN weights mapped to NVM devices can differ considerably from their targeted values, inducing significant performance degradation. Many existing solutions aim to optimize average performance amidst device variations, which is a suitable strategy for general-purpose conditions. However, the worst-case performance that is crucial for safety-critical applications is largely overlooked in current research. In this study, we define the problem of pinpointing the worst-case performance of CiM DNN accelerators affected by device variations. Additionally, we introduce a strategy to identify a specific pattern of the device value deviations in the complex, high-dimensional value deviation space, responsible for this worst-case outcome. Our findings reveal that even subtle device variations can precipitate a dramatic decline in DNN accuracy, posing risks for CiM-based platforms in supporting safety-critical applications. Notably, we observe that prevailing techniques to bolster average DNN performance in CiM accelerators fall short in enhancing worst-case scenarios. In light of this issue, we propose a novel worst-case-aware training technique named A-TRICE that efficiently combines adversarial training and noise-injection training with right-censored Gaussian noise to improve the DNN accuracy in the worst-case scenarios. Our experimental results demonstrate that A-TRICE improves the worst-case accuracy under device variations by up to 33%.
△ Less
Submitted 11 December, 2023;
originally announced December 2023.
-
Three Pulsars Discovered in Globular Cluster M15 (NGC 7078) with FAST
Authors:
Yuxiao Wu,
Zhichen Pan,
Lei Qian,
Scott Ransom,
BoJun Wang,
Zhen Yan,
**tao Luo,
Liyun Zhang,
Minghui Li,
Dejiang Yin,
Baoda Li,
Yifeng Li,
Yinfeng Dai,
Yaowei Li,
Xinnan Zhang,
Tong Liu,
Yu Pan
Abstract:
We present the discovery of three pulsars in Globular Cluster M15 (NGC 7078) by the Five-hundred-meter Aperture Spherical radio Telescope (FAST). In the three pulsars, PSR~J2129+1210J (M15J) is a millisecond pulsar with a spinning period of 11.84 ms and a dispersion measure of 66.68 pc cm$^{-3}$. Both PSR~J2129+1210K and L (M15K and L) are long period pulsars with spinning periods of 1928 ms and 3…
▽ More
We present the discovery of three pulsars in Globular Cluster M15 (NGC 7078) by the Five-hundred-meter Aperture Spherical radio Telescope (FAST). In the three pulsars, PSR~J2129+1210J (M15J) is a millisecond pulsar with a spinning period of 11.84 ms and a dispersion measure of 66.68 pc cm$^{-3}$. Both PSR~J2129+1210K and L (M15K and L) are long period pulsars with spinning periods of 1928 ms and 3961 ms , respectively, while M15L is the GC pulsar with the longest spinning period till now. The discoveries of M15K and L support the theory that core-collapsed Globular Clusters may contain partially recycled long period pulsars. With the same dataset, the timing solutions of M15A to H were updated, and the timing parameter P1 of M15F is different from the previous results, which is approximately 0.027$\times 10^{-18} ss^{-1}$ from our work and $0.032 \times 10^{-18} ss^{-1}$ from Anderson's\citep{anderson-1993}. As predicted by Rodolfi et al. , the luminosity of M15C kept decreasing and the latest detection in our dataset is on December 20$^{\rm th}$, 2022. We also detected M15I for one more time. The different barycentric spin periods indicate that this pulsar should locate in a binary system, manifesting itself as the exceptional one in such a core-collapsing GC.
△ Less
Submitted 10 December, 2023;
originally announced December 2023.
-
Weakly Supervised Video Individual CountingWeakly Supervised Video Individual Counting
Authors:
Xinyan Liu,
Guorong Li,
Yuankai Qi,
Ziheng Yan,
Zhenjun Han,
Anton van den Hengel,
Ming-Hsuan Yang,
Qingming Huang
Abstract:
Video Individual Counting (VIC) aims to predict the number of unique individuals in a single video. % Existing methods learn representations based on trajectory labels for individuals, which are annotation-expensive. % To provide a more realistic reflection of the underlying practical challenge, we introduce a weakly supervised VIC task, wherein trajectory labels are not provided. Instead, two typ…
▽ More
Video Individual Counting (VIC) aims to predict the number of unique individuals in a single video. % Existing methods learn representations based on trajectory labels for individuals, which are annotation-expensive. % To provide a more realistic reflection of the underlying practical challenge, we introduce a weakly supervised VIC task, wherein trajectory labels are not provided. Instead, two types of labels are provided to indicate traffic entering the field of view (inflow) and leaving the field view (outflow). % We also propose the first solution as a baseline that formulates the task as a weakly supervised contrastive learning problem under group-level matching. In doing so, we devise an end-to-end trainable soft contrastive loss to drive the network to distinguish inflow, outflow, and the remaining. % To facilitate future study in this direction, we generate annotations from the existing VIC datasets SenseCrowd and CroHD and also build a new dataset, UAVVIC. % Extensive results show that our baseline weakly supervised method outperforms supervised methods, and thus, little information is lost in the transition to the more practically relevant weakly supervised task. The code and trained model will be public at \href{https://github.com/streamer-AP/CGNet}{CGNet}
△ Less
Submitted 10 December, 2023;
originally announced December 2023.
-
SGNet: Structure Guided Network via Gradient-Frequency Awareness for Depth Map Super-Resolution
Authors:
Zhengxue Wang,
Zhiqiang Yan,
Jian Yang
Abstract:
Depth super-resolution (DSR) aims to restore high-resolution (HR) depth from low-resolution (LR) one, where RGB image is often used to promote this task. Recent image guided DSR approaches mainly focus on spatial domain to rebuild depth structure. However, since the structure of LR depth is usually blurry, only considering spatial domain is not very sufficient to acquire satisfactory results. In t…
▽ More
Depth super-resolution (DSR) aims to restore high-resolution (HR) depth from low-resolution (LR) one, where RGB image is often used to promote this task. Recent image guided DSR approaches mainly focus on spatial domain to rebuild depth structure. However, since the structure of LR depth is usually blurry, only considering spatial domain is not very sufficient to acquire satisfactory results. In this paper, we propose structure guided network (SGNet), a method that pays more attention to gradient and frequency domains, both of which have the inherent ability to capture high-frequency structure. Specifically, we first introduce the gradient calibration module (GCM), which employs the accurate gradient prior of RGB to sharpen the LR depth structure. Then we present the Frequency Awareness Module (FAM) that recursively conducts multiple spectrum differencing blocks (SDB), each of which propagates the precise high-frequency components of RGB into the LR depth. Extensive experimental results on both real and synthetic datasets demonstrate the superiority of our SGNet, reaching the state-of-the-art. Codes and pre-trained models are available at https://github.com/yanzq95/SGNet.
△ Less
Submitted 13 December, 2023; v1 submitted 10 December, 2023;
originally announced December 2023.
-
Interface-Induced Superconductivity in Magnetic Topological Insulator-Iron Chalcogenide Heterostructures
Authors:
Hemian Yi,
Yi-Fan Zhao,
Ying-Ting Chan,
Jiaqi Cai,
Ruobing Mei,
Xianxin Wu,
Zi-Jie Yan,
Ling-Jie Zhou,
Ruoxi Zhang,
Zihao Wang,
Stephen Paolini,
Run Xiao,
Ke Wang,
Anthony R. Richardella,
John Singleton,
Laurel E. Winter,
Thomas Prokscha,
Zaher Salman,
Andreas Suter,
Purnima P. Balakrishnan,
Alexander J. Grutter,
Moses H. W. Chan,
Nitin Samarth,
Xiaodong Xu,
Weida Wu
, et al. (2 additional authors not shown)
Abstract:
When two different electronic materials are brought together, the resultant interface often shows unexpected quantum phenomena, including interfacial superconductivity and Fu-Kane topological superconductivity (TSC). Here, we use molecular beam epitaxy (MBE) to synthesize heterostructures formed by stacking together two magnetic materials, a ferromagnetic topological insulator (TI) and an antiferr…
▽ More
When two different electronic materials are brought together, the resultant interface often shows unexpected quantum phenomena, including interfacial superconductivity and Fu-Kane topological superconductivity (TSC). Here, we use molecular beam epitaxy (MBE) to synthesize heterostructures formed by stacking together two magnetic materials, a ferromagnetic topological insulator (TI) and an antiferromagnetic iron chalcogenide (FeTe). We discover emergent interface-induced superconductivity in these heterostructures and demonstrate the trifecta occurrence of superconductivity, ferromagnetism, and topological band structure in the magnetic TI layer, the three essential ingredients of chiral TSC. The unusual coexistence of ferromagnetism and superconductivity can be attributed to the high upper critical magnetic field that exceeds the Pauli paramagnetic limit for conventional superconductors at low temperatures. The magnetic TI/FeTe heterostructures with robust superconductivity and atomically sharp interfaces provide an ideal wafer-scale platform for the exploration of chiral TSC and Majorana physics, constituting an important step toward scalable topological quantum computation.
△ Less
Submitted 7 December, 2023;
originally announced December 2023.
-
Regressor-Segmenter Mutual Prompt Learning for Crowd Counting
Authors:
Mingyue Guo,
Li Yuan,
Zhaoyi Yan,
Binghui Chen,
Yaowei Wang,
Qixiang Ye
Abstract:
Crowd counting has achieved significant progress by training regressors to predict instance positions. In heavily crowded scenarios, however, regressors are challenged by uncontrollable annotation variance, which causes density map bias and context information inaccuracy. In this study, we propose mutual prompt learning (mPrompt), which leverages a regressor and a segmenter as guidance for each ot…
▽ More
Crowd counting has achieved significant progress by training regressors to predict instance positions. In heavily crowded scenarios, however, regressors are challenged by uncontrollable annotation variance, which causes density map bias and context information inaccuracy. In this study, we propose mutual prompt learning (mPrompt), which leverages a regressor and a segmenter as guidance for each other, solving bias and inaccuracy caused by annotation variance while distinguishing foreground from background. In specific, mPrompt leverages point annotations to tune the segmenter and predict pseudo head masks in a way of point prompt learning. It then uses the predicted segmentation masks, which serve as spatial constraint, to rectify biased point annotations as context prompt learning. mPrompt defines a way of mutual information maximization from prompt learning, mitigating the impact of annotation variance while improving model accuracy. Experiments show that mPrompt significantly reduces the Mean Average Error (MAE), demonstrating the potential to be general framework for down-stream vision tasks.
△ Less
Submitted 3 January, 2024; v1 submitted 4 December, 2023;
originally announced December 2023.
-
Three-Dimensional Quantum Anomalous Hall Effect in Magnetic Topological Insulator Trilayers of Hundred-Nanometer Thickness
Authors:
Yi-Fan Zhao,
Ruoxi Zhang,
Zi-Ting Sun,
Ling-Jie Zhou,
Deyi Zhuo,
Zi-Jie Yan,
Hemian Yi,
Ke Wang,
Moses H. W. Chan,
Chao-Xing Liu,
K. T. Law,
Cui-Zu Chang
Abstract:
Magnetic topological states refer to a class of exotic phases in magnetic materials with their non-trivial topological property determined by magnetic spin configurations. An example of such states is the quantum anomalous Hall (QAH) state, which is a zero magnetic field manifestation of the quantum Hall effect. Current research in this direction focuses on QAH insulators with a thickness of less…
▽ More
Magnetic topological states refer to a class of exotic phases in magnetic materials with their non-trivial topological property determined by magnetic spin configurations. An example of such states is the quantum anomalous Hall (QAH) state, which is a zero magnetic field manifestation of the quantum Hall effect. Current research in this direction focuses on QAH insulators with a thickness of less than 10nm. The thick QAH insulators in the three-dimensional(3D) regime are limited, largely due to inevitable bulk carriers being introduced in thick magnetic TI samples. Here, we employ molecular beam epitaxy (MBE) to synthesize magnetic TI trilayers with a thickness of up to ~106 nm. We find these samples exhibit well-quantized Hall resistance and vanishing longitudinal resistance at zero magnetic field. By varying magnetic dopants, gate voltages, temperature, and external magnetic fields, we examine the properties of these thick QAH insulators and demonstrate the robustness of the 3D QAH effect. The realization of the well-quantized 3D QAH effect indicates that the nonchiral side surface states of our thick magnetic TI trilayers are gapped and thus do not affect the QAH quantization. The 3D QAH insulators of hundred-nanometer thickness provide a promising platform for the exploration of fundamental physics, including axion physics and image magnetic monopole, and the advancement of electronic and spintronic devices to circumvent Moore's law.
△ Less
Submitted 7 December, 2023; v1 submitted 3 December, 2023;
originally announced December 2023.
-
Observational Evidence of a Centi-parsec Supermassive Black Hole Binary Existing in the Nearby Galaxy M81
Authors:
Wu Jiang,
Zhiqiang Shen,
Ivan Martí-Vidal,
Zhen Yan,
Lei Huang,
Roman Gold,
Ya-** Li,
Fuguo Xie,
Noriyuki Kawaguchi
Abstract:
Studying a centi-parsec supermassive black hole binary (SMBHB) would allow us to explore a new parameter space in active galactic nuclei, and these objects are also potential sources of gravitational waves. We report evidence that an SMBHB with an orbital period of about 30 yr may be resident in the nearby galactic nucleus M81. This orbital period and the known mass of M81 imply an orbital separat…
▽ More
Studying a centi-parsec supermassive black hole binary (SMBHB) would allow us to explore a new parameter space in active galactic nuclei, and these objects are also potential sources of gravitational waves. We report evidence that an SMBHB with an orbital period of about 30 yr may be resident in the nearby galactic nucleus M81. This orbital period and the known mass of M81 imply an orbital separation of about 0.02 pc. The jet emanating from the primary black hole showed a short period of jet wobbling at about 16.7 yr, superposing a long-term precession at a timescale of several hundred years. Periodic radio and X-ray outbursts were also found two times per orbital period, which could be explained by a double-peaked mass accretion rate variation per binary orbit. If confirmed, M81 would be one of the closest SMBHB candidates, providing a rare opportunity to study the final parsec problem.
△ Less
Submitted 3 December, 2023;
originally announced December 2023.
-
Multi-Scale 3D Gaussian Splatting for Anti-Aliased Rendering
Authors:
Zhiwen Yan,
Weng Fei Low,
Yu Chen,
Gim Hee Lee
Abstract:
3D Gaussians have recently emerged as a highly efficient representation for 3D reconstruction and rendering. Despite its high rendering quality and speed at high resolutions, they both deteriorate drastically when rendered at lower resolutions or from far away camera position. During low resolution or far away rendering, the pixel size of the image can fall below the Nyquist frequency compared to…
▽ More
3D Gaussians have recently emerged as a highly efficient representation for 3D reconstruction and rendering. Despite its high rendering quality and speed at high resolutions, they both deteriorate drastically when rendered at lower resolutions or from far away camera position. During low resolution or far away rendering, the pixel size of the image can fall below the Nyquist frequency compared to the screen size of each splatted 3D Gaussian and leads to aliasing effect. The rendering is also drastically slowed down by the sequential alpha blending of more splatted Gaussians per pixel. To address these issues, we propose a multi-scale 3D Gaussian splatting algorithm, which maintains Gaussians at different scales to represent the same scene. Higher-resolution images are rendered with more small Gaussians, and lower-resolution images are rendered with fewer larger Gaussians. With similar training time, our algorithm can achieve 13\%-66\% PSNR and 160\%-2400\% rendering speed improvement at 4$\times$-128$\times$ scale rendering on Mip-NeRF360 dataset compared to the single scale 3D Gaussian splitting. Our code and more results are available on our project website https://jokeryan.github.io/projects/ms-gs/
△ Less
Submitted 28 May, 2024; v1 submitted 27 November, 2023;
originally announced November 2023.
-
Animate124: Animating One Image to 4D Dynamic Scene
Authors:
Yuyang Zhao,
Zhiwen Yan,
Enze Xie,
Lanqing Hong,
Zhenguo Li,
Gim Hee Lee
Abstract:
We introduce Animate124 (Animate-one-image-to-4D), the first work to animate a single in-the-wild image into 3D video through textual motion descriptions, an underexplored problem with significant applications. Our 4D generation leverages an advanced 4D grid dynamic Neural Radiance Field (NeRF) model, optimized in three distinct stages using multiple diffusion priors. Initially, a static model is…
▽ More
We introduce Animate124 (Animate-one-image-to-4D), the first work to animate a single in-the-wild image into 3D video through textual motion descriptions, an underexplored problem with significant applications. Our 4D generation leverages an advanced 4D grid dynamic Neural Radiance Field (NeRF) model, optimized in three distinct stages using multiple diffusion priors. Initially, a static model is optimized using the reference image, guided by 2D and 3D diffusion priors, which serves as the initialization for the dynamic NeRF. Subsequently, a video diffusion model is employed to learn the motion specific to the subject. However, the object in the 3D videos tends to drift away from the reference image over time. This drift is mainly due to the misalignment between the text prompt and the reference image in the video diffusion model. In the final stage, a personalized diffusion prior is therefore utilized to address the semantic drift. As the pioneering image-text-to-4D generation framework, our method demonstrates significant advancements over existing baselines, evidenced by comprehensive quantitative and qualitative assessments.
△ Less
Submitted 18 February, 2024; v1 submitted 24 November, 2023;
originally announced November 2023.
-
Cycle Invariant Positional Encoding for Graph Representation Learning
Authors:
Zuoyu Yan,
Tengfei Ma,
Liangcai Gao,
Zhi Tang,
Chao Chen,
Yusu Wang
Abstract:
Cycles are fundamental elements in graph-structured data and have demonstrated their effectiveness in enhancing graph learning models. To encode such information into a graph learning framework, prior works often extract a summary quantity, ranging from the number of cycles to the more sophisticated persistence diagram summaries. However, more detailed information, such as which edges are encoded…
▽ More
Cycles are fundamental elements in graph-structured data and have demonstrated their effectiveness in enhancing graph learning models. To encode such information into a graph learning framework, prior works often extract a summary quantity, ranging from the number of cycles to the more sophisticated persistence diagram summaries. However, more detailed information, such as which edges are encoded in a cycle, has not yet been used in graph neural networks. In this paper, we make one step towards addressing this gap, and propose a structure encoding module, called CycleNet, that encodes cycle information via edge structure encoding in a permutation invariant manner. To efficiently encode the space of all cycles, we start with a cycle basis (i.e., a minimal set of cycles generating the cycle space) which we compute via the kernel of the 1-dimensional Hodge Laplacian of the input graph. To guarantee the encoding is invariant w.r.t. the choice of cycle basis, we encode the cycle information via the orthogonal projector of the cycle basis, which is inspired by BasisNet proposed by Lim et al. We also develop a more efficient variant which however requires that the input graph has a unique shortest cycle basis. To demonstrate the effectiveness of the proposed module, we provide some theoretical understandings of its expressive power. Moreover, we show via a range of experiments that networks enhanced by our CycleNet module perform better in various benchmarks compared to several existing SOTA models.
△ Less
Submitted 30 November, 2023; v1 submitted 24 November, 2023;
originally announced November 2023.
-
IEKM: A Model Incorporating External Keyword Matrices
Authors:
Cheng Luo,
Qin Li,
Zhao Yan,
Mengliang Rao,
Yunbo Cao
Abstract:
A customer service platform system with a core text semantic similarity (STS) task faces two urgent challenges: Firstly, one platform system needs to adapt to different domains of customers, i.e., different domains adaptation (DDA). Secondly, it is difficult for the model of the platform system to distinguish sentence pairs that are literally close but semantically different, i.e., hard negative s…
▽ More
A customer service platform system with a core text semantic similarity (STS) task faces two urgent challenges: Firstly, one platform system needs to adapt to different domains of customers, i.e., different domains adaptation (DDA). Secondly, it is difficult for the model of the platform system to distinguish sentence pairs that are literally close but semantically different, i.e., hard negative samples. In this paper, we propose an incorporation external keywords matrices model (IEKM) to address these challenges. The model uses external tools or dictionaries to construct external matrices and fuses them to the self-attention layers of the Transformer structure through gating units, thus enabling flexible corrections to the model results. We evaluate the method on multiple datasets and the results show that our method has improved performance on all datasets. To demonstrate that our method can effectively solve all the above challenges, we conduct a flexible correction experiment, which results in an increase in the F1 value from 56.61 to 73.53. Our code will be publicly available.
△ Less
Submitted 20 November, 2023;
originally announced November 2023.
-
The first Ka-band (26.1-35 GHz) blind line survey towards Orion KL
Authors:
Xunchuan Liu,
Tie Liu,
Zhiqiang Shen,
Sheng-Li Qin,
Qiuyi Luo,
Yan Gong,
Yu Cheng,
Christian Henkel,
Qilao Gu,
Fengyao Zhu,
Tianwei Zhang,
Rongbing Zhao,
Yajun Wu,
Bin Li,
Juan Li,
Zhang Zhao,
**qing Wang,
Weiye Zhong,
Qinghui Liu,
Bo Xia,
Li Fu,
Zhen Yan,
Chao Zhang,
Lingling Wang,
Qian Ye
, et al. (9 additional authors not shown)
Abstract:
We conducted a Ka-band (26.1--35 GHz) line survey towards Orion KL using the TianMa 65-m Radio Telescope (TMRT). It is the first blind line survey in the Ka band, and achieves a sensitivity of mK level (1--3 mK at a spectral resolution of $\sim$1 km s$^{-1}$). In total, 592 Gaussian features are extracted. Among them, 257 radio recombination lines (RRLs) are identified. The maximum $Δn$ of RRLs of…
▽ More
We conducted a Ka-band (26.1--35 GHz) line survey towards Orion KL using the TianMa 65-m Radio Telescope (TMRT). It is the first blind line survey in the Ka band, and achieves a sensitivity of mK level (1--3 mK at a spectral resolution of $\sim$1 km s$^{-1}$). In total, 592 Gaussian features are extracted. Among them, 257 radio recombination lines (RRLs) are identified. The maximum $Δn$ of RRLs of H, He and C are 20, 15, and 5, respectively. Through stacking, we have detected the $β$ lines of ion RRLs (RRLs of C$^+$ with possible contribution of other ions like O$^+$) for the first time, and tentative signal of the $γ$ lines of ion RRLs can also be seen on the stacked spectrum. Besides, 318 other line features were assigned to 37 molecular species, and ten of these species were not detected in the Q-band survey of TMRT. The vibrationally excited states of nine species were also detected. Emission of most species can be modeled under LTE. A number of transitions of E-CH3OH ($J_2-J_1$) display maser effects, which are confirmed by our modeling, and besides the bum** peak at $J\sim 6$ there is another peak at $J\sim 13$. Methylcyanoacetylene (CH$_3$C$_3$N) is detected in Orion KL for the first time. This work emphasizes that the Ka band, which was long-ignored for spectral line surveys, is very useful for surveying RRLs and molecular lines simultaneously.
△ Less
Submitted 20 November, 2023;
originally announced November 2023.
-
Magnetic-field-induced nonlinear transport in HfTe5
Authors:
Cheng Zhang,
**shan Yang,
Zhongbo Yan,
Xiang Yuan,
Yanwen Liu,
Minhao Zhao,
Alexey Suslov,
**glei Zhang,
Li Pi,
Zhong Wang,
Faxian Xiu
Abstract:
The interplay of electron correlations and topological phases gives rise to various exotic phenomena including fractionalization, excitonic instability, and axionic excitation. Recently-discovered transition-metal pentatellurides can reach the ultra-quantum limit in low magnetic fields and serve as good candidates for achieving such a combination. Here, we report evidences of density wave and meta…
▽ More
The interplay of electron correlations and topological phases gives rise to various exotic phenomena including fractionalization, excitonic instability, and axionic excitation. Recently-discovered transition-metal pentatellurides can reach the ultra-quantum limit in low magnetic fields and serve as good candidates for achieving such a combination. Here, we report evidences of density wave and metal-insulator transition in HfTe5 induced by intense magnetic fields. Using the nonlinear transport technique, we detect a distinct nonlinear conduction behavior in the longitudinal resistivity within the a-c plane, corresponding to the formation of a density wave induced by magnetic fields. In high fields, the onset of the nonlinear conduction in the Hall resistivity indicates an impurity-pinned magnetic freeze-out as the possible origin of the insulating behavior. These frozen electrons can be gradually re-activated into mobile states above a threshold electric field. These experimental evidences call for further investigations into the underlying mechanism for the bulk quantum Hall effect and field-induced phase transtions in pentatellurides.
△ Less
Submitted 19 November, 2023;
originally announced November 2023.
-
Transcending Forgery Specificity with Latent Space Augmentation for Generalizable Deepfake Detection
Authors:
Zhiyuan Yan,
Yuhao Luo,
Siwei Lyu,
Qingshan Liu,
Baoyuan Wu
Abstract:
Deepfake detection faces a critical generalization hurdle, with performance deteriorating when there is a mismatch between the distributions of training and testing data. A broadly received explanation is the tendency of these detectors to be overfitted to forgery-specific artifacts, rather than learning features that are widely applicable across various forgeries. To address this issue, we propos…
▽ More
Deepfake detection faces a critical generalization hurdle, with performance deteriorating when there is a mismatch between the distributions of training and testing data. A broadly received explanation is the tendency of these detectors to be overfitted to forgery-specific artifacts, rather than learning features that are widely applicable across various forgeries. To address this issue, we propose a simple yet effective detector called LSDA (\underline{L}atent \underline{S}pace \underline{D}ata \underline{A}ugmentation), which is based on a heuristic idea: representations with a wider variety of forgeries should be able to learn a more generalizable decision boundary, thereby mitigating the overfitting of method-specific features (see Fig.~\ref{fig:toy}). Following this idea, we propose to enlarge the forgery space by constructing and simulating variations within and across forgery features in the latent space. This approach encompasses the acquisition of enriched, domain-specific features and the facilitation of smoother transitions between different forgery types, effectively bridging domain gaps. Our approach culminates in refining a binary classifier that leverages the distilled knowledge from the enhanced features, striving for a generalizable deepfake detector. Comprehensive experiments show that our proposed method is surprisingly effective and transcends state-of-the-art detectors across several widely used benchmarks.
△ Less
Submitted 28 March, 2024; v1 submitted 19 November, 2023;
originally announced November 2023.
-
EdgeFM: Leveraging Foundation Model for Open-set Learning on the Edge
Authors:
Bufang Yang,
Lixing He,
Neiwen Ling,
Zhenyu Yan,
Guoliang Xing,
Xian Shuai,
Xiaozhe Ren,
Xin Jiang
Abstract:
Deep Learning (DL) models have been widely deployed on IoT devices with the help of advancements in DL algorithms and chips. However, the limited resources of edge devices make these on-device DL models hard to be generalizable to diverse environments and tasks. Although the recently emerged foundation models (FMs) show impressive generalization power, how to effectively leverage the rich knowledg…
▽ More
Deep Learning (DL) models have been widely deployed on IoT devices with the help of advancements in DL algorithms and chips. However, the limited resources of edge devices make these on-device DL models hard to be generalizable to diverse environments and tasks. Although the recently emerged foundation models (FMs) show impressive generalization power, how to effectively leverage the rich knowledge of FMs on resource-limited edge devices is still not explored. In this paper, we propose EdgeFM, a novel edge-cloud cooperative system with open-set recognition capability. EdgeFM selectively uploads unlabeled data to query the FM on the cloud and customizes the specific knowledge and architectures for edge models. Meanwhile, EdgeFM conducts dynamic model switching at run-time taking into account both data uncertainty and dynamic network variations, which ensures the accuracy always close to the original FM. We implement EdgeFM using two FMs on two edge platforms. We evaluate EdgeFM on three public datasets and two self-collected datasets. Results show that EdgeFM can reduce the end-to-end latency up to 3.2x and achieve 34.3% accuracy increase compared with the baseline.
△ Less
Submitted 22 November, 2023; v1 submitted 18 November, 2023;
originally announced November 2023.
-
Worldsheet Formalism for Decoupling Limits in String Theory
Authors:
Joaquim Gomis,
Ziqi Yan
Abstract:
We study the bosonic sector of a decoupling limit of type IIA superstring theory, where a background Ramond-Ramond one-form is fined tuned to its critical value, such that it cancels the associated background D0-brane tension. The light excitations in this critical limit are D0-branes, whose dynamics is described by the Banks-Fischler-Shenker-Susskind (BFSS) Matrix theory that corresponds to M-the…
▽ More
We study the bosonic sector of a decoupling limit of type IIA superstring theory, where a background Ramond-Ramond one-form is fined tuned to its critical value, such that it cancels the associated background D0-brane tension. The light excitations in this critical limit are D0-branes, whose dynamics is described by the Banks-Fischler-Shenker-Susskind (BFSS) Matrix theory that corresponds to M-theory in the Discrete Light-Cone Quantization (DLCQ). We develop the worldsheet formalism for the fundamental string in the same critical limit of type IIA superstring theory. We show that the fundamental string develops singularities on its worldsheet, whose topology is described by nodal Riemann spheres as in ambitwistor string theory. We study the T-duality transformations of this string sigma model and provide a worldsheet derivation for the recently revived and expanded duality web that unifies a zoo of decoupling limits in type II superstring theories. By matching the string worldsheet actions, we demonstrate how some of these decoupling limits are related to tensionless (and ambitwistor) string theory, Carrollian string theory, the Spin Matrix limits of the AdS/CFT correspondence, and more.
△ Less
Submitted 5 June, 2024; v1 submitted 17 November, 2023;
originally announced November 2023.
-
Unification of Decoupling Limits in String and M-theory
Authors:
Chris D. A. Blair,
Johannes Lahnsteiner,
Niels A. J. Obers,
Ziqi Yan
Abstract:
We study and extend the duality web unifying different decoupling limits of type II superstring theories and M-theory. We systematically build connections to different corners, such as Matrix theories, nonrelativistic string and M-theory, tensionless (and ambitwistor) string theory, Carrollian string theory, and Spin Matrix limits of AdS/CFT. We discuss target space, worldsheet, and worldvolume as…
▽ More
We study and extend the duality web unifying different decoupling limits of type II superstring theories and M-theory. We systematically build connections to different corners, such as Matrix theories, nonrelativistic string and M-theory, tensionless (and ambitwistor) string theory, Carrollian string theory, and Spin Matrix limits of AdS/CFT. We discuss target space, worldsheet, and worldvolume aspects of these limits in arbitrary curved backgrounds.
△ Less
Submitted 21 April, 2024; v1 submitted 17 November, 2023;
originally announced November 2023.
-
Magnetic field-induced phases and spin Hamiltonian in Cs2CoBr4
Authors:
L. Facheris,
S. D. Nabi,
K. Yu. Povarov,
Z. Yan,
A. Glezer Moshe,
U. Nagel,
T. Rõõm,
A. Podlesnyak,
E. Ressouche,
K. Beauvois,
J. R. Stewart,
P. Manuel,
D. Khalyavin,
F. Orlandi,
A. Zheludev
Abstract:
Magnetic structures and spin excitations are studied across the phase diagram of the geometrically frustrated S = 3/2 quantum antiferromagnet Cs2CoBr4 in magnetic fields applied along the magnetic easy axis, using neutron diffraction, inelastic neutron scattering and THz absorption spectroscopy. The data are analyzed, where appropriate, using extended SU (4) linear spin wave theory. A minimal magn…
▽ More
Magnetic structures and spin excitations are studied across the phase diagram of the geometrically frustrated S = 3/2 quantum antiferromagnet Cs2CoBr4 in magnetic fields applied along the magnetic easy axis, using neutron diffraction, inelastic neutron scattering and THz absorption spectroscopy. The data are analyzed, where appropriate, using extended SU (4) linear spin wave theory. A minimal magnetic Hamiltonian is proposed based on measurements in the high field polarized state. It deviates considerably from the previously considered models. Additional dilatometry experiments highlight the importance of magnetoelastic coupling in this system.
△ Less
Submitted 14 March, 2024; v1 submitted 17 November, 2023;
originally announced November 2023.
-
FRCSyn Challenge at WACV 2024:Face Recognition Challenge in the Era of Synthetic Data
Authors:
Pietro Melzi,
Ruben Tolosana,
Ruben Vera-Rodriguez,
Minchul Kim,
Christian Rathgeb,
Xiaoming Liu,
Ivan DeAndres-Tame,
Aythami Morales,
Julian Fierrez,
Javier Ortega-Garcia,
Weisong Zhao,
Xiangyu Zhu,
Zheyu Yan,
Xiao-Yu Zhang,
**lin Wu,
Zhen Lei,
Suvidha Tripathi,
Mahak Kothari,
Md Haider Zama,
Debayan Deb,
Bernardo Biesseck,
Pedro Vidal,
Roger Granada,
Guilherme Fickel,
Gustavo Führ
, et al. (22 additional authors not shown)
Abstract:
Despite the widespread adoption of face recognition technology around the world, and its remarkable performance on current benchmarks, there are still several challenges that must be covered in more detail. This paper offers an overview of the Face Recognition Challenge in the Era of Synthetic Data (FRCSyn) organized at WACV 2024. This is the first international challenge aiming to explore the use…
▽ More
Despite the widespread adoption of face recognition technology around the world, and its remarkable performance on current benchmarks, there are still several challenges that must be covered in more detail. This paper offers an overview of the Face Recognition Challenge in the Era of Synthetic Data (FRCSyn) organized at WACV 2024. This is the first international challenge aiming to explore the use of synthetic data in face recognition to address existing limitations in the technology. Specifically, the FRCSyn Challenge targets concerns related to data privacy issues, demographic biases, generalization to unseen scenarios, and performance limitations in challenging scenarios, including significant age disparities between enrollment and testing, pose variations, and occlusions. The results achieved in the FRCSyn Challenge, together with the proposed benchmark, contribute significantly to the application of synthetic data to improve face recognition technology.
△ Less
Submitted 17 November, 2023;
originally announced November 2023.
-
Relativistic hyperpolarizabilities for atomic H, Li, and Be$^+$ systems
Authors:
Shan-Shan Lu,
Hong-Yuan Zheng,
Zong-Chao Yan,
James F. Babb,
Li-Yan Tang
Abstract:
The hyperpolarizability of an atom is a property that describes the nonlinear interaction between an atom and an external electric field leading to a higher-order Stark shift. Accurate evaluations of these coefficients for various systems are crucial to improve experimental precision in advanced atom-based clocks. However, there is a dearth of reports on atomic hyperpolarizabilities, particularly…
▽ More
The hyperpolarizability of an atom is a property that describes the nonlinear interaction between an atom and an external electric field leading to a higher-order Stark shift. Accurate evaluations of these coefficients for various systems are crucial to improve experimental precision in advanced atom-based clocks. However, there is a dearth of reports on atomic hyperpolarizabilities, particularly regarding relativistic hyperpolarizabilities. Thus, in this paper, we use fourth-order perturbation theory to establish a universal formula for the hyperpolarizability and calculate the relativistic hyperpolarizabilities of low-lying states for the monovalent electronic atomic systems H, Li, and Be$^+$. The highly accurate results given here for the H atom could serve as benchmarks for other theoretical methods.
△ Less
Submitted 17 November, 2023;
originally announced November 2023.
-
Introducing CHAD -- An ADM1 Solver for Direct Linking to Lagrangian CFD Software
Authors:
Prashant Kumar,
Zhenghao Yan,
Soroush Dabiri,
Nikolaus Rauch,
Wolfgang Rauch
Abstract:
Standard methods for modeling anaerobic digestion processes assume homogeneous conditions inside the tank and thus suffer from the negligence of hydrodynamics. In this work, we present the software toolbox Coupled Hydrodynamics and Anaerobic Digestion (CHAD), a novel parallelized solver that is capable of utilizing CFD results as the basis for Anaerobic digestion model No.1 (ADMno1) simulations. C…
▽ More
Standard methods for modeling anaerobic digestion processes assume homogeneous conditions inside the tank and thus suffer from the negligence of hydrodynamics. In this work, we present the software toolbox Coupled Hydrodynamics and Anaerobic Digestion (CHAD), a novel parallelized solver that is capable of utilizing CFD results as the basis for Anaerobic digestion model No.1 (ADMno1) simulations. CHAD uses a particle-based Lagrangian CFD solver i.e., DualSPHysics (DSPH) as input and provides for a parallelized, C++ code implementation of the standard ADMno1. This paper demonstrates a conceptual and numerical verification of the toolbox and outlines the future pathway to enhance the approach.
△ Less
Submitted 15 November, 2023;
originally announced November 2023.