Search | arXiv e-print repository

Decentralized Uncoded Storage Elastic Computing with Heterogeneous Computation Speeds

Authors: Wenbo Huang, Xudong You, Kai Wan, Robert Caiming Qiu, Mingyue Ji

Abstract: Elasticity plays an important role in modern cloud computing systems. Elastic computing allows virtual machines (i.e., computing nodes) to be preempted when high-priority jobs arise, and also allows new virtual machines to participate in the computation. In 2018, Yang et al. introduced Coded Storage Elastic Computing (CSEC) to address the elasticity using coding technology, with lower storage and… ▽ More Elasticity plays an important role in modern cloud computing systems. Elastic computing allows virtual machines (i.e., computing nodes) to be preempted when high-priority jobs arise, and also allows new virtual machines to participate in the computation. In 2018, Yang et al. introduced Coded Storage Elastic Computing (CSEC) to address the elasticity using coding technology, with lower storage and computation load requirements. However, CSEC is limited to certain types of computations (e.g., linear) due to the coded data storage based on linear coding. Then Centralized Uncoded Storage Elastic Computing (CUSEC) with heterogeneous computation speeds was proposed, which directly copies parts of data into the virtual machines. In all existing works in elastic computing, the storage assignment is centralized, meaning that the number and identity of all virtual machines possible used in the whole computation process are known during the storage assignment. In this paper, we consider Decentralized Uncoded Storage Elastic Computing (DUSEC) with heterogeneous computation speeds, where any available virtual machine can join the computation which is not predicted and thus coordination among different virtual machines' storage assignments is not allowed. Under a decentralized storage assignment originally proposed in coded caching by Maddah-Ali and Niesen, we propose a computing scheme with closed-form optimal computation time. We also run experiments over MNIST dataset with Softmax regression model through the Tencent cloud platform, and the experiment results demonstrate that the proposed DUSEC system approaches the state-of-art best storage assignment in the CUSEC system in computation time. △ Less

Submitted 1 March, 2024; originally announced March 2024.

Comments: 10 pages, 8 figures, submitted to ISIT2024

arXiv:2403.00491 [pdf, ps, other]

Analyzing Divergence for Nondeterministic Probabilistic Models

Authors: Hao Wu, Yuxi Fu, Huan Long, Xian Xu, Wenbo Zhang

Abstract: Branching and weak probabilistic bisimilarities are two well-known notions capturing behavioral equivalence between nondeterministic probabilistic systems. For probabilistic systems, divergence is of major concern. Recently several divergence-sensitive refinements of branching and weak probabilistic bisimilarities have been proposed in the literature. Both the definitions of these equivalences and… ▽ More Branching and weak probabilistic bisimilarities are two well-known notions capturing behavioral equivalence between nondeterministic probabilistic systems. For probabilistic systems, divergence is of major concern. Recently several divergence-sensitive refinements of branching and weak probabilistic bisimilarities have been proposed in the literature. Both the definitions of these equivalences and the techniques to investigate them differ significantly. This paper presents a comprehensive comparative study on divergence-sensitive behavioral equivalence relations that refine the branching and weak probabilistic bisimilarities. Additionally, these equivalence relations are shown to have efficient checking algorithms. The techniques of this paper might be of independent interest in a more general setting. △ Less

Submitted 1 March, 2024; originally announced March 2024.

arXiv:2402.19385 [pdf, other]

Towards Safe and Reliable Autonomous Driving: Dynamic Occupancy Set Prediction

Authors: Wenbo Shao, Jiahui Xu, Wenhao Yu, Jun Li, Hong Wang

Abstract: In the rapidly evolving field of autonomous driving, reliable prediction is pivotal for vehicular safety. However, trajectory predictions often deviate from actual paths, particularly in complex and challenging environments, leading to significant errors. To address this issue, our study introduces a novel method for Dynamic Occupancy Set (DOS) prediction, it effectively combines advanced trajecto… ▽ More In the rapidly evolving field of autonomous driving, reliable prediction is pivotal for vehicular safety. However, trajectory predictions often deviate from actual paths, particularly in complex and challenging environments, leading to significant errors. To address this issue, our study introduces a novel method for Dynamic Occupancy Set (DOS) prediction, it effectively combines advanced trajectory prediction networks with a DOS prediction module, overcoming the shortcomings of existing models. It provides a comprehensive and adaptable framework for predicting the potential occupancy sets of traffic participants. The innovative contributions of this study include the development of a novel DOS prediction model specifically tailored for navigating complex scenarios, the introduction of precise DOS mathematical representations, and the formulation of optimized loss functions that collectively advance the safety and efficiency of autonomous systems. Through rigorous validation, our method demonstrates marked improvements over traditional models, establishing a new benchmark for safety and operational efficiency in intelligent transportation systems. △ Less

Submitted 2 June, 2024; v1 submitted 29 February, 2024; originally announced February 2024.

Comments: Accepted by IEEE IV 2024

arXiv:2402.18816 [pdf, other]

Nano-Electromagnetic Super-dephasing in Collective Atom-Atom Interactions

Authors: Wenbo Sun, Adrian E. Rubio López, Zubin Jacob

Abstract: Pure dephasing and spontaneous emission are two non-unitary processes of atoms or spins interacting with fluctuating electromagnetic (EM) modes. Collective spontaneous emission (e.g., superradiance) originates from interactions with EM modes in resonance with atoms and has received considerable attention. Meanwhile, the analogous collective dephasing phenomena remain poorly understood. Here, we in… ▽ More Pure dephasing and spontaneous emission are two non-unitary processes of atoms or spins interacting with fluctuating electromagnetic (EM) modes. Collective spontaneous emission (e.g., superradiance) originates from interactions with EM modes in resonance with atoms and has received considerable attention. Meanwhile, the analogous collective dephasing phenomena remain poorly understood. Here, we introduce the nano-EM super-dephasing phenomenon arising in the photonic environment near lossy material interfaces. We show that this effect is enhanced by over 10 orders of magnitude compared to free space or photonic cavities due to the presence of long-range correlations in low-frequency evanescent EM fluctuations. We unravel the universality of nano-EM super-dephasing behaviors near ferrimagnets, metals, and superconductors and their dependence on low-frequency material properties. We demonstrate that the scaling of nano-EM super-dephasing is independent of EM modes' wavelengths and differs from the conventional $N^2$ scaling of superradiance by analyzing the decoherence of entangled states, including GHZ states. Finally, we show how to experimentally isolate and control super-dephasing to open interesting frontiers for scalable quantum systems. △ Less

Submitted 28 February, 2024; originally announced February 2024.

Comments: 6+17 pages, 3+3 figures

arXiv:2402.16866 [pdf, other]

Computation Rate Maximization for Wireless Powered Edge Computing With Multi-User Cooperation

Authors: Yang Li, Xing Zhang, Bo Lei, Qianying Zhao, Min Wei, Zheyan Qu, Wenbo Wang

Abstract: The combination of mobile edge computing (MEC) and radio frequency-based wireless power transfer (WPT) presents a promising technique for providing sustainable energy supply and computing services at the network edge. This study considers a wireless-powered mobile edge computing system that includes a hybrid access point (HAP) equipped with a computing unit and multiple Internet of Things (IoT) de… ▽ More The combination of mobile edge computing (MEC) and radio frequency-based wireless power transfer (WPT) presents a promising technique for providing sustainable energy supply and computing services at the network edge. This study considers a wireless-powered mobile edge computing system that includes a hybrid access point (HAP) equipped with a computing unit and multiple Internet of Things (IoT) devices. In particular, we propose a novel muti-user cooperation scheme to improve computation performance, where collaborative clusters are dynamically formed. Each collaborative cluster comprises a source device (SD) and an auxiliary device (AD), where the SD can partition the computation task into various segments for local processing, offloading to the HAP, and remote execution by the AD with the assistance of the HAP. Specifically, we aims to maximize the weighted sum computation rate (WSCR) of all the IoT devices in the network. This involves jointly optimizing collaboration, time and data allocation among multiple IoT devices and the HAP, while considering the energy causality property and the minimum data processing requirement of each device. Initially, an optimization algorithm based on the interior-point method is designed for time and data allocation. Subsequently, a priority-based iterative algorithm is developed to search for a near-optimal solution to the multi-user collaboration scheme. Finally, a deep learning-based approach is devised to further accelerate the algorithm's operation, building upon the initial two algorithms. Simulation results show that the performance of the proposed algorithms is comparable to that of the exhaustive search method, and the deep learning-based algorithm significantly reduces the execution time of the algorithm. △ Less

Submitted 22 January, 2024; originally announced February 2024.

Comments: Accepted to IEEE Open Journal of the Communications Society

arXiv:2402.14762 [pdf, other]

MT-Bench-101: A Fine-Grained Benchmark for Evaluating Large Language Models in Multi-Turn Dialogues

Authors: Ge Bai, Jie Liu, Xingyuan Bu, Yancheng He, Jiaheng Liu, Zhanhui Zhou, Zhuoran Lin, Wenbo Su, Tiezheng Ge, Bo Zheng, Wanli Ouyang

Abstract: The advent of Large Language Models (LLMs) has drastically enhanced dialogue systems. However, comprehensively evaluating the dialogue abilities of LLMs remains a challenge. Previous benchmarks have primarily focused on single-turn dialogues or provided coarse-grained and incomplete assessments of multi-turn dialogues, overlooking the complexity and fine-grained nuances of real-life dialogues. To… ▽ More The advent of Large Language Models (LLMs) has drastically enhanced dialogue systems. However, comprehensively evaluating the dialogue abilities of LLMs remains a challenge. Previous benchmarks have primarily focused on single-turn dialogues or provided coarse-grained and incomplete assessments of multi-turn dialogues, overlooking the complexity and fine-grained nuances of real-life dialogues. To address this issue, we introduce MT-Bench-101, specifically designed to evaluate the fine-grained abilities of LLMs in multi-turn dialogues. By conducting a detailed analysis of real multi-turn dialogue data, we construct a three-tier hierarchical ability taxonomy comprising 4208 turns across 1388 multi-turn dialogues in 13 distinct tasks. We then evaluate 21 popular LLMs based on MT-Bench-101, conducting comprehensive analyses from both ability and task perspectives and observing differing trends in LLMs performance across dialogue turns within various tasks. Further analysis indicates that neither utilizing common alignment techniques nor chat-specific designs has led to obvious enhancements in the multi-turn abilities of LLMs. Extensive case studies suggest that our designed tasks accurately assess the corresponding multi-turn abilities. The data and code are available at \url{https://github.com/mtbench101/mt-bench-101}. △ Less

Submitted 25 June, 2024; v1 submitted 22 February, 2024; originally announced February 2024.

Comments: [ACL 2024] The first three authors contribute equally, 34 pages, repo at https://github.com/mtbench101/mt-bench-101

arXiv:2402.14660 [pdf, other]

ConceptMath: A Bilingual Concept-wise Benchmark for Measuring Mathematical Reasoning of Large Language Models

Authors: Yanan Wu, Jie Liu, Xingyuan Bu, Jiaheng Liu, Zhanhui Zhou, Yuanxing Zhang, Chenchen Zhang, Zhiqi Bai, Haibin Chen, Tiezheng Ge, Wanli Ouyang, Wenbo Su, Bo Zheng

Abstract: This paper introduces ConceptMath, a bilingual (English and Chinese), fine-grained benchmark that evaluates concept-wise mathematical reasoning of Large Language Models (LLMs). Unlike traditional benchmarks that evaluate general mathematical reasoning with an average accuracy, ConceptMath systematically organizes math problems under a hierarchy of math concepts, so that mathematical reasoning can… ▽ More This paper introduces ConceptMath, a bilingual (English and Chinese), fine-grained benchmark that evaluates concept-wise mathematical reasoning of Large Language Models (LLMs). Unlike traditional benchmarks that evaluate general mathematical reasoning with an average accuracy, ConceptMath systematically organizes math problems under a hierarchy of math concepts, so that mathematical reasoning can be evaluated at different granularity with concept-wise accuracies. Based on our ConcepthMath, we evaluate a broad range of LLMs, and we observe existing LLMs, though achieving high average accuracies on traditional benchmarks, exhibit significant performance variations across different math concepts and may even fail catastrophically on the most basic ones. Besides, we also introduce an efficient fine-tuning strategy to enhance the weaknesses of existing LLMs. Finally, we hope ConceptMath could guide the developers to understand the fine-grained mathematical abilities of their models and facilitate the growth of foundation models. △ Less

Submitted 23 February, 2024; v1 submitted 22 February, 2024; originally announced February 2024.

Comments: The benchmark dataset will be released soon

arXiv:2402.11496 [pdf, other]

Point-Wise Vibration Pattern Production via a Sparse Actuator Array for Surface Tactile Feedback

Authors: Xiaosa Li, Runze Zhao, Chengyue Lu, Xiao Xiao, Wenbo Ding

Abstract: Surface vibration tactile feedback is capable of conveying various semantic information to humans via the handheld electronic devices, like smartphone, touch panel,and game controller. However, covering the whole device contacting surface with dense actuator arrangement can affect its normal use, how to produce desired vibration patterns at any contact point with only several sparse actuators depl… ▽ More Surface vibration tactile feedback is capable of conveying various semantic information to humans via the handheld electronic devices, like smartphone, touch panel,and game controller. However, covering the whole device contacting surface with dense actuator arrangement can affect its normal use, how to produce desired vibration patterns at any contact point with only several sparse actuators deployed on the handled device surface remains a significant challenge. In this work, we develop a tactile feedback board with only five actuators in the size of a smartphone, and achieve the precise vibration pattern production that can focus at any desired position all over the board. Specifically, we investigate the vibration characteristics of single passive coil actuator, and construct its vibration pattern model at any position on the feedback board surface. Optimal phase and amplitude modulation, found with the simulated annealing algorithm, is employed with five actuators in a sparse array. And all actuators' vibration patterns are superimposed linearly to synthetically generate different onboard vibration energy distribution for tactile sensing. Experiments demonstrated that for point-wise vibration pattern production on our tactile board achieved an average level of about 0.9 in the Structural Similarity Index Measure (SSIM) evaluation, when compared to the ideal single-point-focused target vibration pattern. The sparse actuator array can be easily embedded into usual handheld electronic devices, which shows a good significant implication for enriching their haptic interaction functionalities. △ Less

Submitted 18 February, 2024; originally announced February 2024.

arXiv:2402.09179 [pdf, other]

Instruction Backdoor Attacks Against Customized LLMs

Authors: Rui Zhang, Hongwei Li, Rui Wen, Wenbo Jiang, Yuan Zhang, Michael Backes, Yun Shen, Yang Zhang

Abstract: The increasing demand for customized Large Language Models (LLMs) has led to the development of solutions like GPTs. These solutions facilitate tailored LLM creation via natural language prompts without coding. However, the trustworthiness of third-party custom versions of LLMs remains an essential concern. In this paper, we propose the first instruction backdoor attacks against applications integ… ▽ More The increasing demand for customized Large Language Models (LLMs) has led to the development of solutions like GPTs. These solutions facilitate tailored LLM creation via natural language prompts without coding. However, the trustworthiness of third-party custom versions of LLMs remains an essential concern. In this paper, we propose the first instruction backdoor attacks against applications integrated with untrusted customized LLMs (e.g., GPTs). Specifically, these attacks embed the backdoor into the custom version of LLMs by designing prompts with backdoor instructions, outputting the attacker's desired result when inputs contain the pre-defined triggers. Our attack includes 3 levels of attacks: word-level, syntax-level, and semantic-level, which adopt different types of triggers with progressive stealthiness. We stress that our attacks do not require fine-tuning or any modification to the backend LLMs, adhering strictly to GPTs development guidelines. We conduct extensive experiments on 6 prominent LLMs and 5 benchmark text classification datasets. The results show that our instruction backdoor attacks achieve the desired attack performance without compromising utility. Additionally, we propose two defense strategies and demonstrate their effectiveness in reducing such attacks. Our findings highlight the vulnerability and the potential risks of LLM customization such as GPTs. △ Less

Submitted 28 May, 2024; v1 submitted 14 February, 2024; originally announced February 2024.

arXiv:2402.07960 [pdf]

doi 10.4236/ojbm.2023.113061

An Analysis of the Recovery Path of the Consumer Sector in the Post-Pandemic Era

Authors: Wenbo Lyu, Jiayi Zhu, Yunan Ding, Keming Zhang

Abstract: This paper proposes a referencable pattern of the recovery of the consumption sector, a new dimension to observe and evaluate the intrinsic value of the consumption sector, and proposes the concept of sensory-based consumption and the ranking of the weights of different categories;creates the concept of digital consumption index, coupled with digital RMB index and China-style digital economy index… ▽ More This paper proposes a referencable pattern of the recovery of the consumption sector, a new dimension to observe and evaluate the intrinsic value of the consumption sector, and proposes the concept of sensory-based consumption and the ranking of the weights of different categories;creates the concept of digital consumption index, coupled with digital RMB index and China-style digital economy index. Finally we explain the internal logic of digital consumption as a consumption upgrade tool and a higher valuation target in the context of China's economic performance in 2022 and the Chinese government's policy in 2023, leading to the investment strategy of roller conduction effect. △ Less

Submitted 11 February, 2024; originally announced February 2024.

arXiv:2402.07648 [pdf, other]

DeformNet: Latent Space Modeling and Dynamics Prediction for Deformable Object Manipulation

Authors: Chenchang Li, Zihao Ai, Tong Wu, Xiaosa Li, Wenbo Ding, Huazhe Xu

Abstract: Manipulating deformable objects is a ubiquitous task in household environments, demanding adequate representation and accurate dynamics prediction due to the objects' infinite degrees of freedom. This work proposes DeformNet, which utilizes latent space modeling with a learned 3D representation model to tackle these challenges effectively. The proposed representation model combines a PointNet enco… ▽ More Manipulating deformable objects is a ubiquitous task in household environments, demanding adequate representation and accurate dynamics prediction due to the objects' infinite degrees of freedom. This work proposes DeformNet, which utilizes latent space modeling with a learned 3D representation model to tackle these challenges effectively. The proposed representation model combines a PointNet encoder and a conditional neural radiance field (NeRF), facilitating a thorough acquisition of object deformations and variations in lighting conditions. To model the complex dynamics, we employ a recurrent state-space model (RSSM) that accurately predicts the transformation of the latent representation over time. Extensive simulation experiments with diverse objectives demonstrate the generalization capabilities of DeformNet for various deformable object manipulation tasks, even in the presence of previously unseen goals. Finally, we deploy DeformNet on an actual UR5 robotic arm to demonstrate its capability in real-world scenarios. △ Less

Submitted 12 February, 2024; originally announced February 2024.

Comments: 7 pages, Submitted to 2024 IEEE International Conference on Robotics and Automation (ICRA), Japan, Yokohama

arXiv:2402.07170 [pdf]

Research on the multi-stage impact of digital economy on rural revitalization in Hainan Province based on GPM model

Authors: Wenbo Lyu

Abstract: The rapid development of the digital economy has had a profound impact on the implementation of the rural revitalization strategy. Based on this, this study takes Hainan Province as the research object to deeply explore the impact of digital economic development on rural revitalization. The study collected panel data from 2003 to 2022 to construct an evaluation index system for the digital economy… ▽ More The rapid development of the digital economy has had a profound impact on the implementation of the rural revitalization strategy. Based on this, this study takes Hainan Province as the research object to deeply explore the impact of digital economic development on rural revitalization. The study collected panel data from 2003 to 2022 to construct an evaluation index system for the digital economy and rural revitalization and used panel regression analysis and other methods to explore the promotion effect of the digital economy on rural revitalization. Research results show that the digital economy has a significant positive impact on rural revitalization, and this impact increases as the level of fiscal expenditure increases. The issuance of digital RMB has further exerted a regulatory effect and promoted the development of the digital economy and the process of rural revitalization. At the same time, the establishment of the Hainan Free Trade Port has also played a positive role in promoting the development of the digital economy and rural revitalization. In the prediction of the optimal strategy for rural revitalization based on the development levels of the primary, secondary, and tertiary industries (Rate1, Rate2, and Rate3), it was found that rate1 can encourage Hainan Province to implement digital economic innovation, encourage rate3 to implement promotion behaviors, and increase rate2 can At the level of sustainable development when rate3 promotes rate2's digital economic innovation behavior, it can standardize rate2's production behavior to the greatest extent, accelerate the faster application of the digital economy to the rural revitalization industry, and promote the technological advancement of enterprises. △ Less

Submitted 11 February, 2024; originally announced February 2024.

arXiv:2402.06665 [pdf, other]

The Essential Role of Causality in Foundation World Models for Embodied AI

Authors: Tarun Gupta, Wenbo Gong, Chao Ma, Nick Pawlowski, Agrin Hilmkil, Meyer Scetbon, Marc Rigter, Ade Famoti, Ashley Juan Llorens, Jianfeng Gao, Stefan Bauer, Danica Kragic, Bernhard Schölkopf, Cheng Zhang

Abstract: Recent advances in foundation models, especially in large multi-modal models and conversational agents, have ignited interest in the potential of generally capable embodied agents. Such agents will require the ability to perform new tasks in many different real-world environments. However, current foundation models fail to accurately model physical interactions and are therefore insufficient for E… ▽ More Recent advances in foundation models, especially in large multi-modal models and conversational agents, have ignited interest in the potential of generally capable embodied agents. Such agents will require the ability to perform new tasks in many different real-world environments. However, current foundation models fail to accurately model physical interactions and are therefore insufficient for Embodied AI. The study of causality lends itself to the construction of veridical world models, which are crucial for accurately predicting the outcomes of possible interactions. This paper focuses on the prospects of building foundation world models for the upcoming generation of embodied agents and presents a novel viewpoint on the significance of causality within these. We posit that integrating causal considerations is vital to facilitating meaningful physical interactions with the world. Finally, we demystify misconceptions about causality in this context and present our outlook for future research. △ Less

Submitted 29 April, 2024; v1 submitted 6 February, 2024; originally announced February 2024.

arXiv:2402.05725 [pdf, other]

Dual-modal Tactile E-skin: Enabling Bidirectional Human-Robot Interaction via Integrated Tactile Perception and Feedback

Authors: Shilong Mu, Runze Zhao, Zenan Lin, Yan Huang, Shoujie Li, Chenchang Li, Xiao-** Zhang, Wenbo Ding

Abstract: To foster an immersive and natural human-robot interaction, the implementation of tactile perception and feedback becomes imperative, effectively bridging the conventional sensory gap. In this paper, we propose a dual-modal electronic skin (e-skin) that integrates magnetic tactile sensing and vibration feedback for enhanced human-robot interaction. The dual-modal tactile e-skin offers multi-functi… ▽ More To foster an immersive and natural human-robot interaction, the implementation of tactile perception and feedback becomes imperative, effectively bridging the conventional sensory gap. In this paper, we propose a dual-modal electronic skin (e-skin) that integrates magnetic tactile sensing and vibration feedback for enhanced human-robot interaction. The dual-modal tactile e-skin offers multi-functional tactile sensing and programmable haptic feedback, underpinned by a layered structure comprised of flexible magnetic films, soft silicone, a Hall sensor and actuator array, and a microcontroller unit. The e-skin captures the magnetic field changes caused by subtle deformations through Hall sensors, employing deep learning for accurate tactile perception. Simultaneously, the actuator array generates mechanical vibrations to facilitate haptic feedback, delivering diverse mechanical stimuli. Notably, the dual-modal e-skin is capable of transmitting tactile information bidirectionally, enabling object recognition and fine-weighing operations. This bidirectional tactile interaction framework will enhance the immersion and efficiency of interactions between humans and robots. △ Less

Submitted 8 February, 2024; originally announced February 2024.

Comments: 7 pages, 8 figures. Submitted to 2024 IEEE International Conference on Robotics and Automation (ICRA), Japan, Yokohama

arXiv:2402.04013 [pdf, other]

Privacy Leakage on DNNs: A Survey of Model Inversion Attacks and Defenses

Authors: Hao Fang, Yixiang Qiu, Hongyao Yu, Wenbo Yu, Jiawei Kong, Baoli Chong, Bin Chen, Xuan Wang, Shu-Tao Xia

Abstract: Model Inversion (MI) attacks aim to disclose private information about the training data by abusing access to the pre-trained models. These attacks enable adversaries to reconstruct high-fidelity data that closely aligns with the private training data, which has raised significant privacy concerns. Despite the rapid advances in the field, we lack a comprehensive overview of existing MI attacks and… ▽ More Model Inversion (MI) attacks aim to disclose private information about the training data by abusing access to the pre-trained models. These attacks enable adversaries to reconstruct high-fidelity data that closely aligns with the private training data, which has raised significant privacy concerns. Despite the rapid advances in the field, we lack a comprehensive overview of existing MI attacks and defenses. To fill this gap, this paper thoroughly investigates this field and presents a holistic survey. Firstly, our work briefly reviews the traditional MI on machine learning scenarios. We then elaborately analyze and compare numerous recent attacks and defenses on \textbf{D}eep \textbf{N}eural \textbf{N}etworks (DNNs) across multiple modalities and learning tasks. △ Less

Submitted 6 February, 2024; originally announced February 2024.

arXiv:2402.03596 [pdf, other]

PandaX-xT: a Multi-ten-tonne Liquid Xenon Observatory at the China **** Underground Laboratory

Authors: PandaX Collaboration, Abdusalam Abdukerim, Zihao Bo, Wei Chen, Xun Chen, Chen Cheng, Zhaokan Cheng, Xiangyi Cui, Yingjie Fan, Deqing Fang, Lisheng Geng, Karl Giboni, Linhui Gu, Xunan Guo, Xuyuan Guo, Zhichao Guo, Chencheng Han, Ke Han, Changda He, **rong He, Di Huang, Junting Huang, Zhou Huang, Ruquan Hou, Yu Hou , et al. (68 additional authors not shown)

Abstract: We propose a major upgrade to the existing PandaX-4T experiment in the China **** Underground Laboratory. The new experiment, PandaX-xT, will be a multi-ten-tonne liquid xenon, ultra-low background, and general-purpose observatory. The full-scaled PandaX-xT contains a 43-tonne liquid xenon active target. Such an experiment will significantly advance our fundamental understanding of particle phy… ▽ More We propose a major upgrade to the existing PandaX-4T experiment in the China **** Underground Laboratory. The new experiment, PandaX-xT, will be a multi-ten-tonne liquid xenon, ultra-low background, and general-purpose observatory. The full-scaled PandaX-xT contains a 43-tonne liquid xenon active target. Such an experiment will significantly advance our fundamental understanding of particle physics and astrophysics. The sensitivity of dark matter direct detection will be improved by nearly two orders of magnitude compared to the current best limits, approaching the so-called "neutrino floor" for a dark matter mass above 10 GeV/$c^2$, providing a decisive test to the Weakly Interacting Massive Particle paradigm. By searching for the neutrinoless double beta decay of $^{136}$Xe isotope in the detector, the effective Majorana neutrino mass can be measured to a [10 -- 41] meV/$c^2$ sensitivity, providing a key test to the Dirac/Majorana nature of neutrino s. Astrophysical neutrinos and other ultra-rare interactions can also be measured and searched for with an unprecedented background level, opening up new windows of discovery. Depending on the findings, PandaX-xT will seek the next stage upgrade utilizing isotopic separation on natural xenon. △ Less

Submitted 5 February, 2024; originally announced February 2024.

arXiv:2402.00585 [pdf, other]

SATac: A Thermoluminescence Enabled Tactile Sensor for Concurrent Perception of Temperature, Pressure, and Shear

Authors: Ziwu Song, Ran Yu, Xuan Zhang, Kit Wa Sou, Shilong Mu, Dengfeng Peng, Xiao-** Zhang, Wenbo Ding

Abstract: Most vision-based tactile sensors use elastomer deformation to infer tactile information, which can not sense some modalities, like temperature. As an important part of human tactile perception, temperature sensing can help robots better interact with the environment. In this work, we propose a novel multimodal vision-based tactile sensor, SATac, which can simultaneously perceive information of te… ▽ More Most vision-based tactile sensors use elastomer deformation to infer tactile information, which can not sense some modalities, like temperature. As an important part of human tactile perception, temperature sensing can help robots better interact with the environment. In this work, we propose a novel multimodal vision-based tactile sensor, SATac, which can simultaneously perceive information of temperature, pressure, and shear. SATac utilizes thermoluminescence of strontium aluminate (SA) to sense a wide range of temperatures with exceptional resolution. Additionally, the pressure and shear can also be perceived by analyzing Voronoi diagram. A series of experiments are conducted to verify the performance of our proposed sensor. We also discuss the possible application scenarios and demonstrate how SATac could benefit robot perception capabilities. △ Less

Submitted 1 February, 2024; originally announced February 2024.

arXiv:2401.17618 [pdf, other]

Beyond Control: Exploring Novel File System Objects for Data-Only Attacks on Linux Systems

Authors: **meng Zhou, Jiayi Hu, Ziyue Pan, Jiaxun Zhu, Guoren Li, Wenbo Shen, Yulei Sui, Zhiyun Qian

Abstract: The widespread deployment of control-flow integrity has propelled non-control data attacks into the mainstream. In the domain of OS kernel exploits, by corrupting critical non-control data, local attackers can directly gain root access or privilege escalation without hijacking the control flow. As a result, OS kernels have been restricting the availability of such non-control data. This forces att… ▽ More The widespread deployment of control-flow integrity has propelled non-control data attacks into the mainstream. In the domain of OS kernel exploits, by corrupting critical non-control data, local attackers can directly gain root access or privilege escalation without hijacking the control flow. As a result, OS kernels have been restricting the availability of such non-control data. This forces attackers to continue to search for more exploitable non-control data in OS kernels. However, discovering unknown non-control data can be daunting because they are often tied heavily to semantics and lack universal patterns. We make two contributions in this paper: (1) discover critical non-control objects in the file subsystem and (2) analyze their exploitability. This work represents the first study, with minimal domain knowledge, to semi-automatically discover and evaluate exploitable non-control data within the file subsystem of the Linux kernel. Our solution utilizes a custom analysis and testing framework that statically and dynamically identifies promising candidate objects. Furthermore, we categorize these discovered objects into types that are suitable for various exploit strategies, including a novel strategy necessary to overcome the defense that isolates many of these objects. These objects have the advantage of being exploitable without requiring KASLR, thus making the exploits simpler and more reliable. We use 18 real-world CVEs to evaluate the exploitability of the file system objects using various exploit strategies. We develop 10 end-to-end exploits using a subset of CVEs against the kernel with all state-of-the-art mitigations enabled. △ Less

Submitted 31 January, 2024; originally announced January 2024.

Comments: 14 pages, in submission of the 31th ACM Conference on Computer and Communications Security (CCS), 2024

arXiv:2401.17606 [pdf, other]

doi 10.1109/TDSC.2023.3253572

Ambush from All Sides: Understanding Security Threats in Open-Source Software CI/CD Pipelines

Authors: Ziyue Pan, Wenbo Shen, Xingkai Wang, Yutian Yang, Rui Chang, Yao Liu, Chengwei Liu, Yang Liu, Kui Ren

Abstract: The continuous integration and continuous deployment (CI/CD) pipelines are widely adopted on Internet hosting platforms, such as GitHub. With the popularity, the CI/CD pipeline faces various security threats. However, current CI/CD pipelines suffer from malicious code and severe vulnerabilities. Even worse, people have not been fully aware of its attack surfaces and the corresponding impacts. Th… ▽ More The continuous integration and continuous deployment (CI/CD) pipelines are widely adopted on Internet hosting platforms, such as GitHub. With the popularity, the CI/CD pipeline faces various security threats. However, current CI/CD pipelines suffer from malicious code and severe vulnerabilities. Even worse, people have not been fully aware of its attack surfaces and the corresponding impacts. Therefore, in this paper, we conduct a large-scale measurement and a systematic analysis to reveal the attack surfaces of the CI/CD pipeline and quantify their security impacts. Specifically, for the measurement, we collect a data set of 320,000+ CI/CD pipeline-configured GitHub repositories and build an analysis tool to parse the CI/CD pipelines and extract security-critical usages. Besides, current CI/CD ecosystem heavily relies on several core scripts, which may lead to a single point of failure. While the CI/CD pipelines contain sensitive information/operations, making them the attacker's favorite targets. Inspired by the measurement findings, we abstract the threat model and the attack approach toward CI/CD pipelines, followed by a systematic analysis of attack surfaces, attack strategies, and the corresponding impacts. We further launch case studies on five attacks in real-world CI/CD environments to validate the revealed attack surfaces. Finally, we give suggestions on mitigating attacks on CI/CD scripts, including securing CI/CD configurations, securing CI/CD scripts, and improving CI/CD infrastructure. △ Less

Submitted 31 January, 2024; originally announced January 2024.

Journal ref: IEEE Transactions on Dependable and Secure Computing (Volume: 21, Issue: 1, Jan.-Feb. 2024)

arXiv:2401.15576 [pdf, other]

Global polarization and spin alignment in heavy-ion collisions: past, present and future

Authors: Wen-Bo Dong, Xin-Li Sheng, Yi-Liang Yin, Qun Wang

Abstract: We give a brief overview on global polarization and spin alignment in heavy ion collisions. The current theoretical understandings on global polarization of hyperons and the global spin alignment of vector mesons are summarized. We give a brief overview on global polarization and spin alignment in heavy ion collisions. The current theoretical understandings on global polarization of hyperons and the global spin alignment of vector mesons are summarized. △ Less

Submitted 28 January, 2024; originally announced January 2024.

Comments: 10 pages, 2 figures, Proceeding for SPIN 2023

arXiv:2401.13670 [pdf]

"The Roller Conduction Effect" from the A-share Data Evidence

Authors: Wenbo Lyu

Abstract: In the post-epidemic era, consumption recovery has obvious time and space transmission laws, and there are different valuation criteria for consumption segments. Using the A-share data of the consumption recovery stage from January to April 2022, this paper quantitatively compares the rotation effect between different consumption sectors when the valuation returns to the reasonable range. Accordin… ▽ More In the post-epidemic era, consumption recovery has obvious time and space transmission laws, and there are different valuation criteria for consumption segments. Using the A-share data of the consumption recovery stage from January to April 2022, this paper quantitatively compares the rotation effect between different consumption sectors when the valuation returns to the reasonable range. According to the new classification of "sensory-based consumption", it interprets the internal logic of digital consumption as A consumption upgrade tool and a higher valuation target, and expounds the "the roller conduction effect". The law of consumption recovery and valuation return period is explained from the perspective of time and space conduction. The study found that in the early stage of consumption recovery, the recovery of consumer confidence was slow. In this period, A-shares were mainly dominated by the stock capital game, and there was an obvious plate rotation law in the game. Being familiar with this law has strong significance, which not only helps policy makers to adjust the direction of policy guidance, but also helps financial investors to make better investment strategies. The disadvantage of this paper is that it has not yet studied the roller conduction effect of the global financial market, and more rigorous mathematical models are still needed to support the definition of stock funds, which is also the main direction of the author's future research. △ Less

Submitted 15 October, 2023; originally announced January 2024.

Comments: 11 pages

arXiv:2401.13462 [pdf, other]

Growing from Exploration: A self-exploring framework for robots based on foundation models

Authors: Shoujie Li, Ran Yu, Tong Wu, JunWen Zhong, Xiao-** Zhang, Wenbo Ding

Abstract: Intelligent robot is the ultimate goal in the robotics field. Existing works leverage learning-based or optimization-based methods to accomplish human-defined tasks. However, the challenge of enabling robots to explore various environments autonomously remains unresolved. In this work, we propose a framework named GExp, which enables robots to explore and learn autonomously without human intervent… ▽ More Intelligent robot is the ultimate goal in the robotics field. Existing works leverage learning-based or optimization-based methods to accomplish human-defined tasks. However, the challenge of enabling robots to explore various environments autonomously remains unresolved. In this work, we propose a framework named GExp, which enables robots to explore and learn autonomously without human intervention. To achieve this goal, we devise modules including self-exploration, knowledge-base-building, and close-loop feedback based on foundation models. Inspired by the way that infants interact with the world, GExp encourages robots to understand and explore the environment with a series of self-generated tasks. During the process of exploration, the robot will acquire skills from beneficial experiences that are useful in the future. GExp provides robots with the ability to solve complex tasks through self-exploration. GExp work is independent of prior interactive knowledge and human intervention, allowing it to adapt directly to different scenarios, unlike previous studies that provided in-context examples as few-shot learning. In addition, we propose a workflow of deploying the real-world robot system with self-learned skills as an embodied assistant. △ Less

Submitted 24 January, 2024; originally announced January 2024.

Comments: 19 pages

arXiv:2401.09563 [pdf, other]

doi 10.1088/1367-2630/ad3fe1

Giant Enhancement of Vacuum Friction in Spinning YIG Nanospheres

Authors: Farhad Khosravi, Wenbo Sun, Chinmay Khandekar, Tongcang Li, Zubin Jacob

Abstract: Experimental observations of vacuum radiation and vacuum frictional torque are challenging due to their vanishingly small effects in practical systems. For example, a rotating nanosphere in free space slows down due to friction from vacuum fluctuations with a stop** time around the age of the universe. Here, we show that a spinning yttrium iron garnet (YIG) nanosphere near aluminum or YIG slabs… ▽ More Experimental observations of vacuum radiation and vacuum frictional torque are challenging due to their vanishingly small effects in practical systems. For example, a rotating nanosphere in free space slows down due to friction from vacuum fluctuations with a stop** time around the age of the universe. Here, we show that a spinning yttrium iron garnet (YIG) nanosphere near aluminum or YIG slabs exhibits vacuum radiation eight orders of magnitude larger than other metallic or dielectric spinning nanospheres. We achieve this giant enhancement by exploiting the large near-field magnetic local density of states in YIG systems, which occurs in the low-frequency GHz regime comparable to the rotation frequency. Furthermore, we propose a realistic experimental setup for observing the effects of this large vacuum radiation and frictional torque under experimentally accessible conditions. △ Less

Submitted 17 January, 2024; originally announced January 2024.

Comments: main text (5 pages, 3 figures) + appendices (11 pages, 1 figure) + supplementary material (11 pages, 3 figures)

Journal ref: New J. Phys. 26 053006 (2024)

arXiv:2401.08705 [pdf]

doi 10.1063/5.0188665

A solver for subsonic flow around airfoils based on physics-informed neural networks and mesh transformation

Authors: Wenbo Cao, Jiahao Song, Weiwei Zhang

Abstract: Physics-informed neural networks (PINNs) have recently become a new popular method for solving forward and inverse problems governed by partial differential equations (PDEs). However, in the flow around airfoils, the fluid is greatly accelerated near the leading edge, resulting in a local sharper transition, which is difficult to capture by PINNs. Therefore, PINNs are still rarely used to solve th… ▽ More Physics-informed neural networks (PINNs) have recently become a new popular method for solving forward and inverse problems governed by partial differential equations (PDEs). However, in the flow around airfoils, the fluid is greatly accelerated near the leading edge, resulting in a local sharper transition, which is difficult to capture by PINNs. Therefore, PINNs are still rarely used to solve the flow around airfoils. In this study, we combine physical-informed neural networks with mesh transformation, using neural network to learn the flow in the uniform computational space instead of physical space. Mesh transformation avoids the network from capturing the local sharper transition and learning flow with internal boundary (wall boundary). We successfully solve inviscid flow and provide an open-source subsonic flow solver for arbitrary airfoils. Our results show that the solver exhibits higher-order attributes, achieving nearly an order of magnitude error reduction over second-order finite volume methods (FVM) on very sparse meshes. Limited by the learning ability and optimization difficulties of neural network, the accuracy of this solver will not improve significantly with mesh refinement. Nevertheless, it achieves comparable accuracy and efficiency to second-order FVM on fine meshes. Finally, we highlight the significant advantage of the solver in solving parametric problems, as it can efficiently obtain solutions in the continuous parameter space about the angle of attack. △ Less

Submitted 15 January, 2024; originally announced January 2024.

Comments: arXiv admin note: text overlap with arXiv:2401.07203

Journal ref: Physics of Fluids 36, 027134 (2024)

arXiv:2401.07203 [pdf]

A complete state-space solution model for inviscid flow around airfoils based on physics-informed neural networks

Authors: Wenbo Cao, Jiahao Song, Weiwei Zhang

Abstract: Engineering problems often involve solving partial differential equations (PDEs) over a range of similar problem setups with various state parameters. In traditional numerical methods, each problem is solved independently, resulting in many repetitive tasks and expensive computational costs. Data-driven modeling has alleviated these issues, enabling fast solution prediction. Nevertheless, it still… ▽ More Engineering problems often involve solving partial differential equations (PDEs) over a range of similar problem setups with various state parameters. In traditional numerical methods, each problem is solved independently, resulting in many repetitive tasks and expensive computational costs. Data-driven modeling has alleviated these issues, enabling fast solution prediction. Nevertheless, it still requires expensive labeled data and faces limitations in modeling accuracy, generalization, and uncertainty. The recently developed methods for solving PDEs through neural network optimization, such as physics-informed neural networks (PINN), enable the simultaneous solution of a series of similar problems. However, these methods still face challenges in achieving stable training and obtaining correct results in many engineering problems. In prior research, we combined PINN with mesh transformation, using neural network to learn the solution of PDEs in the computational space instead of physical space. This approach proved successful in solving inviscid flow around airfoils. In this study, we expand the input dimensions of the model to include shape parameters and flow conditions, forming an input encompassing the complete state-space (i.e., all parameters determining the solution are included in the input). Our results show that the model has significant advantages in solving high-dimensional parametric problems, achieving continuous solutions in a broad state-space in only about 18.8 hours. This is a task that traditional numerical methods struggle to accomplish. Once established, the model can efficiently complete airfoil flow simulation and shape inverse design tasks in approximately 1 second. Furthermore, we introduce a pretraining-finetuning method, enabling the fine-tuning of the model for the task of interest and quickly achieving accuracy comparable to the finite volume method. △ Less

Submitted 13 January, 2024; originally announced January 2024.

arXiv:2401.07045 [pdf, other]

Measurement of Solar $pp$ Neutrino Flux using Electron Recoil Data from PandaX-4T Commissioning Run

Authors: PandaX Collaboration, Xiaoying Lu, Abdusalam Abdukerim, Zihao Bo, Wei Chen, Xun Chen, Yunhua Chen, Chen Cheng, Zhaokan Cheng, Xiangyi Cui, Yingjie Fan, Deqing Fang, Lisheng Geng, Karl Giboni, Xuyuan Guo, Chencheng Han, Ke Han, Changda He, **rong He, Di Huang, Junting Huang, Zhou Huang, Ruquan Hou, Yu Hou, Xiangdong Ji , et al. (67 additional authors not shown)

Abstract: The proton-proton ($pp$) fusion chain dominates the neutrino production from the Sun. The uncertainty of the predicted $pp$ neutrino flux is at the sub-percent level, whereas that of the best measurement is $\mathcal{O}(10\%)$. In this paper, we present the first result to measure the solar $pp$ neutrinos in the electron recoil energy range from 24 to 144 keV, using the PandaX-4T commissioning dat… ▽ More The proton-proton ($pp$) fusion chain dominates the neutrino production from the Sun. The uncertainty of the predicted $pp$ neutrino flux is at the sub-percent level, whereas that of the best measurement is $\mathcal{O}(10\%)$. In this paper, we present the first result to measure the solar $pp$ neutrinos in the electron recoil energy range from 24 to 144 keV, using the PandaX-4T commissioning data with 0.63 tonne$\times$year exposure. The $pp$ neutrino flux is determined to be $(8.0 \pm 3.9 \,{\rm{(stat)}} \pm 10.0 \,{\rm{(syst)}} )\times 10^{10}\, $$\rm{s}^{-1} \rm{cm}^{-2}$, consistent with Standard Solar Model and existing measurements, corresponding to a flux upper limit of $23.3\times 10^{10}\, $$\rm{s}^{-1} \rm{cm}^{-2}$ at 90\% C.L.. △ Less

Submitted 2 July, 2024; v1 submitted 13 January, 2024; originally announced January 2024.

Comments: 6 pages, 5 figures

arXiv:2401.06951 [pdf, other]

E^2-LLM: Efficient and Extreme Length Extension of Large Language Models

Authors: Jiaheng Liu, Zhiqi Bai, Yuanxing Zhang, Chenchen Zhang, Yu Zhang, Ge Zhang, Jiakai Wang, Haoran Que, Yukang Chen, Wenbo Su, Tiezheng Ge, Jie Fu, Wenhu Chen, Bo Zheng

Abstract: Typically, training LLMs with long context sizes is computationally expensive, requiring extensive training hours and GPU resources. Existing long-context extension methods usually need additional training procedures to support corresponding long-context windows, where the long-context training data (e.g., 32k) is needed, and high GPU training costs are assumed. To address the aforementioned issue… ▽ More Typically, training LLMs with long context sizes is computationally expensive, requiring extensive training hours and GPU resources. Existing long-context extension methods usually need additional training procedures to support corresponding long-context windows, where the long-context training data (e.g., 32k) is needed, and high GPU training costs are assumed. To address the aforementioned issues, we propose an Efficient and Extreme length extension method for Large Language Models, called E 2 -LLM, with only one training procedure and dramatically reduced computation cost, which also removes the need to collect long-context data. Concretely, first, the training data of our E 2 -LLM only requires a short length (e.g., 4k), which reduces the tuning cost greatly. Second, the training procedure on the short training context window is performed only once time, and we can support different evaluation context windows at inference. Third, in E 2 - LLM, based on RoPE position embeddings, we introduce two different augmentation methods on the scale and position index parameters for different samples in training. It aims to make the model more robust to the different relative differences when directly interpolating the arbitrary context length at inference. Comprehensive experimental results on multiple benchmark datasets demonstrate the effectiveness of our E 2 -LLM on challenging long-context tasks. △ Less

Submitted 22 February, 2024; v1 submitted 12 January, 2024; originally announced January 2024.

arXiv:2401.06196 [pdf]

VW-PINNs: A volume weighting method for PDE residuals in physics-informed neural networks

Authors: Jiahao Song, Wenbo Cao, Fei Liao, Weiwei Zhang

Abstract: Physics-informed neural networks (PINNs) have shown remarkable prospects in the solving the forward and inverse problems involving partial differential equations (PDEs). The method embeds PDEs into the neural network by calculating PDE loss at a series of collocation points, providing advantages such as meshfree and more convenient adaptive sampling. However, when solving PDEs using nonuniform col… ▽ More Physics-informed neural networks (PINNs) have shown remarkable prospects in the solving the forward and inverse problems involving partial differential equations (PDEs). The method embeds PDEs into the neural network by calculating PDE loss at a series of collocation points, providing advantages such as meshfree and more convenient adaptive sampling. However, when solving PDEs using nonuniform collocation points, PINNs still face challenge regarding inefficient convergence of PDE residuals or even failure. In this work, we first analyze the ill-conditioning of the PDE loss in PINNs under nonuniform collocation points. To address the issue, we define volume-weighted residual and propose volume-weighted physics-informed neural networks (VW-PINNs). Through weighting the PDE residuals by the volume that the collocation points occupy within the computational domain, we embed explicitly the spatial distribution characteristics of collocation points in the residual evaluation. The fast and sufficient convergence of the PDE residuals for the problems involving nonuniform collocation points is guaranteed. Considering the meshfree characteristics of VW-PINNs, we also develop a volume approximation algorithm based on kernel density estimation to calculate the volume of the collocation points. We verify the universality of VW-PINNs by solving the forward problems involving flow over a circular cylinder and flow over the NACA0012 airfoil under different inflow conditions, where conventional PINNs fail; By solving the Burgers' equation, we verify that VW-PINNs can enhance the efficiency of existing the adaptive sampling method in solving the forward problem by 3 times, and can reduce the relative error of conventional PINNs in solving the inverse problem by more than one order of magnitude. △ Less

Submitted 11 January, 2024; originally announced January 2024.

arXiv:2401.05362 [pdf, other]

DualTeacher: Bridging Coexistence of Unlabelled Classes for Semi-supervised Incremental Object Detection

Authors: Ziqi Yuan, Liyuan Wang, Wenbo Ding, Xingxing Zhang, Jiachen Zhong, Jianyong Ai, Jianmin Li, Jun Zhu

Abstract: In real-world applications, an object detector often encounters object instances from new classes and needs to accommodate them effectively. Previous work formulated this critical problem as incremental object detection (IOD), which assumes the object instances of new classes to be fully annotated in incremental data. However, as supervisory signals are usually rare and expensive, the supervised I… ▽ More In real-world applications, an object detector often encounters object instances from new classes and needs to accommodate them effectively. Previous work formulated this critical problem as incremental object detection (IOD), which assumes the object instances of new classes to be fully annotated in incremental data. However, as supervisory signals are usually rare and expensive, the supervised IOD may not be practical for implementation. In this work, we consider a more realistic setting named semi-supervised IOD (SSIOD), where the object detector needs to learn new classes incrementally from a few labelled data and massive unlabelled data without catastrophic forgetting of old classes. A commonly-used strategy for supervised IOD is to encourage the current model (as a student) to mimic the behavior of the old model (as a teacher), but it generally fails in SSIOD because a dominant number of object instances from old and new classes are coexisting and unlabelled, with the teacher only recognizing a fraction of them. Observing that learning only the classes of interest tends to preclude detection of other classes, we propose to bridge the coexistence of unlabelled classes by constructing two teacher models respectively for old and new classes, and using the concatenation of their predictions to instruct the student. This approach is referred to as DualTeacher, which can serve as a strong baseline for SSIOD with limited resource overhead and no extra hyperparameters. We build various benchmarks for SSIOD and perform extensive experiments to demonstrate the superiority of our approach (e.g., the performance lead is up to 18.28 AP on MS-COCO). Our code is available at \url{https://github.com/chuxiuhong/DualTeacher}. △ Less

Submitted 13 December, 2023; originally announced January 2024.

arXiv:2401.03856 [pdf, other]

doi 10.1109/ICASSP48485.2024.10446951

DJCM: A Deep Joint Cascade Model for Singing Voice Separation and Vocal Pitch Estimation

Authors: Haojie Wei, Xueke Cao, Wenbo Xu, Tangpeng Dan, Yueguo Chen

Abstract: Singing voice separation and vocal pitch estimation are pivotal tasks in music information retrieval. Existing methods for simultaneous extraction of clean vocals and vocal pitches can be classified into two categories: pipeline methods and naive joint learning methods. However, the efficacy of these methods is limited by the following problems: On the one hand, pipeline methods train models for e… ▽ More Singing voice separation and vocal pitch estimation are pivotal tasks in music information retrieval. Existing methods for simultaneous extraction of clean vocals and vocal pitches can be classified into two categories: pipeline methods and naive joint learning methods. However, the efficacy of these methods is limited by the following problems: On the one hand, pipeline methods train models for each task independently, resulting a mismatch between the data distributions at the training and testing time. On the other hand, naive joint learning methods simply add the losses of both tasks, possibly leading to a misalignment between the distinct objectives of each task. To solve these problems, we propose a Deep Joint Cascade Model (DJCM) for singing voice separation and vocal pitch estimation. DJCM employs a novel joint cascade model structure to concurrently train both tasks. Moreover, task-specific weights are used to align different objectives of both tasks. Experimental results show that DJCM achieves state-of-the-art performance on both tasks, with great improvements of 0.45 in terms of Signal-to-Distortion Ratio (SDR) for singing voice separation and 2.86% in terms of Overall Accuracy (OA) for vocal pitch estimation. Furthermore, extensive ablation studies validate the effectiveness of each design of our proposed model. The code of DJCM is available at https://github.com/Dream-High/DJCM . △ Less

Submitted 8 January, 2024; originally announced January 2024.

Comments: This paper has been accepted by ICASSP 2024

arXiv:2401.03692 [pdf, other]

Boosting Column Generation with Graph Neural Networks for Joint Rider Trip Planning and Crew Shift Scheduling

Authors: Jiawei Lu, Tinghan Ye, Wenbo Chen, Pascal Van Hentenryck

Abstract: Optimizing service schedules is pivotal to the reliable, efficient, and inclusive on-demand mobility. This pressing challenge is further exacerbated by the increasing needs of an aging population, the over-subscription of existing services, and the lack of effective solution methods. This study addresses the intricacies of service scheduling, by jointly optimizing rider trip planning and crew sche… ▽ More Optimizing service schedules is pivotal to the reliable, efficient, and inclusive on-demand mobility. This pressing challenge is further exacerbated by the increasing needs of an aging population, the over-subscription of existing services, and the lack of effective solution methods. This study addresses the intricacies of service scheduling, by jointly optimizing rider trip planning and crew scheduling for a complex dynamic mobility service. The resulting optimization problems are extremely challenging computationally for state-of-the-art methods. To address this fundamental gap, this paper introduces the Joint Rider Trip Planning and Crew Shift Scheduling Problem (JRTPCSSP) and a novel solution method, called AGGNNI-CG (Attention and Gated GNN- Informed Column Generation), that hybridizes column generation and machine learning to obtain near-optimal solutions to the JRTPCSSP with the real-time constraints of the application. The key idea of the machine-learning component is to dramatically reduce the number of paths to explore in the pricing component, accelerating the most time-consuming component of the column generation. The machine learning component is a graph neural network with an attention mechanism and a gated architecture, that is particularly suited to cater for the different input sizes coming from daily operations. AGGNNI-CG has been applied to a challenging, real-world dataset from the Paratransit system of Chatham County in Georgia. It produces dramatic improvements compared to the baseline column generation approach, which typically cannot produce feasible solutions in reasonable time on both medium-sized and large-scale complex instances. AGGNNI-CG also produces significant improvements in service compared to the existing system. △ Less

Submitted 8 January, 2024; originally announced January 2024.

arXiv:2401.02090 [pdf, other]

doi 10.1145/3597503.3639221

ModuleGuard:Understanding and Detecting Module Conflicts in Python Ecosystem

Authors: Ruofan Zhu, Xingyu Wang, Chengwei Liu, Zhengzi Xu, Wenbo Shen, Rui Chang, Yang Liu

Abstract: Python has become one of the most popular programming languages for software development due to its simplicity, readability, and versatility. As the Python ecosystem grows, developers face increasing challenges in avoiding module conflicts, which occur when different packages have the same namespace modules. Unfortunately, existing work has neither investigated the module conflict comprehensively… ▽ More Python has become one of the most popular programming languages for software development due to its simplicity, readability, and versatility. As the Python ecosystem grows, developers face increasing challenges in avoiding module conflicts, which occur when different packages have the same namespace modules. Unfortunately, existing work has neither investigated the module conflict comprehensively nor provided tools to detect the conflict. Therefore, this paper systematically investigates the module conflict problem and its impact on the Python ecosystem. We propose a novel technique called InstSimulator, which leverages semantics and installation simulation to achieve accurate and efficient module extraction. Based on this, we implement a tool called ModuleGuard to detect module conflicts for the Python ecosystem. For the study, we first collect 97 MC issues, classify the characteristics and causes of these MC issues, summarize three different conflict patterns, and analyze their potential threats. Then, we conducted a large-scale analysis of the whole PyPI ecosystem (4.2 million packages) and GitHub popular projects (3,711 projects) to detect each MC pattern and analyze their potential impact. We discovered that module conflicts still impact numerous TPLs and GitHub projects. This is primarily due to developers' lack of understanding of the modules within their direct dependencies, not to mention the modules of the transitive dependencies. Our work reveals Python's shortcomings in handling naming conflicts and provides a tool and guidelines for developers to detect conflicts. △ Less

Submitted 4 January, 2024; originally announced January 2024.

Comments: The paper was accepted by ICSE24

MSC Class: 65-04 ACM Class: D.2; K.6.3

arXiv:2401.01638 [pdf, other]

Radon Removal Commissioning of the PandaX-4T Cryogenic Distillation System

Authors: Xiangyi Cui, Zhou Wang, Jiafu Li, Shuaijie Li, Lin Si, Yonglin Ju, Wenbo Ma, Jianglai Liu, Li Zhao, Xiangdong Ji, Rui Yan, Haidong Sha, Peiyao Huang, Xiuli Wang, Huaxuan Liu

Abstract: The PandaX-4T distillation system, designed for the removal of krypton and radon from xenon, is evaluated for its radon removal efficiency using a $^{222}$Rn source during the online distillation process. The PandaX-4T dark matter detector is employed to monitor the temporal evolution of radon activity. To determine the radon reduction factor, the experimental data of radon atoms introduced into a… ▽ More The PandaX-4T distillation system, designed for the removal of krypton and radon from xenon, is evaluated for its radon removal efficiency using a $^{222}$Rn source during the online distillation process. The PandaX-4T dark matter detector is employed to monitor the temporal evolution of radon activity. To determine the radon reduction factor, the experimental data of radon atoms introduced into and bypassed the distillation system is compared. The results indicate that the PandaX-4T distillation system achieves a radon reduction factor exceeding 190 at the flow rate of 10 slpm and the reflux ratio of 1.44. Gas-only online distillation process of a flow rate of 20 slpm is also conducted without observing significant reduction of radon levels in the detector. This observation suggests that the migration flow of radon atoms from the liquid phase to the gas phase is limited, and the flow rate of gas circulation and duration of the process are insignificant compared to the total xenon mass of 5.6 tons in the detector. This study provides the experimental data to support the efficient removal of radon at $\sim$Bq level using the PandaX-4T distillation system, which is the prerequisite of the radon background control in the detector. The further operation with higher flow rate will be applied for the upcoming science run in PandaX-4T. △ Less

Submitted 19 April, 2024; v1 submitted 3 January, 2024; originally announced January 2024.

Comments: 14 pages, 9 figures

arXiv:2401.01503 [pdf, other]

Specific Emitter Identification Based on Joint Variational Mode Decomposition

Authors: Xiaofang Chen, Wenbo Xu, Yue Wang, Yan Huang

Abstract: Specific emitter identification (SEI) technology is significant in device administration scenarios, such as self-organized networking and spectrum management, owing to its high security. For nonlinear and non-stationary electromagnetic signals, SEI often employs variational modal decomposition (VMD) to decompose the signal in order to effectively characterize the distinct device fingerprint. Howev… ▽ More Specific emitter identification (SEI) technology is significant in device administration scenarios, such as self-organized networking and spectrum management, owing to its high security. For nonlinear and non-stationary electromagnetic signals, SEI often employs variational modal decomposition (VMD) to decompose the signal in order to effectively characterize the distinct device fingerprint. However, the trade-off of VMD between the robustness to noise and the ability to preserve signal information has not been investigated in the current literature. Moreover, the existing VMD algorithm does not utilize the stability of the intrinsic distortion of emitters within a certain temporal span, consequently constraining its practical applicability in SEI. In this paper, we propose a joint variational modal decomposition (JVMD) algorithm, which is an improved version of VMD by simultaneously implementing modal decomposition on multi-frame signals. The consistency of multi-frame signals in terms of the central frequencies and the inherent modal functions (IMFs) is exploited, which effectively highlights the distinctive characteristics among emitters and reduces noise. Additionally, the complexity of JVMD is analyzed, which is proven to be more computational-friendly than VMD. Simulations of both modal decomposition and SEI that involve real-world datasets are presented to illustrate that when compared with VMD, the JVMD algorithm improves the accuracy of device classification and the robustness towards noise. △ Less

Submitted 2 January, 2024; originally announced January 2024.

arXiv:2312.15632 [pdf, other]

doi 10.1103/PhysRevLett.132.152502

Searching for Two-Neutrino and Neutrinoless Double Beta Decay of $^{134}$Xe with the PandaX-4T Experiment

Authors: PandaX Collaboration, Xiyu Yan, Zhaokan Cheng, Abdusalam Abdukerim, Zihao Bo, Wei Chen, Xun Chen, Chen Cheng, Xiangyi Cui, Yingjie Fan, Deqing Fang, Changbo Fu, Mengting Fu, Lisheng Geng, Karl Giboni, Linhui Gu, Xuyuan Guo, Chencheng Han, Ke Han, Changda He, **rong He, Di Huang, Yanlin Huang, Junting Huang, Zhou Huang , et al. (72 additional authors not shown)

Abstract: $^{134}$Xe is a candidate isotope for neutrinoless double beta decay~($0νββ$) search. In addition, the two-neutrino case ($2νββ$) allowed by the Standard Model of particle physics has not yet been observed. Utilizing the 10.4% of $^{134}$Xe in the natural xenon in the PandaX-4T detector and its first 94.9-day exposure, we have established the most stringent constraints on $2νββ$ and $0νββ$ of $^{1… ▽ More $^{134}$Xe is a candidate isotope for neutrinoless double beta decay~($0νββ$) search. In addition, the two-neutrino case ($2νββ$) allowed by the Standard Model of particle physics has not yet been observed. Utilizing the 10.4% of $^{134}$Xe in the natural xenon in the PandaX-4T detector and its first 94.9-day exposure, we have established the most stringent constraints on $2νββ$ and $0νββ$ of $^{134}$Xe half-lives, with limits of $2.8\times10^{22}$ yr and $3.0\times10^{23}$ yr at 90% confidence level, respectively. The $2νββ$ ($0νββ$) limit surpasses the previously reported best result by a factor of 32 (2.7), highlighting the potential of large monolithic natural xenon detectors. △ Less

Submitted 28 April, 2024; v1 submitted 25 December, 2023; originally announced December 2023.

Journal ref: Phys.Rev.Lett. 132 (2024) 15, 152502

arXiv:2312.15028 [pdf, other]

Enhanced Ferromagnetism in Monolayer Cr2Te3 via Topological Insulator Coupling

Authors: Yunbo Ou, Murod Mirzhalilov, Norbert M. Nemes, Jose L. Martinez, Mirko Rocci, Austin Akey, Wenbo Ge, Dhavala Suri, Yi** Wang, Haile Ambaye, Jong Keum, Mohit Randeria, Nandini Trivedi, Kenneth S. Burch, David C. Bell, Weida Wu, Don Heiman, Valeria Lauter, Jagadeesh S. Moodera, Hang Chi

Abstract: Exchange-coupled interfaces are pivotal in exploiting two-dimensional (2D) ferromagnetism. Due to the extraordinary correlations among charge, spin, orbital and lattice degrees of freedom, layered magnetic transition metal chalcogenides (TMCs) bode well for exotic topological phenomena. Here we report the realization of wafer-scale Cr2Te3 down to monolayer (ML) on insulating SrTiO3(111) substrates… ▽ More Exchange-coupled interfaces are pivotal in exploiting two-dimensional (2D) ferromagnetism. Due to the extraordinary correlations among charge, spin, orbital and lattice degrees of freedom, layered magnetic transition metal chalcogenides (TMCs) bode well for exotic topological phenomena. Here we report the realization of wafer-scale Cr2Te3 down to monolayer (ML) on insulating SrTiO3(111) substrates using molecular beam epitaxy. Robust ferromagnetism emerges in 2D Cr2Te3 ML with a Curie temperature TC = 17 K. Moreover, when Cr2Te3 is proximitized with topological insulator (TI) (Bi,Sb)2Te3, the magnetism becomes stronger -- for 1 ML, TC increases to 30 K, while for 2 ML it boosts from 65 K to 82 K. Our experiments and theory strongly indicate that the Bloembergen-Rowland interaction is likely a universal aspect of TC enhancement in TI-coupled magnetic heterostructures. The topological-surface-enhanced magnetism in 2D TMC enables further exchange coupling physics and quantum hybrid studies, including paving the way to realize interface-modulated topological electronics. △ Less

Submitted 22 December, 2023; originally announced December 2023.

Comments: Main: 9 pages, 5 figures; SI: 3 pages, 5 figures

arXiv:2312.11072 [pdf, other]

doi 10.1088/1674-1137/ad380f

Waveform Simulation in PandaX-4T

Authors: Jiafu Li, Abdusalam Abdukerim, Chen Cheng, Zihao Bo, Wei Chen, Xun Chen, Yunhua Chen, Zhaokan Cheng, Xiangyi Cui, Yingjie Fan, Deqing Fang, Changbo Fu, Mengting Fu, Lisheng Geng, Karl Giboni, Linhui Gu, Xuyuan Guo, Chencheng Han, Ke Han, Changda He, **rong He, Di Huang, Yanlin Huang, Zhou Huang, Ruquan Hou , et al. (66 additional authors not shown)

Abstract: Signal reconstruction through software processing is a crucial component of the background and signal models in the PandaX-4T experiment, which is a multi-tonne dark matter direct search experiment. The accuracy of signal reconstruction is influenced by various detector artifacts, including noise, dark count of photomultiplier, impurity photoionization in the detector, and other relevant considera… ▽ More Signal reconstruction through software processing is a crucial component of the background and signal models in the PandaX-4T experiment, which is a multi-tonne dark matter direct search experiment. The accuracy of signal reconstruction is influenced by various detector artifacts, including noise, dark count of photomultiplier, impurity photoionization in the detector, and other relevant considerations. In this study, we present a detailed description of a semi-data-driven approach designed to simulate the signal waveform. This work provides a reliable model for the efficiency and bias of the signal reconstruction in the data analysis of PandaX-4T. By comparing critical variables which relate to the temporal shape and hit pattern of the signals, we demonstrate a good agreement between the simulation and data. △ Less

Submitted 21 May, 2024; v1 submitted 18 December, 2023; originally announced December 2023.

Journal ref: Chin. Phys. C 48, no.7,073001 (2024)

arXiv:2312.07336 [pdf]

doi 10.1088/0256-307X/39/6/067101

Giant domain wall anomalous Hall effect in an antiferromagnet

Authors: Wei Xia, Bo Bai, Xuejiao Chen, Yichen Yang, Yang Zhang, Jian Yuan, Qiang Li, Kunya Yang, Xiangqi Liu, Yang Shi, Haiyang Ma, Huali Yang, Mingquan He, Lei Li, Chuanying Xi, Li Pi, Xiaodong Lv, Xia Wang, Xuerong Liu, Shiyan Li, Xiaodong Zhou, Jianpeng Liu, Yulin Chen, Jian Shen, Dawei Shen , et al. (3 additional authors not shown)

Abstract: The Hall effect plays a crucial role in establishment of band theory of solids and discovery of emergent new phases of interacting electrons such as the topological phases of matter. Generally, the dissipationless Hall effect requires time-reversal symmetry breaking (TRSB), where TRSB induced by external magnetic field results in ordinary Hall effect, while TRSB caused by spontaneous magnetization… ▽ More The Hall effect plays a crucial role in establishment of band theory of solids and discovery of emergent new phases of interacting electrons such as the topological phases of matter. Generally, the dissipationless Hall effect requires time-reversal symmetry breaking (TRSB), where TRSB induced by external magnetic field results in ordinary Hall effect, while TRSB caused by spontaneous magnetization gives rise to anomalous Hall effect (AHE) which scales with the net magnetization. The AHE is therefore not expected in antiferromagnets with vanishing small magnetization. However, large AHE was recently observed in certain antiferromagnets with noncolinear spin structure and nonvanishing Berry curvature, thus opening a new area for exploration of large AHE in antiferromagnets. Here, we report another origin of AHE in a layered antiferromagnet, namely the domain wall (DW) skew scattering with Weyl points near the Fermi level, in experiments for the first time. Interestingly, the DWs form a unique periodic stripe structure with controllable periodicity by external magnetic field, which decreases nearly monotonically from 975 nm at 0 T to 232 nm at 4 T. Electrons incident on DW with topological bound states experience strong asymmetric scattering, leading to giant extrinsic AHE, with the DW Hall conductivity (DWHC) at 2 K and 1.2 T even reaching a record value of about 1.51*104 S cm-1 among bulk systems, which is two orders of magnitude larger than the intrinsic anomalous Hall conductivity. The observation of giant DWHC and controllable stripe DW structure in an antiferromagnet not only sets a new paradigm for exploration of large extrinsic anomalous Hall effect, but also provides potential applications in spintronic devices. △ Less

Submitted 12 December, 2023; originally announced December 2023.

Comments: 19 pages Main Text, 5 main figures

Journal ref: Chinese Physics Letters 2022, 39: 067101

arXiv:2312.06651 [pdf, ps, other]

Spherical higher order Fourier analysis over finite fields I: equidistribution for nilsequences

Authors: Wenbo Sun

Abstract: This paper is the first part of the series "Spherical higher order Fourier analysis over finite fields", aiming to develop the higher order Fourier analysis method along spheres over finite fields, and to solve the geometric Ramsey conjecture in the finite field setting. In this paper, we prove a quantitative equidistribution theorem for polynomial sequences in a nilmanifold, where the average i… ▽ More This paper is the first part of the series "Spherical higher order Fourier analysis over finite fields", aiming to develop the higher order Fourier analysis method along spheres over finite fields, and to solve the geometric Ramsey conjecture in the finite field setting. In this paper, we prove a quantitative equidistribution theorem for polynomial sequences in a nilmanifold, where the average is taken along spheres instead of cubes. To be more precise, let $Ω\subseteq\mathbb{F}_{p}^{d}$ be a sphere. We showed that if a polynomial sequence $(g(n)Γ)_{n\inΩ}$ which is $p$-periodic along $Ω$ is not equidistributed on a nilmanifold $G/Γ$, then there exists a nontrivial horizontal character $η$ of $G/Γ$ such that $η\circ g \mod \mathbb{Z}$ vanishes on $Ω$. This result will serve as a fundamental tool in later parts of the series to proof the spherical Gowers inverse theorem and the geometric Ramsey conjecture. △ Less

Submitted 11 December, 2023; v1 submitted 11 December, 2023; originally announced December 2023.

Comments: 127 pages, comments are welcome

MSC Class: 11T99; 37A99

arXiv:2312.06650 [pdf, ps, other]

Spherical higher order Fourier analysis over finite fields II: additive combinatorics for shifted ideals

Authors: Wenbo Sun

Abstract: This paper is the second part of the series "Spherical higher order Fourier analysis over finite fields", aiming to develop the higher order Fourier analysis method along spheres over finite fields, and to solve the geometric Ramsey conjecture in the finite field setting. In this paper, we study additive combinatorial properties for shifted ideals, i.e. the structure of sets of the form… ▽ More This paper is the second part of the series "Spherical higher order Fourier analysis over finite fields", aiming to develop the higher order Fourier analysis method along spheres over finite fields, and to solve the geometric Ramsey conjecture in the finite field setting. In this paper, we study additive combinatorial properties for shifted ideals, i.e. the structure of sets of the form $E\pm E$, where $E$ is a collection of shifted ideals of the polynomial ring $\mathbb{F}_{p}[x_{1},\dots,x_{d}]$ and we identify two ideals if their difference contains the zero polynomial. We show that under appropriate definitions, the set $E\pm E$ enjoys properties similar to the conventional setting where $E$ is a subset of an abelian group. In particular, among other results, we prove the Balog-Gowers-Szemerédi theorem, the Rusza's quasi triangle inequality and a weak form of the Plünnecke-Rusza theorem in the setting of shifted ideals. We also show that for a special class of maps $ξ$ from $\mathbb{F}_{p}^{d}$ to the collection of all shifted ideals of $\mathbb{F}_{p}[x_{1},\dots,x_{d}]$, if the set $ξ(\mathbb{F}_{p}^{d})+ξ(\mathbb{F}_{p}^{d})$ has large additive energy, then $ξ$ is an almost linear Freiman homomorphism. This result is the crucial additive combinatorial input we need to prove the spherical Gowers inverse theorem in later parts of the series. △ Less

Submitted 11 December, 2023; v1 submitted 11 December, 2023; originally announced December 2023.

Comments: 80 pages, comments are welcome

MSC Class: 05C99; 05D99

arXiv:2312.06649 [pdf, ps, other]

Spherical higher order Fourier analysis over finite fields IV: an application to the Geometric Ramsey Conjecture

Authors: Wenbo Sun

Abstract: This paper is the fourth and the last part of the series "Spherical higher order Fourier analysis over finite fields", aiming to develop the higher order Fourier analysis method along spheres over finite fields, and to solve the Geometric Ramsey Conjecture in the finite field setting. In this paper, we proof a conjecture of Graham on the Remsey properties for spherical configurations in the fini… ▽ More This paper is the fourth and the last part of the series "Spherical higher order Fourier analysis over finite fields", aiming to develop the higher order Fourier analysis method along spheres over finite fields, and to solve the Geometric Ramsey Conjecture in the finite field setting. In this paper, we proof a conjecture of Graham on the Remsey properties for spherical configurations in the finite field setting. To be more precise, we show that for any spherical configuration $X$ of $\mathbb{F}_{p}^{d}$ of complexity at most $C$ with $d$ being sufficiently large with respect to $C$ and $\vert X\vert$, and for some prime $p$ being sufficiently large with respect to $C$, $\vert X\vert$ and $ε>0$, any set $E\subseteq \mathbb{F}_{p}^{d}$ with $\vert E\vert>εp^{d}$ contains at least $\gg_{C,ε,\vert X\vert}p^{(k+1)d-(k+1)k/2}$ congruent copies of $X$, where $k$ is the dimension of $\text{span}_{\mathbb{F}_{p}}(X-X)$. The novelty of our approach is that we avoid the use of harmonic analysis, and replace it by the theory of spherical higher order Fourier analysis developed in previous parts of the series. △ Less

Submitted 11 December, 2023; v1 submitted 11 December, 2023; originally announced December 2023.

Comments: 61 pages, comments are welcome. arXiv admin note: text overlap with arXiv:2312.06636

MSC Class: 05D10; 37A99

arXiv:2312.06636 [pdf, ps, other]

Spherical higher order Fourier analysis over finite fields III: a spherical Gowers inverse theorem

Authors: Wenbo Sun

Abstract: This paper is the third part of the series "Spherical higher order Fourier analysis over finite fields", aiming to develop the higher order Fourier analysis method along spheres over finite fields, and to solve the geometric Ramsey conjecture in the finite field setting. In this paper, we prove an inverse theorem over finite field for spherical Gowers norms, i.e. a local Gowers norm supported on… ▽ More This paper is the third part of the series "Spherical higher order Fourier analysis over finite fields", aiming to develop the higher order Fourier analysis method along spheres over finite fields, and to solve the geometric Ramsey conjecture in the finite field setting. In this paper, we prove an inverse theorem over finite field for spherical Gowers norms, i.e. a local Gowers norm supported on a sphere. We show that if the $(s+1)$-th spherical Gowers norm of a 1-bounded function $f\colon\mathbb{F}_{p}^{d}\to \mathbb{C}$ is at least $ε$ and if $d$ is sufficiently large depending only on $s$, then $f$ correlates on the sphere with a $p$-periodic $s$-step nilsequence, where the bounds for the complexity and correlation depend only on $d$ and $ε$. This result will be used in later parts of the series to prove the geometric Ramsey conjecture in the finite field setting. △ Less

Submitted 11 December, 2023; v1 submitted 11 December, 2023; originally announced December 2023.

Comments: 104 pages, comments are welcome

MSC Class: 11T99; 37A99

arXiv:2312.06632 [pdf, other]

Control Risk for Potential Misuse of Artificial Intelligence in Science

Authors: Jiyan He, Weitao Feng, Yaosen Min, **gwei Yi, Kunsheng Tang, Shuai Li, Jie Zhang, Kejiang Chen, Wenbo Zhou, Xing Xie, Weiming Zhang, Nenghai Yu, Shuxin Zheng

Abstract: The expanding application of Artificial Intelligence (AI) in scientific fields presents unprecedented opportunities for discovery and innovation. However, this growth is not without risks. AI models in science, if misused, can amplify risks like creation of harmful substances, or circumvention of established regulations. In this study, we aim to raise awareness of the dangers of AI misuse in scien… ▽ More The expanding application of Artificial Intelligence (AI) in scientific fields presents unprecedented opportunities for discovery and innovation. However, this growth is not without risks. AI models in science, if misused, can amplify risks like creation of harmful substances, or circumvention of established regulations. In this study, we aim to raise awareness of the dangers of AI misuse in science, and call for responsible AI development and use in this domain. We first itemize the risks posed by AI in scientific contexts, then demonstrate the risks by highlighting real-world examples of misuse in chemical science. These instances underscore the need for effective risk management strategies. In response, we propose a system called SciGuard to control misuse risks for AI models in science. We also propose a red-teaming benchmark SciMT-Safety to assess the safety of different systems. Our proposed SciGuard shows the least harmful impact in the assessment without compromising performance in benign tests. Finally, we highlight the need for a multidisciplinary and collaborative effort to ensure the safe and ethical use of AI models in science. We hope that our study can spark productive discussions on using AI ethically in science among researchers, practitioners, policymakers, and the public, to maximize benefits and minimize the risks of misuse. △ Less

Submitted 11 December, 2023; originally announced December 2023.

arXiv:2312.05557 [pdf, ps, other]

Long-Term Rate-Fairness-Aware Beamforming Based Massive MIMO Systems

Authors: W. Zhu, H. D. Tuan, E. Dutkiewicz, Y. Fang, H. V. Poor, L. Hanzo

Abstract: This is the first treatise on multi-user (MU) beamforming designed for achieving long-term rate-fairness in fulldimensional MU massive multi-input multi-output (m-MIMO) systems. Explicitly, based on the channel covariances, which can be assumed to be known beforehand, we address this problem by optimizing the following objective functions: the users' signal-toleakage-noise ratios (SLNRs) using SLN… ▽ More This is the first treatise on multi-user (MU) beamforming designed for achieving long-term rate-fairness in fulldimensional MU massive multi-input multi-output (m-MIMO) systems. Explicitly, based on the channel covariances, which can be assumed to be known beforehand, we address this problem by optimizing the following objective functions: the users' signal-toleakage-noise ratios (SLNRs) using SLNR max-min optimization, geometric mean of SLNRs (GM-SLNR) based optimization, and SLNR soft max-min optimization. We develop a convex-solver based algorithm, which invokes a convex subproblem of cubic time-complexity at each iteration for solving the SLNR maxmin problem. We then develop closed-form expression based algorithms of scalable complexity for the solution of the GMSLNR and of the SLNR soft max-min problem. The simulations provided confirm the users' improved-fairness ergodic rate distributions. △ Less

Submitted 9 December, 2023; originally announced December 2023.

arXiv:2312.01751 [pdf, other]

Joint Task Partitioning and Parallel Scheduling in Device-Assisted Mobile Edge Networks

Authors: Yang Li, Xinlei Ge, Bo Lei, Xing Zhang, Wenbo Wang

Abstract: With the development of the Internet of Things (IoT), certain IoT devices have the capability to not only accomplish their own tasks but also simultaneously assist other resource-constrained devices. Therefore, this paper considers a device-assisted mobile edge computing system that leverages auxiliary IoT devices to alleviate the computational burden on the edge computing server and enhance the o… ▽ More With the development of the Internet of Things (IoT), certain IoT devices have the capability to not only accomplish their own tasks but also simultaneously assist other resource-constrained devices. Therefore, this paper considers a device-assisted mobile edge computing system that leverages auxiliary IoT devices to alleviate the computational burden on the edge computing server and enhance the overall system performance. In this study, computationally intensive tasks are decomposed into multiple partitions, and each task partition can be processed in parallel on an IoT device or the edge server. The objective of this research is to develop an efficient online algorithm that addresses the joint optimization of task partitioning and parallel scheduling under time-varying system states, posing challenges to conventional numerical optimization methods. To address these challenges, a framework called online task partitioning action and parallel scheduling policy generation (OTPPS) is proposed, which is based on deep reinforcement learning (DRL). Specifically, the framework leverages a deep neural network (DNN) to learn the optimal partitioning action for each task by map** input states. Furthermore, it is demonstrated that the remaining parallel scheduling problem exhibits NP-hard complexity when considering a specific task partitioning action. To address this subproblem, a fair and delay-minimized task scheduling (FDMTS) algorithm is designed. Extensive evaluation results demonstrate that OTPPS achieves near-optimal average delay performance and consistently high fairness levels in various environmental states compared to other baseline schemes. △ Less

Submitted 4 December, 2023; originally announced December 2023.

Comments: Accepted to IEEE Internet of Things Journal

arXiv:2312.01294 [pdf, other]

Deep Ensembles Meets Quantile Regression: Uncertainty-aware Imputation for Time Series

Authors: Ying Liu, Peng Cui, Wenbo Hu, Richang Hong

Abstract: Multivariate time series are everywhere. Nevertheless, real-world time series data often exhibit numerous missing values, which is the time series imputation task. Although previous deep learning methods have been shown to be effective for time series imputation, they are shown to produce overconfident imputations, which might be a potentially overlooked threat to the reliability of the intelligen… ▽ More Multivariate time series are everywhere. Nevertheless, real-world time series data often exhibit numerous missing values, which is the time series imputation task. Although previous deep learning methods have been shown to be effective for time series imputation, they are shown to produce overconfident imputations, which might be a potentially overlooked threat to the reliability of the intelligence system. Score-based diffusion method(i.e., CSDI) is effective for the time series imputation task but computationally expensive due to the nature of the generative diffusion model framework. In this paper, we propose a non-generative time series imputation method that produces accurate imputations with inherent uncertainty and meanwhile is computationally efficient. Specifically, we incorporate deep ensembles into quantile regression with a shared model backbone and a series of quantile discrimination functions.This framework combines the merits of accurate uncertainty estimation of deep ensembles and quantile regression and above all, the shared model backbone tremendously reduces most of the computation overhead of the multiple ensembles. We examine the performance of the proposed method on two real-world datasets: air quality and health-care datasets and conduct extensive experiments to show that our method excels at making deterministic and probabilistic predictions. Compared with the score-based diffusion method: CSDI, we can obtain comparable forecasting results and is better when more data is missing. Furthermore, as a non-generative model compared with CSDI, the proposed method consumes a much smaller computation overhead, yielding much faster training speed and fewer model parameters. △ Less

Submitted 3 December, 2023; originally announced December 2023.

arXiv:2312.01292 [pdf, ps, other]

Joint Beam Scheduling and Power Optimization for Beam Hop** LEO Satellite Systems

Authors: Shuang Zheng, Xing Zhang, Peng Wang, Wenbo Wang

Abstract: Low earth orbit (LEO) satellite communications can provide ubiquitous and reliable services, making it an essential part of the Internet of Everything network. Beam hop** (BH) is an emerging technology for effectively addressing the issue of low resource utilization caused by the non-uniform spatio-temporal distribution of traffic demands. However, how to allocate multi-dimensional resources in… ▽ More Low earth orbit (LEO) satellite communications can provide ubiquitous and reliable services, making it an essential part of the Internet of Everything network. Beam hop** (BH) is an emerging technology for effectively addressing the issue of low resource utilization caused by the non-uniform spatio-temporal distribution of traffic demands. However, how to allocate multi-dimensional resources in a timely and efficient way for the highly dynamic LEO satellite systems remains a challenge. This paper proposes a joint beam scheduling and power optimization beam hop** (JBSPO-BH) algorithm considering the differences in the geographic distribution of sink nodes. The JBSPO-BH algorithm decouples the original problem into two sub-problems. The beam scheduling problem is modelled as a potential game, and the Nash equilibrium (NE) point is obtained as the beam scheduling strategy. Moreover, the penalty function interior point method is applied to optimize the power allocation. Simulation results show that the JBSPO-BH algorithm has low time complexity and fast convergence and achieves better performance both in throughput and fairness. Compared with greedy-based BH, greedy-based BH with the power optimization, round-robin BH, Max-SINR BH and satellite resource allocation algorithm, the throughput of the proposed algorithm is improved by 44.99%, 20.79%, 156.06%, 15.39% and 8.17%, respectively. △ Less

Submitted 3 December, 2023; originally announced December 2023.

arXiv:2311.18400 [pdf, other]

Linear response theory for spin alignment of vector mesons in thermal media

Authors: Wen-Bo Dong, Yi-Liang Yin, Xin-Li Sheng, Shi-Zheng Yang, Qun Wang

Abstract: We present a calculation of the spin alignment for unflavored vector mesons in thermalized quark-gluon plasma based on the Kubo formula in linear response theory. This is achieved by expanding the system to the first order of the coupling constant and the spatial gradient. The effect strongly relies on the vector meson's spectral functions which are determined by the interaction and medium propert… ▽ More We present a calculation of the spin alignment for unflavored vector mesons in thermalized quark-gluon plasma based on the Kubo formula in linear response theory. This is achieved by expanding the system to the first order of the coupling constant and the spatial gradient. The effect strongly relies on the vector meson's spectral functions which are determined by the interaction and medium properties. The spectral functions are calculated for the one-quark-loop self-energy with meson-quark interaction. The numerical results show that the correction to the spin alignment from the thermal shear tensor is of the order $10^{-4}\sim10^{-5}$ for the chosen values of quark-meson coupling constant, if the magnitude of thermal shear tensor is $10^{-2}$. △ Less

Submitted 5 March, 2024; v1 submitted 30 November, 2023; originally announced November 2023.

Comments: 22 pages, 5 figures

arXiv:2311.16628 [pdf, ps, other]

Symmetry-regularized neural ordinary differential equations

Authors: Wenbo Hao

Abstract: Neural Ordinary Differential Equations (Neural ODEs) is a class of deep neural network models that interpret the hidden state dynamics of neural networks as an ordinary differential equation, thereby capable of capturing system dynamics in a continuous time framework. In this work, I integrate symmetry regularization into Neural ODEs. In particular, I use continuous Lie symmetry of ODEs and PDEs a… ▽ More Neural Ordinary Differential Equations (Neural ODEs) is a class of deep neural network models that interpret the hidden state dynamics of neural networks as an ordinary differential equation, thereby capable of capturing system dynamics in a continuous time framework. In this work, I integrate symmetry regularization into Neural ODEs. In particular, I use continuous Lie symmetry of ODEs and PDEs associated with the model to derive conservation laws and add them to the loss function, making it physics-informed. This incorporation of inherent structural properties into the loss function could significantly improve robustness and stability of the model during training. To illustrate this method, I employ a toy model that utilizes a cosine rate of change in the hidden state, showcasing the process of identifying Lie symmetries, deriving conservation laws, and constructing a new loss function. △ Less

Submitted 28 November, 2023; originally announced November 2023.

arXiv:2311.16531 [pdf]

Channel Modeling for Terahertz Communications in Rain

Authors: Peian Li, Wenbo Liu, Jiacheng Liu, Da Li, Guohao Liu, Yuanshuai Lei, Jiabiao Zhao, Xiaopeng Wang, Houjun Sun, Jianjun Ma, John F. Federici

Abstract: Terahertz (THz) communication channels, integral to outdoor applications, are critically influenced by natural factors like rainfall. Our research focused on the nuanced effects of rain on these channels, employing an advanced rainfall emulation system. By analyzing key parameters such as rain rate, altitude based variations in rainfall, and diverse raindrop sizes, we identified the paramount sign… ▽ More Terahertz (THz) communication channels, integral to outdoor applications, are critically influenced by natural factors like rainfall. Our research focused on the nuanced effects of rain on these channels, employing an advanced rainfall emulation system. By analyzing key parameters such as rain rate, altitude based variations in rainfall, and diverse raindrop sizes, we identified the paramount significance of the number of raindrops in the THz channel, particularly in scenarios with constant rain rates but varying drop sizes. Central to our findings is a novel model grounded in Mie scattering theory, which adeptly incorporates the variability of raindrop size distributions at different altitudes. This model has displayed strong congruence with our experimental results. In essence, our study underscores the inadequacy of solely depending on a fixed ground-based rain rate and emphasizes the imperative of calibrating distribution metrics to cater to specific environmental and operational contexts. △ Less

Submitted 28 November, 2023; originally announced November 2023.

Comments: submitted to IEEE Transactions on Antennas and Propagation

Showing 101–150 of 757 results for author: Wenbo