Search | arXiv e-print repository

doi 10.1103/PhysRevA.107.033507

Nonreciprocal slow or fast light in anti-$\mathcal{PT}$-symmetric optomechanics

Authors: Meiyu Peng, Huilai Zhang, Qian Zhang, Tian-Xiang Lu, Imran M. Mirza, Hui **g

Abstract: Non-Hermitian systems with anti-parity-time ($\mathcal{APT}$) symmetry have revealed rich physics beyond conventional systems. Here, we study optomechanics in an $\mathcal{APT}$-symmetric spinning resonator and show that, by tuning the rotating speed to approach the exceptional point (EP) or the non-Hermitian spectral degeneracy, nonreciprocal light transmission with a high isolation ratio can be… ▽ More Non-Hermitian systems with anti-parity-time ($\mathcal{APT}$) symmetry have revealed rich physics beyond conventional systems. Here, we study optomechanics in an $\mathcal{APT}$-symmetric spinning resonator and show that, by tuning the rotating speed to approach the exceptional point (EP) or the non-Hermitian spectral degeneracy, nonreciprocal light transmission with a high isolation ratio can be realized. Accompanying this process, nonreciprocal group delay or advance is also identified in the vicinity of EP. Our work sheds new light on manipulating laser propagation with optomechanical EP devices and, in a broader view, can be extended to explore a wide range of $\mathcal{APT}$-symmetric effects, such as $\mathcal{APT}$-symmetric phonon lasers, $\mathcal{APT}$-symmetric topological effects, and $\mathcal{APT}$-symmetric force sensing or accelerator. △ Less

Submitted 27 February, 2023; originally announced February 2023.

Comments: 9 pages, 4 figures. It has been accepted for publication as a Regular Article in Physical Review A

arXiv:2302.09096 [pdf, ps, other]

Using the Sun and the Moon as Source masses and the Earth's Rotation as a Modulation to Search for Exotic Spin-Dependent Interactions at Astronomical Distances

Authors: L. Y. Wu, K. Y. Zhang, M. Peng, J. Gong, H. Yan

Abstract: Exotic spin-dependent interactions mediated by new light particles led to solutions to several important questions in modern physics. Such interactions involving a scalar coupling $g_S^N$ at one vertex and a pseudo-scalar coupling $g_P^n$ at the polarized neutron vertex can be induced by the exchange of spin-0 bosons, or a vector/axial-vector coupling $g_V^N$/$g_A^N$ at one vertex and an axial-vec… ▽ More Exotic spin-dependent interactions mediated by new light particles led to solutions to several important questions in modern physics. Such interactions involving a scalar coupling $g_S^N$ at one vertex and a pseudo-scalar coupling $g_P^n$ at the polarized neutron vertex can be induced by the exchange of spin-0 bosons, or a vector/axial-vector coupling $g_V^N$/$g_A^N$ at one vertex and an axial-vector coupling $g_A^n$ at the polarized neutron vertex can be induced by the exchange of spin-1 bosons. If such new interactions exist, the Sun and the Moon can induce sidereal variations of effective fields along the direction perpendicular to the Earth's rotation axis. We derived new experimental upper limits on such exotic spin-dependent interactions at astronomical interaction ranges by analyzing existing data from laboratory measurements on the Lorentz and CPT violation. We set the most stringent experimental limits on $g_S^Ng_P^n$ ranging from $\sim 2\times 10^{10}$m to $\sim 10^{14}$m. Previously, the best limit on $g_S^Ng_P^n$ at this range is from astrophysics. The result is the first time laboratory limits surpass the astrophysical ones on the scalar-pseudoscalar type interaction, to our best knowledge. We report new constraints on vector-axial-vector and axial-axial-vector type interaction at the range of astronomical scales. The new limits on vector-axial-vector are improved by as much as $\sim$12 orders of magnitude. We also apply the analysis to the Hari-Dass interactions and obtain corresponding new constraints on the interactions. We discuss the possibilities of using the beam method to further search the interaction involving other particles, such as electrons, muons, etc., based on the same idea. △ Less

Submitted 15 June, 2023; v1 submitted 15 February, 2023; originally announced February 2023.

arXiv:2302.06730 [pdf, other]

Multi-Carrier NOMA-Empowered Wireless Federated Learning with Optimal Power and Bandwidth Allocation

Authors: Weicai Li, Tiejun Lv, Yashuai Cao, Wei Ni, Mugen Peng

Abstract: Wireless federated learning (WFL) undergoes a communication bottleneck in uplink, limiting the number of users that can upload their local models in each global aggregation round. This paper presents a new multi-carrier non-orthogonal multiple-access (MC-NOMA)-empowered WFL system under an adaptive learning setting of Flexible Aggregation. Since a WFL round accommodates both local model training a… ▽ More Wireless federated learning (WFL) undergoes a communication bottleneck in uplink, limiting the number of users that can upload their local models in each global aggregation round. This paper presents a new multi-carrier non-orthogonal multiple-access (MC-NOMA)-empowered WFL system under an adaptive learning setting of Flexible Aggregation. Since a WFL round accommodates both local model training and uploading for each user, the use of Flexible Aggregation allows the users to train different numbers of iterations per round, adapting to their channel conditions and computing resources. The key idea is to use MC-NOMA to concurrently upload the local models of the users, thereby extending the local model training times of the users and increasing participating users. A new metric, namely, Weighted Global Proportion of Trained Mini-batches (WGPTM), is analytically established to measure the convergence of the new system. Another important aspect is that we maximize the WGPTM to harness the convergence of the new system by jointly optimizing the transmit powers and subchannel bandwidths. This nonconvex problem is converted equivalently to a tractable convex problem and solved efficiently using variable substitution and Cauchy's inequality. As corroborated experimentally using a convolutional neural network and an 18-layer residential network, the proposed MC-NOMA WFL can efficiently reduce communication delay, increase local model training times, and accelerate the convergence by over 40%, compared to its existing alternative. △ Less

Submitted 13 February, 2023; originally announced February 2023.

Comments: 33 pages, 16 figures

arXiv:2302.02136 [pdf, other]

Efficient End-to-End Video Question Answering with Pyramidal Multimodal Transformer

Authors: Min Peng, Chongyang Wang, Yu Shi, Xiang-Dong Zhou

Abstract: This paper presents a new method for end-to-end Video Question Answering (VideoQA), aside from the current popularity of using large-scale pre-training with huge feature extractors. We achieve this with a pyramidal multimodal transformer (PMT) model, which simply incorporates a learnable word embedding layer, a few convolutional and transformer layers. We use the anisotropic pyramid to fulfill vid… ▽ More This paper presents a new method for end-to-end Video Question Answering (VideoQA), aside from the current popularity of using large-scale pre-training with huge feature extractors. We achieve this with a pyramidal multimodal transformer (PMT) model, which simply incorporates a learnable word embedding layer, a few convolutional and transformer layers. We use the anisotropic pyramid to fulfill video-language interactions across different spatio-temporal scales. In addition to the canonical pyramid, which includes both bottom-up and top-down pathways with lateral connections, novel strategies are proposed to decompose the visual feature stream into spatial and temporal sub-streams at different scales and implement their interactions with the linguistic semantics while preserving the integrity of local and global semantics. We demonstrate better or on-par performances with high computational efficiency against state-of-the-art methods on five VideoQA benchmarks. Our ablation study shows the scalability of our model that achieves competitive results for text-to-video retrieval by leveraging feature extractors with reusable pre-trained weights, and also the effectiveness of the pyramid. △ Less

Submitted 5 March, 2023; v1 submitted 4 February, 2023; originally announced February 2023.

Comments: Accepted by AAAI 2023

arXiv:2301.11586 [pdf, other]

Khaos: The Impact of Inter-procedural Code Obfuscation on Binary Diffing Techniques

Authors: Peihua Zhang, Chenggang Wu, Mingfan Peng, Kai Zeng, Ding Yu, Yuanming Lai, Yan Kang, Wei Wang, Zhe Wang

Abstract: Software obfuscation techniques can prevent binary diffing techniques from locating vulnerable code by obfuscating the third-party code, to achieve the purpose of protecting embedded device software. With the rapid development of binary diffing techniques, they can achieve more and more accurate function matching and identification by extracting the features within the function. This makes existin… ▽ More Software obfuscation techniques can prevent binary diffing techniques from locating vulnerable code by obfuscating the third-party code, to achieve the purpose of protecting embedded device software. With the rapid development of binary diffing techniques, they can achieve more and more accurate function matching and identification by extracting the features within the function. This makes existing software obfuscation techniques, which mainly focus on the intra-procedural code obfuscation, no longer effective. In this paper, we propose a new inter-procedural code obfuscation mechanism Khaos, which moves the code across functions to obfuscate the function by using compilation optimizations. Two obfuscation primitives are proposed to separate and aggregate the function, which are called fission and fusion respectively. A prototype of Khaos is implemented based on the LLVM compiler and evaluated on a large number of real-world programs including SPEC CPU 2006 & 2017, CoreUtils, JavaScript engines, etc. Experimental results show that Khaos outperforms existing code obfuscations and can significantly reduce the accuracy rates of five state-of-the-art binary diffing techniques (less than 19%) with lower runtime overhead (less than 7%). △ Less

Submitted 27 January, 2023; originally announced January 2023.

arXiv:2301.10724 [pdf, other]

doi 10.1145/3580305.3599951

Select and Trade: Towards Unified Pair Trading with Hierarchical Reinforcement Learning

Authors: Weiguang Han, Boyi Zhang, Qianqian Xie, Min Peng, Yanzhao Lai, Jimin Huang

Abstract: Pair trading is one of the most effective statistical arbitrage strategies which seeks a neutral profit by hedging a pair of selected assets. Existing methods generally decompose the task into two separate steps: pair selection and trading. However, the decoupling of two closely related subtasks can block information propagation and lead to limited overall performance. For pair selection, ignoring… ▽ More Pair trading is one of the most effective statistical arbitrage strategies which seeks a neutral profit by hedging a pair of selected assets. Existing methods generally decompose the task into two separate steps: pair selection and trading. However, the decoupling of two closely related subtasks can block information propagation and lead to limited overall performance. For pair selection, ignoring the trading performance results in the wrong assets being selected with irrelevant price movements, while the agent trained for trading can overfit to the selected assets without any historical information of other assets. To address it, in this paper, we propose a paradigm for automatic pair trading as a unified task rather than a two-step pipeline. We design a hierarchical reinforcement learning framework to jointly learn and optimize two subtasks. A high-level policy would select two assets from all possible combinations and a low-level policy would then perform a series of trading actions. Experimental results on real-world stock data demonstrate the effectiveness of our method on pair trading compared with both existing pair selection and trading methods. △ Less

Submitted 5 February, 2023; v1 submitted 25 January, 2023; originally announced January 2023.

Comments: 10 pages, 6 figures

arXiv:2212.08894 [pdf, other]

Performance assessment of helicon wave heating and current drive in EXL-50 spherical torus plasmas

Authors: G. J. Qiao, D. Luo, S. D. Song, J. Q. Dong, Y. J. Shi, J. C. Li, D. Du, Y. K. Martin Peng, M. S. Liu, EXL-50 team

Abstract: Analysis of helicon wave heating and current drive capability in EXL-50 spherical torus plasmas has been conducted. It is found that the driven current increases with the launched parallel refractive index $n_{||}$ and peaks around $n_{||} = 4.0$ when the frequency of the helicon wave is between 300~MHz and 380~MHz. The helicon wave current drive efficiency shows a relatively stable upward trend w… ▽ More Analysis of helicon wave heating and current drive capability in EXL-50 spherical torus plasmas has been conducted. It is found that the driven current increases with the launched parallel refractive index $n_{||}$ and peaks around $n_{||} = 4.0$ when the frequency of the helicon wave is between 300~MHz and 380~MHz. The helicon wave current drive efficiency shows a relatively stable upward trend with increasing plasma temperature. Moreover, the driven current decreases as the plasma density increases. We also analyzed the current drive with helicon waves of 150~MHz and 170~MHz and found that the driven current at a lower frequency was lower than that at a higher frequency. A positive proportional relationship exists between the driven current and $n_{||}$. Besides, as $n_{||}$ increases, the profile of the driven current becomes wider. Finally, the effect of the scrape-off layer (SOL) region on the helicon wave current drive was also investigated. △ Less

Submitted 17 December, 2022; originally announced December 2022.

arXiv:2210.08471 [pdf, other]

Improving Semantic Matching through Dependency-Enhanced Pre-trained Model with Adaptive Fusion

Authors: Jian Song, Di Liang, Rumei Li, Yuntao Li, Sirui Wang, Minlong Peng, Wei Wu, Yongxin Yu

Abstract: Transformer-based pre-trained models like BERT have achieved great progress on Semantic Sentence Matching. Meanwhile, dependency prior knowledge has also shown general benefits in multiple NLP tasks. However, how to efficiently integrate dependency prior structure into pre-trained models to better model complex semantic matching relations is still unsettled. In this paper, we propose the \textbf{D… ▽ More Transformer-based pre-trained models like BERT have achieved great progress on Semantic Sentence Matching. Meanwhile, dependency prior knowledge has also shown general benefits in multiple NLP tasks. However, how to efficiently integrate dependency prior structure into pre-trained models to better model complex semantic matching relations is still unsettled. In this paper, we propose the \textbf{D}ependency-Enhanced \textbf{A}daptive \textbf{F}usion \textbf{A}ttention (\textbf{DAFA}), which explicitly introduces dependency structure into pre-trained models and adaptively fuses it with semantic information. Specifically, \textbf{\emph{(i)}} DAFA first proposes a structure-sensitive paradigm to construct a dependency matrix for calibrating attention weights. It adopts an adaptive fusion module to integrate the obtained dependency information and the original semantic signals. Moreover, DAFA reconstructs the attention calculation flow and provides better interpretability. By applying it on BERT, our method achieves state-of-the-art or competitive performance on 10 public datasets, demonstrating the benefits of adaptively fusing dependency structure in semantic matching task. △ Less

Submitted 24 August, 2023; v1 submitted 16 October, 2022; originally announced October 2022.

Comments: Accepted by Findings of EMNLP 2022

arXiv:2210.04870 [pdf, other]

SMiLE: Schema-augmented Multi-level Contrastive Learning for Knowledge Graph Link Prediction

Authors: Miao Peng, Ben Liu, Qianqian Xie, Wenjie Xu, Hua Wang, Min Peng

Abstract: Link prediction is the task of inferring missing links between entities in knowledge graphs. Embedding-based methods have shown effectiveness in addressing this problem by modeling relational patterns in triples. However, the link prediction task often requires contextual information in entity neighborhoods, while most existing embedding-based methods fail to capture it. Additionally, little atten… ▽ More Link prediction is the task of inferring missing links between entities in knowledge graphs. Embedding-based methods have shown effectiveness in addressing this problem by modeling relational patterns in triples. However, the link prediction task often requires contextual information in entity neighborhoods, while most existing embedding-based methods fail to capture it. Additionally, little attention is paid to the diversity of entity representations in different contexts, which often leads to false prediction results. In this situation, we consider that the schema of knowledge graph contains the specific contextual information, and it is beneficial for preserving the consistency of entities across contexts. In this paper, we propose a novel Schema-augmented Multi-level contrastive LEarning framework (SMiLE) to conduct knowledge graph link prediction. Specifically, we first exploit network schema as the prior constraint to sample negatives and pre-train our model by employing a multi-level contrastive learning method to yield both prior schema and contextual information. Then we fine-tune our model under the supervision of individual triples to learn subtler representations for link prediction. Extensive experimental results on four knowledge graph datasets with thorough analysis of each component demonstrate the effectiveness of our proposed framework against state-of-the-art baselines. The implementation of SMiLE is available at https://github.com/GKNL/SMiLE. △ Less

Submitted 3 March, 2024; v1 submitted 10 October, 2022; originally announced October 2022.

Comments: Findings of EMNLP 2022

arXiv:2208.14256 [pdf, other]

doi 10.1109/TCI.2023.3282042

Denoising Particle Beam Micrographs with Plug-and-Play Methods

Authors: Minxu Peng, Ruangrawee Kitichotkul, Sheila W. Seidel, Christopher Yu, Vivek K Goyal

Abstract: In a particle beam microscope, a raster-scanned focused beam of particles interacts with a sample to generate a secondary electron (SE) signal pixel by pixel. Conventionally formed micrographs are noisy because of limitations on acquisition time and dose. Recent work has shown that estimation methods applicable to a time-resolved measurement paradigm can greatly reduce noise, but these methods app… ▽ More In a particle beam microscope, a raster-scanned focused beam of particles interacts with a sample to generate a secondary electron (SE) signal pixel by pixel. Conventionally formed micrographs are noisy because of limitations on acquisition time and dose. Recent work has shown that estimation methods applicable to a time-resolved measurement paradigm can greatly reduce noise, but these methods apply pixel by pixel without exploiting image structure. Raw SE count data can be modeled with a compound Poisson (Neyman Type A) likelihood, which implies data variance that is signal-dependent and greater than the variation in the underlying particle-sample interaction. These statistical properties make methods that assume additive white Gaussian noise ineffective. This paper introduces methods for particle beam micrograph denoising that use the plug-and-play framework to exploit image structure while being applicable to the unusual data likelihoods of this modality. Approximations of the data likelihood that vary in accuracy and computational complexity are combined with denoising by total variation regularization, BM3D, and DnCNN. Methods are provided for both conventional and time-resolved measurements, assuming SE counts are available. In simulations representative of helium ion microscopy and scanning electron microscopy, significant improvements in root mean-squared error (RMSE), structural similarity index measure (SSIM), and qualitative appearance are obtained. Average reductions in RMSE are by factors ranging from 2.24 to 4.11. △ Less

Submitted 3 May, 2023; v1 submitted 30 August, 2022; originally announced August 2022.

Journal ref: IEEE Transactions on Computational Imaging, vol. 9, pp. 581--593, 13 June 2023

arXiv:2208.13361 [pdf, other]

NL2GDPR: Automatically Develop GDPR Compliant Android Application Features from Natural Language

Authors: Faysal Hossain Shezan, Yingjie Lao, Minlong Peng, Xin Wang, Mingming Sun, ** Li

Abstract: The recent privacy leakage incidences and the more strict policy regulations demand a much higher standard of compliance for companies and mobile apps. However, such obligations also impose significant challenges on app developers for complying with these regulations that contain various perspectives, activities, and roles, especially for small companies and developers who are less experienced in… ▽ More The recent privacy leakage incidences and the more strict policy regulations demand a much higher standard of compliance for companies and mobile apps. However, such obligations also impose significant challenges on app developers for complying with these regulations that contain various perspectives, activities, and roles, especially for small companies and developers who are less experienced in this matter or with limited resources. To address these hurdles, we develop an automatic tool, NL2GDPR, which can generate policies from natural language descriptions from the developer while also ensuring the app's functionalities are compliant with General Data Protection Regulation (GDPR). NL2GDPR is developed by leveraging an information extraction tool, OIA (Open Information Annotation), developed by Baidu Cognitive Computing Lab. At the core, NL2GDPR is a privacy-centric information extraction model, appended with a GDPR policy finder and a policy generator. We perform a comprehensive study to grasp the challenges in extracting privacy-centric information and generating privacy policies, while exploiting optimizations for this specific task. With NL2GDPR, we can achieve 92.9%, 95.2%, and 98.4% accuracy in correctly identifying GDPR policies related to personal data storage, process, and share types, respectively. To the best of our knowledge, NL2GDPR is the first tool that allows a developer to automatically generate GDPR compliant policies, with only the need of entering the natural language for describing the app features. Note that other non-GDPR-related features might be integrated with the generated features to build a complex app. △ Less

Submitted 29 August, 2022; originally announced August 2022.

Comments: 37 pages

arXiv:2208.05479 [pdf, other]

IRS-Based Integrated Location Sensing and Communication for mmWave SIMO Systems

Authors: Xiaoling Hu, Chenxi Liu, Mugen Peng, Caijun Zhong

Abstract: In this paper, we establish an integrated sensing and communication (ISAC) system based on a distributed semi-passive intelligent reflecting surface (IRS), which allows location sensing and data transmission to be carried out simultaneously, sharing the same frequency and time resources. The detailed working process of the proposed IRS-based ISAC system is designed, including the transmission prot… ▽ More In this paper, we establish an integrated sensing and communication (ISAC) system based on a distributed semi-passive intelligent reflecting surface (IRS), which allows location sensing and data transmission to be carried out simultaneously, sharing the same frequency and time resources. The detailed working process of the proposed IRS-based ISAC system is designed, including the transmission protocol, location sensing and beamforming optimization. Specifically, each coherence block consists of two periods, the ISAC period with two time blocks and the pure communication (PC) period. During each time block of the ISAC period, data transmission and user positioning are carried out simultaneously. The estimated user location in the first time block will be used for beamforming design in the second time block. During the PC period, only data transmission is conducted, by invoking the user location estimated in the second time block of the ISAC period for beamforming design. {\color{black}Simulation results show that a millimeter-level positioning accuracy can be achieved by the proposed location sensing scheme, demonstrating the advantage of the proposed IRS-based ISAC framework. Besides, the proposed two beamforming schemes based on the estimated location information achieve similar performance to the benchmark schemes assuming perfect channel state information (CSI), which verifies the effectiveness of beamforming design using sensed location information. △ Less

Submitted 10 August, 2022; originally announced August 2022.

Comments: arXiv admin note: text overlap with arXiv:2208.05300

arXiv:2208.05324 [pdf, ps, other]

IRS-Aided Non-Orthogonal ISAC Systems: Performance Analysis and Beamforming Design

Authors: Zhouyuan Yu, Xiaoling Hu, Chenxi Liu, Mugen Peng, Caijun Zhong

Abstract: Intelligent reflecting surface (IRS) has shown its effectiveness in facilitating orthogonal time-division integrated sensing and communications (TD-ISAC), in which the sensing task and the communication task occupy orthogonal time-frequency resources, while the role of IRS in the more interesting scenarios of non-orthogonal ISAC (NO-ISAC) systems has so far remained unclear. In this paper, we cons… ▽ More Intelligent reflecting surface (IRS) has shown its effectiveness in facilitating orthogonal time-division integrated sensing and communications (TD-ISAC), in which the sensing task and the communication task occupy orthogonal time-frequency resources, while the role of IRS in the more interesting scenarios of non-orthogonal ISAC (NO-ISAC) systems has so far remained unclear. In this paper, we consider an IRS-aided NO-ISAC system, where a distributed IRS is deployed to assist concurrent communication and location sensing for a blind-zone user, occupying non-orthogonal/overlapped time-frequency resources. We first propose a modified Cramer-Rao lower bound (CRLB) to characterize the performances of both communication and location sensing in a unified manner. We further derive the closed-form expressions of the modified CRLB in our considered NO-ISAC system, enabling us to identify the fundamental trade-off between the communication and location sensing performances. In addition, by exploiting the modified CRLB, we propose a joint active and passive beamforming design algorithm that achieves a good communication and location sensing trade-off. Through numerical results, we demonstrate the superiority of the IRS-aided NO-ISAC systems over the IRS-aided TD-ISAC systems, in terms of both communication and localization performances. Besides, it is shown that the IRS-aided NO-ISAC system with random communication signals can achieve comparable localization performance to the IRS-aided localization system with dedicated positioning reference signals. Moreover, we investigate the trade-off between communication performance and localization performance and show how the performance of the NO-ISAC system can be significantly boosted by increasing the number of the IRS elements. △ Less

Submitted 10 August, 2022; originally announced August 2022.

arXiv:2208.05300 [pdf, ps, other]

doi 10.1109/TSP.2022.3217353

Location Sensing and Beamforming Design for IRS-Enabled Multi-User ISAC Systems

Authors: Zhouyuan Yu, Xiaoling Hu, Chenxi Liu, Mugen Peng, Caijun Zhong

Abstract: This paper explores the potential of the intelligent reflecting surface (IRS) in realizing multi-user concurrent communication and localization, using the same time-frequency resources. Specifically, we propose an IRS-enabled multi-user integrated sensing and communication (ISAC) framework, where a distributed semi-passive IRS assists the uplink data transmission from multiple users to the base st… ▽ More This paper explores the potential of the intelligent reflecting surface (IRS) in realizing multi-user concurrent communication and localization, using the same time-frequency resources. Specifically, we propose an IRS-enabled multi-user integrated sensing and communication (ISAC) framework, where a distributed semi-passive IRS assists the uplink data transmission from multiple users to the base station (BS) and conducts multi-user localization, simultaneously. We first design an ISAC transmission protocol, where the whole transmission period consists of two periods, i.e., the ISAC period for simultaneous uplink communication and multi-user localization, and the pure communication (PC) period for only uplink data transmission. For the ISAC period, we propose a multi-user location sensing algorithm, which utilizes the uplink communication signals unknown to the IRS, thus removing the requirement of dedicated positioning reference signals in conventional location sensing methods. Based on the sensed users' locations, we propose two novel beamforming algorithms for the ISAC period and PC period, respectively, which can work with discrete phase shifts and require no channel state information (CSI) acquisition. Numerical results show that the proposed multi-user location sensing algorithm can achieve up to millimeter-level positioning accuracy, indicating the advantage of the IRS-enabled ISAC framework. Moreover, the proposed beamforming algorithms with sensed location information and discrete phase shifts can achieve comparable performance to the benchmark considering perfect CSI acquisition and continuous phase shifts, demonstrating how the location information can ensure the communication performance. △ Less

Submitted 10 August, 2022; originally announced August 2022.

arXiv:2205.15397 [pdf, other]

Minimax Optimal Online Imitation Learning via Replay Estimation

Authors: Gokul Swamy, Nived Rajaraman, Matthew Peng, Sanjiban Choudhury, J. Andrew Bagnell, Zhiwei Steven Wu, Jiantao Jiao, Kannan Ramchandran

Abstract: Online imitation learning is the problem of how best to mimic expert demonstrations, given access to the environment or an accurate simulator. Prior work has shown that in the infinite sample regime, exact moment matching achieves value equivalence to the expert policy. However, in the finite sample regime, even if one has no optimization error, empirical variance can lead to a performance gap tha… ▽ More Online imitation learning is the problem of how best to mimic expert demonstrations, given access to the environment or an accurate simulator. Prior work has shown that in the infinite sample regime, exact moment matching achieves value equivalence to the expert policy. However, in the finite sample regime, even if one has no optimization error, empirical variance can lead to a performance gap that scales with $H^2 / N$ for behavioral cloning and $H / \sqrt{N}$ for online moment matching, where $H$ is the horizon and $N$ is the size of the expert dataset. We introduce the technique of replay estimation to reduce this empirical variance: by repeatedly executing cached expert actions in a stochastic simulator, we compute a smoother expert visitation distribution estimate to match. In the presence of general function approximation, we prove a meta theorem reducing the performance gap of our approach to the parameter estimation error for offline classification (i.e. learning the expert policy). In the tabular setting or with linear function approximation, our meta theorem shows that the performance gap incurred by our approach achieves the optimal $\widetilde{O} \left( \min({H^{3/2}} / {N}, {H} / {\sqrt{N}} \right)$ dependency, under significantly weaker assumptions compared to prior work. We implement multiple instantiations of our approach on several continuous control tasks and find that we are able to significantly improve policy performance across a variety of dataset sizes. △ Less

Submitted 14 January, 2023; v1 submitted 30 May, 2022; originally announced May 2022.

arXiv:2205.07078 [pdf]

doi 10.1063/5.0099072

Electromagnetic Composites: from Effective Medium Theories to Metamaterials

Authors: Faxiang Qin, Mengyue Peng, Diana Estevez, Christian Brosseau

Abstract: Electromagnetic (EM) composites have stimulated tremendous fundamental and practical interests owing to their flexible electromagnetic properties and extensive potential engineering applications. Hence, it is necessary to systematically understand the physical mechanisms and design principles controlling EM composites. In this tutorial, we first provide an overview of the basic theory of electroma… ▽ More Electromagnetic (EM) composites have stimulated tremendous fundamental and practical interests owing to their flexible electromagnetic properties and extensive potential engineering applications. Hence, it is necessary to systematically understand the physical mechanisms and design principles controlling EM composites. In this tutorial, we first provide an overview of the basic theory of electromagnetism about electromagnetic constitutive parameters that can represent the electromagnetic properties of materials. We show how this corpus allows a consistent construction of effective medium theories and allows for numerical simulation of EM composites to deal with structure-property relationships. We then discuss the influence of spatial dispersion of shaped inclusions in the material medium on the EM properties of composites, which has not been systematically illustrated in the context of this interdisciplinary topic. Next, artificial composites or metamaterials with peculiar properties not readily available in nature are highlighted with particular emphasis on the control of the EM interaction with composites. We conclude by discussing appropriate methods of electromagnetic measurement and practical aspects for implementing composites for specific applications are described. Overall, this tutorial will serve the purpose of introducing the basics and applications of electromagnetic composites to newcomers in this field. It is also anticipated that researchers from different backgrounds including materials science, optics, and electrical engineering can communicate to each other with the same language when dealing with this interdisciplinary subject and further push forward this advancement from fundamental science to technological applications. △ Less

Submitted 14 May, 2022; originally announced May 2022.

Comments: 63 pages, 20 figures

arXiv:2205.04061 [pdf, other]

Multilevel Hierarchical Network with Multiscale Sampling for Video Question Answering

Authors: Min Peng, Chongyang Wang, Yuan Gao, Yu Shi, Xiang-Dong Zhou

Abstract: Video question answering (VideoQA) is challenging given its multimodal combination of visual understanding and natural language processing. While most existing approaches ignore the visual appearance-motion information at different temporal scales, it is unknown how to incorporate the multilevel processing capacity of a deep learning model with such multiscale information. Targeting these issues,… ▽ More Video question answering (VideoQA) is challenging given its multimodal combination of visual understanding and natural language processing. While most existing approaches ignore the visual appearance-motion information at different temporal scales, it is unknown how to incorporate the multilevel processing capacity of a deep learning model with such multiscale information. Targeting these issues, this paper proposes a novel Multilevel Hierarchical Network (MHN) with multiscale sampling for VideoQA. MHN comprises two modules, namely Recurrent Multimodal Interaction (RMI) and Parallel Visual Reasoning (PVR). With a multiscale sampling, RMI iterates the interaction of appearance-motion information at each scale and the question embeddings to build the multilevel question-guided visual representations. Thereon, with a shared transformer encoder, PVR infers the visual cues at each level in parallel to fit with answering different question types that may rely on the visual information at relevant levels. Through extensive experiments on three VideoQA datasets, we demonstrate improved performances than previous state-of-the-arts and justify the effectiveness of each part of our method. △ Less

Submitted 9 May, 2022; originally announced May 2022.

Comments: Accepted by IJCAI 2022. arXiv admin note: text overlap with arXiv:2109.04735

arXiv:2204.05995 [pdf, ps, other]

Communication and Computation Assisted Sensing Information Freshness Performance Analysis in Vehicular Networks

Authors: Ning Jiang, Shi Yan, Zhuohan Liu, Chun**g Hu, Mugen Peng

Abstract: The timely sharing of raw sensing information in the vehicular networks (VNETs) is essential to safety. In order to improve the freshness of sensing information, joint scheduling of multi-dimensional resources such as communication and computation is required. However, the complex relevance among multi-dimensional resources is still unclear, and it is difficult to achieve efficient resource utiliz… ▽ More The timely sharing of raw sensing information in the vehicular networks (VNETs) is essential to safety. In order to improve the freshness of sensing information, joint scheduling of multi-dimensional resources such as communication and computation is required. However, the complex relevance among multi-dimensional resources is still unclear, and it is difficult to achieve efficient resource utilization. In this paper, we present a theoretical analysis for a novel metric Age of Information (AoI) on a communication and computation assisted spatial-temporal model. An uplink VNETs scenario where Road Side Units (RSUs) are deployed with computational resource is considered. The transmission and computation process is unified into a two-stage tandem queue and the expression of the average AoI is derived. The network interference is analyzed by modeling the VNETs as Cox Poisson Point Process based on stochastic geometry and the closed-form solution of the coverage probability and the expected data rate performance under the constraints of transmission resources is obtained. The simulation results reveal the basic relationship between communication and computation capacity and show that communication and computation should reach a tradeoff to improve resource utilization while ensuring real-time information requirement. △ Less

Submitted 12 April, 2022; originally announced April 2022.

arXiv:2203.16101 [pdf]

All-optical determination of one or two emitters using quantum polarization with nitrogen-vacancy centers in diamond

Authors: Davin Yue Ming Peng, Josef G. Worboys, Qiang Sun, Shuo Li, Marco Capelli, Shinobu Onoda, Takeshi Ohshima, Philipp Reineck, Brant C. Gibson, Andrew D. Greentree

Abstract: Qubit technologies using nitrogen-vacancy color centers in diamonds require precise knowledge of the centers, including the number of emitters within a diffraction-limited spot and their orientations. However, the number of emitters is challenging to determine when there is finite background, which affects the precision of resulting quantum protocols. Here we show the photoluminescence (PL) intens… ▽ More Qubit technologies using nitrogen-vacancy color centers in diamonds require precise knowledge of the centers, including the number of emitters within a diffraction-limited spot and their orientations. However, the number of emitters is challenging to determine when there is finite background, which affects the precision of resulting quantum protocols. Here we show the photoluminescence (PL) intensity and quantum correlation (Hanbury Brown and Twiss) measurements as a function of polarization for one- and two-emitter systems. The sample was made by implanting low concentrations of adenine (C5H5N5) into a low nitrogen chemical vapor deposition diamond. This approach yielded well-spaced regions with few nitrogen-vacancy centers. By map** the PL intensity and quantum correlation as a function of polarization, we can distinguish two emitter systems from single emitters with background, providing a method to quantify the background signal at implanted sites, which might be different from off-site background levels. This approach also provides a valuable new all-optical mechanism for the determination of one or two emitter systems useful for quantum sensing, communication, and computation tasks. △ Less

Submitted 5 June, 2023; v1 submitted 30 March, 2022; originally announced March 2022.

arXiv:2202.11203 [pdf, other]

Label-Smoothed Backdoor Attack

Authors: Minlong Peng, Zidi Xiong, Mingming Sun, ** Li

Abstract: By injecting a small number of poisoned samples into the training set, backdoor attacks aim to make the victim model produce designed outputs on any input injected with pre-designed backdoors. In order to achieve a high attack success rate using as few poisoned training samples as possible, most existing attack methods change the labels of the poisoned samples to the target class. This practice of… ▽ More By injecting a small number of poisoned samples into the training set, backdoor attacks aim to make the victim model produce designed outputs on any input injected with pre-designed backdoors. In order to achieve a high attack success rate using as few poisoned training samples as possible, most existing attack methods change the labels of the poisoned samples to the target class. This practice often results in severe over-fitting of the victim model over the backdoors, making the attack quite effective in output control but easier to be identified by human inspection or automatic defense algorithms. In this work, we proposed a label-smoothing strategy to overcome the over-fitting problem of these attack methods, obtaining a \textit{Label-Smoothed Backdoor Attack} (LSBA). In the LSBA, the label of the poisoned sample $\bm{x}$ will be changed to the target class with a probability of $p_n(\bm{x})$ instead of 100\%, and the value of $p_n(\bm{x})$ is specifically designed to make the prediction probability the target class be only slightly greater than those of the other classes. Empirical studies on several existing backdoor attacks show that our strategy can considerably improve the stealthiness of these attacks and, at the same time, achieve a high attack success rate. In addition, our strategy makes it able to manually control the prediction probability of the design output through manipulating the applied and activated number of LSBAs\footnote{Source code will be published at \url{https://github.com/v-mipeng/LabelSmoothedAttack.git}}. △ Less

Submitted 18 February, 2022; originally announced February 2022.

Comments: Backdoor Attack

arXiv:2112.10315 [pdf]

doi 10.1088/1361-6587/ac5a08

Experimental study on edge energetic electrons in EXL-50 spherical torus

Authors: Dong Guo, Yuejiang Shi, Wenjun Liu, Yunyang Song, Tiantian Sun, Bing Liu, Yingying Li, Xiaorang Tian, Guosong Zhang, Huasheng Xie, Y. K. Martin Peng, Minsheng Liu

Abstract: A significant number of confined energetic electrons have been observed outside the Last Closed Flux Surface (LCFS) of the solenoid-free, ECRH sustained plasmas in the EXL-50 spherical torus. Several diagnostics have been applied, for the first time, to investigate the key characters of energetic electrons. Experiments reveal the existence of high-temperature low density electrons, which can carry… ▽ More A significant number of confined energetic electrons have been observed outside the Last Closed Flux Surface (LCFS) of the solenoid-free, ECRH sustained plasmas in the EXL-50 spherical torus. Several diagnostics have been applied, for the first time, to investigate the key characters of energetic electrons. Experiments reveal the existence of high-temperature low density electrons, which can carry relatively a large amount of the stored energy. The boundary between the thermal plasma and the energetic electron fluid appears to be clearly separated and the distance between the two boundaries can reach tens of centimeters (around the size of the minor radius of the thermal plasma). This implies that the Grad-Shafranov equilibrium is not suitable to describe the equilibrium of the EXL-50 plasma and a multi-fluid model is required. Particle dynamics simulations of full orbits show that energetic electrons can be well confined outside the LCFS. This is consistent with the experimental observations. △ Less

Submitted 19 December, 2021; originally announced December 2021.

Journal ref: Plasma Phys. Control. Fusion 64 (2022) 055009 (7pp)

arXiv:2112.06224 [pdf, ps, other]

doi 10.1109/WCSP52459.2021.9613157

Joint Sensing, Communication, and Computation Resource Allocation for Cooperative Perception in Fog-Based Vehicular Networks

Authors: Xinran Zhang, Zhimin He, Yaohua Sun, Shuo Yuan, Mugen Peng

Abstract: To enlarge the perception range and reliability of individual autonomous vehicles, cooperative perception has been received much attention. However, considering the high volume of shared messages, limited bandwidth and computation resources in vehicular networks become bottlenecks. In this paper, we investigate how to balance the volume of shared messages and constrained resources in fog-based veh… ▽ More To enlarge the perception range and reliability of individual autonomous vehicles, cooperative perception has been received much attention. However, considering the high volume of shared messages, limited bandwidth and computation resources in vehicular networks become bottlenecks. In this paper, we investigate how to balance the volume of shared messages and constrained resources in fog-based vehicular networks. To this end, we first characterize sum satisfaction of cooperative perception taking account of its spatial-temporal value and latency performance. Next, the sensing block message, communication resource block, and computation resource are jointly allocated to maximize the sum satisfaction of cooperative perception, while satisfying the maximum latency and sojourn time constraints of vehicles. Owing to its non-convexity, we decouple the original problem into two separate sub-problems and devise corresponding solutions. Simulation results demonstrate that our proposed scheme can effectively boost the sum satisfaction of cooperative perception compared with existing baselines. △ Less

Submitted 12 December, 2021; originally announced December 2021.

Comments: Accepted by WCSP 2021

arXiv:2112.03272 [pdf]

doi 10.1063/5.0075019

Clarification of Basic Concepts for Electromagnetic Interference Shielding Effectiveness

Authors: Mengyue Peng, Faxiang Qin

Abstract: There exists serious miscomprehension in the open literature about the electromagnetic interference shielding effectiveness (EMI SE) as a critical index to evaluate the shielding performance, which is misleading to the graduates and newcomers embarking on the field of electromagnetic shielding materials. EMI SE is defined as the sum of three terms including reflection loss, absorption loss and mul… ▽ More There exists serious miscomprehension in the open literature about the electromagnetic interference shielding effectiveness (EMI SE) as a critical index to evaluate the shielding performance, which is misleading to the graduates and newcomers embarking on the field of electromagnetic shielding materials. EMI SE is defined as the sum of three terms including reflection loss, absorption loss and multiple reflection loss in the classical Schelkunoff theory, while it is decomposed into two terms named reflection loss and absorption loss in practice, which is called Calculation theory here. In this paper, we elucidate the widely-seen misconceptions connected with EMI SE via theoretical derivation and instance analysis. Firstly, the terms in Calculation theory are often mistakenly regarded as the approximation of the terms with the same names in Schelkunoff theory when multiple reflection loss is negligible. Secondly, it is insufficient and unreasonable to determine the absorption-dominant shielding performance in the case that absorption loss is higher than reflection loss since reflection loss and absorption loss cannot represent the actual levels of reflected and absorbed power. Power coefficients are recommended to compare the contribution of reflection and absorption to shielding performance. Thirdly, multiple reflection effect is included in the definitions of reflection loss and absorption loss in Calculation theory, and the effect of multiple reflections on shielding property is clarified as against the commonly wrong understandings. These clarifications offer correct comprehension about the shielding mechanism and assessment of reflection and absorption contribution to the total shielding. △ Less

Submitted 5 December, 2021; originally announced December 2021.

Comments: 25 pages, 9 figures

arXiv:2111.10611 [pdf, other]

Online Beam Current Estimation in Particle Beam Microscopy

Authors: Sheila W. Seidel, Luisa Watkins, Minxu Peng, Akshay Agarwal, Christopher Yu, Vivek K Goyal

Abstract: In conventional particle beam microscopy, knowledge of the beam current is essential for accurate micrograph formation and sample milling. This generally necessitates offline calibration of the instrument. In this work, we establish that beam current can be estimated online, from the same secondary electron count data that is used to form micrographs. Our methods depend on the recently introduced… ▽ More In conventional particle beam microscopy, knowledge of the beam current is essential for accurate micrograph formation and sample milling. This generally necessitates offline calibration of the instrument. In this work, we establish that beam current can be estimated online, from the same secondary electron count data that is used to form micrographs. Our methods depend on the recently introduced time-resolved measurement concept, which combines multiple short measurements at a single pixel and has previously been shown to partially mitigate the effect of beam current variation on micrograph accuracy. We analyze the problem of jointly estimating beam current and secondary electron yield using the Cramer-Rao bound. Joint estimators operating at a single pixel and estimators that exploit models for inter-pixel correlation and Markov beam current variation are proposed and tested on synthetic microscopy data. Our estimates of secondary electron yield that incorporate explicit beam current estimation beat state-of-the-art methods, resulting in micrograph accuracy nearly indistinguishable from what is obtained with perfect beam current knowledge. Our novel beam current estimation could help improve milling outcomes, prevent sample damage, and enable online instrument diagnostics. △ Less

Submitted 20 November, 2021; originally announced November 2021.

arXiv:2109.13847 [pdf, ps, other]

doi 10.1103/PhysRevLett.129.051802

New Experimental Limits on Exotic Spin- and Velocity-dependent Interactions Using Rotationally Modulated Source-masses and an Atomic-magnetometer Array

Authors: K. Y. Wu, S. Y. Chen, G. A. Sun, S. M. Peng, M. Peng, H. Yan

Abstract: We conducted laboratory searching for the exotic spin- and velocity-dependent new interactions according to the previously proposed experimental scheme. Two $\sim$6Kg heavy source masses are rotationally modulated at a frequency of 20Hz. Four identical atomic magnetometers are used in an array form to increase the statistics and cancel the common-mode noise. Data processing method based on high pr… ▽ More We conducted laboratory searching for the exotic spin- and velocity-dependent new interactions according to the previously proposed experimental scheme. Two $\sim$6Kg heavy source masses are rotationally modulated at a frequency of 20Hz. Four identical atomic magnetometers are used in an array form to increase the statistics and cancel the common-mode noise. Data processing method based on high precision numerical integration is applied for the four harmonic frequencies of the signal. The rotation direction of the source masses was reversed to flip the signal. Thus the [1,-3,3,-1] weighting method can be applied to remove possible slow drifting further. The experiment method has noise reduction features, and new constraints for Vector-Axial and Axial-Axial were obtained. The new constraints on VA improved by as much as more than four orders, on AA by as much as two orders in the corresponding force range, respectively. △ Less

Submitted 28 September, 2021; originally announced September 2021.

arXiv:2109.10783 [pdf, other]

doi 10.1021/acsanm.1c02969

Dynamic Behaviors and Training Effects in TiN/Ti/HfO$_x$/TiN Nanolayered Memristors with Controllable Quantized Conductance States: Implications for Quantum and Neuromorphic Computing Devices

Authors: Min-Hsuan Peng, Ching-Yang Pan, Hao-Xuan Zheng, Ting-Chang Chang, Pei-hsun Jiang

Abstract: Controllable quantized conductance states of TiN/Ti/HfO$_x$/TiN memristors are realized with great precision through a pulse-mode reset procedure, assisted with analytical differentiation of the condition of the set procedure, which involves critical monitoring of the measured bias voltage. An intriguing training effect that leads to faster switching of the states is also observed during the opera… ▽ More Controllable quantized conductance states of TiN/Ti/HfO$_x$/TiN memristors are realized with great precision through a pulse-mode reset procedure, assisted with analytical differentiation of the condition of the set procedure, which involves critical monitoring of the measured bias voltage. An intriguing training effect that leads to faster switching of the states is also observed during the operation. Detailed analyses on the low- and high-resistance states under different compliance currents reveal a complete picture of the structural evolution and dynamic behaviors of the conductive filament in the HfO$_x$ layer. This study provides a closer inspection on the quantum-level manipulation of nanoscale atomic configurations in the memristors, which helps to develop essential knowledge about the design and fabrication of the future memristor-based quantum devices and neuromorphic computing devices. △ Less

Submitted 22 September, 2021; originally announced September 2021.

Comments: Accepted by ACS Applied Nano Materials

arXiv:2109.08869 [pdf, other]

doi 10.1088/1674-1056/ac3988

Anti-$\mathcal{PT}$-symmetric Kerr gyroscope

Authors: Huilai Zhang, Meiyu Peng, Xun-Wei Xu, Hui **g

Abstract: Non-Hermitian systems can exhibit unconventional spectral singularities called exceptional points (EPs). Various EP sensors have been fabricated in recent years, showing strong spectral responses to external signals. Here we propose how to achieve a nonlinear anti-parity-time ($\mathcal{APT}$) gyroscope by spinning an optical resonator. We show that, in the absence of any nonlinearity, the sensiti… ▽ More Non-Hermitian systems can exhibit unconventional spectral singularities called exceptional points (EPs). Various EP sensors have been fabricated in recent years, showing strong spectral responses to external signals. Here we propose how to achieve a nonlinear anti-parity-time ($\mathcal{APT}$) gyroscope by spinning an optical resonator. We show that, in the absence of any nonlinearity, the sensitivity or optical mode splitting of the linear device can be magnified up to 3 orders than that of the conventional device without EPs. Remarkably, the $\mathcal{APT}$ symmetry can be broken when including the Kerr nonlinearity of the materials and, as the result, the detection threshold can be significantly lowered, i.e., much weaker rotations which are well beyond the ability of a linear gyroscope can now be detected with the nonlinear device. Our work shows the powerful ability of $\mathcal{APT}$ gyroscopes in practice to achieve ultrasensitive rotation measurement. △ Less

Submitted 18 September, 2021; originally announced September 2021.

Journal ref: Chin. Phys. B 31, 014215 (2022)

arXiv:2109.06866 [pdf, ps, other]

doi 10.1103/PhysRevD.105.055020

Searching for Exotic Spin-Dependent Interactions Using Rotationally Modulated Source Masses and an Atomic Magnetometer Array

Authors: K. Y. Wu, S. Y. Chen, J. Gong, M. Peng, H. Yan

Abstract: We describe a proposed experimental search for exotic spin-dependent interactions using rotationally modulated source masses and an atomic magnetometer array. Rather than further improving the magnetometer sensitivity, noise reduction can be another way to reach higher measurement precision. In this work, we propose to use modulating techniques of the source masses to reduce the noise of the exper… ▽ More We describe a proposed experimental search for exotic spin-dependent interactions using rotationally modulated source masses and an atomic magnetometer array. Rather than further improving the magnetometer sensitivity, noise reduction can be another way to reach higher measurement precision. In this work, we propose to use modulating techniques of the source masses to reduce the noise of the experiment. Better precision can be achieved if the fundamental frequency and harmonics of the rotating source masses are used to detect the new interactions. Furthermore, if an array of magnetometers are applied, the statistic precision can be improved, and some common noises can be canceled. Our analysis and simulations indicate that the proposed experiment scheme can improve the detection precisions of three types of spin-dependent interactions by as much as $\sim$5 orders in the force range of $\sim$cm to $\sim$10m. △ Less

Submitted 8 March, 2022; v1 submitted 6 September, 2021; originally announced September 2021.

arXiv:2109.04735 [pdf, other]

Temporal Pyramid Transformer with Multimodal Interaction for Video Question Answering

Authors: Min Peng, Chongyang Wang, Yuan Gao, Yu Shi, Xiang-Dong Zhou

Abstract: Video question answering (VideoQA) is challenging given its multimodal combination of visual understanding and natural language understanding. While existing approaches seldom leverage the appearance-motion information in the video at multiple temporal scales, the interaction between the question and the visual information for textual semantics extraction is frequently ignored. Targeting these iss… ▽ More Video question answering (VideoQA) is challenging given its multimodal combination of visual understanding and natural language understanding. While existing approaches seldom leverage the appearance-motion information in the video at multiple temporal scales, the interaction between the question and the visual information for textual semantics extraction is frequently ignored. Targeting these issues, this paper proposes a novel Temporal Pyramid Transformer (TPT) model with multimodal interaction for VideoQA. The TPT model comprises two modules, namely Question-specific Transformer (QT) and Visual Inference (VI). Given the temporal pyramid constructed from a video, QT builds the question semantics from the coarse-to-fine multimodal co-occurrence between each word and the visual content. Under the guidance of such question-specific semantics, VI infers the visual clues from the local-to-global multi-level interactions between the question and the video. Within each module, we introduce a multimodal attention mechanism to aid the extraction of question-video interactions, with residual connections adopted for the information passing across different levels. Through extensive experiments on three VideoQA datasets, we demonstrate better performances of the proposed method in comparison with the state-of-the-arts. △ Less

Submitted 10 September, 2021; originally announced September 2021.

Comments: Submitted to AAAI'22

arXiv:2109.04161 [pdf, other]

Investigation of the effectiveness of non-inductive `multi-harmonic' electron cyclotron current drive through modeling multi-pass absorptions in the EXL-50 spherical tokamak

Authors: D. Banerjee, S. D. Song, H. S. Xie, B. Liu, M. Y. Wang, W. J. Liu, B. Chen, L. Han, D. Luo, Y. Y. Song, Yu. V. Petrov, X. M. Song, M. S. Liu, R. W. Harvey, Y. J. Shi, Y. K. M. Peng, the EXL50 team

Abstract: The effectiveness of multiple electron cyclotron resonance (ECR) harmonics has been thoroughly investigated in context of high current drive efficiency, generally observed in fully non-inductive operation of the low aspect ratio EXL-50 spherical tokamak (ST) powered by electron cyclotron (EC) waves. The Fokker-Plank equation is numerically solved to obtain electron distribution function, under ste… ▽ More The effectiveness of multiple electron cyclotron resonance (ECR) harmonics has been thoroughly investigated in context of high current drive efficiency, generally observed in fully non-inductive operation of the low aspect ratio EXL-50 spherical tokamak (ST) powered by electron cyclotron (EC) waves. The Fokker-Plank equation is numerically solved to obtain electron distribution function, under steady state of the relativistic nonlinear Coulomb collision and quasi-linear diffusion operators, for calculating plasma current driven by the injected EC wave. For the extra-ordinary EC wave, simulation results unfold a mechanism by which electrons moving around the cold second harmonic ECR layer strongly resonate with higher harmonics via the relativistic Doppler shifted resonance condition. This feature is in fact evident above a certain value of input EC wave power in simulation, indicating it to be a non-linear phenomenon. Similar to the experimental observation, high efficiency in current drive (over 1 A/W) has indeed been found in simulation for a typical low density ($\sim 1\times10^{18}~m^{-3}$), low temperature ($\lesssim 100$ eV) plasma of EXL-50 by taking into account multi-pass absorptions in our simulation model. However, such characteristic is not found in the ordinary EC-wave study for both single-pass and multi-pass simulations, suggesting it as inefficient in driving current on our ST device. △ Less

Submitted 9 September, 2021; originally announced September 2021.

Comments: 22 pages, 15 figures. Submitted to the Nuclear Fusion journal

arXiv:2108.09697

Observation of a strong correlation between the positive floating potential near the edge and plasma current on EXL-50 ECW plasma

Authors: Mingyuan Wang, Dong Guo, Xin Zhao, Yunyang Song, Wenjun Liu, Hongfei Du, Shaodong Song, Bing Liu, Yuejiang Shi, Tiantian Sun, Songjian Li, Debabrata Banerjee, Xiaomin Tian, Yingying Li, Y. -K Martin Peng

Abstract: Fully non-inductive plasma current start-up without the central solenoid in ECW plasma was used on EXL-50 Spherical Torus with a weak external vertical field (Bv). Generally, the number of electrons leaving to the vessel wall by the gradient Bt is larger than ions, and the positive potential was built up in plasma. The relationship between floating potential and the plasma current was studied usin… ▽ More Fully non-inductive plasma current start-up without the central solenoid in ECW plasma was used on EXL-50 Spherical Torus with a weak external vertical field (Bv). Generally, the number of electrons leaving to the vessel wall by the gradient Bt is larger than ions, and the positive potential was built up in plasma. The relationship between floating potential and the plasma current was studied using the Langmuir probes near the boundary. The results show that the floating potential is positive (about 200V) and has a strong correlation with plasma current. In open magnetic field, the plasma current is driven by the high energy electrons in preferential confinement, the plasma current and potential approximately positively correlated with total electron density. After forming the closed flux surface, the plasma current consists mainly of the ECW driven current, and potential is negatively correlated with plasma current. By actively adjusting the Bv, it demonstrated that the positive voltage is approximately inversely correlated with the Bv and plasma current (Ip). Considering that the plasma temperature near the boundary is quite low (~eV), the positive voltage near the boundary caused by the high-energy electron loss. Therefore, the measurements of the boundary potential are important for the study of high-energy electron confinement performance, noninductive plasma current start-up and current driven. △ Less

Submitted 1 September, 2021; v1 submitted 22 August, 2021; originally announced August 2021.

Comments: The content of the article is controversial and needs to be reconfirmed with all authors before submission

arXiv:2108.09648

Non-inductive plasma current sustainment with stochastic electron cyclotron in EXL-50 spherical torus

Authors: Mingyuan Wang, Shikui Cheng, Bing Liu, Shaodong Song, Guo Dong, Yunyang Song, Wenjun Liu, Debabrata Banerjee, Songjian Li, Tiantian Sun, Yingying Li, Yuejiang Shi, Y. -K Martin Peng, ADi Liu

Abstract: The start-up and sustainment of a stochastic wave non-inductive current on a spherical torus was experimentally demonstrated for the first time using only electron cyclotron waves. The plasma current is insensitive to the injection angle of ECWs and approximately linearly correlated with the slope of the X-ray spectrum. Its direction is determined by the vertical magnetic field (BV). The temporal… ▽ More The start-up and sustainment of a stochastic wave non-inductive current on a spherical torus was experimentally demonstrated for the first time using only electron cyclotron waves. The plasma current is insensitive to the injection angle of ECWs and approximately linearly correlated with the slope of the X-ray spectrum. Its direction is determined by the vertical magnetic field (BV). The temporal development in the number of X-ray bremsstrahlung photons with a specified energy is consistent with the stochastic heating model. Moreover, the ratio of Amps to Watts of the ECW is generally >1 kA/kW under normal conditions (maximum plasma current: 150 kA, ECW: 140 kW). The experimental results are explained using the stochastic heating model of the asymmetric electron velocity distribution in stochastic electromagnetic waves. △ Less

Submitted 1 September, 2021; v1 submitted 22 August, 2021; originally announced August 2021.

Comments: The content of the article is controversial and needs to be reconfirmed with all authors before submission

arXiv:2108.04678 [pdf]

doi 10.1088/1361-648X/ac431e

Material-structure integrated design for ultra-broadband microwave metamaterial absorber

Authors: Mengyue Peng, Faxiang Qina, Li** Zhou, Huijie Wei, Zihao Zhu, Xiaopeng Shen

Abstract: We propose herein a method of material-structure integrated design for broadband absorption of dielectric metamaterial, which is achieved by combination of genetic algorithm and simulation platform. A multi-layered metamaterial absorber with an ultra-broadband absorption from 5.3 to 18 GHz (a relative bandwidth of as high as 109%) is realized numerically and experimentally. In addition, simulated… ▽ More We propose herein a method of material-structure integrated design for broadband absorption of dielectric metamaterial, which is achieved by combination of genetic algorithm and simulation platform. A multi-layered metamaterial absorber with an ultra-broadband absorption from 5.3 to 18 GHz (a relative bandwidth of as high as 109%) is realized numerically and experimentally. In addition, simulated results demonstrate the proposed metamaterial exhibits good incident angle and polarization tolerance, which also are significant criteria for practical applications. By investigating the working principle with theoretical calculation and numerical simulation, it can be found that merging of multiple resonance modes encompassing quarter-wavelength interference cancellation, spoof surface plasmon polariton mode, dielectric resonance mode and grating mode is responsible for a remarkable ultra-broadband absorption. Analysis of respective contribution of material and structure indicates that either of them plays an indispensable role in activating different resonance modes, and symphony of material and structure is essential to afford desirable target performance. The material-structure integrated design philosophy highlights the superiority of coupling material and structure and provides an effective comprehensive optimization strategy for dielectric metamaterials. △ Less

Submitted 19 July, 2021; originally announced August 2021.

Comments: 26 pages, 8 figures

arXiv:2107.11053 [pdf, other]

An Adaptive State Aggregation Algorithm for Markov Decision Processes

Authors: Guanting Chen, Johann Demetrio Gaebler, Matt Peng, Chunlin Sun, Yinyu Ye

Abstract: Value iteration is a well-known method of solving Markov Decision Processes (MDPs) that is simple to implement and boasts strong theoretical convergence guarantees. However, the computational cost of value iteration quickly becomes infeasible as the size of the state space increases. Various methods have been proposed to overcome this issue for value iteration in large state and action space MDPs,… ▽ More Value iteration is a well-known method of solving Markov Decision Processes (MDPs) that is simple to implement and boasts strong theoretical convergence guarantees. However, the computational cost of value iteration quickly becomes infeasible as the size of the state space increases. Various methods have been proposed to overcome this issue for value iteration in large state and action space MDPs, often at the price, however, of generalizability and algorithmic simplicity. In this paper, we propose an intuitive algorithm for solving MDPs that reduces the cost of value iteration updates by dynamically grou** together states with similar cost-to-go values. We also prove that our algorithm converges almost surely to within $2\varepsilon / (1 - γ)$ of the true optimal value in the $\ell^\infty$ norm, where $γ$ is the discount factor and aggregated states differ by at most $\varepsilon$. Numerical experiments on a variety of simulated environments confirm the robustness of our algorithm and its ability to solve MDPs with much cheaper updates especially as the scale of the MDP problem increases. △ Less

Submitted 23 July, 2021; originally announced July 2021.

arXiv:2105.03572 [pdf, other]

Blockchain Systems, Technologies and Applications: A Methodology Perspective

Authors: Bin Cao, Zixin Wang, Long Zhang, Daquan Feng, Mugen Peng, Lei Zhang

Abstract: In the past decade, blockchain has shown a promising vision greatly to build the trust without any powerful third party in a secure, decentralized and salable manner. However, due to the wide application and future development from cryptocurrency to Internet of Things, blockchain is an extremely complex system enabling integration with mathematics, finance, computer science, communication and netw… ▽ More In the past decade, blockchain has shown a promising vision greatly to build the trust without any powerful third party in a secure, decentralized and salable manner. However, due to the wide application and future development from cryptocurrency to Internet of Things, blockchain is an extremely complex system enabling integration with mathematics, finance, computer science, communication and network engineering, etc. As a result, it is a challenge for engineer, expert and researcher to fully understand the blockchain process in a systematic view from top to down. First, this article introduces how blockchain works, the research activity and challenge, and illustrates the roadmap involving the classic methodology with typical blockchain use cases and topics. Second, in blockchain system, how to adopt stochastic process, game theory, optimization, machine learning and cryptography to study blockchain running process and design blockchain protocol/algorithm are discussed in details. Moreover, the advantage and limitation using these methods are also summarized as the guide of future work to further considered. Finally, some remaining problems from technical, commercial and political views are discussed as the open issues. The main findings of this article will provide an overview in a methodology perspective to study theoretical model for blockchain fundamentals understanding, design network service for blockchain-based mechanisms and algorithms, as well as apply blockchain for Internet of Things, etc. △ Less

Submitted 7 May, 2021; originally announced May 2021.

arXiv:2104.14844 [pdf]

doi 10.1088/1741-4326/ac71b6

Solenoid-free current drive via ECRH in EXL-50 spherical torus plasmas

Authors: Yuejiang Shi, Bing Liu, Shaodong Song, Yunyang Song, Xianming Song, Bowei Tong, Shikui Cheng, Wenjun Liu, Minyuan Wang, Tiantian Sun, Dong Guo, Songjian Li, Yingying Li, Bin Chen, Xiang Gu, Jianqing Cai, Di Luo, Debabrata Banerjee, Xin Zhao, Yuanming Yang, Wenwu Luo, Peihai Zhou, Yu Wang, A. Ishida, T. Maekawa , et al. (3 additional authors not shown)

Abstract: As a new spherical tokamak (ST) designed to simplify engineering requirements of a possible future fusion power source, the EXL-50 experiment features a low aspect ratio (A) vacuum vessel (VV), encircling a central post assembly containing the toroidal field coil conductors without a central solenoid. Multiple electron cyclotron resonance heating (ECRH) resonances are located within the VV to impr… ▽ More As a new spherical tokamak (ST) designed to simplify engineering requirements of a possible future fusion power source, the EXL-50 experiment features a low aspect ratio (A) vacuum vessel (VV), encircling a central post assembly containing the toroidal field coil conductors without a central solenoid. Multiple electron cyclotron resonance heating (ECRH) resonances are located within the VV to improve current drive effectiveness. Copious energetic electrons are produced and measured with hard X-ray detectors, carry the bulk of the plasma current ranging from 50kA to 150kA, which is maintained for more than 1s duration. It is observed that over one Ampere current can be maintained per Watt of ECRH power issued from the 28-GHz gyrotrons. The plasma current reaches Ip>80kA for high density (>5e18me-2) discharge with 150kW ECHR heating. An analysis was carried out combining reconstructed multi-fluid equilibrium, guiding-center orbits of energetic electrons, and resonant heating mechanisms. It is verified that in EXL-50 a broadly distributed current of energetic electrons creates smaller closed magnetic-flux surfaces of low aspect ratio that in turn confine the thermal plasma electrons and ions and participate in maintaining the equilibrium force-balance. △ Less

Submitted 30 March, 2022; v1 submitted 30 April, 2021; originally announced April 2021.

Journal ref: Nuclear Fusion 2022

arXiv:2104.13130 [pdf, ps, other]

doi 10.1109/TNSE.2024.3361458

Secure and Efficient Federated Learning Through Layering and Sharding Blockchain

Authors: Shuo Yuan, Bin Cao, Yao Sun, Zhiguo Wan, Mugen Peng

Abstract: Introducing blockchain into Federated Learning (FL) to build a trusted edge computing environment for transmission and learning has attracted widespread attention as a new decentralized learning pattern. However, traditional consensus mechanisms and architectures of blockchain systems face significant challenges in handling large-scale FL tasks, especially on Internet of Things (IoT) devices, due… ▽ More Introducing blockchain into Federated Learning (FL) to build a trusted edge computing environment for transmission and learning has attracted widespread attention as a new decentralized learning pattern. However, traditional consensus mechanisms and architectures of blockchain systems face significant challenges in handling large-scale FL tasks, especially on Internet of Things (IoT) devices, due to their substantial resource consumption, limited transaction throughput, and complex communication requirements. To address these challenges, this paper proposes ChainFL, a novel two-layer blockchain-driven FL system. It splits the IoT network into multiple shards within the subchain layer, effectively reducing the scale of information exchange, and employs a Direct Acyclic Graph (DAG)-based mainchain as the mainchain layer, enabling parallel and asynchronous cross-shard validation. Furthermore, the FL procedure is customized to integrate deeply with blockchain technology, and a modified DAG consensus mechanism is designed to mitigate distortion caused by abnormal models. To provide a proof-of-concept implementation and evaluation, multiple subchains based on Hyperledger Fabric and a self-developed DAG-based mainchain are deployed. Extensive experiments demonstrate that ChainFL significantly surpasses conventional FL systems, showing up to a 14% improvement in training efficiency and a threefold increase in robustness. △ Less

Submitted 31 January, 2024; v1 submitted 27 April, 2021; originally announced April 2021.

Comments: Accepted by IEEE Transactions on Network Science and Engineering

arXiv:2103.11441 [pdf, other]

TextFlint: Unified Multilingual Robustness Evaluation Toolkit for Natural Language Processing

Authors: Tao Gui, Xiao Wang, Qi Zhang, Qin Liu, Yicheng Zou, Xin Zhou, Rui Zheng, Chong Zhang, Qinzhuo Wu, Jiacheng Ye, Zexiong Pang, Yongxin Zhang, Zhengyan Li, Ruotian Ma, Zichu Fei, Ruijian Cai, Jun Zhao, Xingwu Hu, Zhiheng Yan, Yiding Tan, Yuan Hu, Qiyuan Bian, Zhihua Liu, Bolin Zhu, Shan Qin , et al. (9 additional authors not shown)

Abstract: Various robustness evaluation methodologies from different perspectives have been proposed for different natural language processing (NLP) tasks. These methods have often focused on either universal or task-specific generalization capabilities. In this work, we propose a multilingual robustness evaluation platform for NLP tasks (TextFlint) that incorporates universal text transformation, task-spec… ▽ More Various robustness evaluation methodologies from different perspectives have been proposed for different natural language processing (NLP) tasks. These methods have often focused on either universal or task-specific generalization capabilities. In this work, we propose a multilingual robustness evaluation platform for NLP tasks (TextFlint) that incorporates universal text transformation, task-specific transformation, adversarial attack, subpopulation, and their combinations to provide comprehensive robustness analysis. TextFlint enables practitioners to automatically evaluate their models from all aspects or to customize their evaluations as desired with just a few lines of code. To guarantee user acceptability, all the text transformations are linguistically based, and we provide a human evaluation for each one. TextFlint generates complete analytical reports as well as targeted augmented data to address the shortcomings of the model's robustness. To validate TextFlint's utility, we performed large-scale empirical evaluations (over 67,000 evaluations) on state-of-the-art deep learning models, classic supervised methods, and real-world systems. Almost all models showed significant performance degradation, including a decline of more than 50% of BERT's prediction accuracy on tasks such as aspect-level sentiment classification, named entity recognition, and natural language inference. Therefore, we call for the robustness to be included in the model evaluation, so as to promote the healthy development of NLP technology. △ Less

Submitted 5 May, 2021; v1 submitted 21 March, 2021; originally announced March 2021.

arXiv:2103.08200 [pdf, other]

Mention-centered Graph Neural Network for Document-level Relation Extraction

Authors: Jiaxin Pan, Min Peng, Yiyan Zhang

Abstract: Document-level relation extraction aims to discover relations between entities across a whole document. How to build the dependency of entities from different sentences in a document remains to be a great challenge. Current approaches either leverage syntactic trees to construct document-level graphs or aggregate inference information from different sentences. In this paper, we build cross-sentenc… ▽ More Document-level relation extraction aims to discover relations between entities across a whole document. How to build the dependency of entities from different sentences in a document remains to be a great challenge. Current approaches either leverage syntactic trees to construct document-level graphs or aggregate inference information from different sentences. In this paper, we build cross-sentence dependencies by inferring compositional relations between inter-sentence mentions. Adopting aggressive linking strategy, intermediate relations are reasoned on the document-level graphs by mention convolution. We further notice the generalization problem of NA instances, which is caused by incomplete annotation and worsened by fully-connected mention pairs. An improved ranking loss is proposed to attend this problem. Experiments show the connections between different mentions are crucial to document-level relation extraction, which enables the model to extract more meaningful higher-level compositional relations. △ Less

Submitted 15 March, 2021; originally announced March 2021.

arXiv:2101.04750 [pdf, other]

Linear Representation Meta-Reinforcement Learning for Instant Adaptation

Authors: Matt Peng, Banghua Zhu, Jiantao Jiao

Abstract: This paper introduces Fast Linearized Adaptive Policy (FLAP), a new meta-reinforcement learning (meta-RL) method that is able to extrapolate well to out-of-distribution tasks without the need to reuse data from training, and adapt almost instantaneously with the need of only a few samples during testing. FLAP builds upon the idea of learning a shared linear representation of the policy so that whe… ▽ More This paper introduces Fast Linearized Adaptive Policy (FLAP), a new meta-reinforcement learning (meta-RL) method that is able to extrapolate well to out-of-distribution tasks without the need to reuse data from training, and adapt almost instantaneously with the need of only a few samples during testing. FLAP builds upon the idea of learning a shared linear representation of the policy so that when adapting to a new task, it suffices to predict a set of linear weights. A separate adapter network is trained simultaneously with the policy such that during adaptation, we can directly use the adapter network to predict these linear weights instead of updating a meta-policy via gradient descent, such as in prior meta-RL methods like MAML, to obtain the new policy. The application of the separate feed-forward network not only speeds up the adaptation run-time significantly, but also generalizes extremely well to very different tasks that prior Meta-RL methods fail to generalize to. Experiments on standard continuous-control meta-RL benchmarks show FLAP presenting significantly stronger performance on out-of-distribution tasks with up to double the average return and up to 8X faster adaptation run-time speeds when compared to prior methods. △ Less

Submitted 12 January, 2021; originally announced January 2021.

arXiv:2012.07311 [pdf, other]

Topic-Oriented Spoken Dialogue Summarization for Customer Service with Saliency-Aware Topic Modeling

Authors: Yicheng Zou, Lujun Zhao, Yangyang Kang, Jun Lin, Minlong Peng, Zhuoren Jiang, Changlong Sun, Qi Zhang, Xuan**g Huang, Xiaozhong Liu

Abstract: In a customer service system, dialogue summarization can boost service efficiency by automatically creating summaries for long spoken dialogues in which customers and agents try to address issues about specific topics. In this work, we focus on topic-oriented dialogue summarization, which generates highly abstractive summaries that preserve the main ideas from dialogues. In spoken dialogues, abund… ▽ More In a customer service system, dialogue summarization can boost service efficiency by automatically creating summaries for long spoken dialogues in which customers and agents try to address issues about specific topics. In this work, we focus on topic-oriented dialogue summarization, which generates highly abstractive summaries that preserve the main ideas from dialogues. In spoken dialogues, abundant dialogue noise and common semantics could obscure the underlying informative content, making the general topic modeling approaches difficult to apply. In addition, for customer service, role-specific information matters and is an indispensable part of a summary. To effectively perform topic modeling on dialogues and capture multi-role information, in this work we propose a novel topic-augmented two-stage dialogue summarizer (TDS) jointly with a saliency-aware neural topic model (SATM) for topic-oriented summarization of customer service dialogues. Comprehensive studies on a real-world Chinese customer service dataset demonstrated the superiority of our method against several strong baselines. △ Less

Submitted 25 June, 2021; v1 submitted 14 December, 2020; originally announced December 2020.

Comments: Accepted by AAAI 2021, 9 pages

arXiv:2011.08402 [pdf, other]

doi 10.1109/TCI.2021.3076887

Time-Resolved Focused Ion Beam Microscopy: Modeling, Estimation Methods, and Analyses

Authors: Minxu Peng, John Murray-Bruce, Vivek K Goyal

Abstract: In a focused ion beam (FIB) microscope, source particles interact with a small volume of a sample to generate secondary electrons that are detected, pixel by pixel, to produce a micrograph. Randomness of the number of incident particles causes excess variation in the micrograph, beyond the variation in the underlying particle-sample interaction. We recently demonstrated that joint processing of mu… ▽ More In a focused ion beam (FIB) microscope, source particles interact with a small volume of a sample to generate secondary electrons that are detected, pixel by pixel, to produce a micrograph. Randomness of the number of incident particles causes excess variation in the micrograph, beyond the variation in the underlying particle-sample interaction. We recently demonstrated that joint processing of multiple time-resolved measurements from a single pixel can mitigate this effect of source shot noise in helium ion microscopy. This paper is focused on establishing a rigorous framework for understanding the potential for this approach. It introduces idealized continuous- and discrete-time abstractions of FIB microscopy with direct electron detection and estimation-theoretic limits of imaging performance under these measurement models. Novel estimators for use with continuous-time measurements are introduced and analyzed, and estimators for use with discrete-time measurements are analyzed and shown to approach their continuous-time counterparts as time resolution is increased. Simulated FIB microscopy results are consistent with theoretical analyses and demonstrate that substantial improvements over conventional FIB microscopy image formation are made possible by time-resolved measurement. △ Less

Submitted 18 February, 2022; v1 submitted 16 November, 2020; originally announced November 2020.

Comments: Accepted at IEEE Transactions on Computational Imaging (2021), Volume 7, Pages 547-561

Journal ref: IEEE Transactions on Computational Imaging, vol. 7, pp. 547-561, 2021

arXiv:2010.13624 [pdf]

Wind Power Transmission System Integration -- a Case Study of China Wind Power Base

Authors: Jianxue Wang, Shutang You, Xingzhong Bai, Mingqiao Peng

Abstract: Due to a series of supporting policies in recent years, China wind power has developed rapidly through a large-scale and centralized mode. This paper analyzes the two major concerns faced by wind power development in China: wind generation reliability and wind energy balancing. More specifically, wind farm trip**-off-grid incidents and wind power curtailment issues, which caused huge economical… ▽ More Due to a series of supporting policies in recent years, China wind power has developed rapidly through a large-scale and centralized mode. This paper analyzes the two major concerns faced by wind power development in China: wind generation reliability and wind energy balancing. More specifically, wind farm trip**-off-grid incidents and wind power curtailment issues, which caused huge economical loss, are investigated in details. Based on operation experience of large wind power bases, technical recommendations and economic incentives are proposed to improve wind power integration and power grid reliability. As a summary and outlook of wind power development in China, this paper provides a reference on future wind power development for other countries. △ Less

Submitted 10 April, 2021; v1 submitted 22 October, 2020; originally announced October 2020.

Comments: 21 pages, 6 figures

arXiv:2010.08116 [pdf]

doi 10.1063/5.0027718

Four-Fluid Axisymmetric Plasma Equilibrium Model Including Relativistic Electrons and Computational Method and Results

Authors: Akio Ishida, Y. -K. Martin Peng, Wenjun Liu

Abstract: A non-relativistic multi-fluid plasma axisymmetric equilibrium model was developed recently to account for the presence of an energetic electron fluid in addition to thermal electron and ion fluids. The equilibrium formulation of a multi-fluid plasma with relativistic energetic electrons is developed and reported in this paper. Relativistic effects in a fluid model approximation can appear in two… ▽ More A non-relativistic multi-fluid plasma axisymmetric equilibrium model was developed recently to account for the presence of an energetic electron fluid in addition to thermal electron and ion fluids. The equilibrium formulation of a multi-fluid plasma with relativistic energetic electrons is developed and reported in this paper. Relativistic effects in a fluid model approximation can appear in two ways: due to a large macroscopic fluid velocity comparable to the speed of light and large particle's microscopic random motion which becomes significant if the temperature becomes comparable to or larger than the electron rest mass-energy. It is found that the axial component of relativistic generalized angular momentum can be used to describe relativistic axisymmetric equilibrium. The formulation is applied to a four-fluid plasma composed of a relativistic energetic electron fluid, a thermal electron fluid, and fluids of two thermal ion species (e.g. proton and boron ions). The four-fluid density expression which is consistent with the electrostatic potential is obtained and applied in the computation. An example equilibrium approximating a four-fluid plasma recently observed in a solenoid-free ECRH sustained spherical torus plasma is calculated and presented. A second equilibrium that extends the energetic electron temperature of the first example to 679keV is calculated revealing significant relativistic effects. △ Less

Submitted 19 February, 2021; v1 submitted 15 October, 2020; originally announced October 2020.

Comments: 31pages,13figures,3 tables

arXiv:2009.14722 [pdf, other]

RDSGAN: Rank-based Distant Supervision Relation Extraction with Generative Adversarial Framework

Authors: Guoqing Luo, Jiaxin Pan, Min Peng

Abstract: Distant supervision has been widely used for relation extraction but suffers from noise labeling problem. Neural network models are proposed to denoise with attention mechanism but cannot eliminate noisy data due to its non-zero weights. Hard decision is proposed to remove wrongly-labeled instances from the positive set though causes loss of useful information contained in removed instances. In th… ▽ More Distant supervision has been widely used for relation extraction but suffers from noise labeling problem. Neural network models are proposed to denoise with attention mechanism but cannot eliminate noisy data due to its non-zero weights. Hard decision is proposed to remove wrongly-labeled instances from the positive set though causes loss of useful information contained in removed instances. In this paper, we propose a novel generative neural framework named RDSGAN (Rank-based Distant Supervision GAN) which automatically generates valid instances for distant supervision relation extraction. Our framework combines soft attention and hard decision to learn the distribution of true positive instances via adversarial training and selects valid instances conforming to the distribution via rank-based distant supervision, which addresses the false positive problem. Experimental results show the superiority of our framework over strong baselines. △ Less

Submitted 30 September, 2020; originally announced September 2020.

arXiv:2009.09179 [pdf]

Recognizing Micro-Expression in Video Clip with Adaptive Key-Frame Mining

Authors: Min Peng, Chongyang Wang, Yuan Gao, Tao Bi, Tong Chen, Yu Shi, Xiang-Dong Zhou

Abstract: As a spontaneous expression of emotion on face, micro-expression reveals the underlying emotion that cannot be controlled by human. In micro-expression, facial movement is transient and sparsely localized through time. However, the existing representation based on various deep learning techniques learned from a full video clip is usually redundant. In addition, methods utilizing the single apex fr… ▽ More As a spontaneous expression of emotion on face, micro-expression reveals the underlying emotion that cannot be controlled by human. In micro-expression, facial movement is transient and sparsely localized through time. However, the existing representation based on various deep learning techniques learned from a full video clip is usually redundant. In addition, methods utilizing the single apex frame of each video clip require expert annotations and sacrifice the temporal dynamics. To simultaneously localize and recognize such fleeting facial movements, we propose a novel end-to-end deep learning architecture, referred to as adaptive key-frame mining network (AKMNet). Operating on the video clip of micro-expression, AKMNet is able to learn discriminative spatio-temporal representation by combining spatial features of self-learned local key frames and their global-temporal dynamics. Theoretical analysis and empirical evaluation show that the proposed approach improved recognition accuracy in comparison with state-of-the-art methods on multiple benchmark datasets. △ Less

Submitted 15 March, 2021; v1 submitted 19 September, 2020; originally announced September 2020.

Comments: Submitted for Review in IEEE Transactions on Multimedia

arXiv:2009.06397 [pdf, other]

Optimal Resource Allocation for Delay Minimization in NOMA-MEC Networks

Authors: Fang Fang, Yanqing Xu, Zhiguo Ding, Chao Shen, Mugen Peng, George K. Karagiannidis

Abstract: Multi-access edge computing (MEC) can enhance the computing capability of mobile devices, while non-orthogonal multiple access (NOMA) can provide high data rates. Combining these two strategies can effectively benefit the network with spectrum and energy efficiency. In this paper, we investigate the task delay minimization in multi-user NOMA-MEC networks, where multiple users can offload their tas… ▽ More Multi-access edge computing (MEC) can enhance the computing capability of mobile devices, while non-orthogonal multiple access (NOMA) can provide high data rates. Combining these two strategies can effectively benefit the network with spectrum and energy efficiency. In this paper, we investigate the task delay minimization in multi-user NOMA-MEC networks, where multiple users can offload their tasks simultaneously through the same frequency band. We adopt the partial offloading policy, in which each user can partition its computation task into offloading and locally computing parts. We aim to minimize the task delay among users by optimizing their tasks partition ratios and offloading transmit power. The delay minimization problem is first formulated, and it is shown that it is a nonconvex one. By carefully investigating its structure, we transform the original problem into an equivalent quasi-convex. In this way, a bisection search iterative algorithm is proposed in order to achieve the minimum task delay. To reduce the complexity of the proposed algorithm and evaluate its optimality, we further derive closed-form expressions for the optimal task partition ratio and offloading power for the case of two-user NOMA-MEC networks. Simulations demonstrate the convergence and optimality of the proposed algorithm and the effectiveness of the closed-form analysis. △ Less

Submitted 11 September, 2020; originally announced September 2020.

Comments: Accepted by IEEE Transactions on Communications 2020. arXiv admin note: substantial text overlap with arXiv:1904.12389

arXiv:2002.08419 [pdf, ps, other]

Mode Selection and Resource Allocation in Sliced Fog Radio Access Networks: A Reinforcement Learning Approach

Authors: Hongyu Xiang, Mugen Peng, Yaohua Sun, Shi Yan

Abstract: The mode selection and resource allocation in fog radio access networks (F-RANs) have been advocated as key techniques to improve spectral and energy efficiency. In this paper, we investigate the joint optimization of mode selection and resource allocation in uplink F-RANs, where both of the traditional user equipments (UEs) and fog UEs are served by constructed network slice instances. The concer… ▽ More The mode selection and resource allocation in fog radio access networks (F-RANs) have been advocated as key techniques to improve spectral and energy efficiency. In this paper, we investigate the joint optimization of mode selection and resource allocation in uplink F-RANs, where both of the traditional user equipments (UEs) and fog UEs are served by constructed network slice instances. The concerned optimization is formulated as a mixed-integer programming problem, and both the orthogonal and multiplexed subchannel allocation strategies are proposed to guarantee the slice isolation. Motivated by the development of machine learning, two reinforcement learning based algorithms are developed to solve the original high complexity problem under traditional and fog UEs' specific performance requirements. The basic idea of the proposals is to generate a good mode selection policy according to the immediate reward fed back by an environment. Simulation results validate the benefits of our proposed algorithms and show that a tradeoff between system power consumption and queue delay can be achieved. △ Less

Submitted 13 February, 2020; originally announced February 2020.

arXiv:2002.05485 [pdf, ps, other]

Deep Reinforcement Learning Based Mode Selection and Resource Allocation for Cellular V2X Communications

Authors: Xinran Zhang, Mugen Peng, Shi Yan, Yaohua Sun

Abstract: Cellular vehicle-to-everything (V2X) communication is crucial to support future diverse vehicular applications. However, for safety-critical applications, unstable vehicle-to-vehicle (V2V) links and high signalling overhead of centralized resource allocation approaches become bottlenecks. In this paper, we investigate a joint optimization problem of transmission mode selection and resource allocat… ▽ More Cellular vehicle-to-everything (V2X) communication is crucial to support future diverse vehicular applications. However, for safety-critical applications, unstable vehicle-to-vehicle (V2V) links and high signalling overhead of centralized resource allocation approaches become bottlenecks. In this paper, we investigate a joint optimization problem of transmission mode selection and resource allocation for cellular V2X communications. In particular, the problem is formulated as a Markov decision process, and a deep reinforcement learning (DRL) based decentralized algorithm is proposed to maximize the sum capacity of vehicle-to-infrastructure users while meeting the latency and reliability requirements of V2V pairs. Moreover, considering training limitation of local DRL models, a two-timescale federated DRL algorithm is developed to help obtain robust model. Wherein, the graph theory based vehicle clustering algorithm is executed on a large timescale and in turn the federated learning algorithm is conducted on a small timescale. Simulation results show that the proposed DRL-based algorithm outperforms other decentralized baselines, and validate the superiority of the two-timescale federated DRL algorithm for newly activated V2V pairs. △ Less

Submitted 13 February, 2020; originally announced February 2020.

Comments: 12 pages, 11 figures, accepted by IEEE IoT Journal

arXiv:2002.05437 [pdf, ps, other]

Tradeoff between Ergodic Rate and Delivery Latency in Fog Radio Access Networks

Authors: Bonan Yin, Mugen Peng, Shi Yan, Chun**g Hu

Abstract: Wireless content caching has recently been considered as an efficient way in fog radio access networks (FRANs) to alleviate the heavy burden on capacity-limited fronthaul links and reduce delivery latency. In this paper, an advanced minimal delay association policy is proposed to minimize latency while guaranteeing spectral efficiency in F-RANs. By utilizing stochastic geometry and queueing theory… ▽ More Wireless content caching has recently been considered as an efficient way in fog radio access networks (FRANs) to alleviate the heavy burden on capacity-limited fronthaul links and reduce delivery latency. In this paper, an advanced minimal delay association policy is proposed to minimize latency while guaranteeing spectral efficiency in F-RANs. By utilizing stochastic geometry and queueing theory, closed-form expressions of successful delivery probability, average ergodic rate, and average delivery latency are derived, where both the traditional association policy based on accessing the base station with maximal received power and the proposed minimal delay association policy are concerned. Impacts of key operating parameters on the aforementioned performance metrics are exploited. It is shown that the proposed association policy has a better delivery latency than the traditional association policy. Increasing the cache size of fog-computing based access points (F-APs) can more significantly reduce average delivery latency, compared with increasing the density of F-APs. Meanwhile, the latter comes at the expense of decreasing average ergodic rate. This implies the deployment of large cache size at F-APs rather than high density of F-APs can promote performance effectively in F-RANs. △ Less

Submitted 13 February, 2020; originally announced February 2020.

Showing 51–100 of 175 results for author: Peng, M