-
Nonreciprocal slow or fast light in anti-$\mathcal{PT}$-symmetric optomechanics
Authors:
Meiyu Peng,
Huilai Zhang,
Qian Zhang,
Tian-Xiang Lu,
Imran M. Mirza,
Hui **g
Abstract:
Non-Hermitian systems with anti-parity-time ($\mathcal{APT}$) symmetry have revealed rich physics beyond conventional systems. Here, we study optomechanics in an $\mathcal{APT}$-symmetric spinning resonator and show that, by tuning the rotating speed to approach the exceptional point (EP) or the non-Hermitian spectral degeneracy, nonreciprocal light transmission with a high isolation ratio can be…
▽ More
Non-Hermitian systems with anti-parity-time ($\mathcal{APT}$) symmetry have revealed rich physics beyond conventional systems. Here, we study optomechanics in an $\mathcal{APT}$-symmetric spinning resonator and show that, by tuning the rotating speed to approach the exceptional point (EP) or the non-Hermitian spectral degeneracy, nonreciprocal light transmission with a high isolation ratio can be realized. Accompanying this process, nonreciprocal group delay or advance is also identified in the vicinity of EP. Our work sheds new light on manipulating laser propagation with optomechanical EP devices and, in a broader view, can be extended to explore a wide range of $\mathcal{APT}$-symmetric effects, such as $\mathcal{APT}$-symmetric phonon lasers, $\mathcal{APT}$-symmetric topological effects, and $\mathcal{APT}$-symmetric force sensing or accelerator.
△ Less
Submitted 27 February, 2023;
originally announced February 2023.
-
Using the Sun and the Moon as Source masses and the Earth's Rotation as a Modulation to Search for Exotic Spin-Dependent Interactions at Astronomical Distances
Authors:
L. Y. Wu,
K. Y. Zhang,
M. Peng,
J. Gong,
H. Yan
Abstract:
Exotic spin-dependent interactions mediated by new light particles led to solutions to several important questions in modern physics. Such interactions involving a scalar coupling $g_S^N$ at one vertex and a pseudo-scalar coupling $g_P^n$ at the polarized neutron vertex can be induced by the exchange of spin-0 bosons, or a vector/axial-vector coupling $g_V^N$/$g_A^N$ at one vertex and an axial-vec…
▽ More
Exotic spin-dependent interactions mediated by new light particles led to solutions to several important questions in modern physics. Such interactions involving a scalar coupling $g_S^N$ at one vertex and a pseudo-scalar coupling $g_P^n$ at the polarized neutron vertex can be induced by the exchange of spin-0 bosons, or a vector/axial-vector coupling $g_V^N$/$g_A^N$ at one vertex and an axial-vector coupling $g_A^n$ at the polarized neutron vertex can be induced by the exchange of spin-1 bosons. If such new interactions exist, the Sun and the Moon can induce sidereal variations of effective fields along the direction perpendicular to the Earth's rotation axis.
We derived new experimental upper limits on such exotic spin-dependent interactions at astronomical interaction ranges by analyzing existing data from laboratory measurements on the Lorentz and CPT violation. We set the most stringent experimental limits on $g_S^Ng_P^n$ ranging from $\sim 2\times 10^{10}$m to $\sim 10^{14}$m. Previously, the best limit on $g_S^Ng_P^n$ at this range is from astrophysics. The result is the first time laboratory limits surpass the astrophysical ones on the scalar-pseudoscalar type interaction, to our best knowledge. We report new constraints on vector-axial-vector and axial-axial-vector type interaction at the range of astronomical scales. The new limits on vector-axial-vector are improved by as much as $\sim$12 orders of magnitude.
We also apply the analysis to the Hari-Dass interactions and obtain corresponding new constraints on the interactions. We discuss the possibilities of using the beam method to further search the interaction involving other particles, such as electrons, muons, etc., based on the same idea.
△ Less
Submitted 15 June, 2023; v1 submitted 15 February, 2023;
originally announced February 2023.
-
Multi-Carrier NOMA-Empowered Wireless Federated Learning with Optimal Power and Bandwidth Allocation
Authors:
Weicai Li,
Tiejun Lv,
Yashuai Cao,
Wei Ni,
Mugen Peng
Abstract:
Wireless federated learning (WFL) undergoes a communication bottleneck in uplink, limiting the number of users that can upload their local models in each global aggregation round. This paper presents a new multi-carrier non-orthogonal multiple-access (MC-NOMA)-empowered WFL system under an adaptive learning setting of Flexible Aggregation. Since a WFL round accommodates both local model training a…
▽ More
Wireless federated learning (WFL) undergoes a communication bottleneck in uplink, limiting the number of users that can upload their local models in each global aggregation round. This paper presents a new multi-carrier non-orthogonal multiple-access (MC-NOMA)-empowered WFL system under an adaptive learning setting of Flexible Aggregation. Since a WFL round accommodates both local model training and uploading for each user, the use of Flexible Aggregation allows the users to train different numbers of iterations per round, adapting to their channel conditions and computing resources. The key idea is to use MC-NOMA to concurrently upload the local models of the users, thereby extending the local model training times of the users and increasing participating users. A new metric, namely, Weighted Global Proportion of Trained Mini-batches (WGPTM), is analytically established to measure the convergence of the new system. Another important aspect is that we maximize the WGPTM to harness the convergence of the new system by jointly optimizing the transmit powers and subchannel bandwidths. This nonconvex problem is converted equivalently to a tractable convex problem and solved efficiently using variable substitution and Cauchy's inequality. As corroborated experimentally using a convolutional neural network and an 18-layer residential network, the proposed MC-NOMA WFL can efficiently reduce communication delay, increase local model training times, and accelerate the convergence by over 40%, compared to its existing alternative.
△ Less
Submitted 13 February, 2023;
originally announced February 2023.
-
Efficient End-to-End Video Question Answering with Pyramidal Multimodal Transformer
Authors:
Min Peng,
Chongyang Wang,
Yu Shi,
Xiang-Dong Zhou
Abstract:
This paper presents a new method for end-to-end Video Question Answering (VideoQA), aside from the current popularity of using large-scale pre-training with huge feature extractors. We achieve this with a pyramidal multimodal transformer (PMT) model, which simply incorporates a learnable word embedding layer, a few convolutional and transformer layers. We use the anisotropic pyramid to fulfill vid…
▽ More
This paper presents a new method for end-to-end Video Question Answering (VideoQA), aside from the current popularity of using large-scale pre-training with huge feature extractors. We achieve this with a pyramidal multimodal transformer (PMT) model, which simply incorporates a learnable word embedding layer, a few convolutional and transformer layers. We use the anisotropic pyramid to fulfill video-language interactions across different spatio-temporal scales. In addition to the canonical pyramid, which includes both bottom-up and top-down pathways with lateral connections, novel strategies are proposed to decompose the visual feature stream into spatial and temporal sub-streams at different scales and implement their interactions with the linguistic semantics while preserving the integrity of local and global semantics. We demonstrate better or on-par performances with high computational efficiency against state-of-the-art methods on five VideoQA benchmarks. Our ablation study shows the scalability of our model that achieves competitive results for text-to-video retrieval by leveraging feature extractors with reusable pre-trained weights, and also the effectiveness of the pyramid.
△ Less
Submitted 5 March, 2023; v1 submitted 4 February, 2023;
originally announced February 2023.
-
Khaos: The Impact of Inter-procedural Code Obfuscation on Binary Diffing Techniques
Authors:
Peihua Zhang,
Chenggang Wu,
Mingfan Peng,
Kai Zeng,
Ding Yu,
Yuanming Lai,
Yan Kang,
Wei Wang,
Zhe Wang
Abstract:
Software obfuscation techniques can prevent binary diffing techniques from locating vulnerable code by obfuscating the third-party code, to achieve the purpose of protecting embedded device software. With the rapid development of binary diffing techniques, they can achieve more and more accurate function matching and identification by extracting the features within the function. This makes existin…
▽ More
Software obfuscation techniques can prevent binary diffing techniques from locating vulnerable code by obfuscating the third-party code, to achieve the purpose of protecting embedded device software. With the rapid development of binary diffing techniques, they can achieve more and more accurate function matching and identification by extracting the features within the function. This makes existing software obfuscation techniques, which mainly focus on the intra-procedural code obfuscation, no longer effective.
In this paper, we propose a new inter-procedural code obfuscation mechanism Khaos, which moves the code across functions to obfuscate the function by using compilation optimizations. Two obfuscation primitives are proposed to separate and aggregate the function, which are called fission and fusion respectively. A prototype of Khaos is implemented based on the LLVM compiler and evaluated on a large number of real-world programs including SPEC CPU 2006 & 2017, CoreUtils, JavaScript engines, etc. Experimental results show that Khaos outperforms existing code obfuscations and can significantly reduce the accuracy rates of five state-of-the-art binary diffing techniques (less than 19%) with lower runtime overhead (less than 7%).
△ Less
Submitted 27 January, 2023;
originally announced January 2023.
-
Select and Trade: Towards Unified Pair Trading with Hierarchical Reinforcement Learning
Authors:
Weiguang Han,
Boyi Zhang,
Qianqian Xie,
Min Peng,
Yanzhao Lai,
Jimin Huang
Abstract:
Pair trading is one of the most effective statistical arbitrage strategies which seeks a neutral profit by hedging a pair of selected assets. Existing methods generally decompose the task into two separate steps: pair selection and trading. However, the decoupling of two closely related subtasks can block information propagation and lead to limited overall performance. For pair selection, ignoring…
▽ More
Pair trading is one of the most effective statistical arbitrage strategies which seeks a neutral profit by hedging a pair of selected assets. Existing methods generally decompose the task into two separate steps: pair selection and trading. However, the decoupling of two closely related subtasks can block information propagation and lead to limited overall performance. For pair selection, ignoring the trading performance results in the wrong assets being selected with irrelevant price movements, while the agent trained for trading can overfit to the selected assets without any historical information of other assets. To address it, in this paper, we propose a paradigm for automatic pair trading as a unified task rather than a two-step pipeline. We design a hierarchical reinforcement learning framework to jointly learn and optimize two subtasks. A high-level policy would select two assets from all possible combinations and a low-level policy would then perform a series of trading actions. Experimental results on real-world stock data demonstrate the effectiveness of our method on pair trading compared with both existing pair selection and trading methods.
△ Less
Submitted 5 February, 2023; v1 submitted 25 January, 2023;
originally announced January 2023.
-
Performance assessment of helicon wave heating and current drive in EXL-50 spherical torus plasmas
Authors:
G. J. Qiao,
D. Luo,
S. D. Song,
J. Q. Dong,
Y. J. Shi,
J. C. Li,
D. Du,
Y. K. Martin Peng,
M. S. Liu,
EXL-50 team
Abstract:
Analysis of helicon wave heating and current drive capability in EXL-50 spherical torus plasmas has been conducted. It is found that the driven current increases with the launched parallel refractive index $n_{||}$ and peaks around $n_{||} = 4.0$ when the frequency of the helicon wave is between 300~MHz and 380~MHz. The helicon wave current drive efficiency shows a relatively stable upward trend w…
▽ More
Analysis of helicon wave heating and current drive capability in EXL-50 spherical torus plasmas has been conducted. It is found that the driven current increases with the launched parallel refractive index $n_{||}$ and peaks around $n_{||} = 4.0$ when the frequency of the helicon wave is between 300~MHz and 380~MHz. The helicon wave current drive efficiency shows a relatively stable upward trend with increasing plasma temperature. Moreover, the driven current decreases as the plasma density increases. We also analyzed the current drive with helicon waves of 150~MHz and 170~MHz and found that the driven current at a lower frequency was lower than that at a higher frequency. A positive proportional relationship exists between the driven current and $n_{||}$. Besides, as $n_{||}$ increases, the profile of the driven current becomes wider. Finally, the effect of the scrape-off layer (SOL) region on the helicon wave current drive was also investigated.
△ Less
Submitted 17 December, 2022;
originally announced December 2022.
-
Improving Semantic Matching through Dependency-Enhanced Pre-trained Model with Adaptive Fusion
Authors:
Jian Song,
Di Liang,
Rumei Li,
Yuntao Li,
Sirui Wang,
Minlong Peng,
Wei Wu,
Yongxin Yu
Abstract:
Transformer-based pre-trained models like BERT have achieved great progress on Semantic Sentence Matching. Meanwhile, dependency prior knowledge has also shown general benefits in multiple NLP tasks. However, how to efficiently integrate dependency prior structure into pre-trained models to better model complex semantic matching relations is still unsettled. In this paper, we propose the \textbf{D…
▽ More
Transformer-based pre-trained models like BERT have achieved great progress on Semantic Sentence Matching. Meanwhile, dependency prior knowledge has also shown general benefits in multiple NLP tasks. However, how to efficiently integrate dependency prior structure into pre-trained models to better model complex semantic matching relations is still unsettled. In this paper, we propose the \textbf{D}ependency-Enhanced \textbf{A}daptive \textbf{F}usion \textbf{A}ttention (\textbf{DAFA}), which explicitly introduces dependency structure into pre-trained models and adaptively fuses it with semantic information. Specifically, \textbf{\emph{(i)}} DAFA first proposes a structure-sensitive paradigm to construct a dependency matrix for calibrating attention weights. It adopts an adaptive fusion module to integrate the obtained dependency information and the original semantic signals. Moreover, DAFA reconstructs the attention calculation flow and provides better interpretability. By applying it on BERT, our method achieves state-of-the-art or competitive performance on 10 public datasets, demonstrating the benefits of adaptively fusing dependency structure in semantic matching task.
△ Less
Submitted 24 August, 2023; v1 submitted 16 October, 2022;
originally announced October 2022.
-
SMiLE: Schema-augmented Multi-level Contrastive Learning for Knowledge Graph Link Prediction
Authors:
Miao Peng,
Ben Liu,
Qianqian Xie,
Wenjie Xu,
Hua Wang,
Min Peng
Abstract:
Link prediction is the task of inferring missing links between entities in knowledge graphs. Embedding-based methods have shown effectiveness in addressing this problem by modeling relational patterns in triples. However, the link prediction task often requires contextual information in entity neighborhoods, while most existing embedding-based methods fail to capture it. Additionally, little atten…
▽ More
Link prediction is the task of inferring missing links between entities in knowledge graphs. Embedding-based methods have shown effectiveness in addressing this problem by modeling relational patterns in triples. However, the link prediction task often requires contextual information in entity neighborhoods, while most existing embedding-based methods fail to capture it. Additionally, little attention is paid to the diversity of entity representations in different contexts, which often leads to false prediction results. In this situation, we consider that the schema of knowledge graph contains the specific contextual information, and it is beneficial for preserving the consistency of entities across contexts. In this paper, we propose a novel Schema-augmented Multi-level contrastive LEarning framework (SMiLE) to conduct knowledge graph link prediction. Specifically, we first exploit network schema as the prior constraint to sample negatives and pre-train our model by employing a multi-level contrastive learning method to yield both prior schema and contextual information. Then we fine-tune our model under the supervision of individual triples to learn subtler representations for link prediction. Extensive experimental results on four knowledge graph datasets with thorough analysis of each component demonstrate the effectiveness of our proposed framework against state-of-the-art baselines. The implementation of SMiLE is available at https://github.com/GKNL/SMiLE.
△ Less
Submitted 3 March, 2024; v1 submitted 10 October, 2022;
originally announced October 2022.
-
Denoising Particle Beam Micrographs with Plug-and-Play Methods
Authors:
Minxu Peng,
Ruangrawee Kitichotkul,
Sheila W. Seidel,
Christopher Yu,
Vivek K Goyal
Abstract:
In a particle beam microscope, a raster-scanned focused beam of particles interacts with a sample to generate a secondary electron (SE) signal pixel by pixel. Conventionally formed micrographs are noisy because of limitations on acquisition time and dose. Recent work has shown that estimation methods applicable to a time-resolved measurement paradigm can greatly reduce noise, but these methods app…
▽ More
In a particle beam microscope, a raster-scanned focused beam of particles interacts with a sample to generate a secondary electron (SE) signal pixel by pixel. Conventionally formed micrographs are noisy because of limitations on acquisition time and dose. Recent work has shown that estimation methods applicable to a time-resolved measurement paradigm can greatly reduce noise, but these methods apply pixel by pixel without exploiting image structure. Raw SE count data can be modeled with a compound Poisson (Neyman Type A) likelihood, which implies data variance that is signal-dependent and greater than the variation in the underlying particle-sample interaction. These statistical properties make methods that assume additive white Gaussian noise ineffective. This paper introduces methods for particle beam micrograph denoising that use the plug-and-play framework to exploit image structure while being applicable to the unusual data likelihoods of this modality. Approximations of the data likelihood that vary in accuracy and computational complexity are combined with denoising by total variation regularization, BM3D, and DnCNN. Methods are provided for both conventional and time-resolved measurements, assuming SE counts are available. In simulations representative of helium ion microscopy and scanning electron microscopy, significant improvements in root mean-squared error (RMSE), structural similarity index measure (SSIM), and qualitative appearance are obtained. Average reductions in RMSE are by factors ranging from 2.24 to 4.11.
△ Less
Submitted 3 May, 2023; v1 submitted 30 August, 2022;
originally announced August 2022.
-
NL2GDPR: Automatically Develop GDPR Compliant Android Application Features from Natural Language
Authors:
Faysal Hossain Shezan,
Yingjie Lao,
Minlong Peng,
Xin Wang,
Mingming Sun,
** Li
Abstract:
The recent privacy leakage incidences and the more strict policy regulations demand a much higher standard of compliance for companies and mobile apps. However, such obligations also impose significant challenges on app developers for complying with these regulations that contain various perspectives, activities, and roles, especially for small companies and developers who are less experienced in…
▽ More
The recent privacy leakage incidences and the more strict policy regulations demand a much higher standard of compliance for companies and mobile apps. However, such obligations also impose significant challenges on app developers for complying with these regulations that contain various perspectives, activities, and roles, especially for small companies and developers who are less experienced in this matter or with limited resources. To address these hurdles, we develop an automatic tool, NL2GDPR, which can generate policies from natural language descriptions from the developer while also ensuring the app's functionalities are compliant with General Data Protection Regulation (GDPR). NL2GDPR is developed by leveraging an information extraction tool, OIA (Open Information Annotation), developed by Baidu Cognitive Computing Lab.
At the core, NL2GDPR is a privacy-centric information extraction model, appended with a GDPR policy finder and a policy generator. We perform a comprehensive study to grasp the challenges in extracting privacy-centric information and generating privacy policies, while exploiting optimizations for this specific task. With NL2GDPR, we can achieve 92.9%, 95.2%, and 98.4% accuracy in correctly identifying GDPR policies related to personal data storage, process, and share types, respectively. To the best of our knowledge, NL2GDPR is the first tool that allows a developer to automatically generate GDPR compliant policies, with only the need of entering the natural language for describing the app features. Note that other non-GDPR-related features might be integrated with the generated features to build a complex app.
△ Less
Submitted 29 August, 2022;
originally announced August 2022.
-
IRS-Based Integrated Location Sensing and Communication for mmWave SIMO Systems
Authors:
Xiaoling Hu,
Chenxi Liu,
Mugen Peng,
Caijun Zhong
Abstract:
In this paper, we establish an integrated sensing and communication (ISAC) system based on a distributed semi-passive intelligent reflecting surface (IRS), which allows location sensing and data transmission to be carried out simultaneously, sharing the same frequency and time resources. The detailed working process of the proposed IRS-based ISAC system is designed, including the transmission prot…
▽ More
In this paper, we establish an integrated sensing and communication (ISAC) system based on a distributed semi-passive intelligent reflecting surface (IRS), which allows location sensing and data transmission to be carried out simultaneously, sharing the same frequency and time resources. The detailed working process of the proposed IRS-based ISAC system is designed, including the transmission protocol, location sensing and beamforming optimization. Specifically, each coherence block consists of two periods, the ISAC period with two time blocks and the pure communication (PC) period. During each time block of the ISAC period, data transmission and user positioning are carried out simultaneously. The estimated user location in the first time block will be used for beamforming design in the second time block. During the PC period, only data transmission is conducted, by invoking the user location estimated in the second time block of the ISAC period for beamforming design. {\color{black}Simulation results show that a millimeter-level positioning accuracy can be achieved by the proposed location sensing scheme, demonstrating the advantage of the proposed IRS-based ISAC framework. Besides, the proposed two beamforming schemes based on the estimated location information achieve similar performance to the benchmark schemes assuming perfect channel state information (CSI), which verifies the effectiveness of beamforming design using sensed location information.
△ Less
Submitted 10 August, 2022;
originally announced August 2022.
-
IRS-Aided Non-Orthogonal ISAC Systems: Performance Analysis and Beamforming Design
Authors:
Zhouyuan Yu,
Xiaoling Hu,
Chenxi Liu,
Mugen Peng,
Caijun Zhong
Abstract:
Intelligent reflecting surface (IRS) has shown its effectiveness in facilitating orthogonal time-division integrated sensing and communications (TD-ISAC), in which the sensing task and the communication task occupy orthogonal time-frequency resources, while the role of IRS in the more interesting scenarios of non-orthogonal ISAC (NO-ISAC) systems has so far remained unclear. In this paper, we cons…
▽ More
Intelligent reflecting surface (IRS) has shown its effectiveness in facilitating orthogonal time-division integrated sensing and communications (TD-ISAC), in which the sensing task and the communication task occupy orthogonal time-frequency resources, while the role of IRS in the more interesting scenarios of non-orthogonal ISAC (NO-ISAC) systems has so far remained unclear. In this paper, we consider an IRS-aided NO-ISAC system, where a distributed IRS is deployed to assist concurrent communication and location sensing for a blind-zone user, occupying non-orthogonal/overlapped time-frequency resources. We first propose a modified Cramer-Rao lower bound (CRLB) to characterize the performances of both communication and location sensing in a unified manner. We further derive the closed-form expressions of the modified CRLB in our considered NO-ISAC system, enabling us to identify the fundamental trade-off between the communication and location sensing performances. In addition, by exploiting the modified CRLB, we propose a joint active and passive beamforming design algorithm that achieves a good communication and location sensing trade-off. Through numerical results, we demonstrate the superiority of the IRS-aided NO-ISAC systems over the IRS-aided TD-ISAC systems, in terms of both communication and localization performances. Besides, it is shown that the IRS-aided NO-ISAC system with random communication signals can achieve comparable localization performance to the IRS-aided localization system with dedicated positioning reference signals. Moreover, we investigate the trade-off between communication performance and localization performance and show how the performance of the NO-ISAC system can be significantly boosted by increasing the number of the IRS elements.
△ Less
Submitted 10 August, 2022;
originally announced August 2022.
-
Location Sensing and Beamforming Design for IRS-Enabled Multi-User ISAC Systems
Authors:
Zhouyuan Yu,
Xiaoling Hu,
Chenxi Liu,
Mugen Peng,
Caijun Zhong
Abstract:
This paper explores the potential of the intelligent reflecting surface (IRS) in realizing multi-user concurrent communication and localization, using the same time-frequency resources. Specifically, we propose an IRS-enabled multi-user integrated sensing and communication (ISAC) framework, where a distributed semi-passive IRS assists the uplink data transmission from multiple users to the base st…
▽ More
This paper explores the potential of the intelligent reflecting surface (IRS) in realizing multi-user concurrent communication and localization, using the same time-frequency resources. Specifically, we propose an IRS-enabled multi-user integrated sensing and communication (ISAC) framework, where a distributed semi-passive IRS assists the uplink data transmission from multiple users to the base station (BS) and conducts multi-user localization, simultaneously. We first design an ISAC transmission protocol, where the whole transmission period consists of two periods, i.e., the ISAC period for simultaneous uplink communication and multi-user localization, and the pure communication (PC) period for only uplink data transmission. For the ISAC period, we propose a multi-user location sensing algorithm, which utilizes the uplink communication signals unknown to the IRS, thus removing the requirement of dedicated positioning reference signals in conventional location sensing methods. Based on the sensed users' locations, we propose two novel beamforming algorithms for the ISAC period and PC period, respectively, which can work with discrete phase shifts and require no channel state information (CSI) acquisition. Numerical results show that the proposed multi-user location sensing algorithm can achieve up to millimeter-level positioning accuracy, indicating the advantage of the IRS-enabled ISAC framework. Moreover, the proposed beamforming algorithms with sensed location information and discrete phase shifts can achieve comparable performance to the benchmark considering perfect CSI acquisition and continuous phase shifts, demonstrating how the location information can ensure the communication performance.
△ Less
Submitted 10 August, 2022;
originally announced August 2022.
-
Minimax Optimal Online Imitation Learning via Replay Estimation
Authors:
Gokul Swamy,
Nived Rajaraman,
Matthew Peng,
Sanjiban Choudhury,
J. Andrew Bagnell,
Zhiwei Steven Wu,
Jiantao Jiao,
Kannan Ramchandran
Abstract:
Online imitation learning is the problem of how best to mimic expert demonstrations, given access to the environment or an accurate simulator. Prior work has shown that in the infinite sample regime, exact moment matching achieves value equivalence to the expert policy. However, in the finite sample regime, even if one has no optimization error, empirical variance can lead to a performance gap tha…
▽ More
Online imitation learning is the problem of how best to mimic expert demonstrations, given access to the environment or an accurate simulator. Prior work has shown that in the infinite sample regime, exact moment matching achieves value equivalence to the expert policy. However, in the finite sample regime, even if one has no optimization error, empirical variance can lead to a performance gap that scales with $H^2 / N$ for behavioral cloning and $H / \sqrt{N}$ for online moment matching, where $H$ is the horizon and $N$ is the size of the expert dataset. We introduce the technique of replay estimation to reduce this empirical variance: by repeatedly executing cached expert actions in a stochastic simulator, we compute a smoother expert visitation distribution estimate to match. In the presence of general function approximation, we prove a meta theorem reducing the performance gap of our approach to the parameter estimation error for offline classification (i.e. learning the expert policy). In the tabular setting or with linear function approximation, our meta theorem shows that the performance gap incurred by our approach achieves the optimal $\widetilde{O} \left( \min({H^{3/2}} / {N}, {H} / {\sqrt{N}} \right)$ dependency, under significantly weaker assumptions compared to prior work. We implement multiple instantiations of our approach on several continuous control tasks and find that we are able to significantly improve policy performance across a variety of dataset sizes.
△ Less
Submitted 14 January, 2023; v1 submitted 30 May, 2022;
originally announced May 2022.
-
Electromagnetic Composites: from Effective Medium Theories to Metamaterials
Authors:
Faxiang Qin,
Mengyue Peng,
Diana Estevez,
Christian Brosseau
Abstract:
Electromagnetic (EM) composites have stimulated tremendous fundamental and practical interests owing to their flexible electromagnetic properties and extensive potential engineering applications. Hence, it is necessary to systematically understand the physical mechanisms and design principles controlling EM composites. In this tutorial, we first provide an overview of the basic theory of electroma…
▽ More
Electromagnetic (EM) composites have stimulated tremendous fundamental and practical interests owing to their flexible electromagnetic properties and extensive potential engineering applications. Hence, it is necessary to systematically understand the physical mechanisms and design principles controlling EM composites. In this tutorial, we first provide an overview of the basic theory of electromagnetism about electromagnetic constitutive parameters that can represent the electromagnetic properties of materials. We show how this corpus allows a consistent construction of effective medium theories and allows for numerical simulation of EM composites to deal with structure-property relationships. We then discuss the influence of spatial dispersion of shaped inclusions in the material medium on the EM properties of composites, which has not been systematically illustrated in the context of this interdisciplinary topic. Next, artificial composites or metamaterials with peculiar properties not readily available in nature are highlighted with particular emphasis on the control of the EM interaction with composites. We conclude by discussing appropriate methods of electromagnetic measurement and practical aspects for implementing composites for specific applications are described. Overall, this tutorial will serve the purpose of introducing the basics and applications of electromagnetic composites to newcomers in this field. It is also anticipated that researchers from different backgrounds including materials science, optics, and electrical engineering can communicate to each other with the same language when dealing with this interdisciplinary subject and further push forward this advancement from fundamental science to technological applications.
△ Less
Submitted 14 May, 2022;
originally announced May 2022.
-
Multilevel Hierarchical Network with Multiscale Sampling for Video Question Answering
Authors:
Min Peng,
Chongyang Wang,
Yuan Gao,
Yu Shi,
Xiang-Dong Zhou
Abstract:
Video question answering (VideoQA) is challenging given its multimodal combination of visual understanding and natural language processing. While most existing approaches ignore the visual appearance-motion information at different temporal scales, it is unknown how to incorporate the multilevel processing capacity of a deep learning model with such multiscale information. Targeting these issues,…
▽ More
Video question answering (VideoQA) is challenging given its multimodal combination of visual understanding and natural language processing. While most existing approaches ignore the visual appearance-motion information at different temporal scales, it is unknown how to incorporate the multilevel processing capacity of a deep learning model with such multiscale information. Targeting these issues, this paper proposes a novel Multilevel Hierarchical Network (MHN) with multiscale sampling for VideoQA. MHN comprises two modules, namely Recurrent Multimodal Interaction (RMI) and Parallel Visual Reasoning (PVR). With a multiscale sampling, RMI iterates the interaction of appearance-motion information at each scale and the question embeddings to build the multilevel question-guided visual representations. Thereon, with a shared transformer encoder, PVR infers the visual cues at each level in parallel to fit with answering different question types that may rely on the visual information at relevant levels. Through extensive experiments on three VideoQA datasets, we demonstrate improved performances than previous state-of-the-arts and justify the effectiveness of each part of our method.
△ Less
Submitted 9 May, 2022;
originally announced May 2022.
-
Communication and Computation Assisted Sensing Information Freshness Performance Analysis in Vehicular Networks
Authors:
Ning Jiang,
Shi Yan,
Zhuohan Liu,
Chun**g Hu,
Mugen Peng
Abstract:
The timely sharing of raw sensing information in the vehicular networks (VNETs) is essential to safety. In order to improve the freshness of sensing information, joint scheduling of multi-dimensional resources such as communication and computation is required. However, the complex relevance among multi-dimensional resources is still unclear, and it is difficult to achieve efficient resource utiliz…
▽ More
The timely sharing of raw sensing information in the vehicular networks (VNETs) is essential to safety. In order to improve the freshness of sensing information, joint scheduling of multi-dimensional resources such as communication and computation is required. However, the complex relevance among multi-dimensional resources is still unclear, and it is difficult to achieve efficient resource utilization. In this paper, we present a theoretical analysis for a novel metric Age of Information (AoI) on a communication and computation assisted spatial-temporal model. An uplink VNETs scenario where Road Side Units (RSUs) are deployed with computational resource is considered. The transmission and computation process is unified into a two-stage tandem queue and the expression of the average AoI is derived. The network interference is analyzed by modeling the VNETs as Cox Poisson Point Process based on stochastic geometry and the closed-form solution of the coverage probability and the expected data rate performance under the constraints of transmission resources is obtained. The simulation results reveal the basic relationship between communication and computation capacity and show that communication and computation should reach a tradeoff to improve resource utilization while ensuring real-time information requirement.
△ Less
Submitted 12 April, 2022;
originally announced April 2022.
-
All-optical determination of one or two emitters using quantum polarization with nitrogen-vacancy centers in diamond
Authors:
Davin Yue Ming Peng,
Josef G. Worboys,
Qiang Sun,
Shuo Li,
Marco Capelli,
Shinobu Onoda,
Takeshi Ohshima,
Philipp Reineck,
Brant C. Gibson,
Andrew D. Greentree
Abstract:
Qubit technologies using nitrogen-vacancy color centers in diamonds require precise knowledge of the centers, including the number of emitters within a diffraction-limited spot and their orientations. However, the number of emitters is challenging to determine when there is finite background, which affects the precision of resulting quantum protocols. Here we show the photoluminescence (PL) intens…
▽ More
Qubit technologies using nitrogen-vacancy color centers in diamonds require precise knowledge of the centers, including the number of emitters within a diffraction-limited spot and their orientations. However, the number of emitters is challenging to determine when there is finite background, which affects the precision of resulting quantum protocols. Here we show the photoluminescence (PL) intensity and quantum correlation (Hanbury Brown and Twiss) measurements as a function of polarization for one- and two-emitter systems. The sample was made by implanting low concentrations of adenine (C5H5N5) into a low nitrogen chemical vapor deposition diamond. This approach yielded well-spaced regions with few nitrogen-vacancy centers. By map** the PL intensity and quantum correlation as a function of polarization, we can distinguish two emitter systems from single emitters with background, providing a method to quantify the background signal at implanted sites, which might be different from off-site background levels. This approach also provides a valuable new all-optical mechanism for the determination of one or two emitter systems useful for quantum sensing, communication, and computation tasks.
△ Less
Submitted 5 June, 2023; v1 submitted 30 March, 2022;
originally announced March 2022.
-
Label-Smoothed Backdoor Attack
Authors:
Minlong Peng,
Zidi Xiong,
Mingming Sun,
** Li
Abstract:
By injecting a small number of poisoned samples into the training set, backdoor attacks aim to make the victim model produce designed outputs on any input injected with pre-designed backdoors. In order to achieve a high attack success rate using as few poisoned training samples as possible, most existing attack methods change the labels of the poisoned samples to the target class. This practice of…
▽ More
By injecting a small number of poisoned samples into the training set, backdoor attacks aim to make the victim model produce designed outputs on any input injected with pre-designed backdoors. In order to achieve a high attack success rate using as few poisoned training samples as possible, most existing attack methods change the labels of the poisoned samples to the target class. This practice often results in severe over-fitting of the victim model over the backdoors, making the attack quite effective in output control but easier to be identified by human inspection or automatic defense algorithms.
In this work, we proposed a label-smoothing strategy to overcome the over-fitting problem of these attack methods, obtaining a \textit{Label-Smoothed Backdoor Attack} (LSBA). In the LSBA, the label of the poisoned sample $\bm{x}$ will be changed to the target class with a probability of $p_n(\bm{x})$ instead of 100\%, and the value of $p_n(\bm{x})$ is specifically designed to make the prediction probability the target class be only slightly greater than those of the other classes. Empirical studies on several existing backdoor attacks show that our strategy can considerably improve the stealthiness of these attacks and, at the same time, achieve a high attack success rate. In addition, our strategy makes it able to manually control the prediction probability of the design output through manipulating the applied and activated number of LSBAs\footnote{Source code will be published at \url{https://github.com/v-mipeng/LabelSmoothedAttack.git}}.
△ Less
Submitted 18 February, 2022;
originally announced February 2022.
-
Experimental study on edge energetic electrons in EXL-50 spherical torus
Authors:
Dong Guo,
Yuejiang Shi,
Wenjun Liu,
Yunyang Song,
Tiantian Sun,
Bing Liu,
Yingying Li,
Xiaorang Tian,
Guosong Zhang,
Huasheng Xie,
Y. K. Martin Peng,
Minsheng Liu
Abstract:
A significant number of confined energetic electrons have been observed outside the Last Closed Flux Surface (LCFS) of the solenoid-free, ECRH sustained plasmas in the EXL-50 spherical torus. Several diagnostics have been applied, for the first time, to investigate the key characters of energetic electrons. Experiments reveal the existence of high-temperature low density electrons, which can carry…
▽ More
A significant number of confined energetic electrons have been observed outside the Last Closed Flux Surface (LCFS) of the solenoid-free, ECRH sustained plasmas in the EXL-50 spherical torus. Several diagnostics have been applied, for the first time, to investigate the key characters of energetic electrons. Experiments reveal the existence of high-temperature low density electrons, which can carry relatively a large amount of the stored energy. The boundary between the thermal plasma and the energetic electron fluid appears to be clearly separated and the distance between the two boundaries can reach tens of centimeters (around the size of the minor radius of the thermal plasma). This implies that the Grad-Shafranov equilibrium is not suitable to describe the equilibrium of the EXL-50 plasma and a multi-fluid model is required. Particle dynamics simulations of full orbits show that energetic electrons can be well confined outside the LCFS. This is consistent with the experimental observations.
△ Less
Submitted 19 December, 2021;
originally announced December 2021.
-
Joint Sensing, Communication, and Computation Resource Allocation for Cooperative Perception in Fog-Based Vehicular Networks
Authors:
Xinran Zhang,
Zhimin He,
Yaohua Sun,
Shuo Yuan,
Mugen Peng
Abstract:
To enlarge the perception range and reliability of individual autonomous vehicles, cooperative perception has been received much attention. However, considering the high volume of shared messages, limited bandwidth and computation resources in vehicular networks become bottlenecks. In this paper, we investigate how to balance the volume of shared messages and constrained resources in fog-based veh…
▽ More
To enlarge the perception range and reliability of individual autonomous vehicles, cooperative perception has been received much attention. However, considering the high volume of shared messages, limited bandwidth and computation resources in vehicular networks become bottlenecks. In this paper, we investigate how to balance the volume of shared messages and constrained resources in fog-based vehicular networks. To this end, we first characterize sum satisfaction of cooperative perception taking account of its spatial-temporal value and latency performance. Next, the sensing block message, communication resource block, and computation resource are jointly allocated to maximize the sum satisfaction of cooperative perception, while satisfying the maximum latency and sojourn time constraints of vehicles. Owing to its non-convexity, we decouple the original problem into two separate sub-problems and devise corresponding solutions. Simulation results demonstrate that our proposed scheme can effectively boost the sum satisfaction of cooperative perception compared with existing baselines.
△ Less
Submitted 12 December, 2021;
originally announced December 2021.
-
Clarification of Basic Concepts for Electromagnetic Interference Shielding Effectiveness
Authors:
Mengyue Peng,
Faxiang Qin
Abstract:
There exists serious miscomprehension in the open literature about the electromagnetic interference shielding effectiveness (EMI SE) as a critical index to evaluate the shielding performance, which is misleading to the graduates and newcomers embarking on the field of electromagnetic shielding materials. EMI SE is defined as the sum of three terms including reflection loss, absorption loss and mul…
▽ More
There exists serious miscomprehension in the open literature about the electromagnetic interference shielding effectiveness (EMI SE) as a critical index to evaluate the shielding performance, which is misleading to the graduates and newcomers embarking on the field of electromagnetic shielding materials. EMI SE is defined as the sum of three terms including reflection loss, absorption loss and multiple reflection loss in the classical Schelkunoff theory, while it is decomposed into two terms named reflection loss and absorption loss in practice, which is called Calculation theory here. In this paper, we elucidate the widely-seen misconceptions connected with EMI SE via theoretical derivation and instance analysis. Firstly, the terms in Calculation theory are often mistakenly regarded as the approximation of the terms with the same names in Schelkunoff theory when multiple reflection loss is negligible. Secondly, it is insufficient and unreasonable to determine the absorption-dominant shielding performance in the case that absorption loss is higher than reflection loss since reflection loss and absorption loss cannot represent the actual levels of reflected and absorbed power. Power coefficients are recommended to compare the contribution of reflection and absorption to shielding performance. Thirdly, multiple reflection effect is included in the definitions of reflection loss and absorption loss in Calculation theory, and the effect of multiple reflections on shielding property is clarified as against the commonly wrong understandings. These clarifications offer correct comprehension about the shielding mechanism and assessment of reflection and absorption contribution to the total shielding.
△ Less
Submitted 5 December, 2021;
originally announced December 2021.
-
Online Beam Current Estimation in Particle Beam Microscopy
Authors:
Sheila W. Seidel,
Luisa Watkins,
Minxu Peng,
Akshay Agarwal,
Christopher Yu,
Vivek K Goyal
Abstract:
In conventional particle beam microscopy, knowledge of the beam current is essential for accurate micrograph formation and sample milling. This generally necessitates offline calibration of the instrument. In this work, we establish that beam current can be estimated online, from the same secondary electron count data that is used to form micrographs. Our methods depend on the recently introduced…
▽ More
In conventional particle beam microscopy, knowledge of the beam current is essential for accurate micrograph formation and sample milling. This generally necessitates offline calibration of the instrument. In this work, we establish that beam current can be estimated online, from the same secondary electron count data that is used to form micrographs. Our methods depend on the recently introduced time-resolved measurement concept, which combines multiple short measurements at a single pixel and has previously been shown to partially mitigate the effect of beam current variation on micrograph accuracy. We analyze the problem of jointly estimating beam current and secondary electron yield using the Cramer-Rao bound. Joint estimators operating at a single pixel and estimators that exploit models for inter-pixel correlation and Markov beam current variation are proposed and tested on synthetic microscopy data. Our estimates of secondary electron yield that incorporate explicit beam current estimation beat state-of-the-art methods, resulting in micrograph accuracy nearly indistinguishable from what is obtained with perfect beam current knowledge. Our novel beam current estimation could help improve milling outcomes, prevent sample damage, and enable online instrument diagnostics.
△ Less
Submitted 20 November, 2021;
originally announced November 2021.
-
New Experimental Limits on Exotic Spin- and Velocity-dependent Interactions Using Rotationally Modulated Source-masses and an Atomic-magnetometer Array
Authors:
K. Y. Wu,
S. Y. Chen,
G. A. Sun,
S. M. Peng,
M. Peng,
H. Yan
Abstract:
We conducted laboratory searching for the exotic spin- and velocity-dependent new interactions according to the previously proposed experimental scheme. Two $\sim$6Kg heavy source masses are rotationally modulated at a frequency of 20Hz. Four identical atomic magnetometers are used in an array form to increase the statistics and cancel the common-mode noise. Data processing method based on high pr…
▽ More
We conducted laboratory searching for the exotic spin- and velocity-dependent new interactions according to the previously proposed experimental scheme. Two $\sim$6Kg heavy source masses are rotationally modulated at a frequency of 20Hz. Four identical atomic magnetometers are used in an array form to increase the statistics and cancel the common-mode noise. Data processing method based on high precision numerical integration is applied for the four harmonic frequencies of the signal. The rotation direction of the source masses was reversed to flip the signal. Thus the [1,-3,3,-1] weighting method can be applied to remove possible slow drifting further. The experiment method has noise reduction features, and new constraints for Vector-Axial and Axial-Axial were obtained. The new constraints on VA improved by as much as more than four orders, on AA by as much as two orders in the corresponding force range, respectively.
△ Less
Submitted 28 September, 2021;
originally announced September 2021.
-
Dynamic Behaviors and Training Effects in TiN/Ti/HfO$_x$/TiN Nanolayered Memristors with Controllable Quantized Conductance States: Implications for Quantum and Neuromorphic Computing Devices
Authors:
Min-Hsuan Peng,
Ching-Yang Pan,
Hao-Xuan Zheng,
Ting-Chang Chang,
Pei-hsun Jiang
Abstract:
Controllable quantized conductance states of TiN/Ti/HfO$_x$/TiN memristors are realized with great precision through a pulse-mode reset procedure, assisted with analytical differentiation of the condition of the set procedure, which involves critical monitoring of the measured bias voltage. An intriguing training effect that leads to faster switching of the states is also observed during the opera…
▽ More
Controllable quantized conductance states of TiN/Ti/HfO$_x$/TiN memristors are realized with great precision through a pulse-mode reset procedure, assisted with analytical differentiation of the condition of the set procedure, which involves critical monitoring of the measured bias voltage. An intriguing training effect that leads to faster switching of the states is also observed during the operation. Detailed analyses on the low- and high-resistance states under different compliance currents reveal a complete picture of the structural evolution and dynamic behaviors of the conductive filament in the HfO$_x$ layer. This study provides a closer inspection on the quantum-level manipulation of nanoscale atomic configurations in the memristors, which helps to develop essential knowledge about the design and fabrication of the future memristor-based quantum devices and neuromorphic computing devices.
△ Less
Submitted 22 September, 2021;
originally announced September 2021.
-
Anti-$\mathcal{PT}$-symmetric Kerr gyroscope
Authors:
Huilai Zhang,
Meiyu Peng,
Xun-Wei Xu,
Hui **g
Abstract:
Non-Hermitian systems can exhibit unconventional spectral singularities called exceptional points (EPs). Various EP sensors have been fabricated in recent years, showing strong spectral responses to external signals. Here we propose how to achieve a nonlinear anti-parity-time ($\mathcal{APT}$) gyroscope by spinning an optical resonator. We show that, in the absence of any nonlinearity, the sensiti…
▽ More
Non-Hermitian systems can exhibit unconventional spectral singularities called exceptional points (EPs). Various EP sensors have been fabricated in recent years, showing strong spectral responses to external signals. Here we propose how to achieve a nonlinear anti-parity-time ($\mathcal{APT}$) gyroscope by spinning an optical resonator. We show that, in the absence of any nonlinearity, the sensitivity or optical mode splitting of the linear device can be magnified up to 3 orders than that of the conventional device without EPs. Remarkably, the $\mathcal{APT}$ symmetry can be broken when including the Kerr nonlinearity of the materials and, as the result, the detection threshold can be significantly lowered, i.e., much weaker rotations which are well beyond the ability of a linear gyroscope can now be detected with the nonlinear device. Our work shows the powerful ability of $\mathcal{APT}$ gyroscopes in practice to achieve ultrasensitive rotation measurement.
△ Less
Submitted 18 September, 2021;
originally announced September 2021.
-
Searching for Exotic Spin-Dependent Interactions Using Rotationally Modulated Source Masses and an Atomic Magnetometer Array
Authors:
K. Y. Wu,
S. Y. Chen,
J. Gong,
M. Peng,
H. Yan
Abstract:
We describe a proposed experimental search for exotic spin-dependent interactions using rotationally modulated source masses and an atomic magnetometer array. Rather than further improving the magnetometer sensitivity, noise reduction can be another way to reach higher measurement precision. In this work, we propose to use modulating techniques of the source masses to reduce the noise of the exper…
▽ More
We describe a proposed experimental search for exotic spin-dependent interactions using rotationally modulated source masses and an atomic magnetometer array. Rather than further improving the magnetometer sensitivity, noise reduction can be another way to reach higher measurement precision. In this work, we propose to use modulating techniques of the source masses to reduce the noise of the experiment. Better precision can be achieved if the fundamental frequency and harmonics of the rotating source masses are used to detect the new interactions. Furthermore, if an array of magnetometers are applied, the statistic precision can be improved, and some common noises can be canceled. Our analysis and simulations indicate that the proposed experiment scheme can improve the detection precisions of three types of spin-dependent interactions by as much as $\sim$5 orders in the force range of $\sim$cm to $\sim$10m.
△ Less
Submitted 8 March, 2022; v1 submitted 6 September, 2021;
originally announced September 2021.
-
Temporal Pyramid Transformer with Multimodal Interaction for Video Question Answering
Authors:
Min Peng,
Chongyang Wang,
Yuan Gao,
Yu Shi,
Xiang-Dong Zhou
Abstract:
Video question answering (VideoQA) is challenging given its multimodal combination of visual understanding and natural language understanding. While existing approaches seldom leverage the appearance-motion information in the video at multiple temporal scales, the interaction between the question and the visual information for textual semantics extraction is frequently ignored. Targeting these iss…
▽ More
Video question answering (VideoQA) is challenging given its multimodal combination of visual understanding and natural language understanding. While existing approaches seldom leverage the appearance-motion information in the video at multiple temporal scales, the interaction between the question and the visual information for textual semantics extraction is frequently ignored. Targeting these issues, this paper proposes a novel Temporal Pyramid Transformer (TPT) model with multimodal interaction for VideoQA. The TPT model comprises two modules, namely Question-specific Transformer (QT) and Visual Inference (VI). Given the temporal pyramid constructed from a video, QT builds the question semantics from the coarse-to-fine multimodal co-occurrence between each word and the visual content. Under the guidance of such question-specific semantics, VI infers the visual clues from the local-to-global multi-level interactions between the question and the video. Within each module, we introduce a multimodal attention mechanism to aid the extraction of question-video interactions, with residual connections adopted for the information passing across different levels. Through extensive experiments on three VideoQA datasets, we demonstrate better performances of the proposed method in comparison with the state-of-the-arts.
△ Less
Submitted 10 September, 2021;
originally announced September 2021.
-
Investigation of the effectiveness of non-inductive `multi-harmonic' electron cyclotron current drive through modeling multi-pass absorptions in the EXL-50 spherical tokamak
Authors:
D. Banerjee,
S. D. Song,
H. S. Xie,
B. Liu,
M. Y. Wang,
W. J. Liu,
B. Chen,
L. Han,
D. Luo,
Y. Y. Song,
Yu. V. Petrov,
X. M. Song,
M. S. Liu,
R. W. Harvey,
Y. J. Shi,
Y. K. M. Peng,
the EXL50 team
Abstract:
The effectiveness of multiple electron cyclotron resonance (ECR) harmonics has been thoroughly investigated in context of high current drive efficiency, generally observed in fully non-inductive operation of the low aspect ratio EXL-50 spherical tokamak (ST) powered by electron cyclotron (EC) waves. The Fokker-Plank equation is numerically solved to obtain electron distribution function, under ste…
▽ More
The effectiveness of multiple electron cyclotron resonance (ECR) harmonics has been thoroughly investigated in context of high current drive efficiency, generally observed in fully non-inductive operation of the low aspect ratio EXL-50 spherical tokamak (ST) powered by electron cyclotron (EC) waves. The Fokker-Plank equation is numerically solved to obtain electron distribution function, under steady state of the relativistic nonlinear Coulomb collision and quasi-linear diffusion operators, for calculating plasma current driven by the injected EC wave. For the extra-ordinary EC wave, simulation results unfold a mechanism by which electrons moving around the cold second harmonic ECR layer strongly resonate with higher harmonics via the relativistic Doppler shifted resonance condition. This feature is in fact evident above a certain value of input EC wave power in simulation, indicating it to be a non-linear phenomenon. Similar to the experimental observation, high efficiency in current drive (over 1 A/W) has indeed been found in simulation for a typical low density ($\sim 1\times10^{18}~m^{-3}$), low temperature ($\lesssim 100$ eV) plasma of EXL-50 by taking into account multi-pass absorptions in our simulation model. However, such characteristic is not found in the ordinary EC-wave study for both single-pass and multi-pass simulations, suggesting it as inefficient in driving current on our ST device.
△ Less
Submitted 9 September, 2021;
originally announced September 2021.
-
Observation of a strong correlation between the positive floating potential near the edge and plasma current on EXL-50 ECW plasma
Authors:
Mingyuan Wang,
Dong Guo,
Xin Zhao,
Yunyang Song,
Wenjun Liu,
Hongfei Du,
Shaodong Song,
Bing Liu,
Yuejiang Shi,
Tiantian Sun,
Songjian Li,
Debabrata Banerjee,
Xiaomin Tian,
Yingying Li,
Y. -K Martin Peng
Abstract:
Fully non-inductive plasma current start-up without the central solenoid in ECW plasma was used on EXL-50 Spherical Torus with a weak external vertical field (Bv). Generally, the number of electrons leaving to the vessel wall by the gradient Bt is larger than ions, and the positive potential was built up in plasma. The relationship between floating potential and the plasma current was studied usin…
▽ More
Fully non-inductive plasma current start-up without the central solenoid in ECW plasma was used on EXL-50 Spherical Torus with a weak external vertical field (Bv). Generally, the number of electrons leaving to the vessel wall by the gradient Bt is larger than ions, and the positive potential was built up in plasma. The relationship between floating potential and the plasma current was studied using the Langmuir probes near the boundary. The results show that the floating potential is positive (about 200V) and has a strong correlation with plasma current. In open magnetic field, the plasma current is driven by the high energy electrons in preferential confinement, the plasma current and potential approximately positively correlated with total electron density. After forming the closed flux surface, the plasma current consists mainly of the ECW driven current, and potential is negatively correlated with plasma current. By actively adjusting the Bv, it demonstrated that the positive voltage is approximately inversely correlated with the Bv and plasma current (Ip). Considering that the plasma temperature near the boundary is quite low (~eV), the positive voltage near the boundary caused by the high-energy electron loss. Therefore, the measurements of the boundary potential are important for the study of high-energy electron confinement performance, noninductive plasma current start-up and current driven.
△ Less
Submitted 1 September, 2021; v1 submitted 22 August, 2021;
originally announced August 2021.
-
Non-inductive plasma current sustainment with stochastic electron cyclotron in EXL-50 spherical torus
Authors:
Mingyuan Wang,
Shikui Cheng,
Bing Liu,
Shaodong Song,
Guo Dong,
Yunyang Song,
Wenjun Liu,
Debabrata Banerjee,
Songjian Li,
Tiantian Sun,
Yingying Li,
Yuejiang Shi,
Y. -K Martin Peng,
ADi Liu
Abstract:
The start-up and sustainment of a stochastic wave non-inductive current on a spherical torus was experimentally demonstrated for the first time using only electron cyclotron waves. The plasma current is insensitive to the injection angle of ECWs and approximately linearly correlated with the slope of the X-ray spectrum. Its direction is determined by the vertical magnetic field (BV). The temporal…
▽ More
The start-up and sustainment of a stochastic wave non-inductive current on a spherical torus was experimentally demonstrated for the first time using only electron cyclotron waves. The plasma current is insensitive to the injection angle of ECWs and approximately linearly correlated with the slope of the X-ray spectrum. Its direction is determined by the vertical magnetic field (BV). The temporal development in the number of X-ray bremsstrahlung photons with a specified energy is consistent with the stochastic heating model. Moreover, the ratio of Amps to Watts of the ECW is generally >1 kA/kW under normal conditions (maximum plasma current: 150 kA, ECW: 140 kW). The experimental results are explained using the stochastic heating model of the asymmetric electron velocity distribution in stochastic electromagnetic waves.
△ Less
Submitted 1 September, 2021; v1 submitted 22 August, 2021;
originally announced August 2021.
-
Material-structure integrated design for ultra-broadband microwave metamaterial absorber
Authors:
Mengyue Peng,
Faxiang Qina,
Li** Zhou,
Huijie Wei,
Zihao Zhu,
Xiaopeng Shen
Abstract:
We propose herein a method of material-structure integrated design for broadband absorption of dielectric metamaterial, which is achieved by combination of genetic algorithm and simulation platform. A multi-layered metamaterial absorber with an ultra-broadband absorption from 5.3 to 18 GHz (a relative bandwidth of as high as 109%) is realized numerically and experimentally. In addition, simulated…
▽ More
We propose herein a method of material-structure integrated design for broadband absorption of dielectric metamaterial, which is achieved by combination of genetic algorithm and simulation platform. A multi-layered metamaterial absorber with an ultra-broadband absorption from 5.3 to 18 GHz (a relative bandwidth of as high as 109%) is realized numerically and experimentally. In addition, simulated results demonstrate the proposed metamaterial exhibits good incident angle and polarization tolerance, which also are significant criteria for practical applications. By investigating the working principle with theoretical calculation and numerical simulation, it can be found that merging of multiple resonance modes encompassing quarter-wavelength interference cancellation, spoof surface plasmon polariton mode, dielectric resonance mode and grating mode is responsible for a remarkable ultra-broadband absorption. Analysis of respective contribution of material and structure indicates that either of them plays an indispensable role in activating different resonance modes, and symphony of material and structure is essential to afford desirable target performance. The material-structure integrated design philosophy highlights the superiority of coupling material and structure and provides an effective comprehensive optimization strategy for dielectric metamaterials.
△ Less
Submitted 19 July, 2021;
originally announced August 2021.
-
An Adaptive State Aggregation Algorithm for Markov Decision Processes
Authors:
Guanting Chen,
Johann Demetrio Gaebler,
Matt Peng,
Chunlin Sun,
Yinyu Ye
Abstract:
Value iteration is a well-known method of solving Markov Decision Processes (MDPs) that is simple to implement and boasts strong theoretical convergence guarantees. However, the computational cost of value iteration quickly becomes infeasible as the size of the state space increases. Various methods have been proposed to overcome this issue for value iteration in large state and action space MDPs,…
▽ More
Value iteration is a well-known method of solving Markov Decision Processes (MDPs) that is simple to implement and boasts strong theoretical convergence guarantees. However, the computational cost of value iteration quickly becomes infeasible as the size of the state space increases. Various methods have been proposed to overcome this issue for value iteration in large state and action space MDPs, often at the price, however, of generalizability and algorithmic simplicity. In this paper, we propose an intuitive algorithm for solving MDPs that reduces the cost of value iteration updates by dynamically grou** together states with similar cost-to-go values. We also prove that our algorithm converges almost surely to within \(2\varepsilon / (1 - γ)\) of the true optimal value in the \(\ell^\infty\) norm, where \(γ\) is the discount factor and aggregated states differ by at most \(\varepsilon\). Numerical experiments on a variety of simulated environments confirm the robustness of our algorithm and its ability to solve MDPs with much cheaper updates especially as the scale of the MDP problem increases.
△ Less
Submitted 23 July, 2021;
originally announced July 2021.
-
Blockchain Systems, Technologies and Applications: A Methodology Perspective
Authors:
Bin Cao,
Zixin Wang,
Long Zhang,
Daquan Feng,
Mugen Peng,
Lei Zhang
Abstract:
In the past decade, blockchain has shown a promising vision greatly to build the trust without any powerful third party in a secure, decentralized and salable manner. However, due to the wide application and future development from cryptocurrency to Internet of Things, blockchain is an extremely complex system enabling integration with mathematics, finance, computer science, communication and netw…
▽ More
In the past decade, blockchain has shown a promising vision greatly to build the trust without any powerful third party in a secure, decentralized and salable manner. However, due to the wide application and future development from cryptocurrency to Internet of Things, blockchain is an extremely complex system enabling integration with mathematics, finance, computer science, communication and network engineering, etc. As a result, it is a challenge for engineer, expert and researcher to fully understand the blockchain process in a systematic view from top to down. First, this article introduces how blockchain works, the research activity and challenge, and illustrates the roadmap involving the classic methodology with typical blockchain use cases and topics. Second, in blockchain system, how to adopt stochastic process, game theory, optimization, machine learning and cryptography to study blockchain running process and design blockchain protocol/algorithm are discussed in details. Moreover, the advantage and limitation using these methods are also summarized as the guide of future work to further considered. Finally, some remaining problems from technical, commercial and political views are discussed as the open issues. The main findings of this article will provide an overview in a methodology perspective to study theoretical model for blockchain fundamentals understanding, design network service for blockchain-based mechanisms and algorithms, as well as apply blockchain for Internet of Things, etc.
△ Less
Submitted 7 May, 2021;
originally announced May 2021.
-
Solenoid-free current drive via ECRH in EXL-50 spherical torus plasmas
Authors:
Yuejiang Shi,
Bing Liu,
Shaodong Song,
Yunyang Song,
Xianming Song,
Bowei Tong,
Shikui Cheng,
Wenjun Liu,
Minyuan Wang,
Tiantian Sun,
Dong Guo,
Songjian Li,
Yingying Li,
Bin Chen,
Xiang Gu,
Jianqing Cai,
Di Luo,
Debabrata Banerjee,
Xin Zhao,
Yuanming Yang,
Wenwu Luo,
Peihai Zhou,
Yu Wang,
A. Ishida,
T. Maekawa
, et al. (3 additional authors not shown)
Abstract:
As a new spherical tokamak (ST) designed to simplify engineering requirements of a possible future fusion power source, the EXL-50 experiment features a low aspect ratio (A) vacuum vessel (VV), encircling a central post assembly containing the toroidal field coil conductors without a central solenoid. Multiple electron cyclotron resonance heating (ECRH) resonances are located within the VV to impr…
▽ More
As a new spherical tokamak (ST) designed to simplify engineering requirements of a possible future fusion power source, the EXL-50 experiment features a low aspect ratio (A) vacuum vessel (VV), encircling a central post assembly containing the toroidal field coil conductors without a central solenoid. Multiple electron cyclotron resonance heating (ECRH) resonances are located within the VV to improve current drive effectiveness. Copious energetic electrons are produced and measured with hard X-ray detectors, carry the bulk of the plasma current ranging from 50kA to 150kA, which is maintained for more than 1s duration. It is observed that over one Ampere current can be maintained per Watt of ECRH power issued from the 28-GHz gyrotrons. The plasma current reaches Ip>80kA for high density (>5e18me-2) discharge with 150kW ECHR heating. An analysis was carried out combining reconstructed multi-fluid equilibrium, guiding-center orbits of energetic electrons, and resonant heating mechanisms. It is verified that in EXL-50 a broadly distributed current of energetic electrons creates smaller closed magnetic-flux surfaces of low aspect ratio that in turn confine the thermal plasma electrons and ions and participate in maintaining the equilibrium force-balance.
△ Less
Submitted 30 March, 2022; v1 submitted 30 April, 2021;
originally announced April 2021.
-
Secure and Efficient Federated Learning Through Layering and Sharding Blockchain
Authors:
Shuo Yuan,
Bin Cao,
Yao Sun,
Zhiguo Wan,
Mugen Peng
Abstract:
Introducing blockchain into Federated Learning (FL) to build a trusted edge computing environment for transmission and learning has attracted widespread attention as a new decentralized learning pattern. However, traditional consensus mechanisms and architectures of blockchain systems face significant challenges in handling large-scale FL tasks, especially on Internet of Things (IoT) devices, due…
▽ More
Introducing blockchain into Federated Learning (FL) to build a trusted edge computing environment for transmission and learning has attracted widespread attention as a new decentralized learning pattern. However, traditional consensus mechanisms and architectures of blockchain systems face significant challenges in handling large-scale FL tasks, especially on Internet of Things (IoT) devices, due to their substantial resource consumption, limited transaction throughput, and complex communication requirements. To address these challenges, this paper proposes ChainFL, a novel two-layer blockchain-driven FL system. It splits the IoT network into multiple shards within the subchain layer, effectively reducing the scale of information exchange, and employs a Direct Acyclic Graph (DAG)-based mainchain as the mainchain layer, enabling parallel and asynchronous cross-shard validation. Furthermore, the FL procedure is customized to integrate deeply with blockchain technology, and a modified DAG consensus mechanism is designed to mitigate distortion caused by abnormal models. To provide a proof-of-concept implementation and evaluation, multiple subchains based on Hyperledger Fabric and a self-developed DAG-based mainchain are deployed. Extensive experiments demonstrate that ChainFL significantly surpasses conventional FL systems, showing up to a 14% improvement in training efficiency and a threefold increase in robustness.
△ Less
Submitted 31 January, 2024; v1 submitted 27 April, 2021;
originally announced April 2021.
-
TextFlint: Unified Multilingual Robustness Evaluation Toolkit for Natural Language Processing
Authors:
Tao Gui,
Xiao Wang,
Qi Zhang,
Qin Liu,
Yicheng Zou,
Xin Zhou,
Rui Zheng,
Chong Zhang,
Qinzhuo Wu,
Jiacheng Ye,
Zexiong Pang,
Yongxin Zhang,
Zhengyan Li,
Ruotian Ma,
Zichu Fei,
Ruijian Cai,
Jun Zhao,
Xingwu Hu,
Zhiheng Yan,
Yiding Tan,
Yuan Hu,
Qiyuan Bian,
Zhihua Liu,
Bolin Zhu,
Shan Qin
, et al. (9 additional authors not shown)
Abstract:
Various robustness evaluation methodologies from different perspectives have been proposed for different natural language processing (NLP) tasks. These methods have often focused on either universal or task-specific generalization capabilities. In this work, we propose a multilingual robustness evaluation platform for NLP tasks (TextFlint) that incorporates universal text transformation, task-spec…
▽ More
Various robustness evaluation methodologies from different perspectives have been proposed for different natural language processing (NLP) tasks. These methods have often focused on either universal or task-specific generalization capabilities. In this work, we propose a multilingual robustness evaluation platform for NLP tasks (TextFlint) that incorporates universal text transformation, task-specific transformation, adversarial attack, subpopulation, and their combinations to provide comprehensive robustness analysis. TextFlint enables practitioners to automatically evaluate their models from all aspects or to customize their evaluations as desired with just a few lines of code. To guarantee user acceptability, all the text transformations are linguistically based, and we provide a human evaluation for each one. TextFlint generates complete analytical reports as well as targeted augmented data to address the shortcomings of the model's robustness. To validate TextFlint's utility, we performed large-scale empirical evaluations (over 67,000 evaluations) on state-of-the-art deep learning models, classic supervised methods, and real-world systems. Almost all models showed significant performance degradation, including a decline of more than 50% of BERT's prediction accuracy on tasks such as aspect-level sentiment classification, named entity recognition, and natural language inference. Therefore, we call for the robustness to be included in the model evaluation, so as to promote the healthy development of NLP technology.
△ Less
Submitted 5 May, 2021; v1 submitted 21 March, 2021;
originally announced March 2021.
-
Mention-centered Graph Neural Network for Document-level Relation Extraction
Authors:
Jiaxin Pan,
Min Peng,
Yiyan Zhang
Abstract:
Document-level relation extraction aims to discover relations between entities across a whole document. How to build the dependency of entities from different sentences in a document remains to be a great challenge. Current approaches either leverage syntactic trees to construct document-level graphs or aggregate inference information from different sentences. In this paper, we build cross-sentenc…
▽ More
Document-level relation extraction aims to discover relations between entities across a whole document. How to build the dependency of entities from different sentences in a document remains to be a great challenge. Current approaches either leverage syntactic trees to construct document-level graphs or aggregate inference information from different sentences. In this paper, we build cross-sentence dependencies by inferring compositional relations between inter-sentence mentions. Adopting aggressive linking strategy, intermediate relations are reasoned on the document-level graphs by mention convolution. We further notice the generalization problem of NA instances, which is caused by incomplete annotation and worsened by fully-connected mention pairs. An improved ranking loss is proposed to attend this problem. Experiments show the connections between different mentions are crucial to document-level relation extraction, which enables the model to extract more meaningful higher-level compositional relations.
△ Less
Submitted 15 March, 2021;
originally announced March 2021.
-
Linear Representation Meta-Reinforcement Learning for Instant Adaptation
Authors:
Matt Peng,
Banghua Zhu,
Jiantao Jiao
Abstract:
This paper introduces Fast Linearized Adaptive Policy (FLAP), a new meta-reinforcement learning (meta-RL) method that is able to extrapolate well to out-of-distribution tasks without the need to reuse data from training, and adapt almost instantaneously with the need of only a few samples during testing. FLAP builds upon the idea of learning a shared linear representation of the policy so that whe…
▽ More
This paper introduces Fast Linearized Adaptive Policy (FLAP), a new meta-reinforcement learning (meta-RL) method that is able to extrapolate well to out-of-distribution tasks without the need to reuse data from training, and adapt almost instantaneously with the need of only a few samples during testing. FLAP builds upon the idea of learning a shared linear representation of the policy so that when adapting to a new task, it suffices to predict a set of linear weights. A separate adapter network is trained simultaneously with the policy such that during adaptation, we can directly use the adapter network to predict these linear weights instead of updating a meta-policy via gradient descent, such as in prior meta-RL methods like MAML, to obtain the new policy. The application of the separate feed-forward network not only speeds up the adaptation run-time significantly, but also generalizes extremely well to very different tasks that prior Meta-RL methods fail to generalize to. Experiments on standard continuous-control meta-RL benchmarks show FLAP presenting significantly stronger performance on out-of-distribution tasks with up to double the average return and up to 8X faster adaptation run-time speeds when compared to prior methods.
△ Less
Submitted 12 January, 2021;
originally announced January 2021.
-
Topic-Oriented Spoken Dialogue Summarization for Customer Service with Saliency-Aware Topic Modeling
Authors:
Yicheng Zou,
Lujun Zhao,
Yangyang Kang,
Jun Lin,
Minlong Peng,
Zhuoren Jiang,
Changlong Sun,
Qi Zhang,
Xuan**g Huang,
Xiaozhong Liu
Abstract:
In a customer service system, dialogue summarization can boost service efficiency by automatically creating summaries for long spoken dialogues in which customers and agents try to address issues about specific topics. In this work, we focus on topic-oriented dialogue summarization, which generates highly abstractive summaries that preserve the main ideas from dialogues. In spoken dialogues, abund…
▽ More
In a customer service system, dialogue summarization can boost service efficiency by automatically creating summaries for long spoken dialogues in which customers and agents try to address issues about specific topics. In this work, we focus on topic-oriented dialogue summarization, which generates highly abstractive summaries that preserve the main ideas from dialogues. In spoken dialogues, abundant dialogue noise and common semantics could obscure the underlying informative content, making the general topic modeling approaches difficult to apply. In addition, for customer service, role-specific information matters and is an indispensable part of a summary. To effectively perform topic modeling on dialogues and capture multi-role information, in this work we propose a novel topic-augmented two-stage dialogue summarizer (TDS) jointly with a saliency-aware neural topic model (SATM) for topic-oriented summarization of customer service dialogues. Comprehensive studies on a real-world Chinese customer service dataset demonstrated the superiority of our method against several strong baselines.
△ Less
Submitted 25 June, 2021; v1 submitted 14 December, 2020;
originally announced December 2020.
-
Time-Resolved Focused Ion Beam Microscopy: Modeling, Estimation Methods, and Analyses
Authors:
Minxu Peng,
John Murray-Bruce,
Vivek K Goyal
Abstract:
In a focused ion beam (FIB) microscope, source particles interact with a small volume of a sample to generate secondary electrons that are detected, pixel by pixel, to produce a micrograph. Randomness of the number of incident particles causes excess variation in the micrograph, beyond the variation in the underlying particle-sample interaction. We recently demonstrated that joint processing of mu…
▽ More
In a focused ion beam (FIB) microscope, source particles interact with a small volume of a sample to generate secondary electrons that are detected, pixel by pixel, to produce a micrograph. Randomness of the number of incident particles causes excess variation in the micrograph, beyond the variation in the underlying particle-sample interaction. We recently demonstrated that joint processing of multiple time-resolved measurements from a single pixel can mitigate this effect of source shot noise in helium ion microscopy. This paper is focused on establishing a rigorous framework for understanding the potential for this approach. It introduces idealized continuous- and discrete-time abstractions of FIB microscopy with direct electron detection and estimation-theoretic limits of imaging performance under these measurement models. Novel estimators for use with continuous-time measurements are introduced and analyzed, and estimators for use with discrete-time measurements are analyzed and shown to approach their continuous-time counterparts as time resolution is increased. Simulated FIB microscopy results are consistent with theoretical analyses and demonstrate that substantial improvements over conventional FIB microscopy image formation are made possible by time-resolved measurement.
△ Less
Submitted 18 February, 2022; v1 submitted 16 November, 2020;
originally announced November 2020.
-
Wind Power Transmission System Integration -- a Case Study of China Wind Power Base
Authors:
Jianxue Wang,
Shutang You,
Xingzhong Bai,
Mingqiao Peng
Abstract:
Due to a series of supporting policies in recent years, China wind power has developed rapidly through a large-scale and centralized mode. This paper analyzes the two major concerns faced by wind power development in China: wind generation reliability and wind energy balancing. More specifically, wind farm trip**-off-grid incidents and wind power curtailment issues, which caused huge economical…
▽ More
Due to a series of supporting policies in recent years, China wind power has developed rapidly through a large-scale and centralized mode. This paper analyzes the two major concerns faced by wind power development in China: wind generation reliability and wind energy balancing. More specifically, wind farm trip**-off-grid incidents and wind power curtailment issues, which caused huge economical loss, are investigated in details. Based on operation experience of large wind power bases, technical recommendations and economic incentives are proposed to improve wind power integration and power grid reliability. As a summary and outlook of wind power development in China, this paper provides a reference on future wind power development for other countries.
△ Less
Submitted 10 April, 2021; v1 submitted 22 October, 2020;
originally announced October 2020.
-
Four-Fluid Axisymmetric Plasma Equilibrium Model Including Relativistic Electrons and Computational Method and Results
Authors:
Akio Ishida,
Y. -K. Martin Peng,
Wenjun Liu
Abstract:
A non-relativistic multi-fluid plasma axisymmetric equilibrium model was developed recently to account for the presence of an energetic electron fluid in addition to thermal electron and ion fluids. The equilibrium formulation of a multi-fluid plasma with relativistic energetic electrons is developed and reported in this paper. Relativistic effects in a fluid model approximation can appear in two…
▽ More
A non-relativistic multi-fluid plasma axisymmetric equilibrium model was developed recently to account for the presence of an energetic electron fluid in addition to thermal electron and ion fluids. The equilibrium formulation of a multi-fluid plasma with relativistic energetic electrons is developed and reported in this paper. Relativistic effects in a fluid model approximation can appear in two ways: due to a large macroscopic fluid velocity comparable to the speed of light and large particle's microscopic random motion which becomes significant if the temperature becomes comparable to or larger than the electron rest mass-energy. It is found that the axial component of relativistic generalized angular momentum can be used to describe relativistic axisymmetric equilibrium. The formulation is applied to a four-fluid plasma composed of a relativistic energetic electron fluid, a thermal electron fluid, and fluids of two thermal ion species (e.g. proton and boron ions). The four-fluid density expression which is consistent with the electrostatic potential is obtained and applied in the computation. An example equilibrium approximating a four-fluid plasma recently observed in a solenoid-free ECRH sustained spherical torus plasma is calculated and presented. A second equilibrium that extends the energetic electron temperature of the first example to 679keV is calculated revealing significant relativistic effects.
△ Less
Submitted 19 February, 2021; v1 submitted 15 October, 2020;
originally announced October 2020.
-
RDSGAN: Rank-based Distant Supervision Relation Extraction with Generative Adversarial Framework
Authors:
Guoqing Luo,
Jiaxin Pan,
Min Peng
Abstract:
Distant supervision has been widely used for relation extraction but suffers from noise labeling problem. Neural network models are proposed to denoise with attention mechanism but cannot eliminate noisy data due to its non-zero weights. Hard decision is proposed to remove wrongly-labeled instances from the positive set though causes loss of useful information contained in removed instances. In th…
▽ More
Distant supervision has been widely used for relation extraction but suffers from noise labeling problem. Neural network models are proposed to denoise with attention mechanism but cannot eliminate noisy data due to its non-zero weights. Hard decision is proposed to remove wrongly-labeled instances from the positive set though causes loss of useful information contained in removed instances. In this paper, we propose a novel generative neural framework named RDSGAN (Rank-based Distant Supervision GAN) which automatically generates valid instances for distant supervision relation extraction. Our framework combines soft attention and hard decision to learn the distribution of true positive instances via adversarial training and selects valid instances conforming to the distribution via rank-based distant supervision, which addresses the false positive problem. Experimental results show the superiority of our framework over strong baselines.
△ Less
Submitted 30 September, 2020;
originally announced September 2020.
-
Recognizing Micro-Expression in Video Clip with Adaptive Key-Frame Mining
Authors:
Min Peng,
Chongyang Wang,
Yuan Gao,
Tao Bi,
Tong Chen,
Yu Shi,
Xiang-Dong Zhou
Abstract:
As a spontaneous expression of emotion on face, micro-expression reveals the underlying emotion that cannot be controlled by human. In micro-expression, facial movement is transient and sparsely localized through time. However, the existing representation based on various deep learning techniques learned from a full video clip is usually redundant. In addition, methods utilizing the single apex fr…
▽ More
As a spontaneous expression of emotion on face, micro-expression reveals the underlying emotion that cannot be controlled by human. In micro-expression, facial movement is transient and sparsely localized through time. However, the existing representation based on various deep learning techniques learned from a full video clip is usually redundant. In addition, methods utilizing the single apex frame of each video clip require expert annotations and sacrifice the temporal dynamics. To simultaneously localize and recognize such fleeting facial movements, we propose a novel end-to-end deep learning architecture, referred to as adaptive key-frame mining network (AKMNet). Operating on the video clip of micro-expression, AKMNet is able to learn discriminative spatio-temporal representation by combining spatial features of self-learned local key frames and their global-temporal dynamics. Theoretical analysis and empirical evaluation show that the proposed approach improved recognition accuracy in comparison with state-of-the-art methods on multiple benchmark datasets.
△ Less
Submitted 15 March, 2021; v1 submitted 19 September, 2020;
originally announced September 2020.
-
Optimal Resource Allocation for Delay Minimization in NOMA-MEC Networks
Authors:
Fang Fang,
Yanqing Xu,
Zhiguo Ding,
Chao Shen,
Mugen Peng,
George K. Karagiannidis
Abstract:
Multi-access edge computing (MEC) can enhance the computing capability of mobile devices, while non-orthogonal multiple access (NOMA) can provide high data rates. Combining these two strategies can effectively benefit the network with spectrum and energy efficiency. In this paper, we investigate the task delay minimization in multi-user NOMA-MEC networks, where multiple users can offload their tas…
▽ More
Multi-access edge computing (MEC) can enhance the computing capability of mobile devices, while non-orthogonal multiple access (NOMA) can provide high data rates. Combining these two strategies can effectively benefit the network with spectrum and energy efficiency. In this paper, we investigate the task delay minimization in multi-user NOMA-MEC networks, where multiple users can offload their tasks simultaneously through the same frequency band. We adopt the partial offloading policy, in which each user can partition its computation task into offloading and locally computing parts. We aim to minimize the task delay among users by optimizing their tasks partition ratios and offloading transmit power. The delay minimization problem is first formulated, and it is shown that it is a nonconvex one. By carefully investigating its structure, we transform the original problem into an equivalent quasi-convex. In this way, a bisection search iterative algorithm is proposed in order to achieve the minimum task delay. To reduce the complexity of the proposed algorithm and evaluate its optimality, we further derive closed-form expressions for the optimal task partition ratio and offloading power for the case of two-user NOMA-MEC networks. Simulations demonstrate the convergence and optimality of the proposed algorithm and the effectiveness of the closed-form analysis.
△ Less
Submitted 11 September, 2020;
originally announced September 2020.
-
Mode Selection and Resource Allocation in Sliced Fog Radio Access Networks: A Reinforcement Learning Approach
Authors:
Hongyu Xiang,
Mugen Peng,
Yaohua Sun,
Shi Yan
Abstract:
The mode selection and resource allocation in fog radio access networks (F-RANs) have been advocated as key techniques to improve spectral and energy efficiency. In this paper, we investigate the joint optimization of mode selection and resource allocation in uplink F-RANs, where both of the traditional user equipments (UEs) and fog UEs are served by constructed network slice instances. The concer…
▽ More
The mode selection and resource allocation in fog radio access networks (F-RANs) have been advocated as key techniques to improve spectral and energy efficiency. In this paper, we investigate the joint optimization of mode selection and resource allocation in uplink F-RANs, where both of the traditional user equipments (UEs) and fog UEs are served by constructed network slice instances. The concerned optimization is formulated as a mixed-integer programming problem, and both the orthogonal and multiplexed subchannel allocation strategies are proposed to guarantee the slice isolation. Motivated by the development of machine learning, two reinforcement learning based algorithms are developed to solve the original high complexity problem under traditional and fog UEs' specific performance requirements. The basic idea of the proposals is to generate a good mode selection policy according to the immediate reward fed back by an environment. Simulation results validate the benefits of our proposed algorithms and show that a tradeoff between system power consumption and queue delay can be achieved.
△ Less
Submitted 13 February, 2020;
originally announced February 2020.
-
Deep Reinforcement Learning Based Mode Selection and Resource Allocation for Cellular V2X Communications
Authors:
Xinran Zhang,
Mugen Peng,
Shi Yan,
Yaohua Sun
Abstract:
Cellular vehicle-to-everything (V2X) communication is crucial to support future diverse vehicular applications. However, for safety-critical applications, unstable vehicle-to-vehicle (V2V) links and high signalling overhead of centralized resource allocation approaches become bottlenecks. In this paper, we investigate a joint optimization problem of transmission mode selection and resource allocat…
▽ More
Cellular vehicle-to-everything (V2X) communication is crucial to support future diverse vehicular applications. However, for safety-critical applications, unstable vehicle-to-vehicle (V2V) links and high signalling overhead of centralized resource allocation approaches become bottlenecks. In this paper, we investigate a joint optimization problem of transmission mode selection and resource allocation for cellular V2X communications. In particular, the problem is formulated as a Markov decision process, and a deep reinforcement learning (DRL) based decentralized algorithm is proposed to maximize the sum capacity of vehicle-to-infrastructure users while meeting the latency and reliability requirements of V2V pairs. Moreover, considering training limitation of local DRL models, a two-timescale federated DRL algorithm is developed to help obtain robust model. Wherein, the graph theory based vehicle clustering algorithm is executed on a large timescale and in turn the federated learning algorithm is conducted on a small timescale. Simulation results show that the proposed DRL-based algorithm outperforms other decentralized baselines, and validate the superiority of the two-timescale federated DRL algorithm for newly activated V2V pairs.
△ Less
Submitted 13 February, 2020;
originally announced February 2020.
-
Tradeoff between Ergodic Rate and Delivery Latency in Fog Radio Access Networks
Authors:
Bonan Yin,
Mugen Peng,
Shi Yan,
Chun**g Hu
Abstract:
Wireless content caching has recently been considered as an efficient way in fog radio access networks (FRANs) to alleviate the heavy burden on capacity-limited fronthaul links and reduce delivery latency. In this paper, an advanced minimal delay association policy is proposed to minimize latency while guaranteeing spectral efficiency in F-RANs. By utilizing stochastic geometry and queueing theory…
▽ More
Wireless content caching has recently been considered as an efficient way in fog radio access networks (FRANs) to alleviate the heavy burden on capacity-limited fronthaul links and reduce delivery latency. In this paper, an advanced minimal delay association policy is proposed to minimize latency while guaranteeing spectral efficiency in F-RANs. By utilizing stochastic geometry and queueing theory, closed-form expressions of successful delivery probability, average ergodic rate, and average delivery latency are derived, where both the traditional association policy based on accessing the base station with maximal received power and the proposed minimal delay association policy are concerned. Impacts of key operating parameters on the aforementioned performance metrics are exploited. It is shown that the proposed association policy has a better delivery latency than the traditional association policy. Increasing the cache size of fog-computing based access points (F-APs) can more significantly reduce average delivery latency, compared with increasing the density of F-APs. Meanwhile, the latter comes at the expense of decreasing average ergodic rate. This implies the deployment of large cache size at F-APs rather than high density of F-APs can promote performance effectively in F-RANs.
△ Less
Submitted 13 February, 2020;
originally announced February 2020.