Search | arXiv e-print repository

Topological Phases, Local Magnetic Moments, and Spin Polarization Triggered by C558-Line Defects in Graphene

Authors: Ning-**g Yang, Wen-Ti Guo, Hai Yang, Zhigao Huang, Jian-Min Zhang

Abstract: We study the electronic properties of a novel topological defect structure for graphene interspersed with C558-line defects along the Armchair boundary. This system has the topological property of being topologically three-periodic and the type-II Dirac-fermionic character of the embedded topological phase. At the same time, we show computationally that the topological properties of the system are… ▽ More We study the electronic properties of a novel topological defect structure for graphene interspersed with C558-line defects along the Armchair boundary. This system has the topological property of being topologically three-periodic and the type-II Dirac-fermionic character of the embedded topological phase. At the same time, we show computationally that the topological properties of the system are overly dependent on the coupling of this line defect. Using strain engineering to regulate the magnitude of hop** at the defect, the position of the energy level can be easily changed to achieve a topological phase transition. We also discuss the local magnetic moment and the ferromagnetic ground state in the context of line defects, which is the conclusion after considering additional Coulomb interactions. This leads to spin polarization of the whole system. Finally, by modulating the local magnetic moment at the position of the line defect, we achieve a tunable spin quantum conductance in a one-dimensional nanoribbon. Near the Fermi energy level, it also has the property of complete spin polarization. Consequently, spin filtering can be achieved by varying the incident energy of the electrons. △ Less

Submitted 14 August, 2023; originally announced August 2023.

Comments: 8 pages, 6 figures

arXiv:2308.06716 [pdf, other]

Novel magnetic topological insulator FeBi$_2$Te$_4$ with controllable topological quantum phase

Authors: Wen-Ti Guo, Ning**g Yang, Zhigao Huang, Jian-Min Zhang

Abstract: Here, we report a new intrinsic magnetic topological insulator FeBi$_2$Te$_4$ based on first-principles calculations and it can achieve a rich topological phase under pressure modulation. Without pressure, we predict that both FeBi$_2$Te$_4$ ferromagnetic and antiferromagnetic orders are non-trivial topological insulators. Furthermore, FeBi$_2$Te$_4$ of FM-z order will undergo a series of phase tr… ▽ More Here, we report a new intrinsic magnetic topological insulator FeBi$_2$Te$_4$ based on first-principles calculations and it can achieve a rich topological phase under pressure modulation. Without pressure, we predict that both FeBi$_2$Te$_4$ ferromagnetic and antiferromagnetic orders are non-trivial topological insulators. Furthermore, FeBi$_2$Te$_4$ of FM-z order will undergo a series of phase transitions from topological insulator to semimetals and then to trivial insulator under pressure. Finally, we further clarify and verify topological phase transitions with low-energy effective model calculations. This topological phase transition process is attributed to the synergy of the magnetic moment and the spin-orbit coupling. The unique topological properties of FeBi$_2$Te$_4$ will be of great interest in driving the development of quantum effects. △ Less

Submitted 13 August, 2023; originally announced August 2023.

arXiv:2308.03032 [pdf, other]

doi 10.1088/1674-4527/ad0498

Understanding the predication mechanism of deep learning through error propagation among parameters in strong lensing case

Authors: Xilong Fan, Peizheng Wang, ** Li, Nan Yang

Abstract: The error propagation among estimated parameters reflects the correlation among the parameters. We study the capability of machine learning of "learning" the correlation of estimated parameters. We show that machine learning can recover the relation between the uncertainties of different parameters, especially, as predicted by the error propagation formula. Gravitational lensing can be used to pro… ▽ More The error propagation among estimated parameters reflects the correlation among the parameters. We study the capability of machine learning of "learning" the correlation of estimated parameters. We show that machine learning can recover the relation between the uncertainties of different parameters, especially, as predicted by the error propagation formula. Gravitational lensing can be used to probe both astrophysics and cosmology. As a practical application, we show that the machine learning is able to intelligently find the error propagation among the gravitational lens parameters (effective lens mass $M_{L}$ and Einstein radius $θ_{E}$) in accordance with the theoretical formula for the singular isothermal ellipse (SIE) lens model. The relation of errors of lens mass and Einstein radius, (e.g. the ratio of standard deviations $\mathcal{F}=σ_{\hat{ M_{L}}}/ σ_{\hat{θ_{E}}}$) predicted by the deep convolution neural network are consistent with the error propagation formula of SIE lens model. As a proof-of-principle test, a toy model of linear relation with Gaussian noise is presented. We found that the predictions obtained by machine learning indeed indicate the information about the law of error propagation and the distribution of noise. Error propagation plays a crucial role in identifying the physical relation among parameters, rather than a coincidence relation, therefore we anticipate our case study on the error propagation of machine learning predictions could extend to other physical systems on searching the correlation among parameters. △ Less

Submitted 9 January, 2024; v1 submitted 6 August, 2023; originally announced August 2023.

Journal ref: Research in Astronomy and Astrophysics 23.12 (2023): 125022

arXiv:2308.02164 [pdf]

Using Targeted Phonon Excitation to Modulate Thermal Conductivity of Boron Nitride

Authors: Dongkai Pan, Xiao Wan, Zhicheng Zong, Yangjun Qin, Nuo Yang

Abstract: Modulation of thermal conductivity has become a hotspot in the field of heat conduction. A novel strategy based on targeted phonon excitation has been recently proposed for efficient and reversible modulation of thermal conductivity. In this article, the effectiveness of that strategy is further evaluated on hexagonal boron nitride through ab initio methods. Results indicate that thermal conductiv… ▽ More Modulation of thermal conductivity has become a hotspot in the field of heat conduction. A novel strategy based on targeted phonon excitation has been recently proposed for efficient and reversible modulation of thermal conductivity. In this article, the effectiveness of that strategy is further evaluated on hexagonal boron nitride through ab initio methods. Results indicate that thermal conductivity can be increased from 885 W m-1 K-1 to 1151 W m-1 K-1 or decreased to 356 W m-1 K-1, thereby broadening the scope of applicability of this strategy. △ Less

Submitted 4 August, 2023; originally announced August 2023.

Comments: 12 pages, 3 figures

arXiv:2307.14346 [pdf, other]

Multi-objective Deep Reinforcement Learning for Mobile Edge Computing

Authors: Ning Yang, Junrui Wen, Meng Zhang, Ming Tang

Abstract: Mobile edge computing (MEC) is essential for next-generation mobile network applications that prioritize various performance metrics, including delays and energy consumption. However, conventional single-objective scheduling solutions cannot be directly applied to practical systems in which the preferences of these applications (i.e., the weights of different objectives) are often unknown or chall… ▽ More Mobile edge computing (MEC) is essential for next-generation mobile network applications that prioritize various performance metrics, including delays and energy consumption. However, conventional single-objective scheduling solutions cannot be directly applied to practical systems in which the preferences of these applications (i.e., the weights of different objectives) are often unknown or challenging to specify in advance. In this study, we address this issue by formulating a multi-objective offloading problem for MEC with multiple edges to minimize expected long-term energy consumption and transmission delay while considering unknown preferences as parameters. To address the challenge of unknown preferences, we design a multi-objective (deep) reinforcement learning (MORL)-based resource scheduling scheme with proximal policy optimization (PPO). In addition, we introduce a well-designed state encoding method for constructing features for multiple edges in MEC systems, a sophisticated reward function for accurately computing the utilities of delay and energy consumption. Simulation results demonstrate that our proposed MORL scheme enhances the hypervolume of the Pareto front by up to 233.1% compared to benchmarks. Our full framework is available at https://github.com/gracefulning/mec_morl_multipolicy. △ Less

Submitted 5 July, 2023; originally announced July 2023.

Comments: Received by IEEE WiOpt 2023

arXiv:2307.14202 [pdf, ps, other]

Heterogeneous Receptors - Based Molecule Harvesting in MC: Analysis for ISI Mitigation and Energy Efficiency

Authors: Xinyu Huang, Yu Huang, Miaowen Wen, Nan Yang, Robert Schober

Abstract: This paper investigates a spherical transmitter (TX) with a membrane covered by heterogeneous receptors of varying sizes and arbitrary locations for molecular communication (MC), where molecules are encapsulated within vesicles and released from the TX through membrane fusion. Assuming continuous vesicle generation at the TX and a transparent receiver (RX), we calculate the molecule release rate,… ▽ More This paper investigates a spherical transmitter (TX) with a membrane covered by heterogeneous receptors of varying sizes and arbitrary locations for molecular communication (MC), where molecules are encapsulated within vesicles and released from the TX through membrane fusion. Assuming continuous vesicle generation at the TX and a transparent receiver (RX), we calculate the molecule release rate, the fraction of absorbed molecules at the TX, and the received signal at the RX. All obtained analytical expressions are functions of all receptors locations and sizes, and are validated by particle-based simulations. Our numerical results indicate that evenly distributed receptors on the TX membrane can absorb more molecules than randomly distributed receptors or a single receptor. Furthermore, inspired by the autoreceptor functionality in synaptic communication, we incorporate a negative feedback mechanism (NFM) at the TX, such that molecule release stops after a certain period. We then derive the fraction of molecules that can be reused for the subsequent emissions when considering both NFM and molecule harvesting. Our numerical results demonstrate that incorporating NFM can reduce inter-symbol interference (ISI) while maintaining the same peak received signal as the case without NFM. Additionally, our results show that TXs incorporating both molecule harvesting and NFM can achieve a higher energy efficiency and lower error probability than TXs employing only molecule harvesting or neither functionality. △ Less

Submitted 26 July, 2023; originally announced July 2023.

Comments: 30 pages, 9 figures, Submitted to IEEE journals for possible publication. arXiv admin note: substantial text overlap with arXiv:2211.14603

arXiv:2307.12594 [pdf]

Optimized data collection and analysis process for studying solar-thermal desalination by machine learning

Authors: Guilong Peng, Senshan Sun, Yangjun Qin, Zhenwei Xu, Juxin Du, Swellam W. sharshir, A. W. Kandel, A. E. Kabeel, Nuo Yang

Abstract: An effective interdisciplinary study between machine learning and solar-thermal desalination requires a sufficiently large and well-analyzed experimental datasets. This study develops a modified dataset collection and analysis process for studying solar-thermal desalination by machine learning. Based on the optimized water condensation and collection process, the proposed experimental method colle… ▽ More An effective interdisciplinary study between machine learning and solar-thermal desalination requires a sufficiently large and well-analyzed experimental datasets. This study develops a modified dataset collection and analysis process for studying solar-thermal desalination by machine learning. Based on the optimized water condensation and collection process, the proposed experimental method collects over one thousand datasets, which is ten times more than the average number of datasets in previous works, by accelerating data collection and reducing the time by 83.3%. On the other hand, the effects of dataset features are investigated by using three different algorithms, including artificial neural networks, multiple linear regressions, and random forests. The investigation focuses on the effects of dataset size and range on prediction accuracy, factor importance ranking, and the model's generalization ability. The results demonstrate that a larger dataset can significantly improve prediction accuracy when using artificial neural networks and random forests. Additionally, the study highlights the significant impact of dataset size and range on ranking the importance of influence factors. Furthermore, the study reveals that the extrapolation data range significantly affects the extrapolation accuracy of artificial neural networks. Based on the results, massive dataset collection and analysis of dataset feature effects are important steps in an effective and consistent machine learning process flow for solar-thermal desalination, which can promote machine learning as a more general tool in the field of solar-thermal desalination. △ Less

Submitted 24 July, 2023; originally announced July 2023.

arXiv:2307.09707 [pdf, other]

Improved Label Design for Timing Synchronization in OFDM Systems against Multi-path Uncertainty

Authors: Chao** Qing, Shuhai Tang, Na Yang, Chuangui Rao, Jiafan Wang

Abstract: Timing synchronization (TS) is vital for orthogonal frequency division multiplexing (OFDM) systems, which makes the discrete Fourier transform (DFT) window start at the inter-symbol-interference (ISI)-free region. However, the multi-path uncertainty in wireless communication scenarios degrades the TS correctness. To alleviate this degradation, we propose a learning-based TS method enhanced by impr… ▽ More Timing synchronization (TS) is vital for orthogonal frequency division multiplexing (OFDM) systems, which makes the discrete Fourier transform (DFT) window start at the inter-symbol-interference (ISI)-free region. However, the multi-path uncertainty in wireless communication scenarios degrades the TS correctness. To alleviate this degradation, we propose a learning-based TS method enhanced by improving the design of training label. In the proposed method, the classic cross-correlator extracts the initial TS feature for benefiting the following machine learning. Wherein, the network architecture unfolds one classic cross-correlation process. Against the multi-path uncertainty, a novel training label is designed by representing the ISI-free region and especially highlighting its approximate midpoint. Therein, the closer to the region boundary of ISI-free the smaller label values are set, expecting to locate the maximum network output in ISI-free region with a high probability. Then, to guarantee the correctness of labeling, we exploit the priori information of line-of-sight (LOS) to form a LOS-aided labeling. Numerical results confirm that, the proposed training label effectively enhances the correctness of the proposed TS learner against the multi-path uncertainty. △ Less

Submitted 18 July, 2023; originally announced July 2023.

Comments: 5 pages, 5 figures

arXiv:2307.07227 [pdf, ps, other]

doi 10.1109/TWC.2023.3344802

Secure Short-Packet Communications via UAV-Enabled Mobile Relaying: Joint Resource Optimization and 3D Trajectory Design

Authors: Milad Tatar Mamaghani, Xiangyun Zhou, Nan Yang, A. Lee Swindlehurst

Abstract: Short-packet communication (SPC) and unmanned aerial vehicles (UAVs) are anticipated to play crucial roles in the development of 5G-and-beyond wireless networks and the Internet of Things (IoT). In this paper, we propose a secure SPC system, where a UAV serves as a mobile decode-and-forward (DF) relay, periodically receiving and relaying small data packets from a remote IoT device to its receiver… ▽ More Short-packet communication (SPC) and unmanned aerial vehicles (UAVs) are anticipated to play crucial roles in the development of 5G-and-beyond wireless networks and the Internet of Things (IoT). In this paper, we propose a secure SPC system, where a UAV serves as a mobile decode-and-forward (DF) relay, periodically receiving and relaying small data packets from a remote IoT device to its receiver in two hops with strict latency requirements, in the presence of an eavesdropper. This system requires careful optimization of important design parameters, such as the coding blocklengths of both hops, transmit powers, and the UAV's trajectory. While the overall optimization problem is nonconvex, we tackle it by applying a block successive convex approximation (BSCA) approach to divide the original problem into three subproblems and solve them separately. Then, an overall iterative algorithm is proposed to obtain the final design with guaranteed convergence. Our proposed low-complexity algorithm incorporates robust trajectory design and resource management to optimize the effective average secrecy throughput of the communication system over the course of the UAV-relay's mission. Simulation results demonstrate significant performance improvements compared to various benchmark schemes and provide useful design insights on the coding blocklengths and transmit powers along the trajectory of the UAV. △ Less

Submitted 29 December, 2023; v1 submitted 14 July, 2023; originally announced July 2023.

Comments: 14 double-column pages, 8 figures. To appear in IEEE Transactions on Wireless Communications. This is an extended version of our work presented at the 2023 IEEE GlobeCom arXiv:2310.05142

arXiv:2307.07164 [pdf, other]

Learning to Retrieve In-Context Examples for Large Language Models

Authors: Liang Wang, Nan Yang, Furu Wei

Abstract: Large language models (LLMs) have demonstrated their ability to learn in-context, allowing them to perform various tasks based on a few input-output examples. However, the effectiveness of in-context learning is heavily reliant on the quality of the selected examples. In this paper, we propose a novel framework to iteratively train dense retrievers that can identify high-quality in-context example… ▽ More Large language models (LLMs) have demonstrated their ability to learn in-context, allowing them to perform various tasks based on a few input-output examples. However, the effectiveness of in-context learning is heavily reliant on the quality of the selected examples. In this paper, we propose a novel framework to iteratively train dense retrievers that can identify high-quality in-context examples for LLMs. Our framework initially trains a reward model based on LLM feedback to evaluate the quality of candidate examples, followed by knowledge distillation to train a bi-encoder based dense retriever. Our experiments on a suite of $30$ tasks demonstrate that our framework significantly enhances in-context learning performance. Furthermore, we show the generalization ability of our framework to unseen tasks during training. An in-depth analysis reveals that our model improves performance by retrieving examples with similar patterns, and the gains are consistent across LLMs of varying sizes. The code and data are available at https://github.com/microsoft/LMOps/tree/main/llm_retriever . △ Less

Submitted 26 January, 2024; v1 submitted 14 July, 2023; originally announced July 2023.

Comments: Accepted by EACL 2024

arXiv:2307.03424 [pdf, ps, other]

On Tate Milnor-Witt Motives

Authors: Jean Fasel, Nanjun Yang

Abstract: Over Euclidean fields, we prove that extensions and direct summands of MW-motives $\mathbb{Z}(i)[2i]$ are direct sums of $\mathbb{Z}(i)[2i]$, $\mathbb{Z}/2^rη(i)[2i]$ and $\mathbb{Z}/\textbf{l}[i]$, where $l$ is odd and $\textbf{l}=\sum_{i=0}^{l-1}ε^i$. Over Euclidean fields, we prove that extensions and direct summands of MW-motives $\mathbb{Z}(i)[2i]$ are direct sums of $\mathbb{Z}(i)[2i]$, $\mathbb{Z}/2^rη(i)[2i]$ and $\mathbb{Z}/\textbf{l}[i]$, where $l$ is odd and $\textbf{l}=\sum_{i=0}^{l-1}ε^i$. △ Less

Submitted 6 December, 2023; v1 submitted 7 July, 2023; originally announced July 2023.

Comments: 21 pages, comments welcome

arXiv:2307.01366 [pdf, other]

Minimizing Age of Information for Mobile Edge Computing Systems: A Nested Index Approach

Authors: Shuo Chen, Ning Yang, Meng Zhang, Jun Wang

Abstract: Exploiting the computational heterogeneity of mobile devices and edge nodes, mobile edge computation (MEC) provides an efficient approach to achieving real-time applications that are sensitive to information freshness, by offloading tasks from mobile devices to edge nodes. We use the metric Age-of-Information (AoI) to evaluate information freshness. An efficient solution to minimize the AoI for th… ▽ More Exploiting the computational heterogeneity of mobile devices and edge nodes, mobile edge computation (MEC) provides an efficient approach to achieving real-time applications that are sensitive to information freshness, by offloading tasks from mobile devices to edge nodes. We use the metric Age-of-Information (AoI) to evaluate information freshness. An efficient solution to minimize the AoI for the MEC system with multiple users is non-trivial to obtain due to the random computing time. In this paper, we consider multiple users offloading tasks to heterogeneous edge servers in a MEC system. We first reformulate the problem as a Restless Multi-Arm-Bandit (RMAB) problem and establish a hierarchical Markov Decision Process (MDP) to characterize the updating of AoI for the MEC system. Based on the hierarchical MDP, we propose a nested index framework and design a nested index policy with provably asymptotic optimality. Finally, the closed form of the nested index is obtained, which enables the performance tradeoffs between computation complexity and accuracy. Our algorithm leads to an optimality gap reduction of up to 40%, compared to benchmarks. Our algorithm asymptotically approximates the lower bound as the system scalar gets large enough. △ Less

Submitted 3 July, 2023; originally announced July 2023.

arXiv:2307.00217 [pdf, other]

Metric Learning-Based Timing Synchronization by Using Lightweight Neural Network

Authors: Chao** Qing, Na Yang, Shuhai Tang, Chuangui Rao, Jiafan Wang, Hui Lin

Abstract: Timing synchronization (TS) is one of the key tasks in orthogonal frequency division multiplexing (OFDM) systems. However, multi-path uncertainty corrupts the TS correctness, making OFDM systems suffer from a severe inter-symbol-interference (ISI). To tackle this issue, we propose a timing-metric learning-based TS method assisted by a lightweight one-dimensional convolutional neural network (1-D C… ▽ More Timing synchronization (TS) is one of the key tasks in orthogonal frequency division multiplexing (OFDM) systems. However, multi-path uncertainty corrupts the TS correctness, making OFDM systems suffer from a severe inter-symbol-interference (ISI). To tackle this issue, we propose a timing-metric learning-based TS method assisted by a lightweight one-dimensional convolutional neural network (1-D CNN). Specifically, the receptive field of 1-D CNN is specifically designed to extract the metric features from the classic synchronizer. Then, to combat the multi-path uncertainty, we employ the varying delays and gains of multi-path (the characteristics of multi-path uncertainty) to design the timing-metric objective, and thus form the training labels. This is typically different from the existing timing-metric objectives with respect to the timing synchronization point. Our method substantively increases the completeness of training data against the multi-path uncertainty due to the complete preservation of metric information. By this mean, the TS correctness is improved against the multi-path uncertainty. Numerical results demonstrate the effectiveness and generalization of the proposed TS method against the multi-path uncertainty. △ Less

Submitted 1 July, 2023; originally announced July 2023.

Comments: 4 pages, 3 figures

arXiv:2306.17570 [pdf, other]

ELM-based Timing Synchronization for OFDM Systems by Exploiting Computer-aided Training Strategy

Authors: Mintao Zhang, Shuhai Tang, Chao** Qing, Na Yang, Xi Cai, Jiafan Wang

Abstract: Due to the implementation bottleneck of training data collection in realistic wireless communications systems, supervised learning-based timing synchronization (TS) is challenged by the incompleteness of training data. To tackle this bottleneck, we extend the computer-aided approach, with which the local device can generate the training data instead of generating learning labels from the received… ▽ More Due to the implementation bottleneck of training data collection in realistic wireless communications systems, supervised learning-based timing synchronization (TS) is challenged by the incompleteness of training data. To tackle this bottleneck, we extend the computer-aided approach, with which the local device can generate the training data instead of generating learning labels from the received samples collected in realistic systems, and then construct an extreme learning machine (ELM)-based TS network in orthogonal frequency division multiplexing (OFDM) systems. Specifically, by leveraging the rough information of channel impulse responses (CIRs), i.e., root-mean-square (r.m.s) delay, we propose the loose constraint-based and flexible constraint-based training strategies for the learning-label design against the maximum multi-path delay. The underlying mechanism is to improve the completeness of multi-path delays that may appear in the realistic wireless channels and thus increase the statistical efficiency of the designed TS learner. By this means, the proposed ELM-based TS network can alleviate the degradation of generalization performance. Numerical results reveal the robustness and generalization of the proposed scheme against varying parameters. △ Less

Submitted 30 June, 2023; originally announced June 2023.

Comments: 12 pages, 7 figures,

arXiv:2306.15222 [pdf, other]

Learning to Rank in Generative Retrieval

Authors: Yongqi Li, Nan Yang, Liang Wang, Furu Wei, Wenjie Li

Abstract: Generative retrieval stands out as a promising new paradigm in text retrieval that aims to generate identifier strings of relevant passages as the retrieval target. This generative paradigm taps into powerful generative language models, distinct from traditional sparse or dense retrieval methods. However, only learning to generate is insufficient for generative retrieval. Generative retrieval lear… ▽ More Generative retrieval stands out as a promising new paradigm in text retrieval that aims to generate identifier strings of relevant passages as the retrieval target. This generative paradigm taps into powerful generative language models, distinct from traditional sparse or dense retrieval methods. However, only learning to generate is insufficient for generative retrieval. Generative retrieval learns to generate identifiers of relevant passages as an intermediate goal and then converts predicted identifiers into the final passage rank list. The disconnect between the learning objective of autoregressive models and the desired passage ranking target leads to a learning gap. To bridge this gap, we propose a learning-to-rank framework for generative retrieval, dubbed LTRGR. LTRGR enables generative retrieval to learn to rank passages directly, optimizing the autoregressive model toward the final passage ranking target via a rank loss. This framework only requires an additional learning-to-rank training phase to enhance current generative retrieval systems and does not add any burden to the inference stage. We conducted experiments on three public benchmarks, and the results demonstrate that LTRGR achieves state-of-the-art performance among generative retrieval methods. The code and checkpoints are released at https://github.com/liyongqi67/LTRGR. △ Less

Submitted 16 December, 2023; v1 submitted 27 June, 2023; originally announced June 2023.

Comments: AAAI 2024

arXiv:2305.19877 [pdf]

Enhancing interfacial thermal conductance of Si/PVDF by strengthening atomic couplings

Authors: Zhicheng Zong, Shichen Deng, Yangjun Qin, Xiao Wan, Jiahong Zhan, Dengke Ma, Nuo Yang

Abstract: The thermal transport across inorganic/organic interfaces attracts interest for both academic and industry due to its widely applications in flexible electronics etc. Here, the interfacial thermal conductance of inorganic/organic interfaces consisting of silicon and polyvinylidene fluoride is systematically investigated by molecular dynamics simulations. Interestingly, it is demonstrated that a mo… ▽ More The thermal transport across inorganic/organic interfaces attracts interest for both academic and industry due to its widely applications in flexible electronics etc. Here, the interfacial thermal conductance of inorganic/organic interfaces consisting of silicon and polyvinylidene fluoride is systematically investigated by molecular dynamics simulations. Interestingly, it is demonstrated that a modified silicon surface with hydroxyl groups can drastically enhance the conductance by 698%. These results are elucidated based on interfacial couplings and lattice dynamics insights. This study not only provides feasible strategies to effectively modulate the interfacial thermal conductance of inorganic/organic interfaces but also deepens the understanding of the fundamental physics underlying phonon transport across interfaces. △ Less

Submitted 10 June, 2023; v1 submitted 31 May, 2023; originally announced May 2023.

arXiv:2305.16675 [pdf, other]

Multiview Identifiers Enhanced Generative Retrieval

Authors: Yongqi Li, Nan Yang, Liang Wang, Furu Wei, Wenjie Li

Abstract: Instead of simply matching a query to pre-existing passages, generative retrieval generates identifier strings of passages as the retrieval target. At a cost, the identifier must be distinctive enough to represent a passage. Current approaches use either a numeric ID or a text piece (such as a title or substrings) as the identifier. However, these identifiers cannot cover a passage's content well.… ▽ More Instead of simply matching a query to pre-existing passages, generative retrieval generates identifier strings of passages as the retrieval target. At a cost, the identifier must be distinctive enough to represent a passage. Current approaches use either a numeric ID or a text piece (such as a title or substrings) as the identifier. However, these identifiers cannot cover a passage's content well. As such, we are motivated to propose a new type of identifier, synthetic identifiers, that are generated based on the content of a passage and could integrate contextualized information that text pieces lack. Furthermore, we simultaneously consider multiview identifiers, including synthetic identifiers, titles, and substrings. These views of identifiers complement each other and facilitate the holistic ranking of passages from multiple perspectives. We conduct a series of experiments on three public datasets, and the results indicate that our proposed approach performs the best in generative retrieval, demonstrating its effectiveness and robustness. △ Less

Submitted 26 May, 2023; originally announced May 2023.

Comments: ACL 2023 Main Conference

arXiv:2305.14918 [pdf, other]

doi 10.1109/LRA.2023.3273509

Incremental Dense Reconstruction from Monocular Video with Guided Sparse Feature Volume Fusion

Authors: Xingxing Zuo, Nan Yang, Nathaniel Merrill, Binbin Xu, Stefan Leutenegger

Abstract: Incrementally recovering 3D dense structures from monocular videos is of paramount importance since it enables various robotics and AR applications. Feature volumes have recently been shown to enable efficient and accurate incremental dense reconstruction without the need to first estimate depth, but they are not able to achieve as high of a resolution as depth-based methods due to the large memor… ▽ More Incrementally recovering 3D dense structures from monocular videos is of paramount importance since it enables various robotics and AR applications. Feature volumes have recently been shown to enable efficient and accurate incremental dense reconstruction without the need to first estimate depth, but they are not able to achieve as high of a resolution as depth-based methods due to the large memory consumption of high-resolution feature volumes. This letter proposes a real-time feature volume-based dense reconstruction method that predicts TSDF (Truncated Signed Distance Function) values from a novel sparsified deep feature volume, which is able to achieve higher resolutions than previous feature volume-based methods, and is favorable in large-scale outdoor scenarios where the majority of voxels are empty. An uncertainty-aware multi-view stereo (MVS) network is leveraged to infer initial voxel locations of the physical surface in a sparse feature volume. Then for refining the recovered 3D geometry, deep features are attentively aggregated from multiview images at potential surface locations, and temporally fused. Besides achieving higher resolutions than before, our method is shown to produce more complete reconstructions with finer detail in many cases. Extensive evaluations on both public and self-collected datasets demonstrate a very competitive real-time reconstruction result for our method compared to state-of-the-art reconstruction methods in both indoor and outdoor settings. △ Less

Submitted 24 May, 2023; originally announced May 2023.

Comments: 8 pages, 5 figures, RA-L 2023

arXiv:2305.14895 [pdf, other]

doi 10.1088/1674-4527/acd593

The Lobster Eye Imager for Astronomy Onboard the SATech-01 Satellite

Authors: Z. X. Ling, X. J. Sun, C. Zhang, S. L. Sun, G. **, S. N. Zhang, X. F. Zhang, J. B. Chang, F. S. Chen, Y. F. Chen, Z. W. Cheng, W. Fu, Y. X. Han, H. Li, J. F. Li, Y. Li, Z. D. Li, P. R. Liu, Y. H. Lv, X. H. Ma, Y. J. Tang, C. B. Wang, R. J. Xie, Y. L. Xue, A. L. Yan , et al. (101 additional authors not shown)

Abstract: The Lobster Eye Imager for Astronomy (LEIA), a pathfinder of the Wide-field X-ray Telescope of the Einstein Probe (EP) mission, was successfully launched onboard the SATech-01 satellite of the Chinese Academy of Sciences on 27 July 2022. In this paper, we introduce the design and on-ground test results of the LEIA instrument. Using state-of-the-art Micro-Pore Optics (MPO), a wide field-of-view (Fo… ▽ More The Lobster Eye Imager for Astronomy (LEIA), a pathfinder of the Wide-field X-ray Telescope of the Einstein Probe (EP) mission, was successfully launched onboard the SATech-01 satellite of the Chinese Academy of Sciences on 27 July 2022. In this paper, we introduce the design and on-ground test results of the LEIA instrument. Using state-of-the-art Micro-Pore Optics (MPO), a wide field-of-view (FoV) of 346 square degrees (18.6 degrees * 18.6 degrees) of the X-ray imager is realized. An optical assembly composed of 36 MPO chips is used to focus incident X-ray photons, and four large-format complementary metal-oxide semiconductor (CMOS) sensors, each of 6 cm * 6 cm, are used as the focal plane detectors. The instrument has an angular resolution of 4 - 8 arcmin (in FWHM) for the central focal spot of the point spread function, and an effective area of 2 - 3 cm2 at 1 keV in essentially all the directions within the field of view. The detection passband is 0.5 - 4 keV in the soft X-rays and the sensitivity is 2 - 3 * 10-11 erg s-1 cm-2 (about 1 mini-Crab) at 1,000 second observation. The total weight of LEIA is 56 kg and the power is 85 W. The satellite, with a design lifetime of 2 years, operates in a Sun-synchronous orbit of 500 km with an orbital period of 95 minutes. LEIA is paving the way for future missions by verifying in flight the technologies of both novel focusing imaging optics and CMOS sensors for X-ray observation, and by optimizing the working setups of the instrumental parameters. In addition, LEIA is able to carry out scientific observations to find new transients and to monitor known sources in the soft X-ray band, albeit limited useful observing time available. △ Less

Submitted 24 May, 2023; originally announced May 2023.

Comments: Accepted by RAA

arXiv:2305.12687 [pdf, other]

Learn to Flap: Foil Non-parametric Path Planning via Deep Reinforcement Learning

Authors: Z. P. Wang, R. J. Lin, Z. Y. Zhao, P. M. Guo, N. Yang, D. X. Fan

Abstract: To optimize flap** foil performance, the application of deep reinforcement learning (DRL) on controlling foil non-parametric motion is conducted in the present study. Traditional control techniques and simplified motions cannot fully model nonlinear, unsteady and high-dimensional foil-vortex interactions. A DRL-training framework based on Proximal Policy Optimization and Transformer architecture… ▽ More To optimize flap** foil performance, the application of deep reinforcement learning (DRL) on controlling foil non-parametric motion is conducted in the present study. Traditional control techniques and simplified motions cannot fully model nonlinear, unsteady and high-dimensional foil-vortex interactions. A DRL-training framework based on Proximal Policy Optimization and Transformer architecture is proposed. The policy is initialized from the sinusoidal expert display. We first demonstrate the effectiveness of the proposed DRL-training framework which can optimize foil motion while enhancing foil generated thrust. By adjusting reward setting and action threshold, the DRL-optimized foil trajectories can gain further enhancement compared to sinusoidal motion. Via flow analysis of wake morphology and instantaneous pressure distributions, it is found that the DRL-optimized foil can adaptively adjust the phases between motion and shedding vortices to improve hydrodynamic performance. Our results give a hint for solving complex fluid manipulation problems through DRL method. △ Less

Submitted 25 May, 2023; v1 submitted 21 May, 2023; originally announced May 2023.

Comments: submitted to Journal of Fluid Mechanics rapids

arXiv:2305.07247 [pdf, other]

Provably Convergent Schrödinger Bridge with Applications to Probabilistic Time Series Imputation

Authors: Yu Chen, Wei Deng, Shikai Fang, Fengpei Li, Nicole Tianjiao Yang, Yikai Zhang, Kashif Rasul, Shandian Zhe, Anderson Schneider, Yuriy Nevmyvaka

Abstract: The Schrödinger bridge problem (SBP) is gaining increasing attention in generative modeling and showing promising potential even in comparison with the score-based generative models (SGMs). SBP can be interpreted as an entropy-regularized optimal transport problem, which conducts projections onto every other marginal alternatingly. However, in practice, only approximated projections are accessible… ▽ More The Schrödinger bridge problem (SBP) is gaining increasing attention in generative modeling and showing promising potential even in comparison with the score-based generative models (SGMs). SBP can be interpreted as an entropy-regularized optimal transport problem, which conducts projections onto every other marginal alternatingly. However, in practice, only approximated projections are accessible and their convergence is not well understood. To fill this gap, we present a first convergence analysis of the Schrödinger bridge algorithm based on approximated projections. As for its practical applications, we apply SBP to probabilistic time series imputation by generating missing values conditioned on observed data. We show that optimizing the transport cost improves the performance and the proposed algorithm achieves the state-of-the-art result in healthcare and environmental data while exhibiting the advantage of exploring both temporal and feature patterns in probabilistic time series imputation. △ Less

Submitted 10 September, 2023; v1 submitted 12 May, 2023; originally announced May 2023.

Comments: Accepted by ICML 2023

arXiv:2304.12684 [pdf, ps, other]

UAV-assisted IoT Monitoring Network: Adaptive Multiuser Access for Low-Latency and High-Reliability Under Bursty Traffic

Authors: Nilupuli Senadhira, Salman Durrani, Sheeraz A. Alvi, Nan Yang, Xiangyun Zhou

Abstract: In this work, we propose an adaptive system design for an Internet of Things (IoT) monitoring network with latency and reliability requirements, where IoT devices generate time-critical and event-triggered bursty traffic, and an unmanned aerial vehicle (UAV) aggregates and relays sensed data to the base station. Existing transmission schemes based on the overall average traffic rates over-utilize… ▽ More In this work, we propose an adaptive system design for an Internet of Things (IoT) monitoring network with latency and reliability requirements, where IoT devices generate time-critical and event-triggered bursty traffic, and an unmanned aerial vehicle (UAV) aggregates and relays sensed data to the base station. Existing transmission schemes based on the overall average traffic rates over-utilize network resources when traffic is smooth, and suffer from packet collisions when traffic is bursty which occurs in an event of interest. We address such problems by designing an adaptive transmission scheme employing multiuser shared access (MUSA) based grant-free non-orthogonal multiple access and use short packet communication for low latency of the IoT-to-UAV communication. Specifically, to accommodate bursty traffic, we design an analytical framework and formulate an optimization problem to maximize the performance by determining the optimal number of transmission time slots, subject to the stringent reliability and latency constraints. We compare the performance of the proposed scheme with a non-adaptive power-diversity based scheme with a fixed number of time slots. Our results show that the proposed scheme has superior reliability and stability in comparison to the state-of-the-art scheme at moderate to high average traffic rates, while satisfying the stringent latency requirements. △ Less

Submitted 16 May, 2024; v1 submitted 25 April, 2023; originally announced April 2023.

Comments: Submitted for possible journal publication

arXiv:2304.12550 [pdf, other]

Combining Adversaries with Anti-adversaries in Training

Authors: Xiaoling Zhou, Nan Yang, Ou Wu

Abstract: Adversarial training is an effective learning technique to improve the robustness of deep neural networks. In this study, the influence of adversarial training on deep learning models in terms of fairness, robustness, and generalization is theoretically investigated under more general perturbation scope that different samples can have different perturbation directions (the adversarial and anti-adv… ▽ More Adversarial training is an effective learning technique to improve the robustness of deep neural networks. In this study, the influence of adversarial training on deep learning models in terms of fairness, robustness, and generalization is theoretically investigated under more general perturbation scope that different samples can have different perturbation directions (the adversarial and anti-adversarial directions) and varied perturbation bounds. Our theoretical explorations suggest that the combination of adversaries and anti-adversaries (samples with anti-adversarial perturbations) in training can be more effective in achieving better fairness between classes and a better tradeoff between robustness and generalization in some typical learning scenarios (e.g., noisy label learning and imbalance learning) compared with standard adversarial training. On the basis of our theoretical findings, a more general learning objective that combines adversaries and anti-adversaries with varied bounds on each training sample is presented. Meta learning is utilized to optimize the combination weights. Experiments on benchmark datasets under different learning scenarios verify our theoretical findings and the effectiveness of the proposed methodology. △ Less

Submitted 18 May, 2023; v1 submitted 24 April, 2023; originally announced April 2023.

Comments: 8 pages, 5 figures

Journal ref: AAAI2023

arXiv:2304.06226 [pdf, other]

doi 10.1016/j.cpc.2023.109004

Prompt: Probability-Conserved Cross Section Biasing Monte Carlo Particle Transport System

Authors: Zi-Yi Pan, Ni Yang, Ming Tang, Peixun Shen, Xiao-Xiao Cai

Abstract: An open source software package for simulating thermal neutron propagation in geometry is presented. In this system, neutron propagation can be treated by either the particle transport method or the ray-tracing method. Supported by an accurate backend scattering physics engine, this system is capable of reproducing neutron scattering experiments in complex geometries and is expected to be used in… ▽ More An open source software package for simulating thermal neutron propagation in geometry is presented. In this system, neutron propagation can be treated by either the particle transport method or the ray-tracing method. Supported by an accurate backend scattering physics engine, this system is capable of reproducing neutron scattering experiments in complex geometries and is expected to be used in the areas of instrument characterisation, optimisation and data analysis. In this paper, the relevant theories are briefly introduced. The simulation flow and the user input syntax to control it are provided in detail. Five benchmarking simulations, focusing on different aspects of simulation and scattering techniques, are given to demonstrate the applications of this simulation system. They include an idealised total scattering instrument, a monochromatic powder diffractometer, a neutron guide, a chopper and an imaging setup for complex geometries. Simulated results are benchmarked against experimental data or well-established software packages when appropriate. Good agreements are observed. △ Less

Submitted 4 December, 2023; v1 submitted 12 April, 2023; originally announced April 2023.

Comments: 72 pages, 32 figures

arXiv:2304.04487 [pdf, other]

Inference with Reference: Lossless Acceleration of Large Language Models

Authors: Nan Yang, Tao Ge, Liang Wang, Binxing Jiao, Daxin Jiang, Linjun Yang, Rangan Majumder, Furu Wei

Abstract: We propose LLMA, an LLM accelerator to losslessly speed up Large Language Model (LLM) inference with references. LLMA is motivated by the observation that there are abundant identical text spans between the decoding result by an LLM and the reference that is available in many real world scenarios (e.g., retrieved documents). LLMA first selects a text span from the reference and copies its tokens t… ▽ More We propose LLMA, an LLM accelerator to losslessly speed up Large Language Model (LLM) inference with references. LLMA is motivated by the observation that there are abundant identical text spans between the decoding result by an LLM and the reference that is available in many real world scenarios (e.g., retrieved documents). LLMA first selects a text span from the reference and copies its tokens to the decoder and then efficiently checks the tokens' appropriateness as the decoding result in parallel within one decoding step. The improved computational parallelism allows LLMA to achieve over 2x speed-up for LLMs with identical generation results as greedy decoding in many practical generation scenarios where significant overlap between in-context reference and outputs exists (e.g., search engines and multi-turn conversations). △ Less

Submitted 10 April, 2023; originally announced April 2023.

Comments: 9 pages

arXiv:2303.13099 [pdf, other]

Multi-View Zero-Shot Open Intent Induction from Dialogues: Multi Domain Batch and Proxy Gradient Transfer

Authors: Hyukhun Koh, Haesung Pyun, Nakyeong Yang, Kyomin Jung

Abstract: In Task Oriented Dialogue (TOD) system, detecting and inducing new intents are two main challenges to apply the system in the real world. In this paper, we suggest the semantic multi-view model to resolve these two challenges: (1) SBERT for General Embedding (GE), (2) Multi Domain Batch (MDB) for dialogue domain knowledge, and (3) Proxy Gradient Transfer (PGT) for cluster-specialized semantic. MDB… ▽ More In Task Oriented Dialogue (TOD) system, detecting and inducing new intents are two main challenges to apply the system in the real world. In this paper, we suggest the semantic multi-view model to resolve these two challenges: (1) SBERT for General Embedding (GE), (2) Multi Domain Batch (MDB) for dialogue domain knowledge, and (3) Proxy Gradient Transfer (PGT) for cluster-specialized semantic. MDB feeds diverse dialogue datasets to the model at once to tackle the multi-domain problem by learning the multiple domain knowledge. We introduce a novel method PGT, which employs the Siamese network to fine-tune the model with a clustering method directly.Our model can learn how to cluster dialogue utterances by using PGT. Experimental results demonstrate that our multi-view model with MDB and PGT significantly improves the Open Intent Induction performance compared to baseline systems. △ Less

Submitted 13 August, 2023; v1 submitted 23 March, 2023; originally announced March 2023.

Comments: 8 pages, 3 figures, SIGDIAL DSTC 2023 workshop

arXiv:2303.12341 [pdf, other]

EasyDGL: Encode, Train and Interpret for Continuous-time Dynamic Graph Learning

Authors: Chao Chen, Haoyu Geng, Nianzu Yang, Xiaokang Yang, Junchi Yan

Abstract: Dynamic graphs arise in various real-world applications, and it is often welcomed to model the dynamics directly in continuous time domain for its flexibility. This paper aims to design an easy-to-use pipeline (termed as EasyDGL which is also due to its implementation by DGL toolkit) composed of three key modules with both strong fitting ability and interpretability. Specifically the proposed pipe… ▽ More Dynamic graphs arise in various real-world applications, and it is often welcomed to model the dynamics directly in continuous time domain for its flexibility. This paper aims to design an easy-to-use pipeline (termed as EasyDGL which is also due to its implementation by DGL toolkit) composed of three key modules with both strong fitting ability and interpretability. Specifically the proposed pipeline which involves encoding, training and interpreting: i) a temporal point process (TPP) modulated attention architecture to endow the continuous-time resolution with the coupled spatiotemporal dynamics of the observed graph with edge-addition events; ii) a principled loss composed of task-agnostic TPP posterior maximization based on observed events on the graph, and a task-aware loss with a masking strategy over dynamic graph, where the covered tasks include dynamic link prediction, dynamic node classification and node traffic forecasting; iii) interpretation of the model outputs (e.g., representations and predictions) with scalable perturbation-based quantitative analysis in the graph Fourier domain, which could more comprehensively reflect the behavior of the learned model. Extensive experimental results on public benchmarks show the superior performance of our EasyDGL for time-conditioned predictive tasks, and in particular demonstrate that EasyDGL can effectively quantify the predictive power of frequency content that a model learn from the evolving graph data. △ Less

Submitted 22 March, 2023; originally announced March 2023.

Comments: 9 figures, 7 tables

arXiv:2303.11339 [pdf, other]

FedMAE: Federated Self-Supervised Learning with One-Block Masked Auto-Encoder

Authors: Nan Yang, Xuanyu Chen, Charles Z. Liu, Dong Yuan, Wei Bao, Lizhen Cui

Abstract: Latest federated learning (FL) methods started to focus on how to use unlabeled data in clients for training due to users' privacy concerns, high labeling costs, or lack of expertise. However, current Federated Semi-Supervised/Self-Supervised Learning (FSSL) approaches fail to learn large-scale images because of the limited computing resources of local clients. In this paper, we introduce a new fr… ▽ More Latest federated learning (FL) methods started to focus on how to use unlabeled data in clients for training due to users' privacy concerns, high labeling costs, or lack of expertise. However, current Federated Semi-Supervised/Self-Supervised Learning (FSSL) approaches fail to learn large-scale images because of the limited computing resources of local clients. In this paper, we introduce a new framework FedMAE, which stands for Federated Masked AutoEncoder, to address the problem of how to utilize unlabeled large-scale images for FL. Specifically, FedMAE can pre-train one-block Masked AutoEncoder (MAE) using large images in lightweight client devices, and then cascades multiple pre-trained one-block MAEs in the server to build a multi-block ViT backbone for downstream tasks. Theoretical analysis and experimental results on image reconstruction and classification show that our FedMAE achieves superior performance compared to the state-of-the-art FSSL methods. △ Less

Submitted 20 March, 2023; originally announced March 2023.

arXiv:2303.07678 [pdf, other]

Query2doc: Query Expansion with Large Language Models

Authors: Liang Wang, Nan Yang, Furu Wei

Abstract: This paper introduces a simple yet effective query expansion approach, denoted as query2doc, to improve both sparse and dense retrieval systems. The proposed method first generates pseudo-documents by few-shot prompting large language models (LLMs), and then expands the query with generated pseudo-documents. LLMs are trained on web-scale text corpora and are adept at knowledge memorization. The ps… ▽ More This paper introduces a simple yet effective query expansion approach, denoted as query2doc, to improve both sparse and dense retrieval systems. The proposed method first generates pseudo-documents by few-shot prompting large language models (LLMs), and then expands the query with generated pseudo-documents. LLMs are trained on web-scale text corpora and are adept at knowledge memorization. The pseudo-documents from LLMs often contain highly relevant information that can aid in query disambiguation and guide the retrievers. Experimental results demonstrate that query2doc boosts the performance of BM25 by 3% to 15% on ad-hoc IR datasets, such as MS-MARCO and TREC DL, without any model fine-tuning. Furthermore, our method also benefits state-of-the-art dense retrievers in terms of both in-domain and out-of-domain results. △ Less

Submitted 11 October, 2023; v1 submitted 14 March, 2023; originally announced March 2023.

Comments: Accepted to EMNLP 2023

arXiv:2303.05205 [pdf, other]

Real-time scheduling of renewable power systems through planning-based reinforcement learning

Authors: Shaohuai Liu, **bo Liu, Weirui Ye, Nan Yang, Guanglun Zhang, Haiwang Zhong, Chongqing Kang, Qirong Jiang, Xuri Song, Fangchun Di, Yang Gao

Abstract: The growing renewable energy sources have posed significant challenges to traditional power scheduling. It is difficult for operators to obtain accurate day-ahead forecasts of renewable generation, thereby requiring the future scheduling system to make real-time scheduling decisions aligning with ultra-short-term forecasts. Restricted by the computation speed, traditional optimization-based method… ▽ More The growing renewable energy sources have posed significant challenges to traditional power scheduling. It is difficult for operators to obtain accurate day-ahead forecasts of renewable generation, thereby requiring the future scheduling system to make real-time scheduling decisions aligning with ultra-short-term forecasts. Restricted by the computation speed, traditional optimization-based methods can not solve this problem. Recent developments in reinforcement learning (RL) have demonstrated the potential to solve this challenge. However, the existing RL methods are inadequate in terms of constraint complexity, algorithm performance, and environment fidelity. We are the first to propose a systematic solution based on the state-of-the-art reinforcement learning algorithm and the real power grid environment. The proposed approach enables planning and finer time resolution adjustments of power generators, including unit commitment and economic dispatch, thus increasing the grid's ability to admit more renewable energy. The well-trained scheduling agent significantly reduces renewable curtailment and load shedding, which are issues arising from traditional scheduling's reliance on inaccurate day-ahead forecasts. High-frequency control decisions exploit the existing units' flexibility, reducing the power grid's dependence on hardware transformations and saving investment and operating costs, as demonstrated in experimental results. This research exhibits the potential of reinforcement learning in promoting low-carbon and intelligent power systems and represents a solid step toward sustainable electricity generation. △ Less

Submitted 13 March, 2023; v1 submitted 9 March, 2023; originally announced March 2023.

Comments: 12 pages, 7 figures

arXiv:2303.04772 [pdf, other]

Multilevel Diffusion: Infinite Dimensional Score-Based Diffusion Models for Image Generation

Authors: Paul Hagemann, Sophie Mildenberger, Lars Ruthotto, Gabriele Steidl, Nicole Tianjiao Yang

Abstract: Score-based diffusion models (SBDM) have recently emerged as state-of-the-art approaches for image generation. Existing SBDMs are typically formulated in a finite-dimensional setting, where images are considered as tensors of finite size. This paper develops SBDMs in the infinite-dimensional setting, that is, we model the training data as functions supported on a rectangular domain. Besides the qu… ▽ More Score-based diffusion models (SBDM) have recently emerged as state-of-the-art approaches for image generation. Existing SBDMs are typically formulated in a finite-dimensional setting, where images are considered as tensors of finite size. This paper develops SBDMs in the infinite-dimensional setting, that is, we model the training data as functions supported on a rectangular domain. Besides the quest for generating images at ever higher resolution, our primary motivation is to create a well-posed infinite-dimensional learning problem so that we can discretize it consistently on multiple resolution levels. We thereby intend to obtain diffusion models that generalize across different resolution levels and improve the efficiency of the training process. We demonstrate how to overcome two shortcomings of current SBDM approaches in the infinite-dimensional setting. First, we modify the forward process to ensure that the latent distribution is well-defined in the infinite-dimensional setting using the notion of trace class operators. We derive the reverse processes for finite approximations. Second, we illustrate that approximating the score function with an operator network is beneficial for multilevel training. After deriving the convergence of the discretization and the approximation of multilevel training, we implement an infinite-dimensional SBDM approach and show the first promising results on MNIST and Fashion-MNIST, underlining our developed theory. △ Less

Submitted 4 November, 2023; v1 submitted 8 March, 2023; originally announced March 2023.

MSC Class: 60H10; 65D18

arXiv:2302.12397 [pdf, other]

Cascaded ELM-based Joint Frame Synchronization and Channel Estimation over Rician Fading Channel with Hardware Imperfections

Authors: Chao** Qing, Chuangui Rao, Shuhai Tang, Na Yang, Jiafan Wang

Abstract: Due to the interdependency of frame synchronization (FS) and channel estimation (CE), joint FS and CE (JFSCE) schemes are proposed to enhance their functionalities and therefore boost the overall performance of wireless communication systems. Although traditional JFSCE schemes alleviate the influence between FS and CE, they show deficiencies in dealing with hardware imperfection (HI) and determini… ▽ More Due to the interdependency of frame synchronization (FS) and channel estimation (CE), joint FS and CE (JFSCE) schemes are proposed to enhance their functionalities and therefore boost the overall performance of wireless communication systems. Although traditional JFSCE schemes alleviate the influence between FS and CE, they show deficiencies in dealing with hardware imperfection (HI) and deterministic line-of-sight (LOS) path. To tackle this challenge, we proposed a cascaded ELM-based JFSCE to alleviate the influence of HI in the scenario of the Rician fading channel. Specifically, the conventional JFSCE method is first employed to extract the initial features, and thus forms the non-Neural Network (NN) solutions for FS and CE, respectively. Then, the ELM-based networks, named FS-NET and CE-NET, are cascaded to capture the NN solutions of FS and CE. Simulation and analysis results show that, compared with the conventional JFSCE methods, the proposed cascaded ELM-based JFSCE significantly reduces the error probability of FS and the normalized mean square error (NMSE) of CE, even against the impacts of parameter variations. △ Less

Submitted 23 February, 2023; originally announced February 2023.

Comments: 12 pages, 9 figures

arXiv:2302.11823 [pdf, other]

FedIL: Federated Incremental Learning from Decentralized Unlabeled Data with Convergence Analysis

Authors: Nan Yang, Dong Yuan, Charles Z Liu, Yongkun Deng, Wei Bao

Abstract: Most existing federated learning methods assume that clients have fully labeled data to train on, while in reality, it is hard for the clients to get task-specific labels due to users' privacy concerns, high labeling costs, or lack of expertise. This work considers the server with a small labeled dataset and intends to use unlabeled data in multiple clients for semi-supervised learning. We propose… ▽ More Most existing federated learning methods assume that clients have fully labeled data to train on, while in reality, it is hard for the clients to get task-specific labels due to users' privacy concerns, high labeling costs, or lack of expertise. This work considers the server with a small labeled dataset and intends to use unlabeled data in multiple clients for semi-supervised learning. We propose a new framework with a generalized model, Federated Incremental Learning (FedIL), to address the problem of how to utilize labeled data in the server and unlabeled data in clients separately in the scenario of Federated Learning (FL). FedIL uses the Iterative Similarity Fusion to enforce the server-client consistency on the predictions of unlabeled data and uses incremental confidence to establish a credible pseudo-label set in each client. We show that FedIL will accelerate model convergence by Cosine Similarity with normalization, proved by Banach Fixed Point Theorem. The code is available at https://anonymous.4open.science/r/fedil. △ Less

Submitted 23 February, 2023; originally announced February 2023.

arXiv:2302.08682 [pdf, other]

Random Padding Data Augmentation

Authors: Nan Yang, Laicheng Zhong, Fan Huang, Dong Yuan, Wei Bao

Abstract: The convolutional neural network (CNN) learns the same object in different positions in images, which can improve the recognition accuracy of the model. An implication of this is that CNN may know where the object is. The usefulness of the features' spatial information in CNNs has not been well investigated. In this paper, we found that the model's learning of features' position information hinder… ▽ More The convolutional neural network (CNN) learns the same object in different positions in images, which can improve the recognition accuracy of the model. An implication of this is that CNN may know where the object is. The usefulness of the features' spatial information in CNNs has not been well investigated. In this paper, we found that the model's learning of features' position information hindered the learning of the features' relationship. Therefore, we introduced Random Padding, a new type of padding method for training CNNs that impairs the architecture's capacity to learn position information by adding zero-padding randomly to half of the border of feature maps. Random Padding is parameter-free, simple to construct, and compatible with the majority of CNN-based recognition models. This technique is also complementary to data augmentations such as random crop**, rotation, flip** and erasing, and consistently improves the performance of image classification over strong baselines. △ Less

Submitted 16 February, 2023; originally announced February 2023.

arXiv:2302.02151 [pdf, other]

doi 10.1145/3543507.3583286

Contrastive Collaborative Filtering for Cold-Start Item Recommendation

Authors: Zhihui Zhou, Lilin Zhang, Ning Yang

Abstract: The cold-start problem is a long-standing challenge in recommender systems. As a promising solution, content-based generative models usually project a cold-start item's content onto a warm-start item embedding to capture collaborative signals from item content so that collaborative filtering can be applied. However, since the training of the cold-start recommendation models is conducted on warm da… ▽ More The cold-start problem is a long-standing challenge in recommender systems. As a promising solution, content-based generative models usually project a cold-start item's content onto a warm-start item embedding to capture collaborative signals from item content so that collaborative filtering can be applied. However, since the training of the cold-start recommendation models is conducted on warm datasets, the existent methods face the issue that the collaborative embeddings of items will be blurred, which significantly degenerates the performance of cold-start item recommendation. To address this issue, we propose a novel model called Contrastive Collaborative Filtering for Cold-start item Recommendation (CCFCRec), which capitalizes on the co-occurrence collaborative signals in warm training data to alleviate the issue of blurry collaborative embeddings for cold-start item recommendation. In particular, we devise a contrastive collaborative filtering (CF) framework, consisting of a content CF module and a co-occurrence CF module to generate the content-based collaborative embedding and the co-occurrence collaborative embedding for a training item, respectively. During the joint training of the two CF modules, we apply a contrastive learning between the two collaborative embeddings, by which the knowledge about the co-occurrence signals can be indirectly transferred to the content CF module, so that the blurry collaborative embeddings can be rectified implicitly by the memorized co-occurrence collaborative signals during the applying phase. Together with the sound theoretical analysis, the extensive experiments conducted on real datasets demonstrate the superiority of the proposed model. The codes and datasets are available on https://github.com/zzhin/CCFCRec. △ Less

Submitted 22 February, 2023; v1 submitted 4 February, 2023; originally announced February 2023.

Comments: This paper has been accepted by WWW '23

arXiv:2301.09069 [pdf, other]

Provable Unrestricted Adversarial Training without Compromise with Generalizability

Authors: Lilin Zhang, Ning Yang, Yanchao Sun, Philip S. Yu

Abstract: Adversarial training (AT) is widely considered as the most promising strategy to defend against adversarial attacks and has drawn increasing interest from researchers. However, the existing AT methods still suffer from two challenges. First, they are unable to handle unrestricted adversarial examples (UAEs), which are built from scratch, as opposed to restricted adversarial examples (RAEs), which… ▽ More Adversarial training (AT) is widely considered as the most promising strategy to defend against adversarial attacks and has drawn increasing interest from researchers. However, the existing AT methods still suffer from two challenges. First, they are unable to handle unrestricted adversarial examples (UAEs), which are built from scratch, as opposed to restricted adversarial examples (RAEs), which are created by adding perturbations bound by an $l_p$ norm to observed examples. Second, the existing AT methods often achieve adversarial robustness at the expense of standard generalizability (i.e., the accuracy on natural examples) because they make a tradeoff between them. To overcome these challenges, we propose a unique viewpoint that understands UAEs as imperceptibly perturbed unobserved examples. Also, we find that the tradeoff results from the separation of the distributions of adversarial examples and natural examples. Based on these ideas, we propose a novel AT approach called Provable Unrestricted Adversarial Training (PUAT), which can provide a target classifier with comprehensive adversarial robustness against both UAE and RAE, and simultaneously improve its standard generalizability. Particularly, PUAT utilizes partially labeled data to achieve effective UAE generation by accurately capturing the natural data distribution through a novel augmented triple-GAN. At the same time, PUAT extends the traditional AT by introducing the supervised loss of the target classifier into the adversarial loss and achieves the alignment between the UAE distribution, the natural data distribution, and the distribution learned by the classifier, with the collaboration of the augmented triple-GAN. Finally, the solid theoretical analysis and extensive experiments conducted on widely-used benchmarks demonstrate the superiority of PUAT. △ Less

Submitted 18 May, 2024; v1 submitted 22 January, 2023; originally announced January 2023.

arXiv:2301.07668 [pdf, other]

Behind the Scenes: Density Fields for Single View Reconstruction

Authors: Felix Wimbauer, Nan Yang, Christian Rupprecht, Daniel Cremers

Abstract: Inferring a meaningful geometric scene representation from a single image is a fundamental problem in computer vision. Approaches based on traditional depth map prediction can only reason about areas that are visible in the image. Currently, neural radiance fields (NeRFs) can capture true 3D including color, but are too complex to be generated from a single image. As an alternative, we propose to… ▽ More Inferring a meaningful geometric scene representation from a single image is a fundamental problem in computer vision. Approaches based on traditional depth map prediction can only reason about areas that are visible in the image. Currently, neural radiance fields (NeRFs) can capture true 3D including color, but are too complex to be generated from a single image. As an alternative, we propose to predict implicit density fields. A density field maps every location in the frustum of the input image to volumetric density. By directly sampling color from the available views instead of storing color in the density field, our scene representation becomes significantly less complex compared to NeRFs, and a neural network can predict it in a single forward pass. The prediction network is trained through self-supervision from only video data. Our formulation allows volume rendering to perform both depth prediction and novel view synthesis. Through experiments, we show that our method is able to predict meaningful geometry for regions that are occluded in the input image. Additionally, we demonstrate the potential of our approach on three datasets for depth prediction and novel-view synthesis. △ Less

Submitted 19 April, 2023; v1 submitted 18 January, 2023; originally announced January 2023.

Comments: Project Page: https://fwmb.github.io/bts/

arXiv:2301.01147 [pdf, other]

4Seasons: Benchmarking Visual SLAM and Long-Term Localization for Autonomous Driving in Challenging Conditions

Authors: Patrick Wenzel, Nan Yang, Rui Wang, Niclas Zeller, Daniel Cremers

Abstract: In this paper, we present a novel visual SLAM and long-term localization benchmark for autonomous driving in challenging conditions based on the large-scale 4Seasons dataset. The proposed benchmark provides drastic appearance variations caused by seasonal changes and diverse weather and illumination conditions. While significant progress has been made in advancing visual SLAM on small-scale datase… ▽ More In this paper, we present a novel visual SLAM and long-term localization benchmark for autonomous driving in challenging conditions based on the large-scale 4Seasons dataset. The proposed benchmark provides drastic appearance variations caused by seasonal changes and diverse weather and illumination conditions. While significant progress has been made in advancing visual SLAM on small-scale datasets with similar conditions, there is still a lack of unified benchmarks representative of real-world scenarios for autonomous driving. We introduce a new unified benchmark for jointly evaluating visual odometry, global place recognition, and map-based visual localization performance which is crucial to successfully enable autonomous driving in any condition. The data has been collected for more than one year, resulting in more than 300 km of recordings in nine different environments ranging from a multi-level parking garage to urban (including tunnels) to countryside and highway. We provide globally consistent reference poses with up to centimeter-level accuracy obtained from the fusion of direct stereo-inertial odometry with RTK GNSS. We evaluate the performance of several state-of-the-art visual odometry and visual localization baseline approaches on the benchmark and analyze their properties. The experimental results provide new insights into current approaches and show promising potential for future research. Our benchmark and evaluation protocols will be available at https://www.4seasons-dataset.com/. △ Less

Submitted 31 December, 2022; originally announced January 2023.

Comments: arXiv admin note: substantial text overlap with arXiv:2009.06364

arXiv:2301.00138 [pdf, ps, other]

Chaos and Entanglement in Non-Markovian Optomechanical Systems

Authors: Pengju Chen, Nan Yang, Austen Couvertier, Quanzhen Ding, Rupak Chatterjee, Ting Yu

Abstract: We study the chaotic motion of an optomechanical system coupled to a non-Markovian environment. We show that the environmental memory time can significantly affect chaos in an enhancing way. In addition to classical chaotic motion, the quantum entanglement in the presence of chaos is investigated. It is found that both the environmental memory and chaos can lift up bipartite entanglement in a non-… ▽ More We study the chaotic motion of an optomechanical system coupled to a non-Markovian environment. We show that the environmental memory time can significantly affect chaos in an enhancing way. In addition to classical chaotic motion, the quantum entanglement in the presence of chaos is investigated. It is found that both the environmental memory and chaos can lift up bipartite entanglement in a non-linear optomechanical system. These observations may help expand our understanding of the transition from classical to quantum dynamics. △ Less

Submitted 31 December, 2022; originally announced January 2023.

arXiv:2212.10657 [pdf, other]

The impact of gas accretion and AGN feedback on the scatter of the mass-metallicity relation

Authors: Nancy Yang, Dirk Scholte, Amelie Saintonge

Abstract: The gas-phase metallicity of galaxies encodes important information about galaxy evolution processes, in particular star formation, feedback, outflows and gas accretion, the relative importance of which can be extracted from systematic trends in the scatter of the mass-metallicity relation (MZR). Here, we use a sample of low redshift (0.02 < z < 0.055) galaxies from SDSS to investigate the nature… ▽ More The gas-phase metallicity of galaxies encodes important information about galaxy evolution processes, in particular star formation, feedback, outflows and gas accretion, the relative importance of which can be extracted from systematic trends in the scatter of the mass-metallicity relation (MZR). Here, we use a sample of low redshift (0.02 < z < 0.055) galaxies from SDSS to investigate the nature of the scatter around the MZR, the observables and physical processes causing it, and its dependence on galaxy mass. We use cold gas masses inferred from optical emission lines using the technique of Scholte & Saintonge (2023) to confirm that at fixed stellar mass, metallicity and gas mass are anti-correlated, but only for galaxies up to M*= 10^{10.5} Msun. In that mass regime, we find a link between the offset of a galaxy from the MZR and halo mass, using the amplitude of the two-point correlation function as a proxy for halo mass; at fixed stellar mass, the most gas-poor galaxies reside in the most massive halos. This observation is consistent with changes in gas accretion rates onto galaxies as a function of halo mass, with environmental effects acting on satellite galaxies also contributing. At higher stellar masses, the scatter of the MZR does no longer correlate with gas or halo mass. Instead, there is some indication of a link with AGN activity, as expected from models and simulations that metallicity is set by the interplay between gas in- and outflows, star formation, and AGN feedback, sha** the MZR and its scatter. △ Less

Submitted 26 December, 2023; v1 submitted 20 December, 2022; originally announced December 2022.

Comments: 10 pages, 6 figures. Accepted for publication in MNRAS. Updated to reflect minor changes made during the referring process

arXiv:2212.03533 [pdf, other]

Text Embeddings by Weakly-Supervised Contrastive Pre-training

Authors: Liang Wang, Nan Yang, Xiaolong Huang, Binxing Jiao, Linjun Yang, Daxin Jiang, Rangan Majumder, Furu Wei

Abstract: This paper presents E5, a family of state-of-the-art text embeddings that transfer well to a wide range of tasks. The model is trained in a contrastive manner with weak supervision signals from our curated large-scale text pair dataset (called CCPairs). E5 can be readily used as a general-purpose embedding model for any tasks requiring a single-vector representation of texts such as retrieval, clu… ▽ More This paper presents E5, a family of state-of-the-art text embeddings that transfer well to a wide range of tasks. The model is trained in a contrastive manner with weak supervision signals from our curated large-scale text pair dataset (called CCPairs). E5 can be readily used as a general-purpose embedding model for any tasks requiring a single-vector representation of texts such as retrieval, clustering, and classification, achieving strong performance in both zero-shot and fine-tuned settings. We conduct extensive evaluations on 56 datasets from the BEIR and MTEB benchmarks. For zero-shot settings, E5 is the first model that outperforms the strong BM25 baseline on the BEIR retrieval benchmark without using any labeled data. When fine-tuned, E5 obtains the best results on the MTEB benchmark, beating existing embedding models with 40x more parameters. △ Less

Submitted 22 February, 2024; v1 submitted 7 December, 2022; originally announced December 2022.

Comments: 17 pages, v2 fixes the SummEval numbers

arXiv:2212.02947 [pdf, other]

CNN-based Timing Synchronization for OFDM Systems Assisted by Initial Path Acquisition in Frequency Selective Fading Channel

Authors: Chao** Qing, Na Yang, Shuhai Tang, Chuangui Rao, Jiafan Wang, **liang Chen

Abstract: Multi-path fading seriously affects the accuracy of timing synchronization (TS) in orthogonal frequency division multiplexing (OFDM) systems. To tackle this issue, we propose a convolutional neural network (CNN)-based TS scheme assisted by initial path acquisition in this paper. Specifically, the classic cross-correlation method is first employed to estimate a coarse timing offset and capture an i… ▽ More Multi-path fading seriously affects the accuracy of timing synchronization (TS) in orthogonal frequency division multiplexing (OFDM) systems. To tackle this issue, we propose a convolutional neural network (CNN)-based TS scheme assisted by initial path acquisition in this paper. Specifically, the classic cross-correlation method is first employed to estimate a coarse timing offset and capture an initial path, which shrinks the TS search region. Then, a one-dimensional (1-D) CNN is developed to optimize the TS of OFDM systems. Due to the narrowed search region of TS, the CNN-based TS effectively locates the accurate TS point and inspires us to construct a lightweight network in terms of computational complexity and online running time. Compared with the compressed sensing-based TS method and extreme learning machine-based TS method, simulation results show that the proposed method can effectively improve the TS performance with the reduced computational complexity and online running time. Besides, the proposed TS method presents robustness against the variant parameters of multi-path fading channels. △ Less

Submitted 6 December, 2022; originally announced December 2022.

Comments: 5 pages, 3 figures

arXiv:2211.14603 [pdf, ps, other]

Analysis of Molecule Harvesting by Heterogeneous Receptors on MC Transmitters

Authors: Xinyu Huang, Yu Huang, Miaowen Wen, Nan Yang, Robert Schober

Abstract: This paper designs a molecule harvesting transmitter (TX) model, where the surface of a spherical TX is covered by heterogeneous receptors with different sizes and arbitrary locations. If molecules hit any receptor, they are absorbed by the TX immediately. Within the TX, molecules are stored in vesicles that are continuously generated and released by the TX via the membrane fusion process. Conside… ▽ More This paper designs a molecule harvesting transmitter (TX) model, where the surface of a spherical TX is covered by heterogeneous receptors with different sizes and arbitrary locations. If molecules hit any receptor, they are absorbed by the TX immediately. Within the TX, molecules are stored in vesicles that are continuously generated and released by the TX via the membrane fusion process. Considering a transparent receiver (RX) and molecular degradation during the propagation from the TX to the RX, we derive the molecule release rate and the fraction of molecules absorbed by the TX as well as the received signal at the RX. Notably, this analytical result is applicable for different numbers, sizes, and locations of receptors, and its accuracy is verified via particle-based simulations. Numerical results show that different vesicle generation rates result in the same number of molecules absorbed by the TX, but different peak received signals at the RX. △ Less

Submitted 17 October, 2023; v1 submitted 26 November, 2022; originally announced November 2022.

Comments: 7 pages, 4 figures. This work has been accepted by IEEE GLOBECOM 2023. Copyright may be transferred without notice, after which this version may no longer be accessible

arXiv:2211.13128 [pdf, other]

A Closed-loop Sleep Modulation System with FPGA-Accelerated Deep Learning

Authors: Mingzhe Sun, Aaron Zhou, Naize Yang, Yaqian Xu, Yuhan Hou, Xilin Liu

Abstract: Closed-loop sleep modulation is an emerging research paradigm to treat sleep disorders and enhance sleep benefits. However, two major barriers hinder the widespread application of this research paradigm. First, subjects often need to be wire-connected to rack-mount instrumentation for data acquisition, which negatively affects sleep quality. Second, conventional real-time sleep stage classificatio… ▽ More Closed-loop sleep modulation is an emerging research paradigm to treat sleep disorders and enhance sleep benefits. However, two major barriers hinder the widespread application of this research paradigm. First, subjects often need to be wire-connected to rack-mount instrumentation for data acquisition, which negatively affects sleep quality. Second, conventional real-time sleep stage classification algorithms give limited performance. In this work, we conquer these two limitations by develo** a sleep modulation system that supports closed-loop operations on the device. Sleep stage classification is performed using a lightweight deep learning (DL) model accelerated by a low-power field-programmable gate array (FPGA) device. The DL model uses a single channel electroencephalogram (EEG) as input. Two convolutional neural networks (CNNs) are used to capture general and detailed features, and a bidirectional long-short-term memory (LSTM) network is used to capture time-variant sequence features. An 8-bit quantization is used to reduce the computational cost without compromising performance. The DL model has been validated using a public sleep database containing 81 subjects, achieving a state-of-the-art classification accuracy of 85.8% and a F1-score of 79%. The developed model has also shown the potential to be generalized to different channels and input data lengths. Closed-loop in-phase auditory stimulation has been demonstrated on the test bench. △ Less

Submitted 18 November, 2022; originally announced November 2022.

arXiv:2211.05910 [pdf, other]

Efficient and Accurate Quantized Image Super-Resolution on Mobile NPUs, Mobile AI & AIM 2022 challenge: Report

Authors: Andrey Ignatov, Radu Timofte, Maurizio Denna, Abdel Younes, Ganzorig Gankhuyag, **gang Huh, Myeong Kyun Kim, Kihwan Yoon, Hyeon-Cheol Moon, Seungho Lee, Yoonsik Choe, **woo Jeong, Sungjei Kim, Maciej Smyl, Tomasz Latkowski, Pawel Kubik, Michal Sokolski, Yujie Ma, Jiahao Chao, Zhou Zhou, Hongfan Gao, Zhengfeng Yang, Zhenbing Zeng, Zhengyang Zhuge, Chenghua Li , et al. (71 additional authors not shown)

Abstract: Image super-resolution is a common task on mobile and IoT devices, where one often needs to upscale and enhance low-resolution images and video frames. While numerous solutions have been proposed for this problem in the past, they are usually not compatible with low-power mobile NPUs having many computational and memory constraints. In this Mobile AI challenge, we address this problem and propose… ▽ More Image super-resolution is a common task on mobile and IoT devices, where one often needs to upscale and enhance low-resolution images and video frames. While numerous solutions have been proposed for this problem in the past, they are usually not compatible with low-power mobile NPUs having many computational and memory constraints. In this Mobile AI challenge, we address this problem and propose the participants to design an efficient quantized image super-resolution solution that can demonstrate a real-time performance on mobile NPUs. The participants were provided with the DIV2K dataset and trained INT8 models to do a high-quality 3X image upscaling. The runtime of all models was evaluated on the Synaptics VS680 Smart Home board with a dedicated edge NPU capable of accelerating quantized neural networks. All proposed solutions are fully compatible with the above NPU, demonstrating an up to 60 FPS rate when reconstructing Full HD resolution images. A detailed description of all models developed in the challenge is provided in this paper. △ Less

Submitted 7 November, 2022; originally announced November 2022.

Comments: arXiv admin note: text overlap with arXiv:2105.07825, arXiv:2105.08826, arXiv:2211.04470, arXiv:2211.03885, arXiv:2211.05256

arXiv:2211.00874 [pdf, other]

Age of Information of Multi-user Mobile Edge Computing Systems

Authors: Zhifeng Tang, Zhuo Sun, Nan Yang, Xiangyun Zhou

Abstract: In this paper, we analyze the average age of information (AoI) and the average peak AoI (PAoI) of a multiuser mobile edge computing (MEC) system where a base station (BS) generates and transmits computation-intensive packets to user equipments (UEs). In this MEC system, we focus on three computing schemes: (i) The local computing scheme where all computational tasks are computed by the local serve… ▽ More In this paper, we analyze the average age of information (AoI) and the average peak AoI (PAoI) of a multiuser mobile edge computing (MEC) system where a base station (BS) generates and transmits computation-intensive packets to user equipments (UEs). In this MEC system, we focus on three computing schemes: (i) The local computing scheme where all computational tasks are computed by the local server at the UE, (ii) The edge computing scheme where all computational tasks are computed by the edge server at the BS, and (iii) The partial computing scheme where computational tasks are partially allocated at the edge server and the rest are computed by the local server. Considering exponentially distributed transmission time and computation time and adopting the first come first serve (FCFS) queuing policy, we derive closed-form expressions for the average AoI and average PAoI. To address the complexity of the average AoI expression, we derive simple upper and lower bounds on the average AoI, which allow us to explicitly examine the dependence of the optimal offloading decision on the MEC system parameters. Aided by simulation results, we verify our analysis and illustrate the impact of system parameters on the AoI performance. △ Less

Submitted 2 November, 2022; originally announced November 2022.

arXiv:2211.00869 [pdf, other]

Title2Event: Benchmarking Open Event Extraction with a Large-scale Chinese Title Dataset

Authors: Haolin Deng, Yanan Zhang, Yangfan Zhang, Wangyang Ying, Changlong Yu, Jun Gao, Wei Wang, Xiaoling Bai, Nan Yang, ** Ma, Xiang Chen, Tianhua Zhou

Abstract: Event extraction (EE) is crucial to downstream tasks such as new aggregation and event knowledge graph construction. Most existing EE datasets manually define fixed event types and design specific schema for each of them, failing to cover diverse events emerging from the online text. Moreover, news titles, an important source of event mentions, have not gained enough attention in current EE resear… ▽ More Event extraction (EE) is crucial to downstream tasks such as new aggregation and event knowledge graph construction. Most existing EE datasets manually define fixed event types and design specific schema for each of them, failing to cover diverse events emerging from the online text. Moreover, news titles, an important source of event mentions, have not gained enough attention in current EE research. In this paper, We present Title2Event, a large-scale sentence-level dataset benchmarking Open Event Extraction without restricting event types. Title2Event contains more than 42,000 news titles in 34 topics collected from Chinese web pages. To the best of our knowledge, it is currently the largest manually-annotated Chinese dataset for open event extraction. We further conduct experiments on Title2Event with different models and show that the characteristics of titles make it challenging for event extraction, addressing the significance of advanced study on this problem. The dataset and baseline codes are available at https://open-event-hub.github.io/title2event. △ Less

Submitted 2 November, 2022; originally announced November 2022.

Comments: EMNLP 2022

arXiv:2210.15672 [pdf, other]

Average Age of Information Penalty of Short-Packet Communications with Packet Management

Authors: Zhifeng Tang, Nan Yang, Xiangyun Zhou, Jemin Lee

Abstract: In this paper, we analyze the non-linear age of information (AoI) performance in a point-to-point short packet communication system, where a transmitter generates packets based on status updates and transmits the packets to a receiver. Specifically, we investigate three packet management strategies, namely, the non-preemption with no buffer strategy, the non-preemption with one buffer strategy, an… ▽ More In this paper, we analyze the non-linear age of information (AoI) performance in a point-to-point short packet communication system, where a transmitter generates packets based on status updates and transmits the packets to a receiver. Specifically, we investigate three packet management strategies, namely, the non-preemption with no buffer strategy, the non-preemption with one buffer strategy, and the preemption strategy. To characterize the level of the receiver's dissatisfaction on outdated data, we adopt a generalized α-βAoI penalty function into the analysis and derive closed-form expressions for the average AoI penalty achieved by the three packet management strategies. Simulation results are used to corroborate our analysis and explicitly evaluate the impact of various system parameters, such as the coding rate and status update generation rate, on the AoI performance. Additionally, we find that the value of αreflects the system transmission reliability. △ Less

Submitted 26 October, 2022; originally announced October 2022.

Comments: arXiv admin note: text overlap with arXiv:2210.15078

arXiv:2210.15514 [pdf, other]

Point-Voxel Adaptive Feature Abstraction for Robust Point Cloud Classification

Authors: Lifa Zhu, Changwei Lin, Chen Zheng, Ninghua Yang

Abstract: Great progress has been made in point cloud classification with learning-based methods. However, complex scene and sensor inaccuracy in real-world application make point cloud data suffer from corruptions, such as occlusion, noise and outliers. In this work, we propose Point-Voxel based Adaptive (PV-Ada) feature abstraction for robust point cloud classification under various corruptions. Specifica… ▽ More Great progress has been made in point cloud classification with learning-based methods. However, complex scene and sensor inaccuracy in real-world application make point cloud data suffer from corruptions, such as occlusion, noise and outliers. In this work, we propose Point-Voxel based Adaptive (PV-Ada) feature abstraction for robust point cloud classification under various corruptions. Specifically, the proposed framework iteratively voxelize the point cloud and extract point-voxel feature with shared local encoding and Transformer. Then, adaptive max-pooling is proposed to robustly aggregate the point cloud feature for classification. Experiments on ModelNet-C dataset demonstrate that PV-Ada outperforms the state-of-the-art methods. In particular, we rank the $2^{nd}$ place in ModelNet-C classification track of PointCloud-C Challenge 2022, with Overall Accuracy (OA) being 0.865. Code will be available at https://github.com/zhulf0804/PV-Ada. △ Less

Submitted 29 October, 2022; v1 submitted 27 October, 2022; originally announced October 2022.

Comments: Technical report

arXiv:2210.15078 [pdf, other]

doi 10.1109/JSAC.2023.3280986

Age of Information in Downlink Systems: Broadcast or Unicast Transmission?

Authors: Zhifeng Tang, Nan Yang, Parastoo Sadeghi, Xiangyun Zhou

Abstract: We analytically decide whether the broadcast transmission scheme or the unicast transmission scheme achieves the optimal age of information (AoI) performance of a multiuser system where a base station (BS) generates and transmits status updates to multiple user equipments (UEs). In the broadcast transmission scheme, the status update for all UEs is jointly encoded into a packet for transmission, w… ▽ More We analytically decide whether the broadcast transmission scheme or the unicast transmission scheme achieves the optimal age of information (AoI) performance of a multiuser system where a base station (BS) generates and transmits status updates to multiple user equipments (UEs). In the broadcast transmission scheme, the status update for all UEs is jointly encoded into a packet for transmission, while in the unicast transmission scheme, the status update for each UE is encoded individually and transmitted by following the round robin policy. For both transmission schemes, we examine three packet management strategies, namely the non-preemption strategy, the preemption in buffer strategy, and the preemption in serving strategy. We first derive new closed-form expressions for the average AoI achieved by two transmission schemes with three packet management strategies. Based on them, we compare the AoI performance of two transmission schemes in two systems, namely, the remote control system and the dynamic system. Aided by simulation results, we verify our analysis and investigate the impact of system parameters on the average AoI. For example, the unicast transmission scheme is more appropriate for the system with a large number UEs. Otherwise, the broadcast transmission scheme is more appropriate. △ Less

Submitted 7 July, 2023; v1 submitted 26 October, 2022; originally announced October 2022.

Showing 51–100 of 425 results for author: Yang, N