Search | arXiv e-print repository

Terahertz Communications for Massive Connectivity and Security in 6G and Beyond Era

Abstract: Terahertz (THz) communications (THzCom) has experienced a meteoric rise of interest, due to its benefits for ultra-high data rate transmission in the sixth generation (6G) and beyond era. Despite so, the research on exploring the potential of THzCom for other performance targets anticipated by 6G, including massive connectivity and security, is still in its infancy. In this article, we start with… ▽ More Terahertz (THz) communications (THzCom) has experienced a meteoric rise of interest, due to its benefits for ultra-high data rate transmission in the sixth generation (6G) and beyond era. Despite so, the research on exploring the potential of THzCom for other performance targets anticipated by 6G, including massive connectivity and security, is still in its infancy. In this article, we start with briefly describing the unique peculiarities of THz channels, and then discuss theoretical frameworks to facilitate the analysis and design of THz transmission for achieving massive connectivity and security. Then we discuss promising spectrum management strategies, including the exploration of multiple THz transmission windows and frequency reuse with multiplexing and signal processing, to substantially increase the number of supported users and identify to-be-tackled challenges. We further present important research directions based on the principles of physical layer security, such as new spectrum allocation policies and beamforming algorithms, to fight against eavesdrop** in THzCom systems, ushering in secure THzCom systems. △ Less

Submitted 25 October, 2022; originally announced October 2022.

Comments: This paper has been accepted for publication in IEEE Communications Magazine. Copyright may be transferred without notice, after which this version may no longer be accessible

arXiv:2210.13806 [pdf]

Deformation insensitive thermal conductance of the designed Si metamaterial

Authors: Lina Yang, Quan Zhang, Gengkai Hu, Nuo Yang

Abstract: The thermal management have been widely focused due to broad applications. Generally, the deformation can largely tune the thermal transport. The main challenge of flexible electronics/ materials is to maintain thermal conductance under large deformation. This work investigates the thermal conductance of a nano-designed Si metamaterial constructed with curved nanobeams by molecular dynamics simula… ▽ More The thermal management have been widely focused due to broad applications. Generally, the deformation can largely tune the thermal transport. The main challenge of flexible electronics/ materials is to maintain thermal conductance under large deformation. This work investigates the thermal conductance of a nano-designed Si metamaterial constructed with curved nanobeams by molecular dynamics simulation. Interestingly, it shows that the thermal conductance of the nano-designed Si metamaterial is insensitive under a large deformation (strain~-41%). The new feature comes from the designed curved nanobeams which makes a quasi-zero stiffness. Further calculations show that, when under a large deformation, the average stress in nanobeam is ultra-small (<151 MPa) and its phonon density of states are little changed. This work provides valuable insights on multifunction, such as both stable thermal and mechanical properties, of nano-designed metamaterials. △ Less

Submitted 9 February, 2023; v1 submitted 25 October, 2022; originally announced October 2022.

arXiv:2210.13759 [pdf, ps, other]

doi 10.1007/s10231-023-01336-9

On recognition of direct powers of finite simple linear groups by spectrum

Authors: N. Yang, I. B. Gorshkov, A. M. Staroletov, A. V. Vasil'ev

Abstract: The spectrum of a finite group is the set of its element orders. We give an affirmative answer to Problem 20.58(a) from the Kourovka Notebook proving that for every positive integer $k$, the $k$-th direct power of the simple linear group $L_{n}(2)$ is uniquely determined by its spectrum in the class of finite groups provided $n$ is a power of $2$ greater than or equal to $56k^2$. The spectrum of a finite group is the set of its element orders. We give an affirmative answer to Problem 20.58(a) from the Kourovka Notebook proving that for every positive integer $k$, the $k$-th direct power of the simple linear group $L_{n}(2)$ is uniquely determined by its spectrum in the class of finite groups provided $n$ is a power of $2$ greater than or equal to $56k^2$. △ Less

Submitted 7 February, 2023; v1 submitted 25 October, 2022; originally announced October 2022.

Comments: 17 pages

MSC Class: 20D06

Journal ref: Ann. Mat. Pura Appl., 202, 2699-2714 (2023)

arXiv:2210.12029 [pdf, other]

Adversarial Transformer for Repairing Human Airway Segmentation

Authors: Zeyu Tang, Nan Yang, Simon Walsh, Guang Yang

Abstract: Discontinuity in the delineation of peripheral bronchioles hinders the potential clinical application of automated airway segmentation models. Moreover, the deployment of such models is limited by the data heterogeneity across different centres, and pathological abnormalities also make achieving accurate robust segmentation in distal small airways difficult. Meanwhile, the diagnosis and prognosis… ▽ More Discontinuity in the delineation of peripheral bronchioles hinders the potential clinical application of automated airway segmentation models. Moreover, the deployment of such models is limited by the data heterogeneity across different centres, and pathological abnormalities also make achieving accurate robust segmentation in distal small airways difficult. Meanwhile, the diagnosis and prognosis of lung diseases often rely on evaluating structural changes in those anatomical regions. To address this gap, this paper presents a patch-scale adversarial-based refinement network that takes in preliminary segmentation along with original CT images and outputs a refined mask of the airway structure. The method is validated on three different datasets encompassing healthy cases, cases with cystic fibrosis and cases with COVID-19. The results are quantitatively evaluated by seven metrics and achieved more than a 15% rise in detected length ratio and detected branch ratio, showing promising performance compared to previously proposed models. The visual illustration also proves our refinement guided by a patch-scale discriminator and centreline objective functions is effective in detecting discontinuities and missing bronchioles. Furthermore, the generalizability of our refinement pipeline is tested on three previous models and improves their segmentation completeness significantly. △ Less

Submitted 21 October, 2022; originally announced October 2022.

Comments: 8 Pages, 7 figures

arXiv:2209.10734 [pdf, other]

CCR: Facial Image Editing with Continuity, Consistency and Reversibility

Authors: Nan Yang, Xin Luan, Huidi Jia, Zhi Han, Yandong Tang

Abstract: Three problems exist in sequential facial image editing: incontinuous editing, inconsistent editing, and irreversible editing. Incontinuous editing is that the current editing can not retain the previously edited attributes. Inconsistent editing is that swap** the attribute editing orders can not yield the same results. Irreversible editing means that operating on a facial image is irreversible,… ▽ More Three problems exist in sequential facial image editing: incontinuous editing, inconsistent editing, and irreversible editing. Incontinuous editing is that the current editing can not retain the previously edited attributes. Inconsistent editing is that swap** the attribute editing orders can not yield the same results. Irreversible editing means that operating on a facial image is irreversible, especially in sequential facial image editing. In this work, we put forward three concepts and corresponding definitions: editing continuity, consistency, and reversibility. Then, we propose a novel model to achieve the goal of editing continuity, consistency, and reversibility. A sufficient criterion is defined to determine whether a model is continuous, consistent, and reversible. Extensive qualitative and quantitative experimental results validate our proposed model and show that a continuous, consistent and reversible editing model has a more flexible editing function while preserving facial identity. Furthermore, we think that our proposed definitions and model will have wide and promising applications in multimedia processing. Code and data are available at https://github.com/mickoluan/CCR. △ Less

Submitted 21 September, 2022; originally announced September 2022.

Comments: 10 pages, 11 figures

arXiv:2209.09694 [pdf]

Modulating Thermal Conductivity via Targeted Phonon Excitation

Authors: Xiao Wan, Dongkai Pan, **g-Tao Lü, Sebastian Volz, Lifa Zhang, Qing Hao, Yangjun Qin, Zhicheng Zong, Nuo Yang

Abstract: Thermal conductivity is a critical material property in numerous applications, such as those related to thermoelectric devices and heat dissipation. Effectively modulating thermal conductivity has become a great concern in the field of heat conduction. In this study, a quantum strategy is proposed to modulate thermal conductivity by exciting targeted phonons. The results show that the thermal cond… ▽ More Thermal conductivity is a critical material property in numerous applications, such as those related to thermoelectric devices and heat dissipation. Effectively modulating thermal conductivity has become a great concern in the field of heat conduction. In this study, a quantum strategy is proposed to modulate thermal conductivity by exciting targeted phonons. The results show that the thermal conductivity of graphene can be tailored in the range of 1559 W/m-K (49%) to 4093 W/m-K (128%), compared with the intrinsic value of 3189 W/m-K. A similar trend is also observed for graphene nanoribbons. The results are obtained through both ab initio calculations and molecular dynamics simulations. This brand-new quantum strategy to modulate thermal conductivity paves a way for quantum heat conduction. △ Less

Submitted 5 April, 2023; v1 submitted 20 September, 2022; originally announced September 2022.

arXiv:2209.04164 [pdf, other]

doi 10.1109/GLOBECOM46510.2021.9685590

Joint Caching and Transmission in the Mobile Edge Network: A Multi-Agent Learning Approach

Authors: Qirui Mi, Ning Yang, Haifeng Zhang, Haijun Zhang, Jun Wang

Abstract: Joint caching and transmission optimization problem is challenging due to the deep coupling between decisions. This paper proposes an iterative distributed multi-agent learning approach to jointly optimize caching and transmission. The goal of this approach is to minimize the total transmission delay of all users. In this iterative approach, each iteration includes caching optimization and transmi… ▽ More Joint caching and transmission optimization problem is challenging due to the deep coupling between decisions. This paper proposes an iterative distributed multi-agent learning approach to jointly optimize caching and transmission. The goal of this approach is to minimize the total transmission delay of all users. In this iterative approach, each iteration includes caching optimization and transmission optimization. A multi-agent reinforcement learning (MARL)-based caching network is developed to cache popular tasks, such as answering which files to evict from the cache and which files to storage. Based on the cached files of the caching network, the transmission network transmits cached files for users by single transmission (ST) or joint transmission (JT) with multi-agent Bayesian learning automaton (MABLA) method. And then users access the edge servers with the minimum transmission delay. The experimental results demonstrate the performance of the proposed multi-agent learning approach. △ Less

Submitted 9 September, 2022; originally announced September 2022.

arXiv:2209.02934 [pdf, other]

Boundary Guided Semantic Learning for Real-time COVID-19 Lung Infection Segmentation System

Authors: Runmin Cong, Yumo Zhang, Ning Yang, Haisheng Li, Xueqi Zhang, Ruochen Li, Zewen Chen, Yao Zhao, Sam Kwong

Abstract: The coronavirus disease 2019 (COVID-19) continues to have a negative impact on healthcare systems around the world, though the vaccines have been developed and national vaccination coverage rate is steadily increasing. At the current stage, automatically segmenting the lung infection area from CT images is essential for the diagnosis and treatment of COVID-19. Thanks to the development of deep lea… ▽ More The coronavirus disease 2019 (COVID-19) continues to have a negative impact on healthcare systems around the world, though the vaccines have been developed and national vaccination coverage rate is steadily increasing. At the current stage, automatically segmenting the lung infection area from CT images is essential for the diagnosis and treatment of COVID-19. Thanks to the development of deep learning technology, some deep learning solutions for lung infection segmentation have been proposed. However, due to the scattered distribution, complex background interference and blurred boundaries, the accuracy and completeness of the existing models are still unsatisfactory. To this end, we propose a boundary guided semantic learning network (BSNet) in this paper. On the one hand, the dual-branch semantic enhancement module that combines the top-level semantic preservation and progressive semantic integration is designed to model the complementary relationship between different high-level features, thereby promoting the generation of more complete segmentation results. On the other hand, the mirror-symmetric boundary guidance module is proposed to accurately detect the boundaries of the lesion regions in a mirror-symmetric way. Experiments on the publicly available dataset demonstrate that our BSNet outperforms the existing state-of-the-art competitors and achieves a real-time inference speed of 44 FPS. △ Less

Submitted 7 September, 2022; originally announced September 2022.

Comments: Accepted by IEEE Transactions on Consumer Electronics 2022

arXiv:2208.13358 [pdf, other]

doi 10.1145/3511808.3557152

Billion-user Customer Lifetime Value Prediction: An Industrial-scale Solution from Kuaishou

Authors: Kunpeng Li, Guangcui Shao, Naijun Yang, Xiao Fang, Yang Song

Abstract: Customer Life Time Value (LTV) is the expected total revenue that a single user can bring to a business. It is widely used in a variety of business scenarios to make operational decisions when acquiring new customers. Modeling LTV is a challenging problem, due to its complex and mutable data distribution. Existing approaches either directly learn from posterior feature distributions or leverage st… ▽ More Customer Life Time Value (LTV) is the expected total revenue that a single user can bring to a business. It is widely used in a variety of business scenarios to make operational decisions when acquiring new customers. Modeling LTV is a challenging problem, due to its complex and mutable data distribution. Existing approaches either directly learn from posterior feature distributions or leverage statistical models that make strong assumption on prior distributions, both of which fail to capture those mutable distributions. In this paper, we propose a complete set of industrial-level LTV modeling solutions. Specifically, we introduce an Order Dependency Monotonic Network (ODMN) that models the ordered dependencies between LTVs of different time spans, which greatly improves model performance. We further introduce a Multi Distribution Multi Experts (MDME) module based on the Divide-and-Conquer idea, which transforms the severely imbalanced distribution modeling problem into a series of relatively balanced sub-distribution modeling problems hence greatly reduces the modeling complexity. In addition, a novel evaluation metric Mutual Gini is introduced to better measure the distribution difference between the estimated value and the ground-truth label based on the Lorenz Curve. The ODMN framework has been successfully deployed in many business scenarios of Kuaishou, and achieved great performance. Extensive experiments on real-world industrial data demonstrate the superiority of the proposed methods compared to state-of-the-art baselines including ZILN and Two-Stage XGBoost models. △ Less

Submitted 29 August, 2022; originally announced August 2022.

Comments: Accepted by CIKM 2022, 9 pages

arXiv:2208.04760 [pdf, other]

Time Lag Aware Sequential Recommendation

Authors: Lihua Chen, Ning Yang, Philip S Yu

Abstract: Although a variety of methods have been proposed for sequential recommendation, it is still far from being well solved partly due to two challenges. First, the existing methods often lack the simultaneous consideration of the global stability and local fluctuation of user preference, which might degrade the learning of a user's current preference. Second, the existing methods often use a scalar ba… ▽ More Although a variety of methods have been proposed for sequential recommendation, it is still far from being well solved partly due to two challenges. First, the existing methods often lack the simultaneous consideration of the global stability and local fluctuation of user preference, which might degrade the learning of a user's current preference. Second, the existing methods often use a scalar based weighting schema to fuse the long-term and short-term preferences, which is too coarse to learn an expressive embedding of current preference. To address the two challenges, we propose a novel model called Time Lag aware Sequential Recommendation (TLSRec), which integrates a hierarchical modeling of user preference and a time lag sensitive fine-grained fusion of the long-term and short-term preferences. TLSRec employs a hierarchical self-attention network to learn users' preference at both global and local time scales, and a neural time gate to adaptively regulate the contributions of the long-term and short-term preferences for the learning of a user's current preference at the aspect level and based on the lag between the current time and the time of the last behavior of a user. The extensive experiments conducted on real datasets verify the effectiveness of TLSRec. △ Less

Submitted 9 August, 2022; originally announced August 2022.

Comments: This paper has been accepted by CIKM 2022

arXiv:2208.04232 [pdf, other]

Learning Diverse Document Representations with Deep Query Interactions for Dense Retrieval

Authors: Zehan Li, Nan Yang, Liang Wang, Furu Wei

Abstract: In this paper, we propose a new dense retrieval model which learns diverse document representations with deep query interactions. Our model encodes each document with a set of generated pseudo-queries to get query-informed, multi-view document representations. It not only enjoys high inference efficiency like the vanilla dual-encoder models, but also enables deep query-document interactions in doc… ▽ More In this paper, we propose a new dense retrieval model which learns diverse document representations with deep query interactions. Our model encodes each document with a set of generated pseudo-queries to get query-informed, multi-view document representations. It not only enjoys high inference efficiency like the vanilla dual-encoder models, but also enables deep query-document interactions in document encoding and provides multi-faceted representations to better match different queries. Experiments on several benchmarks demonstrate the effectiveness of the proposed method, out-performing strong dual encoder baselines.The code is available at \url{https://github.com/jordane95/dual-cross-encoder △ Less

Submitted 8 August, 2022; originally announced August 2022.

arXiv:2208.03618 [pdf, ps, other]

An Unsupervised Learning Approach for Spectrum Allocation in Terahertz Communication Systems

Authors: Akram Shafie, Chunhui Li, Nan Yang, Xiangyun Zhou, Trung Q. Duong

Abstract: We propose a new spectrum allocation strategy, aided by unsupervised learning, for multiuser terahertz communication systems. In this strategy, adaptive sub-band bandwidth is considered such that the spectrum of interest can be divided into sub-bands with unequal bandwidths. This strategy reduces the variation in molecular absorption loss among the users, leading to the improved data rate performa… ▽ More We propose a new spectrum allocation strategy, aided by unsupervised learning, for multiuser terahertz communication systems. In this strategy, adaptive sub-band bandwidth is considered such that the spectrum of interest can be divided into sub-bands with unequal bandwidths. This strategy reduces the variation in molecular absorption loss among the users, leading to the improved data rate performance. We first formulate an optimization problem to determine the optimal sub-band bandwidth and transmit power, and then propose the unsupervised learning-based approach to obtaining the near-optimal solution to this problem. In the proposed approach, we first train a deep neural network (DNN) while utilizing a loss function that is inspired by the Lagrangian of the formulated problem. Then using the trained DNN, we approximate the near-optimal solutions. Numerical results demonstrate that comparing to existing approaches, our proposed unsupervised learning-based approach achieves a higher data rate, especially when the molecular absorption coefficient within the spectrum of interest varies in a highly non-linear manner. △ Less

Submitted 6 August, 2022; originally announced August 2022.

Comments: This paper has been accepted for publication in IEEE Global Communications Conferences (GLOBECOM) 2022. Copyright may be transferred without notice, after which this version may no longer be accessible

arXiv:2207.11021 [pdf, other]

Terahertz Communications for 6G and Beyond Wireless Networks: Challenges, Key Advancements, and Opportunities

Authors: Akram Shafie, Nan Yang, Chong Han, Josep Miquel Jornet, Markku Juntti, Thomas Kurner

Abstract: The unprecedented increase in wireless data traffic, predicted to occur within the next decade, is motivating academia and industries to look beyond contemporary wireless standards and conceptualize the sixth-generation (6G) wireless networks. Among various promising solutions, terahertz (THz) communications (THzCom) is recognized as a highly promising technology for the 6G and beyond era, due to… ▽ More The unprecedented increase in wireless data traffic, predicted to occur within the next decade, is motivating academia and industries to look beyond contemporary wireless standards and conceptualize the sixth-generation (6G) wireless networks. Among various promising solutions, terahertz (THz) communications (THzCom) is recognized as a highly promising technology for the 6G and beyond era, due to its unique potential to support terabit-per-second transmission in emerging applications. This article delves into key areas for develo** end-to-end THzCom systems, focusing on physical, link, and network layers. Specifically, we discuss the areas of THz spectrum management, THz antennas and beamforming, and the integration of other 6G-enabling technologies for THzCom. For each area, we identify the challenges imposed by the unique properties of the THz band. We then present main advancements and outline perspective research directions in each area to stimulate future research efforts for realizing THzCom in 6G and beyond wireless networks. △ Less

Submitted 22 July, 2022; originally announced July 2022.

Comments: This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible

arXiv:2207.02578 [pdf, other]

SimLM: Pre-training with Representation Bottleneck for Dense Passage Retrieval

Authors: Liang Wang, Nan Yang, Xiaolong Huang, Binxing Jiao, Linjun Yang, Daxin Jiang, Rangan Majumder, Furu Wei

Abstract: In this paper, we propose SimLM (Similarity matching with Language Model pre-training), a simple yet effective pre-training method for dense passage retrieval. It employs a simple bottleneck architecture that learns to compress the passage information into a dense vector through self-supervised pre-training. We use a replaced language modeling objective, which is inspired by ELECTRA, to improve th… ▽ More In this paper, we propose SimLM (Similarity matching with Language Model pre-training), a simple yet effective pre-training method for dense passage retrieval. It employs a simple bottleneck architecture that learns to compress the passage information into a dense vector through self-supervised pre-training. We use a replaced language modeling objective, which is inspired by ELECTRA, to improve the sample efficiency and reduce the mismatch of the input distribution between pre-training and fine-tuning. SimLM only requires access to unlabeled corpus, and is more broadly applicable when there are no labeled data or queries. We conduct experiments on several large-scale passage retrieval datasets, and show substantial improvements over strong baselines under various settings. Remarkably, SimLM even outperforms multi-vector approaches such as ColBERTv2 which incurs significantly more storage cost. Our code and model check points are available at https://github.com/microsoft/unilm/tree/master/simlm . △ Less

Submitted 12 May, 2023; v1 submitted 6 July, 2022; originally announced July 2022.

Comments: Accepted to ACL 2023

arXiv:2207.02401 [pdf, other]

Novel Spectrum Allocation Among Multiple Transmission Windows for Terahertz Communication Systems

Authors: Akram Shafie, Nan Yang, Chong Han, Josep M. Jornet

Abstract: This paper presents a novel spectrum allocation strategy for multiuser terahertz (THz) band communication systems when the to-be-allocated spectrum is composed of multiple transmission windows (TWs). This strategy explores the benefits of (i) allowing users to occupy sub-bands with unequal bandwidths and (ii) optimally avoiding using some spectra that exist at the edges of TWs where molecular abso… ▽ More This paper presents a novel spectrum allocation strategy for multiuser terahertz (THz) band communication systems when the to-be-allocated spectrum is composed of multiple transmission windows (TWs). This strategy explores the benefits of (i) allowing users to occupy sub-bands with unequal bandwidths and (ii) optimally avoiding using some spectra that exist at the edges of TWs where molecular absorption loss is high. To maximize the aggregated multiuser data rate, we formulate an optimization problem, with the primary focus on spectrum allocation. We then apply transformations and modifications to make the problem computationally tractable, and develop an iterative algorithm based on successive convex approximation to determine the optimal sub-band bandwidth and the unused spectra at the edges of TWs. Using numerical results, we show that a significantly higher data rate can be achieved by changing the sub-band bandwidth, as compared to equal sub-band bandwidth. We also show that a further data rate gain can be obtained by optimally determining the unused spectra at the edges of TWs, as compared to avoiding using pre-defined spectra at the edges of TWs. △ Less

Submitted 5 July, 2022; originally announced July 2022.

Comments: This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible

arXiv:2206.11502 [pdf]

A Review of Published Machine Learning Natural Language Processing Applications for Protocolling Radiology Imaging

Authors: Nihal Raju, Michael Woodburn, Stefan Kachel, Jack O'Shaughnessy, Laurence Sorace, Natalie Yang, Ruth P Lim

Abstract: Machine learning (ML) is a subfield of Artificial intelligence (AI), and its applications in radiology are growing at an ever-accelerating rate. The most studied ML application is the automated interpretation of images. However, natural language processing (NLP), which can be combined with ML for text interpretation tasks, also has many potential applications in radiology. One such application is… ▽ More Machine learning (ML) is a subfield of Artificial intelligence (AI), and its applications in radiology are growing at an ever-accelerating rate. The most studied ML application is the automated interpretation of images. However, natural language processing (NLP), which can be combined with ML for text interpretation tasks, also has many potential applications in radiology. One such application is automation of radiology protocolling, which involves interpreting a clinical radiology referral and selecting the appropriate imaging technique. It is an essential task which ensures that the correct imaging is performed. However, the time that a radiologist must dedicate to protocolling could otherwise be spent reporting, communicating with referrers, or teaching. To date, there have been few publications in which ML models were developed that use clinical text to automate protocol selection. This article reviews the existing literature in this field. A systematic assessment of the published models is performed with reference to best practices suggested by machine learning convention. Progress towards implementing automated protocolling in a clinical setting is discussed. △ Less

Submitted 23 June, 2022; originally announced June 2022.

Comments: 7 figures

MSC Class: 68T07

arXiv:2206.07949 [pdf, other]

AI Enlightens Wireless Communication: A Transformer Backbone for CSI Feedback

Authors: Han Xiao, Zhiqin Wang, Dexin Li, Wenqiang Tian, Xiaofeng Liu, Wendong Liu, Shi **, Jia Shen, Zhi Zhang, Ning Yang

Abstract: This paper is based on the background of the 2nd Wireless Communication Artificial Intelligence (AI) Competition (WAIC) which is hosted by IMT-2020(5G) Promotion Group 5G+AIWork Group, where the framework of the eigenvector-based channel state information (CSI) feedback problem is firstly provided. Then a basic Transformer backbone for CSI feedback referred to EVCsiNet-T is proposed. Moreover, a s… ▽ More This paper is based on the background of the 2nd Wireless Communication Artificial Intelligence (AI) Competition (WAIC) which is hosted by IMT-2020(5G) Promotion Group 5G+AIWork Group, where the framework of the eigenvector-based channel state information (CSI) feedback problem is firstly provided. Then a basic Transformer backbone for CSI feedback referred to EVCsiNet-T is proposed. Moreover, a series of potential enhancements for deep learning based (DL-based) CSI feedback including i) data augmentation, ii) loss function design, iii) training strategy, and iv) model ensemble are introduced. The experimental results involving the comparison between EVCsiNet-T and traditional codebook methods over different channels are further provided, which show the advanced performance and a promising prospect of Transformer on DL-based CSI feedback problem. △ Less

Submitted 16 June, 2022; originally announced June 2022.

arXiv:2206.05068 [pdf, other]

doi 10.1063/5.0118952

Boosting current-induced molecular dynamics with machine-learning potential

Authors: Gen Li, Bing-Zhong Hu, Wen-Hao Mao, Nuo Yang, **g-Tao Lü

Abstract: In a current-carrying single-molecular junction (SMJ), a hierarchy of hybrid energy transport processes takes place under a highly nonequilibrium situation, including energy transfer from electrons to molecular vibrations via electron-vibration interaction, energy redistribution within different vibrational modes via anharmonic coupling, and eventual energy transport to surrounding electrodes. A c… ▽ More In a current-carrying single-molecular junction (SMJ), a hierarchy of hybrid energy transport processes takes place under a highly nonequilibrium situation, including energy transfer from electrons to molecular vibrations via electron-vibration interaction, energy redistribution within different vibrational modes via anharmonic coupling, and eventual energy transport to surrounding electrodes. A comprehensive understanding of such processes is a prerequisite for their potential applications as single-molecular devices. $Ab$ $initio$ current-induced molecular dynamics (MD) is an ideal approach to address this complicated problem. But the computational cost hinders its usage in systematic study of realistic SMJs. Here, we achieve orders of magnitude improvement in the speed of MD simulation by employing machine-learning potential with accuracy comparable to density functional theory. Using this approach, we show that SMJs with graphene electrodes generate order of magnitude less heating than those with gold electrodes. Our work illustrates the superior heat transport property of graphene as electrodes for SMJs, thanks to its better phonon spectral overlap with molecular vibrations. △ Less

Submitted 10 June, 2022; originally announced June 2022.

Comments: 8 pages with supplemental materials

Journal ref: J. Chem. Phys. 157, 174303 (2022)

arXiv:2206.01246 [pdf, other]

doi 10.1103/PhysRevLett.130.237101

Stochastic gradient descent introduces an effective landscape-dependent regularization favoring flat solutions

Authors: Ning Yang, Chao Tang, Yuhai Tu

Abstract: Generalization is one of the most important problems in deep learning (DL). In the overparameterized regime in neural networks, there exist many low-loss solutions that fit the training data equally well. The key question is which solution is more generalizable. Empirical studies showed a strong correlation between flatness of the loss landscape at a solution and its generalizability, and stochast… ▽ More Generalization is one of the most important problems in deep learning (DL). In the overparameterized regime in neural networks, there exist many low-loss solutions that fit the training data equally well. The key question is which solution is more generalizable. Empirical studies showed a strong correlation between flatness of the loss landscape at a solution and its generalizability, and stochastic gradient descent (SGD) is crucial in finding the flat solutions. To understand how SGD drives the learning system to flat solutions, we construct a simple model whose loss landscape has a continuous set of degenerate (or near degenerate) minima. By solving the Fokker-Planck equation of the underlying stochastic learning dynamics, we show that due to its strong anisotropy the SGD noise introduces an additional effective loss term that decreases with flatness and has an overall strength that increases with the learning rate and batch-to-batch variation. We find that the additional landscape-dependent SGD-loss breaks the degeneracy and serves as an effective regularization for finding flat solutions. Furthermore, a stronger SGD noise shortens the convergence time to the flat solutions. However, we identify an upper bound for the SGD noise beyond which the system fails to converge. Our results not only elucidate the role of SGD for generalization they may also have important implications for hyperparameter selection for learning efficiently without divergence. △ Less

Submitted 2 June, 2022; originally announced June 2022.

Comments: Main text: 11 pages, 3 figures; supplementary materials: 19 pages, 5 figures

arXiv:2205.12520 [pdf, other]

Molecular Absorption Effect: A Double-edged Sword of Terahertz Communications

Authors: Chong Han, Weijun Gao, Nan Yang, Josep M. Jornet

Abstract: Communications in the terahertz band (THz) (0.1--10~THz) have been regarded as a promising technology for future 6G and beyond wireless systems, to overcome the challenges of evergrowing wireless data traffic and crowded spectrum. As the frequency increases from the microwave band to the THz band, new spectrum features pose unprecedented challenges to wireless communication system design. The mole… ▽ More Communications in the terahertz band (THz) (0.1--10~THz) have been regarded as a promising technology for future 6G and beyond wireless systems, to overcome the challenges of evergrowing wireless data traffic and crowded spectrum. As the frequency increases from the microwave band to the THz band, new spectrum features pose unprecedented challenges to wireless communication system design. The molecular absorption effect is one of the new THz spectrum properties, which enlarges the path loss and noise at specific frequencies. This brings in a double-edged sword for THz wireless communication systems. On one hand, from the data rate viewpoint, molecular absorption is detrimental, since it mitigates the received signal power and degrades the channel capacity. On the other hand, it is worth noticing that for wireless security and covertness, the molecular absorption effect can be utilized to safeguard THz communications among users. In this paper, the features of the molecular absorption effect and their impact on the THz system design are analyzed under various scenarios, with the ultimate goal of providing guidelines to how better exploit this unique THz phenomenon. Specifically, since the molecular absorption greatly depends on the propagation medium, different communication scenarios consisting of various media are discussed, including terrestrial, air and space, sea surface and nano-scale communications. Furthermore, two novel molecular absorption enlightened secure and covert communication schemes are presented, where the molecular absorption effect is utilized as the key and unique feature to boost security and covertness. △ Less

Submitted 25 May, 2022; originally announced May 2022.

arXiv:2205.07973 [pdf, other]

doi 10.1049/ntw2.12038

Many Field Packet Classification with Decomposition and Reinforcement Learning

Authors: Hasibul Jamil, Ning Yang, Ning Weng

Abstract: Scalable packet classification is a key requirement to support scalable network applications like firewalls, intrusion detection, and differentiated services. With ever increasing in the line-rate in core networks, it becomes a great challenge to design a scalable packet classification solution using hand-tuned heuristics approaches. In this paper, we present a scalable learning-based packet class… ▽ More Scalable packet classification is a key requirement to support scalable network applications like firewalls, intrusion detection, and differentiated services. With ever increasing in the line-rate in core networks, it becomes a great challenge to design a scalable packet classification solution using hand-tuned heuristics approaches. In this paper, we present a scalable learning-based packet classification engine by building an efficient data structure for different ruleset with many fields. Our method consists of the decomposition of fields into subsets and building separate decision trees on those subsets using a deep reinforcement learning procedure. To decompose given fields of a ruleset, we consider different grou** metrics like standard deviation of individual fields and introduce a novel metric called diversity index (DI). We examine different decomposition schemes and construct decision trees for each scheme using deep reinforcement learning and compare the results. The results show that the SD decomposition metrics results in 11.5% faster than DI metrics, 25% faster than random 2 and 40% faster than random 1. Furthermore, our learning-based selection method can be applied to varying rulesets due to its ruleset independence. △ Less

Submitted 16 May, 2022; originally announced May 2022.

Comments: 13 pages, published in IET Netw. arXiv admin note: substantial text overlap with arXiv:1902.10319 by other authors

ACM Class: C.2

Journal ref: IET Netw 2022 1-16

arXiv:2205.04157 [pdf, other]

Task-specific Compression for Multi-task Language Models using Attribution-based Pruning

Authors: Nakyeong Yang, Yunah Jang, Hwanhee Lee, Seohyeong Jung, Kyomin Jung

Abstract: Multi-task language models show outstanding performance for various natural language understanding tasks with only a single model. However, these language models utilize an unnecessarily large number of model parameters, even when used only for a specific task. This paper proposes a novel training-free compression method for multi-task language models using a pruning method. Specifically, we use a… ▽ More Multi-task language models show outstanding performance for various natural language understanding tasks with only a single model. However, these language models utilize an unnecessarily large number of model parameters, even when used only for a specific task. This paper proposes a novel training-free compression method for multi-task language models using a pruning method. Specifically, we use an attribution method to determine which neurons are essential for performing a specific task. We task-specifically prune unimportant neurons and leave only task-specific parameters. Furthermore, we extend our method to be applicable in low-resource and unsupervised settings. Since our compression method is training-free, it uses few computing resources and does not destroy the pre-trained knowledge of language models. Experimental results on the six widely-used datasets show that our proposed pruning method significantly outperforms baseline pruning methods. In addition, we demonstrate that our method preserves performance even in an unseen domain setting. △ Less

Submitted 11 February, 2023; v1 submitted 9 May, 2022; originally announced May 2022.

Comments: 11 pages, 4 figures

Journal ref: EACL 2023 Findings

arXiv:2204.13443 [pdf, ps, other]

Analysis of MC Systems Employing Receivers Covered by Heterogeneous Receptors

Authors: Xinyu Huang, Yuting Fang, Stuart T. Johnston, Mattew Faria, Nan Yang, Robert Schober

Abstract: This paper investigates the channel impulse response (CIR), i.e., the molecule hitting rate, of a molecular communication (MC) system employing an absorbing receiver (RX) covered by multiple non overlap** receptors. In this system, receptors are heterogeneous, i.e., they may have different sizes and arbitrary locations. Furthermore, we consider two types of transmitter (TX), namely a point TX an… ▽ More This paper investigates the channel impulse response (CIR), i.e., the molecule hitting rate, of a molecular communication (MC) system employing an absorbing receiver (RX) covered by multiple non overlap** receptors. In this system, receptors are heterogeneous, i.e., they may have different sizes and arbitrary locations. Furthermore, we consider two types of transmitter (TX), namely a point TX and a membrane fusion (MF)-based spherical TX. We assume the point TX or the center of the MF-based TX has a fixed distance to the center of the RX. Given this fixed distance, the TX can be at different locations and the CIR of the RX depends on the exact location of the TX. By averaging over all possible TX locations, we analyze the expected molecule hitting rate at the RX as a function of the sizes and locations of the receptors, where we assume molecule degradation may occur during the propagation of the signaling molecules. Notably, our analysis is valid for different numbers, a wide range of sizes, and arbitrary locations of the receptors, and its accuracy is confirmed via particle-based simulations. Exploiting our numerical results, we show that the expected number of absorbed molecules at the RX increases with the number of receptors, when the total area on the RX surface covered by receptors is fixed. Based on the derived analytical expressions, we compare different geometric receptor distributions by examining the expected number of absorbed molecules at the RX. We show that evenly distributed receptors result in a larger number of absorbed molecules than other distributions. We further compare three models that combine different types of TXs and RXs. △ Less

Submitted 28 April, 2022; originally announced April 2022.

Comments: This paper has been submitted to IEEE journals for possible publication. This paper was accepted for presentation in part at the 2022 IEEE International Conference on Communication (ICC). arXiv:2111.02020v2. arXiv admin note: text overlap with arXiv:2111.02020

arXiv:2204.08917 [pdf, other]

Global-and-Local Collaborative Learning for Co-Salient Object Detection

Authors: Runmin Cong, Ning Yang, Chongyi Li, Huazhu Fu, Yao Zhao, Qingming Huang, Sam Kwong

Abstract: The goal of co-salient object detection (CoSOD) is to discover salient objects that commonly appear in a query group containing two or more relevant images. Therefore, how to effectively extract inter-image correspondence is crucial for the CoSOD task. In this paper, we propose a global-and-local collaborative learning architecture, which includes a global correspondence modeling (GCM) and a local… ▽ More The goal of co-salient object detection (CoSOD) is to discover salient objects that commonly appear in a query group containing two or more relevant images. Therefore, how to effectively extract inter-image correspondence is crucial for the CoSOD task. In this paper, we propose a global-and-local collaborative learning architecture, which includes a global correspondence modeling (GCM) and a local correspondence modeling (LCM) to capture comprehensive inter-image corresponding relationship among different images from the global and local perspectives. Firstly, we treat different images as different time slices and use 3D convolution to integrate all intra features intuitively, which can more fully extract the global group semantics. Secondly, we design a pairwise correlation transformation (PCT) to explore similarity correspondence between pairwise images and combine the multiple local pairwise correspondences to generate the local inter-image relationship. Thirdly, the inter-image relationships of the GCM and LCM are integrated through a global-and-local correspondence aggregation (GLA) module to explore more comprehensive inter-image collaboration cues. Finally, the intra- and inter-features are adaptively integrated by an intra-and-inter weighting fusion (AEWF) module to learn co-saliency features and predict the co-saliency map. The proposed GLNet is evaluated on three prevailing CoSOD benchmark datasets, demonstrating that our model trained on a small dataset (about 3k images) still outperforms eleven state-of-the-art competitors trained on some large datasets (about 8k-200k images). △ Less

Submitted 19 April, 2022; originally announced April 2022.

Comments: Accepted by IEEE Transactions on Cybernetics 2022, project page: https://rmcong.github.io/proj_GLNet.html

arXiv:2204.06517 [pdf, other]

Learning Self-Modulating Attention in Continuous Time Space with Applications to Sequential Recommendation

Authors: Chao Chen, Haoyu Geng, Nianzu Yang, Junchi Yan, Daiyue Xue, Jian** Yu, Xiaokang Yang

Abstract: User interests are usually dynamic in the real world, which poses both theoretical and practical challenges for learning accurate preferences from rich behavior data. Among existing user behavior modeling solutions, attention networks are widely adopted for its effectiveness and relative simplicity. Despite being extensively studied, existing attentions still suffer from two limitations: i) conven… ▽ More User interests are usually dynamic in the real world, which poses both theoretical and practical challenges for learning accurate preferences from rich behavior data. Among existing user behavior modeling solutions, attention networks are widely adopted for its effectiveness and relative simplicity. Despite being extensively studied, existing attentions still suffer from two limitations: i) conventional attentions mainly take into account the spatial correlation between user behaviors, regardless the distance between those behaviors in the continuous time space; and ii) these attentions mostly provide a dense and undistinguished distribution over all past behaviors then attentively encode them into the output latent representations. This is however not suitable in practical scenarios where a user's future actions are relevant to a small subset of her/his historical behaviors. In this paper, we propose a novel attention network, named self-modulating attention, that models the complex and non-linearly evolving dynamic user preferences. We empirically demonstrate the effectiveness of our method on top-N sequential recommendation tasks, and the results on three large-scale real-world datasets show that our model can achieve state-of-the-art performance. △ Less

Submitted 29 March, 2022; originally announced April 2022.

Comments: Published in ICML 2021

arXiv:2204.04852 [pdf, ps, other]

doi 10.1103/PhysRevB.105.165423

A unified theory of second sound in two dimensional materials

Authors: Man-Yu Shang, Wen-Hao Mao, Nuo Yang, Baowen Li, **g-Tao Lü

Abstract: We develop a unified theory for the second sound in two dimensional materials. Previously studied drifting and driftless second sound are two limiting cases of the theory, corresponding to the drift and diffusive part of the energy flux, respectively. We find that due to the presence of quadratic flexural phonons the drifting second sound does not exist in the thermodynamic limit, while the driftl… ▽ More We develop a unified theory for the second sound in two dimensional materials. Previously studied drifting and driftless second sound are two limiting cases of the theory, corresponding to the drift and diffusive part of the energy flux, respectively. We find that due to the presence of quadratic flexural phonons the drifting second sound does not exist in the thermodynamic limit, while the driftless mode is less affected. This is understood as a result of infinite effective inertia of flexual phonons, due to their constant density states and divergent Bose-Einstein distribution in the long wave length limit. Consequently, the group velocity of the drifting mode is smaller than that of the driftless mode. However, upon tensile strain, the velocity of drifting mode becomes larger. Both of them increase with tensile strain due to the linearization of the flexural phonon dispersion. Our results clarify several puzzles encountered previously and pave the way for exploring wave-like heat transport beyond hydrodynamic regime. △ Less

Submitted 10 April, 2022; originally announced April 2022.

arXiv:2204.04612 [pdf, other]

Confidence Estimation Transformer for Long-term Renewable Energy Forecasting in Reinforcement Learning-based Power Grid Dispatching

Authors: Xinhang Li, Zihao Li, Nan Yang, Zheng Yuan, Qinwen Wang, Yiying Yang, Yupeng Huang, Xuri Song, Lei Li, Lin Zhang

Abstract: The expansion of renewable energy could help realizing the goals of peaking carbon dioxide emissions and carbon neutralization. Some existing grid dispatching methods integrating short-term renewable energy prediction and reinforcement learning (RL) have been proved to alleviate the adverse impact of energy fluctuations risk. However, these methods omit the long-term output prediction, which leads… ▽ More The expansion of renewable energy could help realizing the goals of peaking carbon dioxide emissions and carbon neutralization. Some existing grid dispatching methods integrating short-term renewable energy prediction and reinforcement learning (RL) have been proved to alleviate the adverse impact of energy fluctuations risk. However, these methods omit the long-term output prediction, which leads to stability and security problems on the optimal power flow. This paper proposes a confidence estimation Transformer for long-term renewable energy forecasting in reinforcement learning-based power grid dispatching (Conformer-RLpatching). Conformer-RLpatching predicts long-term active output of each renewable energy generator with an enhanced Transformer to boost the performance of hybrid energy grid dispatching. Furthermore, a confidence estimation method is proposed to reduce the prediction error of renewable energy. Meanwhile, a dispatching necessity evaluation mechanism is put forward to decide whether the active output of a generator needs to be adjusted. Experiments carried out on the SG-126 power grid simulator show that Conformer-RLpatching achieves great improvement over the second best algorithm DDPG in security score by 25.8% and achieves a better total reward compared with the golden medal team in the power grid dispatching competition sponsored by State Grid Corporation of China under the same simulation environment. Codes are outsourced in https://github.com/buptlxh/Conformer-RLpatching. △ Less

Submitted 10 April, 2022; originally announced April 2022.

arXiv:2203.16284 [pdf, other]

FIRe: Fast Inverse Rendering using Directional and Signed Distance Functions

Authors: Tarun Yenamandra, Ayush Tewari, Nan Yang, Florian Bernard, Christian Theobalt, Daniel Cremers

Abstract: Neural 3D implicit representations learn priors that are useful for diverse applications, such as single- or multiple-view 3D reconstruction. A major downside of existing approaches while rendering an image is that they require evaluating the network multiple times per camera ray so that the high computational time forms a bottleneck for downstream applications. We address this problem by introduc… ▽ More Neural 3D implicit representations learn priors that are useful for diverse applications, such as single- or multiple-view 3D reconstruction. A major downside of existing approaches while rendering an image is that they require evaluating the network multiple times per camera ray so that the high computational time forms a bottleneck for downstream applications. We address this problem by introducing a novel neural scene representation that we call the directional distance function (DDF). To this end, we learn a signed distance function (SDF) along with our DDF model to represent a class of shapes. Specifically, our DDF is defined on the unit sphere and predicts the distance to the surface along any given direction. Therefore, our DDF allows rendering images with just a single network evaluation per camera ray. Based on our DDF, we present a novel fast algorithm (FIRe) to reconstruct 3D shapes given a posed depth map. We evaluate our proposed method on 3D reconstruction from single-view depth images, where we empirically show that our algorithm reconstructs 3D shapes more accurately and it is more than 15 times faster (per iteration) than competing methods. △ Less

Submitted 19 December, 2023; v1 submitted 30 March, 2022; originally announced March 2022.

Comments: News: Accepted to WACV'24. Project page: https://vision.in.tum.de/research/geometry/fire

arXiv:2203.15338 [pdf, other]

Dynamic-subarray with Fixed Phase Shifters for Energy-efficient Terahertz Hybrid Beamforming under Partial CSI

Authors: Longfei Yan, Chong Han, Nan Yang, **hong Yuan

Abstract: Terahertz (THz) communications are regarded as a pillar technology for the 6G systems, by offering multi-ten-GHz bandwidth. To overcome the huge propagation loss while reducing the hardware complexity, THz ultra-massive (UM) MIMO systems with hybrid beamforming are proposed to offer high array gain. Notably, the adjustable-phase-shifters considered in most existing hybrid beamforming studies are p… ▽ More Terahertz (THz) communications are regarded as a pillar technology for the 6G systems, by offering multi-ten-GHz bandwidth. To overcome the huge propagation loss while reducing the hardware complexity, THz ultra-massive (UM) MIMO systems with hybrid beamforming are proposed to offer high array gain. Notably, the adjustable-phase-shifters considered in most existing hybrid beamforming studies are power-hungry and difficult to realize in the THz band. Moreover, due to the ultra-massive antennas, full channel-state-information (CSI) is challenging to obtain. To address these practical concerns, in this paper, an energy-efficient dynamic-subarray with fixed-phase-shifters (DS-FPS) architecture is proposed for THz hybrid beamforming. To compensate for the spectral efficiency loss caused by the fixed-phase of FPS, a switch network is inserted to enable dynamic connections. In addition, by considering the partial CSI, we propose a row-successive-decomposition (RSD) algorithm to design the hybrid beamforming matrices for DS-FPS. A row-by-row (RBR) algorithm is further proposed to reduce computational complexity. Extensive simulation results show that, the proposed DS-FPS architecture with the RSD and RBR algorithms achieves much higher energy efficiency than the existing architectures. Moreover, the DS-FPS architecture with partial CSI achieves 97% spectral efficiency of that with full CSI. △ Less

Submitted 29 March, 2022; originally announced March 2022.

Comments: 30 pages, 10 figures

arXiv:2203.12055 [pdf, other]

doi 10.1109/CCTA49430.2022.9966180

Energy-optimal Three-dimensional Path-following Control of Autonomous Underwater Vehicles under Ocean Currents

Authors: Niankai Yang, Chao Shen, Matthew Johnson-Roberson, **g Sun

Abstract: This paper presents a three-dimensional (3D) energy-optimal path-following control design for autonomous underwater vehicles subject to ocean currents. The proposed approach has a two-stage control architecture consisting of the setpoint computation and the setpoint tracking. In the first stage, the surge velocity, heave velocity, and pitch angle setpoints are optimized by minimizing the required… ▽ More This paper presents a three-dimensional (3D) energy-optimal path-following control design for autonomous underwater vehicles subject to ocean currents. The proposed approach has a two-stage control architecture consisting of the setpoint computation and the setpoint tracking. In the first stage, the surge velocity, heave velocity, and pitch angle setpoints are optimized by minimizing the required vehicle propulsion energy under currents, and the line-of-sight (LOS) guidance law is used to generate the yaw angle setpoint that ensures path following. In the second stage, two model predictive controllers are designed to control the vehicle motion in the horizontal and vertical planes by tracking the optimal setpoints. The proposed controller is compared with a conventional LOS-based control that maintains zero heave velocity relative to the current (i.e., relative heave velocity) and derives pitch angle setpoint using LOS guidance to reach the desired depth. Through simulations, we show that the proposed approach can achieve more than 13% energy saving on a lawnmower-type and an inspection mission under different ocean current conditions. The simulation results demonstrate that allowing motions with non-zero relative heave velocity improves energy efficiency in 3D path-following applications. △ Less

Submitted 2 January, 2023; v1 submitted 22 March, 2022; originally announced March 2022.

Comments: 8 pages, 7 figures

arXiv:2202.11482 [pdf]

doi 10.1016/j.physa.2022.128220

The evolution of cooperation in the public goods game on the scale-free community networks under multiple strategy updating rules

Authors: Mingzhen Zhang, Naiding Yang, Xianglin Zhu

Abstract: Social networks have a scale-free property and community structure, and many problems in life have the characteristic of public goods, such as resource shortage. Due to different preferences of individuals, there exist individuals who adopt heterogeneous strategies updating rules in the network. We investigate the evolution of cooperation in the scale-free community network with public goods games… ▽ More Social networks have a scale-free property and community structure, and many problems in life have the characteristic of public goods, such as resource shortage. Due to different preferences of individuals, there exist individuals who adopt heterogeneous strategies updating rules in the network. We investigate the evolution of cooperation in the scale-free community network with public goods games and the influence of multiple strategy updating rules. Here, two types of strategy updating rules are considered which are pairwise comparison rules and aspiration-driven rules. Numerical simulations are conducted and presented corresponding results. We find that community structure promotes the emergence of cooperation in public goods games. In the meantime, there is a "U" shape relationship between the frequency of cooperators and the proportion of the two strategy updating rules. With the variance in the proportion of the two strategy updating rules, pairwise comparison rules seem to be more sensitive. Compared with aspiration-driven rules, pairwise comparison rules play a more important role in promoting cooperation. Our work may be helpful to understand the evolution of cooperation in social networks. △ Less

Submitted 23 February, 2022; originally announced February 2022.

Comments: 6 figures, 11 pages

arXiv:2202.09212 [pdf, other]

Molecule Generation for Drug Design: a Graph Learning Perspective

Authors: Nianzu Yang, Huai** Wu, Kaipeng Zeng, Yang Li, Junchi Yan

Abstract: Machine learning, particularly graph learning, is gaining increasing recognition for its transformative impact across various fields. One such promising application is in the realm of molecule design and discovery, notably within the pharmaceutical industry. Our survey offers a comprehensive overview of state-of-the-art methods in molecule design, particularly focusing on \emph{de novo} drug desig… ▽ More Machine learning, particularly graph learning, is gaining increasing recognition for its transformative impact across various fields. One such promising application is in the realm of molecule design and discovery, notably within the pharmaceutical industry. Our survey offers a comprehensive overview of state-of-the-art methods in molecule design, particularly focusing on \emph{de novo} drug design, which incorporates (deep) graph learning techniques. We categorize these methods into three distinct groups: \emph{i)} \emph{all-at-once}, \emph{ii)} \emph{fragment-based}, and \emph{iii)} \emph{node-by-node}. Additionally, we introduce some key public datasets and outline the commonly used evaluation metrics for both the generation and optimization of molecules. In the end, we discuss the existing challenges in this field and suggest potential directions for future research. △ Less

Submitted 8 January, 2024; v1 submitted 18 February, 2022; originally announced February 2022.

arXiv:2202.06490 [pdf, other]

doi 10.1103/PhysRevB.104.245413

Temperature-dependent thermal transport of single molecular junctions from semi-classical Langevin molecular dynamics

Authors: Gen Li, Bing-Zhong Hu, Nuo Yang, **g-Tao Lü

Abstract: Thermal conductance of single molecular junctions at room temperature has been measured recently using picowatt-resolution scanning probes. However, fully understanding thermal transport in a much wider temperature range is needed for the exploration of energy transfer at single-molecular limit and the development of single-molecular devices. Here, employing a semiclassical Langevin molecular dyna… ▽ More Thermal conductance of single molecular junctions at room temperature has been measured recently using picowatt-resolution scanning probes. However, fully understanding thermal transport in a much wider temperature range is needed for the exploration of energy transfer at single-molecular limit and the development of single-molecular devices. Here, employing a semiclassical Langevin molecular dynamics method, a comparative study is performed on the thermal transport of an alkane chain between Au and graphene electrodes, respectively. We illustrate the different roles of quantum statistics and anharmonic interaction in the two types of junctions. For a graphene junction, quantum statistics is essential at room temperature, while the anharmonic interaction is negligible. For a Au junction, it is the other way. Our study paves the way for theoretically understanding thermal transport of realistic single-molecular junctions in the full temperature range by including both quantum statistics and anharmonic interaction within one theoretical framework. △ Less

Submitted 14 February, 2022; originally announced February 2022.

Comments: 7 pages, 7 figures

Journal ref: Phys. Rev. B 104, 245413 (2021)

arXiv:2201.05973 [pdf, other]

Multi-Sparse-Domain Collaborative Recommendation via Enhanced Comprehensive Aspect Preference Learning

Authors: Xiaoyun Zhao, Ning Yang, Philip S. Yu

Abstract: Cross-domain recommendation (CDR) has been attracting increasing attention of researchers for its ability to alleviate the data sparsity problem in recommender systems. However, the existing single-target or dual-target CDR methods often suffer from two drawbacks, the assumption of at least one rich domain and the heavy dependence on domain-invariant preference, which are impractical in real world… ▽ More Cross-domain recommendation (CDR) has been attracting increasing attention of researchers for its ability to alleviate the data sparsity problem in recommender systems. However, the existing single-target or dual-target CDR methods often suffer from two drawbacks, the assumption of at least one rich domain and the heavy dependence on domain-invariant preference, which are impractical in real world where sparsity is ubiquitous and might degrade the user preference learning. To overcome these issues, we propose a Multi-Sparse-Domain Collaborative Recommendation (MSDCR) model for multi-target cross-domain recommendation. Unlike traditional CDR methods, MSDCR treats the multiple relevant domains as all sparse and can simultaneously improve the recommendation performance in each domain. We propose a Multi-Domain Separation Network (MDSN) and a Gated Aspect Preference Enhancement (GAPE) module for MSDCR to enhance a user's domain-specific aspect preferences in a domain by transferring the complementary aspect preferences in other domains, during which the uniqueness of the domain-specific preference can be preserved through the adversarial training offered by MDSN and the complementarity can be adaptively determined by GAPE. Meanwhile, we propose a Multi-Domain Adaptation Network (MDAN) for MSDCR to capture a user's domain-invariant aspect preference. With the integration of the enhanced domain-specific aspect preference and the domain-invariant aspect preference, MSDCR can reach a comprehensive understanding of a user's preference in each sparse domain. At last, the extensive experiments conducted on real datasets demonstrate the remarkable superiority of MSDCR over the state-of-the-art single-domain recommendation models and CDR models. △ Less

Submitted 16 January, 2022; originally announced January 2022.

arXiv:2201.05970 [pdf, other]

Learning from Atypical Behavior: Temporary Interest Aware Recommendation Based on Reinforcement Learning

Authors: Ziwen Du, Ning Yang, Zhonghua Yu, Philip S. Yu

Abstract: Traditional robust recommendation methods view atypical user-item interactions as noise and aim to reduce their impact with some kind of noise filtering technique, which often suffers from two challenges. First, in real world, atypical interactions may signal users' temporary interest different from their general preference. Therefore, simply filtering out the atypical interactions as noise may be… ▽ More Traditional robust recommendation methods view atypical user-item interactions as noise and aim to reduce their impact with some kind of noise filtering technique, which often suffers from two challenges. First, in real world, atypical interactions may signal users' temporary interest different from their general preference. Therefore, simply filtering out the atypical interactions as noise may be inappropriate and degrade the personalization of recommendations. Second, it is hard to acquire the temporary interest since there are no explicit supervision signals to indicate whether an interaction is atypical or not. To address this challenges, we propose a novel model called Temporary Interest Aware Recommendation (TIARec), which can distinguish atypical interactions from normal ones without supervision and capture the temporary interest as well as the general preference of users. Particularly, we propose a reinforcement learning framework containing a recommender agent and an auxiliary classifier agent, which are jointly trained with the objective of maximizing the cumulative return of the recommendations made by the recommender agent. During the joint training process, the classifier agent can judge whether the interaction with an item recommended by the recommender agent is atypical, and the knowledge about learning temporary interest from atypical interactions can be transferred to the recommender agent, which makes the recommender agent able to alone make recommendations that balance the general preference and temporary interest of users. At last, the experiments conducted on real world datasets verify the effectiveness of TIARec. △ Less

Submitted 16 January, 2022; originally announced January 2022.

arXiv:2111.15198 [pdf, ps, other]

doi 10.1007/s40304-022-00288-5

Finite groups isospectral to simple groups

Authors: Maria A. Grechkoseeva, Victor D. Mazurov, Wujie Shi, Andrey V. Vasil'ev, Nanying Yang

Abstract: The spectrum of a finite group is the set of element orders of this group. The main goal of this paper is to survey results concerning recognition of finite simple groups by spectrum, in particular, to list all finite simple groups for which the recognition problem is solved. The spectrum of a finite group is the set of element orders of this group. The main goal of this paper is to survey results concerning recognition of finite simple groups by spectrum, in particular, to list all finite simple groups for which the recognition problem is solved. △ Less

Submitted 28 December, 2022; v1 submitted 30 November, 2021; originally announced November 2021.

Comments: Numbering of theorems, lemmas and other assertions has been changed to match that of the version published in Commun. Math. Stat

MSC Class: 20D05; 20D60

Journal ref: Commun. Math. Stat. vol.11, 169-194 (2023)

arXiv:2111.09066 [pdf, ps, other]

doi 10.1134/S0037446622020161

On the sharp Baer--Suzuki theorem for $π$-radicals: sporadic groups

Authors: Nanying Yang, Zhenfeng Wu, Danila O. Revin

Abstract: Let $π$ be a proper subset of the set of all primes. Denote by $r$ the smallest prime which does not belong to $π$ and set $m = r$ if $r = 2$ or $3$ and $m = r-1$ if $r \geqslant 5$. We study the following conjecture: a conjugacy class $D$ of a finite group $G$ is contained in the $π$-radical $\mathrm{O}_π(G)$ of $G$ if and only if every $m$ elements of $D$ generate a $π$-subgroup. We confirm this… ▽ More Let $π$ be a proper subset of the set of all primes. Denote by $r$ the smallest prime which does not belong to $π$ and set $m = r$ if $r = 2$ or $3$ and $m = r-1$ if $r \geqslant 5$. We study the following conjecture: a conjugacy class $D$ of a finite group $G$ is contained in the $π$-radical $\mathrm{O}_π(G)$ of $G$ if and only if every $m$ elements of $D$ generate a $π$-subgroup. We confirm this conjecture for each group $G$ whose nonabelian composition factors are isomorphic to sporadic or alternating groups. △ Less

Submitted 17 November, 2021; originally announced November 2021.

Comments: in Russian

MSC Class: 20D25; 20D05; 20E45

Journal ref: Sib. Math. J. 63, 387-394 (2022)

arXiv:2111.07418 [pdf, other]

TANDEM: Tracking and Dense Map** in Real-time using Deep Multi-view Stereo

Authors: Lukas Koestler, Nan Yang, Niclas Zeller, Daniel Cremers

Abstract: In this paper, we present TANDEM a real-time monocular tracking and dense map** framework. For pose estimation, TANDEM performs photometric bundle adjustment based on a sliding window of keyframes. To increase the robustness, we propose a novel tracking front-end that performs dense direct image alignment using depth maps rendered from a global model that is built incrementally from dense depth… ▽ More In this paper, we present TANDEM a real-time monocular tracking and dense map** framework. For pose estimation, TANDEM performs photometric bundle adjustment based on a sliding window of keyframes. To increase the robustness, we propose a novel tracking front-end that performs dense direct image alignment using depth maps rendered from a global model that is built incrementally from dense depth predictions. To predict the dense depth maps, we propose Cascade View-Aggregation MVSNet (CVA-MVSNet) that utilizes the entire active keyframe window by hierarchically constructing 3D cost volumes with adaptive view aggregation to balance the different stereo baselines between the keyframes. Finally, the predicted depth maps are fused into a consistent global map represented as a truncated signed distance function (TSDF) voxel grid. Our experimental results show that TANDEM outperforms other state-of-the-art traditional and learning-based monocular visual odometry (VO) methods in terms of camera tracking. Moreover, TANDEM shows state-of-the-art real-time 3D reconstruction performance. △ Less

Submitted 14 November, 2021; originally announced November 2021.

Comments: CoRL 2021. The manuscript contains the main paper and the supplementary materials. Project page: https://go.vision.in.tum.de/tandem

arXiv:2111.05629 [pdf, other]

doi 10.1109/TCOMM.2021.3139887

Spectrum Allocation with Adaptive Sub-band Bandwidth for Terahertz Communication Systems

Authors: Akram Shafie, Nan Yang, Sheeraz Alvi, Chong Han, Salman Durrani, Josep M. Jornet

Abstract: We study spectrum allocation for terahertz (THz) band communication (THzCom) systems, while considering the frequency and distance-dependent nature of THz channels. Different from existing studies, we explore multi-band-based spectrum allocation with adaptive sub-band bandwidth (ASB) by allowing the spectrum of interest to be divided into sub-bands with unequal bandwidths. Also, we investigate the… ▽ More We study spectrum allocation for terahertz (THz) band communication (THzCom) systems, while considering the frequency and distance-dependent nature of THz channels. Different from existing studies, we explore multi-band-based spectrum allocation with adaptive sub-band bandwidth (ASB) by allowing the spectrum of interest to be divided into sub-bands with unequal bandwidths. Also, we investigate the impact of sub-band assignment on multi-connectivity (MC) enabled THzCom systems, where users associate and communicate with multiple access points simultaneously. We formulate resource allocation problems, with the primary focus on spectrum allocation, to determine sub-band assignment, sub-band bandwidth, and optimal transmit power. Thereafter, we propose reasonable approximations and transformations, and develop iterative algorithms based on the successive convex approximation technique to analytically solve the formulated problems. Aided by numerical results, we show that by enabling and optimizing ASB, significantly higher throughput can be achieved as compared to adopting equal sub-band bandwidth, and this throughput gain is most profound when the power budget constraint is more stringent. We also show that our sub-band assignment strategy in MC-enabled THzCom systems outperforms the state-of-the-art sub-band assignment strategies and the performance gain is most profound when the spectrum with the lowest average molecular absorption coefficient is selected during spectrum allocation. △ Less

Submitted 4 July, 2022; v1 submitted 10 November, 2021; originally announced November 2021.

Comments: This work has been accepted for publication in IEEE Transaction on Communications

Journal ref: IEEE Transactions on Communications, vol. 70, no. 2, pp. 1407-1422, Feb. 2022

arXiv:2111.02638 [pdf, ps, other]

The Age of Information of Short-Packet Communications: Joint or Distributed Encoding?

Authors: Zhifeng Tang, Nan Yang, Parastoo Sadeghi, Xiangyun Zhou

Abstract: In this paper, we analyze the impact of different encoding schemes on the age of information (AoI) performance in a point-to-point system, where a source generates packets based on the status updates collected from multiple sensors and transmits the packets to a destination. In this system, we consider two encoding schemes, namely, the joint encoding scheme and the distributed encoding scheme. In… ▽ More In this paper, we analyze the impact of different encoding schemes on the age of information (AoI) performance in a point-to-point system, where a source generates packets based on the status updates collected from multiple sensors and transmits the packets to a destination. In this system, we consider two encoding schemes, namely, the joint encoding scheme and the distributed encoding scheme. In the joint encoding scheme, the status updates from all the sensors are jointly encoded into a packet for transmission. In the distributed encoding scheme, the status update from each sensor is encoded individually and the sensors' packets are transmitted following the round robin policy. To ensure the freshness of packets, the zero-wait policy is adopted in both schemes, where a new packet is immediately generated once the source finishes the transmission of the current packet. We derive closed-form expressions for the average AoI achieved by these two encoding schemes and compare their performances. Simulation results show that the distributed encoding scheme is more appropriate for systems with a relatively large number of sensors, compared with the joint encoding scheme. △ Less

Submitted 4 November, 2021; originally announced November 2021.

arXiv:2111.02020 [pdf, ps, other]

Analysis of Receiver Covered by Heterogeneous Receptors in Molecular Communications

Authors: Xinyu Huang, Yuting Fang, Stuart T. Johnston, Matthew Faria, Nan Yang, Robert Schober

Abstract: This paper analyzes the channel impulse response of an absorbing receiver (RX) covered by multiple non-overlap** heterogeneous receptors with different sizes and arbitrary locations in a molecular communication system. In this system, a point transmitter (TX) is assumed to be uniformly located on a virtual sphere at a fixed distance from the RX. Considering molecule degradation during the propag… ▽ More This paper analyzes the channel impulse response of an absorbing receiver (RX) covered by multiple non-overlap** heterogeneous receptors with different sizes and arbitrary locations in a molecular communication system. In this system, a point transmitter (TX) is assumed to be uniformly located on a virtual sphere at a fixed distance from the RX. Considering molecule degradation during the propagation from the TX to the RX, the expected molecule hitting rate at the RX over varying locations of the TX is analyzed as a function of the size and location of each receptor. Notably, this analytical result is applicable for different numbers, sizes, and locations of receptors, and its accuracy is demonstrated via particle-based simulations. Numerical results show that (i) the expected number of absorbed molecules at the RX increases with an increasing number of receptors, when the total area of receptors on the RX surface is fixed, and (ii) evenly distributed receptors lead to the largest expected number of absorbed molecules. △ Less

Submitted 15 February, 2022; v1 submitted 3 November, 2021; originally announced November 2021.

Comments: 6 pages, 4 figures. Accepted by IEEE International Conference on Communications (ICC) 2022

arXiv:2110.13640 [pdf, other]

s2s-ft: Fine-Tuning Pretrained Transformer Encoders for Sequence-to-Sequence Learning

Authors: Hangbo Bao, Li Dong, Wenhui Wang, Nan Yang, Furu Wei

Abstract: Pretrained bidirectional Transformers, such as BERT, have achieved significant improvements in a wide variety of language understanding tasks, while it is not straightforward to directly apply them for natural language generation. In this paper, we present a sequence-to-sequence fine-tuning toolkit s2s-ft, which adopts pretrained Transformers for conditional generation tasks. Inspired by UniLM, we… ▽ More Pretrained bidirectional Transformers, such as BERT, have achieved significant improvements in a wide variety of language understanding tasks, while it is not straightforward to directly apply them for natural language generation. In this paper, we present a sequence-to-sequence fine-tuning toolkit s2s-ft, which adopts pretrained Transformers for conditional generation tasks. Inspired by UniLM, we implement three sequence-to-sequence fine-tuning algorithms, namely, causal fine-tuning, masked fine-tuning, and pseudo-masked fine-tuning. By leveraging the existing pretrained bidirectional Transformers, experimental results show that s2s-ft achieves strong performance on several benchmarks of abstractive summarization, and question generation. Moreover, we demonstrate that the package s2s-ft supports both monolingual and multilingual NLG tasks. The s2s-ft toolkit is available at https://github.com/microsoft/unilm/tree/master/s2s-ft. △ Less

Submitted 26 October, 2021; originally announced October 2021.

Comments: Demo paper for the s2s-ft toolkit: https://github.com/microsoft/unilm/tree/master/s2s-ft

arXiv:2110.08088 [pdf]

doi 10.1038/s41467-022-29455-6

Local large temperature difference and ultra-wideband photothermoelectric response of the silver nanostructure film/carbon nanotube film heterostructure

Authors: Bocheng Lv, Weidong Wu, Yan Xie, Jia-Lin Zhu, Yang Cao, Wanyun Ma, Ning Yang, Weidong Chu, **quan Wei, Jia-Lin Sun

Abstract: Photothermoelectric materials have important applications in many fields. Here, we joined a silver nanostructure film (AgNSF) and a carbon nanotube film (CNTF) by van der Waals force to form a AgNSF/CNTF heterojunction, which shows excellent photothermal and photoelectric conversion properties. The local temperature difference and the output photovoltage increase rapidly when the heterojunction is… ▽ More Photothermoelectric materials have important applications in many fields. Here, we joined a silver nanostructure film (AgNSF) and a carbon nanotube film (CNTF) by van der Waals force to form a AgNSF/CNTF heterojunction, which shows excellent photothermal and photoelectric conversion properties. The local temperature difference and the output photovoltage increase rapidly when the heterojunction is irradiated by lasers with wavelengths ranging from ultraviolet to terahertz. The maximum of the local temperature difference reaches 205.9 K, which is significantly higher than that of other photothermoelectric materials reported in literatures. The photothermal and photoelectric responsivity depend on the wavelength of lasers, which are 175-601 K/W and 9.35-40.4 mV/W, respectively. We demonstrate that light absorption of the carbon nanotube is enhanced by local surface plasmons, and the output photovoltage is dominated by Seebeck effect. The AgNSF/CNTF heterostructure can be used as high-efficiency sensitive photothermal materials or as ultra-wideband fast-response photoelectric material. △ Less

Submitted 15 October, 2021; originally announced October 2021.

Comments: 11 figures

MSC Class: 78 ACM Class: J.2

arXiv:2110.04509 [pdf]

Improving the mass transfer rate and energy efficiency of solar still by enhancing the inner air circulation

Authors: Guilong Peng, Zhenwei Xu, Jiajun Ji, Senshan Sun, Nuo Yang

Abstract: Solar still is an eco-friendly and convenient desalination system that can provide fresh water for remote areas and emergencies. The energy efficiency and productivity of conventional solar still are unsatisfying and need improvement, which requires a deep understanding of the heat and mass transfer process in solar still. In this work, the effect of the inner air circulation on the system's heat… ▽ More Solar still is an eco-friendly and convenient desalination system that can provide fresh water for remote areas and emergencies. The energy efficiency and productivity of conventional solar still are unsatisfying and need improvement, which requires a deep understanding of the heat and mass transfer process in solar still. In this work, the effect of the inner air circulation on the system's heat and mass transfer performance and energy efficiency are studied theoretically and experimentally. The theoretical results reveal that a weak acceleration of the air circulation inside the SS will significantly increase its performance, due to the improved mass transfer process. By enhancing the inner air circulation, the evaporation and condensation in the solar still can reach up to the limit, and the theoretical energy efficiency reaches up to 87%, 91.5%, and 94.5%, for the input power density at 300 W/m2, 500 W/m2, and 700 W/m2, respectively. Besides, lower ambient temperature and higher ambient convective heat transfer coefficient will decrease the energy efficiency. Given the heat loss, the experimental energy efficiencies are only 3% to 6% lower than the theoretical results, which indicates that the great performance predicted by the theory can be realized in practical application. This work provides a new understanding and strategy for improving the performance of the solar still. △ Less

Submitted 9 October, 2021; originally announced October 2021.

arXiv:2109.05869 [pdf, ps, other]

Whittle Index Based Scheduling Policy for Minimizing the Cost of Age of Information

Authors: Zhifeng Tang, Zhuo Sun, Nan Yang, Xiangyun Zhou

Abstract: We design a new scheduling policy to minimize the general non-decreasing cost function of age of information (AoI) in a multiuser system. In this system, the base station stochastically generates time-sensitive packets and transmits them to corresponding user equipments via an unreliable channel. We first formulate the transmission scheduling problem as an average cost constrained Markov decision… ▽ More We design a new scheduling policy to minimize the general non-decreasing cost function of age of information (AoI) in a multiuser system. In this system, the base station stochastically generates time-sensitive packets and transmits them to corresponding user equipments via an unreliable channel. We first formulate the transmission scheduling problem as an average cost constrained Markov decision process problem. Through introducing the service charge, we derive the closed-form expression for the Whittle index, based on which we design the scheduling policy. Using numerical results, we demonstrate the performance gain of our designed scheduling policy compared to the existing policies, such as the optimal policy, the on-demand Whittle index policy, and the age greedy policy. △ Less

Submitted 13 September, 2021; originally announced September 2021.

arXiv:2108.06033 [pdf]

Realization of ultrabroadband THz/IR photoresponse in a bias-tunable ratchet photodetector

Authors: Peng Bai, Xiaohong Li, Ning Yang, Weidong Chu, Xueqi Bai, Siheng Huang, Yueheng Zhang, Wenzhong Shen, Zhanglong Fu, Dixiang Shao, Zhiyong Tan, Hua Li, Juncheng Cao, Lianhe Li, Edmund Harold Linfield, Yan Xie, Ziran Zhao

Abstract: High performance Terahertz (THz) photodetector has drawn wide attention and got great improvement due to its significant application in biomedical, astrophysics, nondestructive inspection, 6th generation communication system as well as national security application. Here we demonstrate a novel broadband photon-type THz/infrared (IR) photodetector based on the GaAs/AlxGa1-xAs ratchet structure. Thi… ▽ More High performance Terahertz (THz) photodetector has drawn wide attention and got great improvement due to its significant application in biomedical, astrophysics, nondestructive inspection, 6th generation communication system as well as national security application. Here we demonstrate a novel broadband photon-type THz/infrared (IR) photodetector based on the GaAs/AlxGa1-xAs ratchet structure. This kind of photodetector realizes a THz photon-response based on the electrically pumped hot hole injection and overcomes the internal workfunction related spectral response limit. An ultrabroadband photoresponse from 4 THz to 300 THz and a peak responsivity of 50.3 mA/W are realized at negative bias voltage of -1 V. The photodetector also presents a bias-tunable photon-response characteristic due to the asymmetric structure. The ratchet structure also induces an evident photocurrent even at zero bias voltage, which indicates the detector can be regard as a broadband photovoltaic-like detector. The rectification characteristic and high temperature operation possibility of the photodetector are also discussed. This work not only demonstrates a novel ultrabroadband THz/IR photodetector, but also provides a new method to study the light-responsive ratchet. △ Less

Submitted 12 August, 2021; originally announced August 2021.

arXiv:2108.02572 [pdf, other]

doi 10.1145/1122445.1122456

SINGA-Easy: An Easy-to-Use Framework for MultiModal Analysis

Authors: Naili Xing, Sai Ho Yeung, Chenghao Cai, Teck Khim Ng, Wei Wang, Kaiyuan Yang, Nan Yang, Meihui Zhang, Gang Chen, Beng Chin Ooi

Abstract: Deep learning has achieved great success in a wide spectrum of multimedia applications such as image classification, natural language processing and multimodal data analysis. Recent years have seen the development of many deep learning frameworks that provide a high-level programming interface for users to design models, conduct training and deploy inference. However, it remains challenging to bui… ▽ More Deep learning has achieved great success in a wide spectrum of multimedia applications such as image classification, natural language processing and multimodal data analysis. Recent years have seen the development of many deep learning frameworks that provide a high-level programming interface for users to design models, conduct training and deploy inference. However, it remains challenging to build an efficient end-to-end multimedia application with most existing frameworks. Specifically, in terms of usability, it is demanding for non-experts to implement deep learning models, obtain the right settings for the entire machine learning pipeline, manage models and datasets, and exploit external data sources all together. Further, in terms of adaptability, elastic computation solutions are much needed as the actual serving workload fluctuates constantly, and scaling the hardware resources to handle the fluctuating workload is typically infeasible. To address these challenges, we introduce SINGA-Easy, a new deep learning framework that provides distributed hyper-parameter tuning at the training stage, dynamic computational cost control at the inference stage, and intuitive user interactions with multimedia contents facilitated by model explanation. Our experiments on the training and deployment of multi-modality data analysis applications show that the framework is both usable and adaptable to dynamic inference loads. We implement SINGA-Easy on top of Apache SINGA and demonstrate our system with the entire machine learning life cycle. △ Less

Submitted 3 August, 2021; originally announced August 2021.

Comments: 10 pages, 10 figures

arXiv:2107.02583 [pdf, other]

Point Cloud Registration using Representative Overlap** Points

Authors: Lifa Zhu, Dongrui Liu, Changwei Lin, Rui Yan, Francisco Gómez-Fernández, Ninghua Yang, Ziyong Feng

Abstract: 3D point cloud registration is a fundamental task in robotics and computer vision. Recently, many learning-based point cloud registration methods based on correspondences have emerged. However, these methods heavily rely on such correspondences and meet great challenges with partial overlap. In this paper, we propose ROPNet, a new deep learning model using Representative Overlap** Points with di… ▽ More 3D point cloud registration is a fundamental task in robotics and computer vision. Recently, many learning-based point cloud registration methods based on correspondences have emerged. However, these methods heavily rely on such correspondences and meet great challenges with partial overlap. In this paper, we propose ROPNet, a new deep learning model using Representative Overlap** Points with discriminative features for registration that transforms partial-to-partial registration into partial-to-complete registration. Specifically, we propose a context-guided module which uses an encoder to extract global features for predicting point overlap score. To better find representative overlap** points, we use the extracted global features for coarse alignment. Then, we introduce a Transformer to enrich point features and remove non-representative points based on point overlap score and feature matching. A similarity matrix is built in a partial-to-complete mode, and finally, weighted SVD is adopted to estimate a transformation matrix. Extensive experiments over ModelNet40 using noisy and partially overlap** point clouds show that the proposed method outperforms traditional and learning-based methods, achieving state-of-the-art performance. The code is available at https://github.com/zhulf0804/ROPNet. △ Less

Submitted 6 July, 2021; originally announced July 2021.

arXiv:2106.15779 [pdf, other]

Dual Adversarial Variational Embedding for Robust Recommendation

Authors: Qiaomin Yi, Ning Yang, Philip S. Yu

Abstract: Robust recommendation aims at capturing true preference of users from noisy data, for which there are two lines of methods have been proposed. One is based on noise injection, and the other is to adopt the generative model Variational Auto-encoder (VAE). However, the existing works still face two challenges. First, the noise injection based methods often draw the noise from a fixed noise distribut… ▽ More Robust recommendation aims at capturing true preference of users from noisy data, for which there are two lines of methods have been proposed. One is based on noise injection, and the other is to adopt the generative model Variational Auto-encoder (VAE). However, the existing works still face two challenges. First, the noise injection based methods often draw the noise from a fixed noise distribution given in advance, while in real world, the noise distributions of different users and items may differ from each other due to personal behaviors and item usage patterns. Second, the VAE based models are not expressive enough to capture the true preference since VAE often yields an embedding space of a single modal, while in real world, user-item interactions usually exhibit multi-modality on user preference distribution. In this paper, we propose a novel model called Dual Adversarial Variational Embedding (DAVE) for robust recommendation, which can provide personalized noise reduction for different users and items, and capture the multi-modality of the embedding space, by combining the advantages of VAE and adversarial training between the introduced auxiliary discriminators and the variational inference networks. The extensive experiments conducted on real datasets verify the effectiveness of DAVE on robust recommendation. △ Less

Submitted 29 June, 2021; originally announced June 2021.

arXiv:2106.13605 [pdf, other]

doi 10.1103/PhysRevB.104.115411

Thermal dissipation in the quantum Hall regime in graphene

Authors: J. -Y. Fang, N. -X. Yang, Q. Yan, A. -M. Guo, Q. -F. Sun

Abstract: It is widely accepted that both backscattering and dissipation cannot occur in topological systems because of the topological protection. Here we show that the thermal dissipation can occur in the quantum Hall (QH) regime in graphene in the presence of dissipation sources, although the Hall plateaus and the zero longitudinal resistance still survive. Dissipation appears along the downstream chiral… ▽ More It is widely accepted that both backscattering and dissipation cannot occur in topological systems because of the topological protection. Here we show that the thermal dissipation can occur in the quantum Hall (QH) regime in graphene in the presence of dissipation sources, although the Hall plateaus and the zero longitudinal resistance still survive. Dissipation appears along the downstream chiral flow direction of the constriction in the Hall plateau regime, but it occurs mainly in the bulk in the Hall plateau transition regime. In addition, dissipation processes are accompanied with the evolution of the energy distribution from non-equilibrium to equilibrium. This indicates that topology neither prohibits the appearance of dissipation nor prohibits entropy increasing, which opens a new topic on the dissipation in topological systems. △ Less

Submitted 25 June, 2021; originally announced June 2021.

Comments: 10 pages, 9 figures

Journal ref: Phys. Rev. B 104, 115411 (2021)

Showing 101–150 of 425 results for author: Yang, N