-
Style-Aware Contrastive Learning for Multi-Style Image Captioning
Authors:
Yucheng Zhou,
Guodong Long
Abstract:
Existing multi-style image captioning methods show promising results in generating a caption with accurate visual content and desired linguistic style. However, existing methods overlook the relationship between linguistic style and visual content. To overcome this drawback, we propose style-aware contrastive learning for multi-style image captioning. First, we present a style-aware visual encoder…
▽ More
Existing multi-style image captioning methods show promising results in generating a caption with accurate visual content and desired linguistic style. However, existing methods overlook the relationship between linguistic style and visual content. To overcome this drawback, we propose style-aware contrastive learning for multi-style image captioning. First, we present a style-aware visual encoder with contrastive learning to mine potential visual content relevant to style. Moreover, we propose a style-aware triplet contrast objective to distinguish whether the image, style and caption matched. To provide positive and negative samples for contrastive learning, we present three retrieval schemes: object-based retrieval, RoI-based retrieval and triplet-based retrieval, and design a dynamic trade-off function to calculate retrieval scores. Experimental results demonstrate that our approach achieves state-of-the-art performance. In addition, we conduct an extensive analysis to verify the effectiveness of our method.
△ Less
Submitted 26 January, 2023;
originally announced January 2023.
-
Improving Cross-modal Alignment for Text-Guided Image Inpainting
Authors:
Yucheng Zhou,
Guodong Long
Abstract:
Text-guided image inpainting (TGII) aims to restore missing regions based on a given text in a damaged image. Existing methods are based on a strong vision encoder and a cross-modal fusion model to integrate cross-modal features. However, these methods allocate most of the computation to visual encoding, while light computation on modeling modality interactions. Moreover, they take cross-modal fus…
▽ More
Text-guided image inpainting (TGII) aims to restore missing regions based on a given text in a damaged image. Existing methods are based on a strong vision encoder and a cross-modal fusion model to integrate cross-modal features. However, these methods allocate most of the computation to visual encoding, while light computation on modeling modality interactions. Moreover, they take cross-modal fusion for depth features, which ignores a fine-grained alignment between text and image. Recently, vision-language pre-trained models (VLPM), encapsulating rich cross-modal alignment knowledge, have advanced in most multimodal tasks. In this work, we propose a novel model for TGII by improving cross-modal alignment (CMA). CMA model consists of a VLPM as a vision-language encoder, an image generator and global-local discriminators. To explore cross-modal alignment knowledge for image restoration, we introduce cross-modal alignment distillation and in-sample distribution distillation. In addition, we employ adversarial training to enhance the model to fill the missing region in complicated structures effectively. Experiments are conducted on two popular vision-language datasets. Results show that our model achieves state-of-the-art performance compared with other strong competitors.
△ Less
Submitted 26 January, 2023;
originally announced January 2023.
-
Multimodal Event Transformer for Image-guided Story Ending Generation
Authors:
Yucheng Zhou,
Guodong Long
Abstract:
Image-guided story ending generation (IgSEG) is to generate a story ending based on given story plots and ending image. Existing methods focus on cross-modal feature fusion but overlook reasoning and mining implicit information from story plots and ending image. To tackle this drawback, we propose a multimodal event transformer, an event-based reasoning framework for IgSEG. Specifically, we constr…
▽ More
Image-guided story ending generation (IgSEG) is to generate a story ending based on given story plots and ending image. Existing methods focus on cross-modal feature fusion but overlook reasoning and mining implicit information from story plots and ending image. To tackle this drawback, we propose a multimodal event transformer, an event-based reasoning framework for IgSEG. Specifically, we construct visual and semantic event graphs from story plots and ending image, and leverage event-based reasoning to reason and mine implicit information in a single modality. Next, we connect visual and semantic event graphs and utilize cross-modal fusion to integrate different-modality features. In addition, we propose a multimodal injector to adaptive pass essential information to decoder. Besides, we present an incoherence detection to enhance the understanding context of a story plot and the robustness of graph modeling for our model. Experimental results show that our method achieves state-of-the-art performance for the image-guided story ending generation.
△ Less
Submitted 26 January, 2023;
originally announced January 2023.
-
Prompt Federated Learning for Weather Forecasting: Toward Foundation Models on Meteorological Data
Authors:
Shengchao Chen,
Guodong Long,
Tao Shen,
**g Jiang
Abstract:
To tackle the global climate challenge, it urgently needs to develop a collaborative platform for comprehensive weather forecasting on large-scale meteorological data. Despite urgency, heterogeneous meteorological sensors across countries and regions, inevitably causing multivariate heterogeneity and data exposure, become the main barrier. This paper develops a foundation model across regions capa…
▽ More
To tackle the global climate challenge, it urgently needs to develop a collaborative platform for comprehensive weather forecasting on large-scale meteorological data. Despite urgency, heterogeneous meteorological sensors across countries and regions, inevitably causing multivariate heterogeneity and data exposure, become the main barrier. This paper develops a foundation model across regions capable of understanding complex meteorological data and providing weather forecasting. To relieve the data exposure concern across regions, a novel federated learning approach has been proposed to collaboratively learn a brand-new spatio-temporal Transformer-based foundation model across participants with heterogeneous meteorological data. Moreover, a novel prompt learning mechanism has been adopted to satisfy low-resourced sensors' communication and computational constraints. The effectiveness of the proposed method has been demonstrated on classical weather forecasting tasks using three meteorological datasets with multivariate time series.
△ Less
Submitted 27 May, 2023; v1 submitted 22 January, 2023;
originally announced January 2023.
-
Federated Recommendation with Additive Personalization
Authors:
Zhiwei Li,
Guodong Long,
Tianyi Zhou
Abstract:
Building recommendation systems via federated learning (FL) is a new emerging challenge for advancing next-generation Internet service and privacy protection. Existing approaches train shared item embedding by FL while kee** the user embedding private on client side. However, item embedding identical for all clients cannot capture users' individual differences on perceiving the same item and thu…
▽ More
Building recommendation systems via federated learning (FL) is a new emerging challenge for advancing next-generation Internet service and privacy protection. Existing approaches train shared item embedding by FL while kee** the user embedding private on client side. However, item embedding identical for all clients cannot capture users' individual differences on perceiving the same item and thus leads to poor personalization. Moreover, dense item embedding in FL results in expensive communication cost and latency. To address these challenges, we propose Federated Recommendation with Additive Personalization (FedRAP), which learns a global view of items via FL and a personalized view locally on each user. FedRAP enforces sparsity of the global view to save FL's communication cost and encourages difference between the two views through regularization. We propose an effective curriculum to learn the local and global views progressively with increasing regularization weights. To produce recommendations for an user, FedRAP adds the two views together to obtain a personalized item embedding. FedRAP achieves the best performance in FL setting on multiple benchmarks. It outperforms recent federated recommendation methods and several ablation study baselines.
△ Less
Submitted 7 February, 2024; v1 submitted 22 January, 2023;
originally announced January 2023.
-
Dual Personalization on Federated Recommendation
Authors:
Chunxu Zhang,
Guodong Long,
Tianyi Zhou,
Peng Yan,
Zijian Zhang,
Chengqi Zhang,
Bo Yang
Abstract:
Federated recommendation is a new Internet service architecture that aims to provide privacy-preserving recommendation services in federated settings. Existing solutions are used to combine distributed recommendation algorithms and privacy-preserving mechanisms. Thus it inherently takes the form of heavyweight models at the server and hinders the deployment of on-device intelligent models to end-u…
▽ More
Federated recommendation is a new Internet service architecture that aims to provide privacy-preserving recommendation services in federated settings. Existing solutions are used to combine distributed recommendation algorithms and privacy-preserving mechanisms. Thus it inherently takes the form of heavyweight models at the server and hinders the deployment of on-device intelligent models to end-users. This paper proposes a novel Personalized Federated Recommendation (PFedRec) framework to learn many user-specific lightweight models to be deployed on smart devices rather than a heavyweight model on a server. Moreover, we propose a new dual personalization mechanism to effectively learn fine-grained personalization on both users and items. The overall learning process is formulated into a unified federated optimization framework. Specifically, unlike previous methods that share exactly the same item embeddings across users in a federated system, dual personalization allows mild finetuning of item embeddings for each user to generate user-specific views for item representations which can be integrated into existing federated recommendation methods to gain improvements immediately. Experiments on multiple benchmark datasets have demonstrated the effectiveness of PFedRec and the dual personalization mechanism. Moreover, we provide visualizations and in-depth analysis of the personalization techniques in item embedding, which shed novel insights on the design of recommender systems in federated settings. The code is available.
△ Less
Submitted 13 May, 2023; v1 submitted 16 January, 2023;
originally announced January 2023.
-
Factoring integers with sublinear resources on a superconducting quantum processor
Authors:
Bao Yan,
Ziqi Tan,
Shijie Wei,
Haocong Jiang,
Weilong Wang,
Hong Wang,
Lan Luo,
Qianheng Duan,
Yiting Liu,
Wenhao Shi,
Yangyang Fei,
Xiangdong Meng,
Yu Han,
Zheng Shan,
Jiachen Chen,
Xuhao Zhu,
Chuanyu Zhang,
Feitong **,
Hekang Li,
Chao Song,
Zhen Wang,
Zhi Ma,
H. Wang,
Gui-Lu Long
Abstract:
Shor's algorithm has seriously challenged information security based on public key cryptosystems. However, to break the widely used RSA-2048 scheme, one needs millions of physical qubits, which is far beyond current technical capabilities. Here, we report a universal quantum algorithm for integer factorization by combining the classical lattice reduction with a quantum approximate optimization alg…
▽ More
Shor's algorithm has seriously challenged information security based on public key cryptosystems. However, to break the widely used RSA-2048 scheme, one needs millions of physical qubits, which is far beyond current technical capabilities. Here, we report a universal quantum algorithm for integer factorization by combining the classical lattice reduction with a quantum approximate optimization algorithm (QAOA). The number of qubits required is O(logN/loglog N), which is sublinear in the bit length of the integer $N$, making it the most qubit-saving factorization algorithm to date. We demonstrate the algorithm experimentally by factoring integers up to 48 bits with 10 superconducting qubits, the largest integer factored on a quantum device. We estimate that a quantum circuit with 372 physical qubits and a depth of thousands is necessary to challenge RSA-2048 using our algorithm. Our study shows great promise in expediting the application of current noisy quantum computers, and paves the way to factor large integers of realistic cryptographic significance.
△ Less
Submitted 23 December, 2022;
originally announced December 2022.
-
Fine-Grained Distillation for Long Document Retrieval
Authors:
Yucheng Zhou,
Tao Shen,
Xiubo Geng,
Chongyang Tao,
Guodong Long,
Can Xu,
Daxin Jiang
Abstract:
Long document retrieval aims to fetch query-relevant documents from a large-scale collection, where knowledge distillation has become de facto to improve a retriever by mimicking a heterogeneous yet powerful cross-encoder. However, in contrast to passages or sentences, retrieval on long documents suffers from the scope hypothesis that a long document may cover multiple topics. This maximizes their…
▽ More
Long document retrieval aims to fetch query-relevant documents from a large-scale collection, where knowledge distillation has become de facto to improve a retriever by mimicking a heterogeneous yet powerful cross-encoder. However, in contrast to passages or sentences, retrieval on long documents suffers from the scope hypothesis that a long document may cover multiple topics. This maximizes their structure heterogeneity and poses a granular-mismatch issue, leading to an inferior distillation efficacy. In this work, we propose a new learning framework, fine-grained distillation (FGD), for long-document retrievers. While preserving the conventional dense retrieval paradigm, it first produces global-consistent representations crossing different fine granularity and then applies multi-granular aligned distillation merely during training. In experiments, we evaluate our framework on two long-document retrieval benchmarks, which show state-of-the-art performance.
△ Less
Submitted 20 December, 2022;
originally announced December 2022.
-
Single-photon-memory measurement-device-independent quantum secure direct communication
Authors:
Xiang-Jie Li,
Dong Pan,
Gui-Lu Long,
Lajos Hanzo
Abstract:
Quantum secure direct communication (QSDC) uses the quantum channel to transmit information reliably and securely. In order to eliminate the security loopholes resulting from practical detectors, the measurement-device-independent (MDI) QSDC protocol has been proposed. However, block-based transmission of quantum states is utilized in MDI-QSDC, which requires practical quantum memory that is still…
▽ More
Quantum secure direct communication (QSDC) uses the quantum channel to transmit information reliably and securely. In order to eliminate the security loopholes resulting from practical detectors, the measurement-device-independent (MDI) QSDC protocol has been proposed. However, block-based transmission of quantum states is utilized in MDI-QSDC, which requires practical quantum memory that is still unavailable at the time of writing. For circumventing this impediment, we propose a single-photon-memory MDI QSDC protocol (SPMQC) for dispensing with high-performance quantum memory. The performance of the proposed protocol is characterized by simulations considering realistic experimental parameters, and the results show that it is feasible to implement SPMQC by relying on present-day technology.
△ Less
Submitted 11 December, 2022;
originally announced December 2022.
-
Federated Learning on Non-IID Graphs via Structural Knowledge Sharing
Authors:
Yue Tan,
Yixin Liu,
Guodong Long,
**g Jiang,
Qinghua Lu,
Chengqi Zhang
Abstract:
Graph neural networks (GNNs) have shown their superiority in modeling graph data. Owing to the advantages of federated learning, federated graph learning (FGL) enables clients to train strong GNN models in a distributed manner without sharing their private data. A core challenge in federated systems is the non-IID problem, which also widely exists in real-world graph data. For example, local data…
▽ More
Graph neural networks (GNNs) have shown their superiority in modeling graph data. Owing to the advantages of federated learning, federated graph learning (FGL) enables clients to train strong GNN models in a distributed manner without sharing their private data. A core challenge in federated systems is the non-IID problem, which also widely exists in real-world graph data. For example, local data of clients may come from diverse datasets or even domains, e.g., social networks and molecules, increasing the difficulty for FGL methods to capture commonly shared knowledge and learn a generalized encoder. From real-world graph datasets, we observe that some structural properties are shared by various domains, presenting great potential for sharing structural knowledge in FGL. Inspired by this, we propose FedStar, an FGL framework that extracts and shares the common underlying structure information for inter-graph federated learning tasks. To explicitly extract the structure information rather than encoding them along with the node features, we define structure embeddings and encode them with an independent structure encoder. Then, the structure encoder is shared across clients while the feature-based knowledge is learned in a personalized way, making FedStar capable of capturing more structure-based domain-invariant information and avoiding feature misalignment issues. We perform extensive experiments over both cross-dataset and cross-domain non-IID FGL settings, demonstrating the superiority of FedStar.
△ Less
Submitted 23 November, 2022;
originally announced November 2022.
-
Near-Term Quantum Computing Techniques: Variational Quantum Algorithms, Error Mitigation, Circuit Compilation, Benchmarking and Classical Simulation
Authors:
He-Liang Huang,
Xiao-Yue Xu,
Chu Guo,
Guo**g Tian,
Shi-Jie Wei,
Xiaoming Sun,
Wan-Su Bao,
Gui-Lu Long
Abstract:
Quantum computing is a game-changing technology for global academia, research centers and industries including computational science, mathematics, finance, pharmaceutical, materials science, chemistry and cryptography. Although it has seen a major boost in the last decade, we are still a long way from reaching the maturity of a full-fledged quantum computer. That said, we will be in the Noisy-Inte…
▽ More
Quantum computing is a game-changing technology for global academia, research centers and industries including computational science, mathematics, finance, pharmaceutical, materials science, chemistry and cryptography. Although it has seen a major boost in the last decade, we are still a long way from reaching the maturity of a full-fledged quantum computer. That said, we will be in the Noisy-Intermediate Scale Quantum (NISQ) era for a long time, working on dozens or even thousands of qubits quantum computing systems. An outstanding challenge, then, is to come up with an application that can reliably carry out a nontrivial task of interest on the near-term quantum devices with non-negligible quantum noise. To address this challenge, several near-term quantum computing techniques, including variational quantum algorithms, error mitigation, quantum circuit compilation and benchmarking protocols, have been proposed to characterize and mitigate errors, and to implement algorithms with a certain resistance to noise, so as to enhance the capabilities of near-term quantum devices and explore the boundaries of their ability to realize useful applications. Besides, the development of near-term quantum devices is inseparable from the efficient classical simulation, which plays a vital role in quantum algorithm design and verification, error-tolerant verification and other applications. This review will provide a thorough introduction of these near-term quantum computing techniques, report on their progress, and finally discuss the future prospect of these techniques, which we hope will motivate researchers to undertake additional studies in this field.
△ Less
Submitted 27 December, 2022; v1 submitted 16 November, 2022;
originally announced November 2022.
-
The weak coupling theory of all dimensional loop quantum gravity
Authors:
Gao** Long,
Chun-Yen Lin
Abstract:
The weak coupling loop quantum theory with Abelian gauge group provides us a new perspective to study the weak coupling properties of LQG. In this paper, the weak coupling theory of all dimensional loop quantum gravity is established based on a symplectic-morphism between the $SO(D+1)$ holonomy-flux phase space and the $U(1)^{\frac{D(D+1)}{2}}$ holonomy-flux phase space. More explicitly, the Gauss…
▽ More
The weak coupling loop quantum theory with Abelian gauge group provides us a new perspective to study the weak coupling properties of LQG. In this paper, the weak coupling theory of all dimensional loop quantum gravity is established based on a symplectic-morphism between the $SO(D+1)$ holonomy-flux phase space and the $U(1)^{\frac{D(D+1)}{2}}$ holonomy-flux phase space. More explicitly, the Gaussian, simplicity, diffeomorphism and scalar constraint operators in $SO(D+1)$ loop quantum gravity will be generalized to the $U(1)^{\frac{D(D+1)}{2}}$ loop quantum theory based on the symplectic-morphism, and the $U(1)^{\frac{D(D+1)}{2}}$ loop quantum theory equipped with these constraint operators gives the weak coupling $U(1)^{\frac{D(D+1)}{2}}$ loop quantum gravity, with the corresponding Hilbert space is composed by the $U(1)^{\frac{D(D+1)}{2}}$ heat-kernel coherent states which are peaked at the weak coupling region of the $U(1)^{\frac{D(D+1)}{2}}$ holonomy-flux phase space.
△ Less
Submitted 16 November, 2022; v1 submitted 14 November, 2022;
originally announced November 2022.
-
CCPrefix: Counterfactual Contrastive Prefix-Tuning for Many-Class Classification
Authors:
Yang Li,
Canran Xu,
Guodong Long,
Tao Shen,
Chongyang Tao,
**g Jiang
Abstract:
Recently, prefix-tuning was proposed to efficiently adapt pre-trained language models to a broad spectrum of natural language classification tasks. It leverages soft prefix as task-specific indicators and language verbalizers as categorical-label mentions to narrow the formulation gap from pre-training language models. However, when the label space increases considerably (i.e., many-class classifi…
▽ More
Recently, prefix-tuning was proposed to efficiently adapt pre-trained language models to a broad spectrum of natural language classification tasks. It leverages soft prefix as task-specific indicators and language verbalizers as categorical-label mentions to narrow the formulation gap from pre-training language models. However, when the label space increases considerably (i.e., many-class classification), such a tuning technique suffers from a verbalizer ambiguity problem since the many-class labels are represented by semantic-similar verbalizers in short language phrases. To overcome this, inspired by the human-decision process that the most ambiguous classes would be mulled over for each instance, we propose a brand-new prefix-tuning method, Counterfactual Contrastive Prefix-tuning (CCPrefix), for many-class classification. Basically, an instance-dependent soft prefix, derived from fact-counterfactual pairs in the label space, is leveraged to complement the language verbalizers in many-class classification. We conduct experiments on many-class benchmark datasets in both the fully supervised setting and the few-shot setting, which indicates that our model outperforms former baselines.
△ Less
Submitted 12 February, 2024; v1 submitted 10 November, 2022;
originally announced November 2022.
-
Unsupervised Knowledge Graph Construction and Event-centric Knowledge Infusion for Scientific NLI
Authors:
Chenglin Wang,
Yucheng Zhou,
Guodong Long,
Xiaodong Wang,
Xiaowei Xu
Abstract:
With the advance of natural language inference (NLI), a rising demand for NLI is to handle scientific texts. Existing methods depend on pre-trained models (PTM) which lack domain-specific knowledge. To tackle this drawback, we introduce a scientific knowledge graph to generalize PTM to scientific domain. However, existing knowledge graph construction approaches suffer from some drawbacks, i.e., ex…
▽ More
With the advance of natural language inference (NLI), a rising demand for NLI is to handle scientific texts. Existing methods depend on pre-trained models (PTM) which lack domain-specific knowledge. To tackle this drawback, we introduce a scientific knowledge graph to generalize PTM to scientific domain. However, existing knowledge graph construction approaches suffer from some drawbacks, i.e., expensive labeled data, failure to apply in other domains, long inference time and difficulty extending to large corpora. Therefore, we propose an unsupervised knowledge graph construction method to build a scientific knowledge graph (SKG) without any labeled data. Moreover, to alleviate noise effect from SKG and complement knowledge in sentences better, we propose an event-centric knowledge infusion method to integrate external knowledge into each event that is a fine-grained semantic unit in sentences. Experimental results show that our method achieves state-of-the-art performance and the effectiveness and reliability of SKG.
△ Less
Submitted 27 October, 2022; v1 submitted 27 October, 2022;
originally announced October 2022.
-
A Probabilistic Imaginary Time Evolution Algorithm Based on Non-unitary Quantum Circuit
Authors:
Hao-Nan Xie,
Shi-Jie Wei,
Fan Yang,
Zheng-An Wang,
Chi-Tong Chen,
Heng Fan,
Gui-Lu Long
Abstract:
Imaginary time evolution is a powerful tool applied in quantum physics, while existing classical algorithms for simulating imaginary time evolution suffer high computational complexity as the quantum systems become larger and more complex. In this work, we propose a probabilistic algorithm for implementing imaginary time evolution based on non-unitary quantum circuit. We demonstrate the feasibilit…
▽ More
Imaginary time evolution is a powerful tool applied in quantum physics, while existing classical algorithms for simulating imaginary time evolution suffer high computational complexity as the quantum systems become larger and more complex. In this work, we propose a probabilistic algorithm for implementing imaginary time evolution based on non-unitary quantum circuit. We demonstrate the feasibility of this method by solving the ground state energy of several quantum many-body systems, including H2, LiH molecules and the quantum Ising chain. Moreover, we perform experiments on superconducting and trapped ion cloud platforms respectively to find the ground state energy of H2 and its most stable molecular structure. We also analyze the successful probability of the algorithm, which is a polynomial of the output error and introduce an approach to increase the success probability by rearranging the terms of Hamiltonian.
△ Less
Submitted 11 October, 2022;
originally announced October 2022.
-
Multifaceted Hierarchical Report Identification for Non-Functional Bugs in Deep Learning Frameworks
Authors:
Guoming Long,
Tao Chen,
Georgina Cosma
Abstract:
Non-functional bugs (e.g., performance- or accuracy-related bugs) in Deep Learning (DL) frameworks can lead to some of the most devastating consequences. Reporting those bugs on a repository such as GitHub is a standard route to fix them. Yet, given the growing number of new GitHub reports for DL frameworks, it is intrinsically difficult for developers to distinguish those that reveal non-function…
▽ More
Non-functional bugs (e.g., performance- or accuracy-related bugs) in Deep Learning (DL) frameworks can lead to some of the most devastating consequences. Reporting those bugs on a repository such as GitHub is a standard route to fix them. Yet, given the growing number of new GitHub reports for DL frameworks, it is intrinsically difficult for developers to distinguish those that reveal non-functional bugs among the others, and assign them to the right contributor for investigation in a timely manner. In this paper, we propose MHNurf - an end-to-end tool for automatically identifying non-functional bug related reports in DL frameworks. The core of MHNurf is a Multifaceted Hierarchical Attention Network (MHAN) that tackles three unaddressed challenges: (1) learning the semantic knowledge, but doing so by (2) considering the hierarchy (e.g., words/tokens in sentences/statements) and focusing on the important parts (i.e., words, tokens, sentences, and statements) of a GitHub report, while (3) independently extracting information from different types of features, i.e., content, comment, code, command, and label.
To evaluate MHNurf, we leverage 3,721 GitHub reports from five DL frameworks for conducting experiments. The results show that MHNurf works the best with a combination of content, comment, and code, which considerably outperforms the classic HAN where only the content is used. MHNurf also produces significantly more accurate results than nine other state-of-the-art classifiers with strong statistical significance, i.e., up to 71% AUC improvement and has the best Scott-Knott rank on four frameworks while 2nd on the remaining one. To facilitate reproduction and promote future research, we have made our dataset, code, and detailed supplementary results publicly available at: https://github.com/ideas-labo/APSEC2022-MHNurf.
△ Less
Submitted 4 October, 2022;
originally announced October 2022.
-
Federated Learning from Pre-Trained Models: A Contrastive Learning Approach
Authors:
Yue Tan,
Guodong Long,
Jie Ma,
Lu Liu,
Tianyi Zhou,
**g Jiang
Abstract:
Federated Learning (FL) is a machine learning paradigm that allows decentralized clients to learn collaboratively without sharing their private data. However, excessive computation and communication demands pose challenges to current FL frameworks, especially when training large-scale models. To prevent these issues from hindering the deployment of FL systems, we propose a lightweight framework wh…
▽ More
Federated Learning (FL) is a machine learning paradigm that allows decentralized clients to learn collaboratively without sharing their private data. However, excessive computation and communication demands pose challenges to current FL frameworks, especially when training large-scale models. To prevent these issues from hindering the deployment of FL systems, we propose a lightweight framework where clients jointly learn to fuse the representations generated by multiple fixed pre-trained models rather than training a large-scale model from scratch. This leads us to a more practical FL problem by considering how to capture more client-specific and class-relevant information from the pre-trained models and jointly improve each client's ability to exploit those off-the-shelf models. In this work, we design a Federated Prototype-wise Contrastive Learning (FedPCL) approach which shares knowledge across clients through their class prototypes and builds client-specific representations in a prototype-wise contrastive manner. Sharing prototypes rather than learnable model parameters allows each client to fuse the representations in a personalized way while kee** the shared knowledge in a compact form for efficient communication. We perform a thorough evaluation of the proposed FedPCL in the lightweight framework, measuring and visualizing its ability to fuse various pre-trained models on popular FL datasets.
△ Less
Submitted 20 September, 2022;
originally announced September 2022.
-
Optomechanical compensatory cooling mechanism with exceptional points
Authors:
Guo-Qing Qin,
Xuan Mao,
Hao Zhang,
Peng-Yu Wen,
Gui-Qin Li,
Dong Ruan,
Gui-Lu Long
Abstract:
The ground state cooling of Brillouin scattering optomechanical system is limited by defects in practical sample. In this paper, we propose a new compensatory cooling mechanism for Brillouin scattering optomechanical system with exceptional points (EPs). By using the EPs both in optical and mechanical modes, the limited cooling process is compensated effectively. The dual-EPs system, which is disc…
▽ More
The ground state cooling of Brillouin scattering optomechanical system is limited by defects in practical sample. In this paper, we propose a new compensatory cooling mechanism for Brillouin scattering optomechanical system with exceptional points (EPs). By using the EPs both in optical and mechanical modes, the limited cooling process is compensated effectively. The dual-EPs system, which is discovered in this work for the first time, can be induced by two defects with specific relative angles and has function of not only actively manipulating the coupling strength of optical modes but also the Brillouin phonon modes. Our results provide new tools to manipulate the optomechanical interaction in multi-mode systems and open the possibility of quantum state transfer and quantum interface protocols based on phonon cooling in quantum applications.
△ Less
Submitted 7 September, 2022;
originally announced September 2022.
-
On the gauge reduction with respect to simplicity constraint in all dimensional loop quantum gravity
Authors:
Gao** Long,
Xiangdong Zhang
Abstract:
In this paper, we are going to discuss the gauge reduction with respect to the simplicity constraint in both classical and quantum theory of all dimensional loop quantum gravity. With the gauge reduction with respect to edge-simplicity constraint being proceeded and the anomalous vertex simplicity constraint being imposed weakly in holonomy-flux phase space, the simplicity reduced holonomy can be…
▽ More
In this paper, we are going to discuss the gauge reduction with respect to the simplicity constraint in both classical and quantum theory of all dimensional loop quantum gravity. With the gauge reduction with respect to edge-simplicity constraint being proceeded and the anomalous vertex simplicity constraint being imposed weakly in holonomy-flux phase space, the simplicity reduced holonomy can be established. However, we find that the simplicity reduced holonomy can not capture the degrees of freedom of intrinsic curvature, which leads that it fails to construct a correct scalar constraint operator in all dimensional LQG following the standard strategy. To tackle this problem, we establish a new type of holonomy corresponding to the simplicity reduced connection, which captures the degrees of freedom of both intrinsic and extrinsic curvature properly. Based on this new type of holonomy, we propose three new strategies to construct the scalar constraint operators, which serve as valuable candidates to study the dynamics of all dimensional LQG in the future.
△ Less
Submitted 5 September, 2022;
originally announced September 2022.
-
The formation of the stripped envelope type II b Supernova progenitors: Rotation, Metallicity and Overshooting
Authors:
Gang Long,
Hanfeng Song,
Georges Meynet,
Andre Maeder,
Ruiyu Zhang,
Ying Qin,
Sylvia Ekströmt,
Cyril Georgy,
Liuyan Zhao
Abstract:
Type IIb supernovae are believed to originate from core-collapse progenitors having kept only a very thin hydrogen envelope. We aim to explore how some physical factors, such as rotation, metallicity, overshooting, and the initial orbital period in binaries, significantly affect the Roche lobe overflow and the formation of type IIb supernovae. It is found that binaries are the main channel that ca…
▽ More
Type IIb supernovae are believed to originate from core-collapse progenitors having kept only a very thin hydrogen envelope. We aim to explore how some physical factors, such as rotation, metallicity, overshooting, and the initial orbital period in binaries, significantly affect the Roche lobe overflow and the formation of type IIb supernovae. It is found that binaries are the main channel that capable of producing type typeIIb supernovae progenitors in the mass range for initial masses below 20 $M_{\odot}$. The formation of type IIb supernova progenitors is extremely sensitive to the initial orbital period. A less massive hydrogen indicates smaller radius and a higher effective temperatures, and vice versa. Binary systems with initial periods between 300 and 720 days produce type IIb progenitors that are a red supergiant. Those with an initial period between 50 and 300 days produce yellow supergiant progenitors and those with initial periods shorter than 50 days, blue supergiant progenitors. Both rapid rotation and larger overshooting can enlarge the carbon-oxygen core mass and lead to higher core temperature and lower central density at the pre-collapse phase. They are also beneficial to surface nitrogen enrichment but restrict the efficiency of the first dredge-up. SN IIb progenitors with low metallicity have smaller hydrogen envelope masses and radii than the high metallicity counterparts. Ultra-stripped binary models have systematically higher core mass fraction $\rm ^{12}C$ left, which has important influence on the compactness of type IIb progenitors.
△ Less
Submitted 24 August, 2022;
originally announced August 2022.
-
The radiation emitted from axion dark matter in a homogeneous magnetic field, and possibilities for detection
Authors:
Shuo Xu,
Siyu Chen,
Hong-Hao Zhang,
Guangbo Long
Abstract:
We study the direct radiation excited by oscillating axion (or axion-like particle) dark matter in a homogenous magnetic field and its detection scheme. We concretely derive the analytical expression of the axion-induced radiated power for a cylindrical uniform magnetic field. In the long wave limit, the radiation power is proportional to the square of the B-field volume and the axion mass $m_a$,…
▽ More
We study the direct radiation excited by oscillating axion (or axion-like particle) dark matter in a homogenous magnetic field and its detection scheme. We concretely derive the analytical expression of the axion-induced radiated power for a cylindrical uniform magnetic field. In the long wave limit, the radiation power is proportional to the square of the B-field volume and the axion mass $m_a$, whereas it oscillate as approaching the short wave limit and the peak powers are proportional to the side area of the cylindrical magnetic field and $m_a^{-2}$. The maximum power locates at mass $m_a\sim\frac{3Ď€}{4R}$ for fixed radius $R$. Based on this characteristic of the power, we discuss a scheme to detect the axions in the mass range $1-10^4$\,neV, where four detectors of different bandwidths surround the B-field. The expected sensitivity for $m_a\lesssim1\,ÎĽ$eV under typical-parameter values can far exceed the existing constraints.
△ Less
Submitted 24 August, 2022; v1 submitted 22 August, 2022;
originally announced August 2022.
-
Simultaneous ground-state cooling of multiple degenerate mechanical modes through cross-Kerr effect
Authors:
Pengyu Wen,
Xuan Mao,
Min Wang,
Chuan Wang,
Gui-Qin Li,
Gui-Lu Long
Abstract:
Simultaneous ground-state cooling of multiple degenerate mechanical modes is a tough issue in optomechanical system due to the existence of the dark mode effect. Here we propose a universal and scalable method to break the dark mode effect of two degenerate mechanical modes by introducing the cross-Kerr (CK) nonlinearity. At most four stable steady states can be achieved in our scheme in the prese…
▽ More
Simultaneous ground-state cooling of multiple degenerate mechanical modes is a tough issue in optomechanical system due to the existence of the dark mode effect. Here we propose a universal and scalable method to break the dark mode effect of two degenerate mechanical modes by introducing the cross-Kerr (CK) nonlinearity. At most four stable steady states can be achieved in our scheme in the presence of the CK effect, different from the bistable behavior of the standard optomechanical system. Under the constant input laser power, the effective detuning and mechanical resonant frequency can be modulated by the CK nonlinearity, which results in an optimal CK coupling strength for cooling. Similarly, there will be an optimal input laser power for cooling when the CK coupling strength stays fixed. Our scheme can be extended to break the dark mode effect of multiple degenerate mechanical modes by introducing more than one CK effects. To fulfill the requirement of the simultaneous ground-state cooling of N multiple degenerate mechanical modes N-1 CK effects with different strengths are needed. Our proposal provides new insights in dark mode control and might pave the way to manipulating of multiple quantum states in macroscopic system.
△ Less
Submitted 20 August, 2022;
originally announced August 2022.
-
Dynamical encircling exceptional point in largely detuned multimode optomechanical system
Authors:
Dan Long,
Xuan Mao,
Guo-Qing Qin,
Hao Zhang,
Min Wang,
Gui-Qin Li,
Gui-Lu Long
Abstract:
Dynamical encircling exceptional point(EP) shows a number of intriguing physical phenomena and its potential applications. To enrich the manipulations of optical systems in experiment, here, we study the dynamical encircling EP, i.e. state transfer process, in largely detuned multimode optomechanical system. The process of state transfer has been investigated with different factors about the locat…
▽ More
Dynamical encircling exceptional point(EP) shows a number of intriguing physical phenomena and its potential applications. To enrich the manipulations of optical systems in experiment, here, we study the dynamical encircling EP, i.e. state transfer process, in largely detuned multimode optomechanical system. The process of state transfer has been investigated with different factors about the location of start point, the orientation and the initial state of the trajectories around the EP in parameter space. Results show that the nonreciprocal and the chiral topological energy transfer between two optical modes are performed successfully by tuning the effective optomechanical coupling in the multimode system with large detuning. Moreover, the factor of evolution speed about system parameters is also discussed. Our work demonstrates the fundamental physics around EP in large detuning domain of multimode optomechanical system and provides an alternative for manipulating of optical modes in non-hermitian system.
△ Less
Submitted 1 August, 2022;
originally announced August 2022.
-
Towards Robust Ranker for Text Retrieval
Authors:
Yucheng Zhou,
Tao Shen,
Xiubo Geng,
Chongyang Tao,
Can Xu,
Guodong Long,
Binxing Jiao,
Daxin Jiang
Abstract:
A ranker plays an indispensable role in the de facto 'retrieval & rerank' pipeline, but its training still lags behind -- learning from moderate negatives or/and serving as an auxiliary module for a retriever. In this work, we first identify two major barriers to a robust ranker, i.e., inherent label noises caused by a well-trained retriever and non-ideal negatives sampled for a high-capable ranke…
▽ More
A ranker plays an indispensable role in the de facto 'retrieval & rerank' pipeline, but its training still lags behind -- learning from moderate negatives or/and serving as an auxiliary module for a retriever. In this work, we first identify two major barriers to a robust ranker, i.e., inherent label noises caused by a well-trained retriever and non-ideal negatives sampled for a high-capable ranker. Thereby, we propose multiple retrievers as negative generators improve the ranker's robustness, where i) involving extensive out-of-distribution label noises renders the ranker against each noise distribution, and ii) diverse hard negatives from a joint distribution are relatively close to the ranker's negative distribution, leading to more challenging thus effective training. To evaluate our robust ranker (dubbed R$^2$anker), we conduct experiments in various settings on the popular passage retrieval benchmark, including BM25-reranking, full-ranking, retriever distillation, etc. The empirical results verify the new state-of-the-art effectiveness of our model.
△ Less
Submitted 16 June, 2022;
originally announced June 2022.
-
Optomechanically induced transparency and directional amplification in a non-Hermitian optomechanical lattice
Authors:
Pengyu Wen,
Min Wang,
Gui-Lu Long
Abstract:
Cavity optomechanics is important in both quantum information processing and basic physics research. In this paper, we propose an optomechanical lattice which manifests non-Hermitian physics . We first use the non-Bloch band theory to investigate the energy spectrum and transmission property of an optomechanical lattice. The generalized Brillouin zone of the system is calculated with the help of t…
▽ More
Cavity optomechanics is important in both quantum information processing and basic physics research. In this paper, we propose an optomechanical lattice which manifests non-Hermitian physics . We first use the non-Bloch band theory to investigate the energy spectrum and transmission property of an optomechanical lattice. The generalized Brillouin zone of the system is calculated with the help of the resultant. And the periodical boundary condition (PBC) and open boundary condition energy spectrum are given, subsequently. By introducing probe laser on different sites we observed the directional amplification of the system. The direction of the amplification is analyzed combined with the non-Hermitian skin effect. The frequency that supports the amplification is analyzed by considering the PBC energy spectrum. By introducing probe laser on one site we investigate the onsite transmission property. Optomechanically induced transparency (OMIT) can be achieved in our system. By varying the parameters and size of the system, the OMIT peak can be effectively modulated or even turned into optomechanically induced amplification . Our system shows its potential as the function of a single-way signal filter. And our model can be extended to other non-Hermitian Bosonic model which may possess topological features and bipolar non-Hermitian skin effect.
△ Less
Submitted 14 June, 2022;
originally announced June 2022.
-
QCSH: a Full Quantum Computer Nuclear Shell-Model Package
Authors:
Peng Lv,
Shi-Jie Wei,
Hao-Nan Xie,
Gui-Lu Long
Abstract:
Nucleus is a typical many-body quantum system. Full calculation of a nuclear system in a classical computer is far beyond the capacity of current classical computers. With fast development of hardware, the prospect of using quantum computers in nuclear physics is closing. Here, we report a full quantum package, QCSH, for solving nuclear shell-model in a quantum computer. QCSH uses the linear combi…
▽ More
Nucleus is a typical many-body quantum system. Full calculation of a nuclear system in a classical computer is far beyond the capacity of current classical computers. With fast development of hardware, the prospect of using quantum computers in nuclear physics is closing. Here, we report a full quantum package, QCSH, for solving nuclear shell-model in a quantum computer. QCSH uses the linear combination of unitaries formalism of quantum computing, and performs all calculations in a quantum computer. The complexities of qubit resource, number of basic gates of QCSH, are both polynomial to the nuclear size. QCSH can already provide meaningful results in the near term. As examples, the binding energies of twelve light nuclei, $^{2}$H, $^{3}$H, $^{3}$He, $^{4}$He, $^{6}$Li, $^{7}$Li, $^{12}$C, $^{14}$N, $^{16}$O, $^{17}$O, $^{23}$Na and $^{40}$Ca are calculated using QCSH in a classical quantum emulator. The binding energy of Deuteron has already been experimentally studied using QCSH on a superconducting quantum computing device. QCSH not only works in near-term quantum devices, but also in future large-scale quantum computers. With the development of quantum devices, nuclear system constitutes another promising area for demonstrating practical quantum advantage.
△ Less
Submitted 24 May, 2022;
originally announced May 2022.
-
UnifieR: A Unified Retriever for Large-Scale Retrieval
Authors:
Tao Shen,
Xiubo Geng,
Chongyang Tao,
Can Xu,
Guodong Long,
Kai Zhang,
Daxin Jiang
Abstract:
Large-scale retrieval is to recall relevant documents from a huge collection given a query. It relies on representation learning to embed documents and queries into a common semantic encoding space. According to the encoding space, recent retrieval methods based on pre-trained language models (PLM) can be coarsely categorized into either dense-vector or lexicon-based paradigms. These two paradigms…
▽ More
Large-scale retrieval is to recall relevant documents from a huge collection given a query. It relies on representation learning to embed documents and queries into a common semantic encoding space. According to the encoding space, recent retrieval methods based on pre-trained language models (PLM) can be coarsely categorized into either dense-vector or lexicon-based paradigms. These two paradigms unveil the PLMs' representation capability in different granularities, i.e., global sequence-level compression and local word-level contexts, respectively. Inspired by their complementary global-local contextualization and distinct representing views, we propose a new learning framework, UnifieR which unifies dense-vector and lexicon-based retrieval in one model with a dual-representing capability. Experiments on passage retrieval benchmarks verify its effectiveness in both paradigms. A uni-retrieval scheme is further presented with even better retrieval quality. We lastly evaluate the model on BEIR benchmark to verify its transferability.
△ Less
Submitted 4 June, 2023; v1 submitted 23 May, 2022;
originally announced May 2022.
-
FedNoiL: A Simple Two-Level Sampling Method for Federated Learning with Noisy Labels
Authors:
Zhuowei Wang,
Tianyi Zhou,
Guodong Long,
Bo Han,
**g Jiang
Abstract:
Federated learning (FL) aims at training a global model on the server side while the training data are collected and located at the local devices. Hence, the labels in practice are usually annotated by clients of varying expertise or criteria and thus contain different amounts of noises. Local training on noisy labels can easily result in overfitting to noisy labels, which is devastating to the gl…
▽ More
Federated learning (FL) aims at training a global model on the server side while the training data are collected and located at the local devices. Hence, the labels in practice are usually annotated by clients of varying expertise or criteria and thus contain different amounts of noises. Local training on noisy labels can easily result in overfitting to noisy labels, which is devastating to the global model through aggregation. Although recent robust FL methods take malicious clients into account, they have not addressed local noisy labels on each device and the impact to the global model. In this paper, we develop a simple two-level sampling method "FedNoiL" that (1) selects clients for more robust global aggregation on the server; and (2) selects clean labels and correct pseudo-labels at the client end for more robust local training. The sampling probabilities are built upon clean label detection by the global model. Moreover, we investigate different schedules changing the local epochs between aggregations over the course of FL, which notably improves the communication and computation efficiency in noisy label setting. In experiments with homogeneous/heterogeneous data distributions and noise ratios, we observed that direct combinations of SOTA FL methods with SOTA noisy-label learning methods can easily fail but our method consistently achieves better and robust performance.
△ Less
Submitted 20 May, 2022;
originally announced May 2022.
-
Quantum oscillations in field-induced correlated insulators of a moiré superlattice
Authors:
Le Liu,
Yanbang Chu,
Guang Yang,
Yalong Yuan,
Fanfan Wu,
Yiru Ji,
**peng Tian,
Rong Yang,
Kenji Watanabe,
Takashi Taniguchi,
Gen Long,
Dongxia Shi,
Jianpeng Liu,
Jie Shen,
Li Lu,
Wei Yang,
Guangyu Zhang
Abstract:
We report an observation of quantum oscillations (QOs) in the correlated insulators with valley anisotropy of twisted double bilayer graphene (TDBG). The anomalous QOs are best captured in the magneto resistivity oscillations of the insulators at v = -2, with a period of 1/B and an oscillation amplitude as high as 150 kΩ. The QOs can survive up to ~10 K, and above 12 K, the insulating behaviors ar…
▽ More
We report an observation of quantum oscillations (QOs) in the correlated insulators with valley anisotropy of twisted double bilayer graphene (TDBG). The anomalous QOs are best captured in the magneto resistivity oscillations of the insulators at v = -2, with a period of 1/B and an oscillation amplitude as high as 150 kΩ. The QOs can survive up to ~10 K, and above 12 K, the insulating behaviors are dominant. The QOs of the insulator are strongly D dependent: the carrier density extracted from the 1/B periodicity decreases almost linearly with D from -0.7 to -1.1 V/nm, suggesting a reduced Fermi surface; the effective mass from Lifshitz-Kosevich analysis depends nonlinearly on D, reaching a minimal value of 0.1 me at D = ~ -1.0 V/nm. Similar observations of QOs are also found at v = 2, as well as in other devices without graphite gate. We interpret the D sensitive QOs of the correlated insulators in the picture of band inversion. By reconstructing an inverted band model with the measured effective mass and Fermi surface, the density of state at the gap, calculated from thermal broadened Landau levels, agrees qualitatively with the observed QOs in the insulators. While more theoretical understandings are needed in the future to fully account for the anomalous QOs in this moiré system, our study suggests that TDBG is an excellent platform to discover exotic phases where correlation and topology are at play.
△ Less
Submitted 14 May, 2023; v1 submitted 20 May, 2022;
originally announced May 2022.
-
The thermodynamics of isolated horizons in loop quantum gravity
Authors:
Shupeng Song,
Gao** Long,
Cong Zhang,
Xiangdong Zhang
Abstract:
The statistical mechanical calculation of the thermodynamical properties of non-rotating isolated horizons are studied in the loop quantum gravity framework. By employing the Hawking temperature and horizon mass of isolated horizons as physical inputs, the microcanonical ensemble associated with the system are well established. As a result, the black hole entropy and other thermodynamical quantiti…
▽ More
The statistical mechanical calculation of the thermodynamical properties of non-rotating isolated horizons are studied in the loop quantum gravity framework. By employing the Hawking temperature and horizon mass of isolated horizons as physical inputs, the microcanonical ensemble associated with the system are well established. As a result, the black hole entropy and other thermodynamical quantities can be computed and consistent with well-known Hawking's semiclassical analysis. Moreover, the value of the Immirzi parameter of loop quantum gravity for {higher dimensional case and 4-dimensional U(1) case are} also obtained.
△ Less
Submitted 19 August, 2023; v1 submitted 20 May, 2022;
originally announced May 2022.
-
Experimental demonstration of phase-matching and Sagnac effect in a millimeter-scale wedged resonator gyroscope
Authors:
Xuan Mao,
Hong Yang,
Dan Long,
Min Wang,
Peng-Yu Wen,
Yun-Qi Hu,
Bo-Yang Wang,
Gui-Qin Li,
Jian-Cun Gao,
Gui-Lu Long
Abstract:
The highly efficient coupling of light from conventional optical components to optical mode volumes lies in the heart of chip-based micro-devices, which is determined by the phase-matching between propagation constants of fiber taper and the whispering-gallery-mode (WGM) of the resonator. Optical gyroscopes, typically realized as fiber-optic gyroscopes and ring-laser gyroscopes, have been the main…
▽ More
The highly efficient coupling of light from conventional optical components to optical mode volumes lies in the heart of chip-based micro-devices, which is determined by the phase-matching between propagation constants of fiber taper and the whispering-gallery-mode (WGM) of the resonator. Optical gyroscopes, typically realized as fiber-optic gyroscopes and ring-laser gyroscopes, have been the mainstay in diverse applications such as positioning and inertial sensing. Here, the phase-matching is theoretically analyzed and experimentally verified. We observe Sagnac effect in a millimeter-scale wedged resonator gyroscope which has attracted considerable attention and been rapidly promoted in recent years. We demonstrate a bidirectional pump and probe scheme, which directly measures the frequency beat caused by the Sagnac effect. We establish the linear response between the detected beat frequency and the rotation velocity. The clockwise and counterclockwise rotation can also be distinguished according to the value of the frequency beat. The experimental results verify the feasibility of develo** gyroscope in WGM resonator system and pave the way for future development.
△ Less
Submitted 13 May, 2022;
originally announced May 2022.
-
A Variational Quantum Attack for AES-like Symmetric Cryptography
Authors:
ZeGuo Wang,
ShiJie Wei,
Gui-Lu Long,
Lajos Hanzo
Abstract:
We propose a variational quantum attack algorithm (VQAA) for classical AES-like symmetric cryptography, as exemplified the simplified-data encryption standard (S-DES). In the VQAA, the known ciphertext is encoded as the ground state of a Hamiltonian that is constructed through a regular graph, and the ground state can be found using a variational approach. We designed the ansatz and cost function…
▽ More
We propose a variational quantum attack algorithm (VQAA) for classical AES-like symmetric cryptography, as exemplified the simplified-data encryption standard (S-DES). In the VQAA, the known ciphertext is encoded as the ground state of a Hamiltonian that is constructed through a regular graph, and the ground state can be found using a variational approach. We designed the ansatz and cost function for the S-DES's variational quantum attack. It is surprising that sometimes the VQAA is even faster than Grove's algorithm as demonstrated by our simulation results. The relationships of the entanglement entropy, concurrence and the cost function are investigated, which indicate that entanglement plays a crucial role in the speedup.
△ Less
Submitted 6 May, 2022;
originally announced May 2022.
-
Efficient Pipeline Planning for Expedited Distributed DNN Training
Authors:
Ziyue Luo,
Xiaodong Yi,
Guo** Long,
Shiqing Fan,
Chuan Wu,
Jun Yang,
Wei Lin
Abstract:
To train modern large DNN models, pipeline parallelism has recently emerged, which distributes the model across GPUs and enables different devices to process different microbatches in pipeline. Earlier pipeline designs allow multiple versions of model parameters to co-exist (similar to asynchronous training), and cannot ensure the same model convergence and accuracy performance as without pipelini…
▽ More
To train modern large DNN models, pipeline parallelism has recently emerged, which distributes the model across GPUs and enables different devices to process different microbatches in pipeline. Earlier pipeline designs allow multiple versions of model parameters to co-exist (similar to asynchronous training), and cannot ensure the same model convergence and accuracy performance as without pipelining. Synchronous pipelining has recently been proposed which ensures model performance by enforcing a synchronization barrier between training iterations. Nonetheless, the synchronization barrier requires waiting for gradient aggregation from all microbatches and thus delays the training progress. Optimized pipeline planning is needed to minimize such wait and hence the training time, which has not been well studied in the literature. This paper designs efficient, near-optimal algorithms for expediting synchronous pipeline-parallel training of modern large DNNs over arbitrary inter-GPU connectivity. Our algorithm framework comprises two components: a pipeline partition and device map** algorithm, and a pipeline scheduler that decides processing order of microbatches over the partitions, which together minimize the per-iteration training time. We conduct thorough theoretical analysis, extensive testbed experiments and trace-driven simulation, and demonstrate our scheme can accelerate training up to 157% compared with state-of-the-art designs.
△ Less
Submitted 22 April, 2022;
originally announced April 2022.
-
On Reporting Performance and Accuracy Bugs for Deep Learning Frameworks: An Exploratory Study from GitHub
Authors:
Guoming Long,
Tao Chen
Abstract:
The tremendous success of Deep Learning (DL) has significantly boosted the number of open-sourced DL frameworks hosted on GitHub. Among others, performance and accuracy bugs are critical factors that affect the reputation of these DL frameworks, therefore understanding the practice of discovering and investigating them for DL is important. In this paper, we conduct an exploratory study on the natu…
▽ More
The tremendous success of Deep Learning (DL) has significantly boosted the number of open-sourced DL frameworks hosted on GitHub. Among others, performance and accuracy bugs are critical factors that affect the reputation of these DL frameworks, therefore understanding the practice of discovering and investigating them for DL is important. In this paper, we conduct an exploratory study on the nature of reporting performance and accuracy bugs bugs for DL frameworks, aiming to improve our knowledge on this topic. Our study covers 10 most popular open-sourced DL frameworks on GitHub (e.g., TensorFlow, Keras, and PyTorch), based on which we sample 664 representative performance and accuracy bugs bug reports out of a total population of 22,522. Through systematic analysis of these samples, our key findings are: (1) low speed is the primary reason that a performance bug related report is submitted but we see no consistent pattern for accuracy related ones; (2) most of the reports are about issues encountered in the training stage; (3) only a small proportion of the reports provide insufficient information to investigate; (4) the majority of the performance and accuracy bugs bug reports (from 69% to 100%) are not related to the actual bug or regarded as unclassified; (5) around 50% of the performance and accuracy bug reports, which indeed reveal bugs, are not resolved by direct patches. Deriving from the above, we discuss a set of actionable implications to the researchers, maintainers, and report submitters on this subject. To promote open science, the labeled dataset has been made publicly available at https://tinyurl.com/4x3tap9w.
△ Less
Submitted 16 April, 2022;
originally announced April 2022.
-
General Hamiltonian Representation of ML Detection Relying on the Quantum Approximate Optimization Algorithm
Authors:
**g**g Cui,
Gui Lu Long,
Lajos Hanzo
Abstract:
The quantum approximate optimization algorithm (QAOA) conceived for solving combinatorial optimization problems has attracted significant interest since it can be run on the existing noisy intermediate-scale quantum (NISQ) devices. A primary step of using the QAOA is the efficient Hamiltonian construction based on different problem instances. Hence, we solve the maximum likelihood (ML) detection p…
▽ More
The quantum approximate optimization algorithm (QAOA) conceived for solving combinatorial optimization problems has attracted significant interest since it can be run on the existing noisy intermediate-scale quantum (NISQ) devices. A primary step of using the QAOA is the efficient Hamiltonian construction based on different problem instances. Hence, we solve the maximum likelihood (ML) detection problem for general constellations by appropriately adapting the QAOA, which gives rise to a new paradigm in communication systems. We first transform the ML detection problem into a weighted minimum $N$-satisfiability (WMIN-$N$-SAT) problem, where we formulate the objective function of the WMIN-$N$-SAT as a pseudo Boolean function. Furthermore, we formalize the connection between the degree of the objective function and the Gray-labelled modulation constellations. Explicitly, we show a series of results exploring the connection between the coefficients of the monomials and the patterns of the associated constellation points, which substantially simplifies the objective function with respect to the problem Hamiltonian of the QAOA. In particular, for an M-ary Gray-mapped quadrature amplitude modulation (MQAM) constellation, we show that the specific qubits encoding the in-phase components and those encoding the quadrature components are independent in the quantum system of interest, which allows the in-phase and quadrature components to be detected separately using the QAOA. Furthermore, we characterize the degree of the objective function in the WMIN-$N$-SAT problem corresponding to the ML detection of multiple-input and multiple-output (MIMO) channels. Finally, we evaluate the approximation ratio of the QAOA for the ML detection problem of quadrature phase shift keying (QPSK) relying on QAOA circuits of different depths.
△ Less
Submitted 11 April, 2022;
originally announced April 2022.
-
Twisted geometry coherent states in all dimensional loop quantum gravity: II. Ehrenfest Property
Authors:
Gao** Long
Abstract:
In the preceding paper of this series of articles we constructed the twisted geometry coherent states in all dimensional loop quantum gravity and established their peakedness properties. In this paper we establish the "Ehrenfest property" of these coherent states which are labelled by the twisted geometry parameters. By this we mean that the expectation values of the polynomials of the elementary…
▽ More
In the preceding paper of this series of articles we constructed the twisted geometry coherent states in all dimensional loop quantum gravity and established their peakedness properties. In this paper we establish the "Ehrenfest property" of these coherent states which are labelled by the twisted geometry parameters. By this we mean that the expectation values of the polynomials of the elementary operators as well as the operators which are not polynomial functions of the elementary operators, reproduce, to zeroth order in $\hbar$, the values of the corresponding classical functions at the twisted geometry space point where the coherent state is peaked.
△ Less
Submitted 13 April, 2022; v1 submitted 6 April, 2022;
originally announced April 2022.
-
Dynamics simulation and numerical analysis of arbitrary time-dependent $\mathcal{PT}$-symmetric system based on density operators
Authors:
Xiaogang Li,
Chao Zheng,
Jiancun Gao,
Guilu Long
Abstract:
$\mathcal{PT}$-symmetric system has attracted extensive attention in recent years because of its unique properties and applications. How to simulate $\mathcal{PT}$-symmetric system in traditional quantum mechanical system has not only fundamental theoretical significance but also practical value. We propose a dynamics simulation scheme of arbitrary time-dependent $\mathcal{PT}$-symmetric system ba…
▽ More
$\mathcal{PT}$-symmetric system has attracted extensive attention in recent years because of its unique properties and applications. How to simulate $\mathcal{PT}$-symmetric system in traditional quantum mechanical system has not only fundamental theoretical significance but also practical value. We propose a dynamics simulation scheme of arbitrary time-dependent $\mathcal{PT}$-symmetric system based on density operators, and the results are compatible with previous methods based on pure-state vectors. Based on the above, we are able to study the influence of quantum noises on the simulation results with the technique of vectorization of density operators and matrixization of superoperators (VDMS), and we show the depolarizing (Dep) noise is the most fatal and should be avoided as much as possible. Meanwhile, we also give a numerical analysis. We find that the problem of chronological product usually has to be solved not only in the numerical calculation, but also even in the experiment, because the dilated higher-dimensional Hamiltonian is usually time-dependent. Through theoretical analysis and numerical calculation, we find that on the premise of meeting the goal of calculation accuracy and saving computing resources, the time step of calculation and the cut-off term of Magnus series have to be carefully balanced.
△ Less
Submitted 30 October, 2022; v1 submitted 16 March, 2022;
originally announced March 2022.
-
ClarET: Pre-training a Correlation-Aware Context-To-Event Transformer for Event-Centric Generation and Classification
Authors:
Yucheng Zhou,
Tao Shen,
Xiubo Geng,
Guodong Long,
Daxin Jiang
Abstract:
Generating new events given context with correlated ones plays a crucial role in many event-centric reasoning tasks. Existing works either limit their scope to specific scenarios or overlook event-level correlations. In this paper, we propose to pre-train a general Correlation-aware context-to-Event Transformer (ClarET) for event-centric reasoning. To achieve this, we propose three novel event-cen…
▽ More
Generating new events given context with correlated ones plays a crucial role in many event-centric reasoning tasks. Existing works either limit their scope to specific scenarios or overlook event-level correlations. In this paper, we propose to pre-train a general Correlation-aware context-to-Event Transformer (ClarET) for event-centric reasoning. To achieve this, we propose three novel event-centric objectives, i.e., whole event recovering, contrastive event-correlation encoding and prompt-based event locating, which highlight event-level correlations with effective training. The proposed ClarET is applicable to a wide range of event-centric reasoning scenarios, considering its versatility of (i) event-correlation types (e.g., causal, temporal, contrast), (ii) application formulations (i.e., generation and classification), and (iii) reasoning types (e.g., abductive, counterfactual and ending reasoning). Empirical fine-tuning results, as well as zero- and few-shot learning, on 9 benchmarks (5 generation and 4 classification tasks covering 4 reasoning types with diverse event correlations), verify its effectiveness and generalization ability.
△ Less
Submitted 9 March, 2022; v1 submitted 4 March, 2022;
originally announced March 2022.
-
Personalized Federated Learning With Graph
Authors:
Fengwen Chen,
Guodong Long,
Zonghan Wu,
Tianyi Zhou,
**g Jiang
Abstract:
Knowledge sharing and model personalization are two key components in the conceptual framework of personalized federated learning (PFL). Existing PFL methods focus on proposing new model personalization mechanisms while simply implementing knowledge sharing by aggregating models from all clients, regardless of their relation graph. This paper aims to enhance the knowledge-sharing process in PFL by…
▽ More
Knowledge sharing and model personalization are two key components in the conceptual framework of personalized federated learning (PFL). Existing PFL methods focus on proposing new model personalization mechanisms while simply implementing knowledge sharing by aggregating models from all clients, regardless of their relation graph. This paper aims to enhance the knowledge-sharing process in PFL by leveraging the graph-based structural information among clients. We propose a novel structured federated learning (SFL) framework to learn both the global and personalized models simultaneously using client-wise relation graphs and clients' private data. We cast SFL with graph into a novel optimization problem that can model the client-wise complex relations and graph-based structural topology by a unified framework. Moreover, in addition to using an existing relation graph, SFL could be expanded to learn the hidden relations among clients. Experiments on traffic and image benchmark datasets can demonstrate the effectiveness of the proposed method. All implementation codes are available on Github
△ Less
Submitted 30 April, 2022; v1 submitted 1 March, 2022;
originally announced March 2022.
-
Global Correlation and Local Information Flows in Controllable Non-Markovian Open Quantum Dynamics
Authors:
Xin-Yu Chen,
Na-Na Zhang,
Wan-Ting He,
Xiang-Yu Kong,
Ming-Jie Tao,
Fu-Guo Deng,
Qing Ai,
Gui-Lu Long
Abstract:
In a fully-controllable experiment platform for studying non-Markovian open quantum dynamics, we show that the non-Markovianity could be investigated from the global and local aspects. By mixing random unitary dynamics, we demonstrate non-Markovian and Markovian open quantum dynamics. From the global point of view, by tuning the base frequency we demonstrate the transition from the Markovianity to…
▽ More
In a fully-controllable experiment platform for studying non-Markovian open quantum dynamics, we show that the non-Markovianity could be investigated from the global and local aspects. By mixing random unitary dynamics, we demonstrate non-Markovian and Markovian open quantum dynamics. From the global point of view, by tuning the base frequency we demonstrate the transition from the Markovianity to the non-Markovianity as measured by the quantum mutual information (QMI). In a Markovian open quantum process, the QMI decays monotonically, while it may rise temporarily in a non-Markovian process. However, under some circumstances, it is not sufficient to globally investigate the non-Markovianity of the open quantum dynamics. As an essential supplement, we further utilize the quantum Fisher information (QFI) flow to locally characterize the non-Markovianity in different channels. We demonstrate that the QMI in combination with the QFI flow are capable of measuring the non-Markovianity for a multi-channel open quantum dynamics.
△ Less
Submitted 28 February, 2022;
originally announced February 2022.
-
On the Convergence of Clustered Federated Learning
Authors:
Jie Ma,
Guodong Long,
Tianyi Zhou,
**g Jiang,
Chengqi Zhang
Abstract:
Knowledge sharing and model personalization are essential components to tackle the non-IID challenge in federated learning (FL). Most existing FL methods focus on two extremes: 1) to learn a shared model to serve all clients with non-IID data, and 2) to learn personalized models for each client, namely personalized FL. There is a trade-off solution, namely clustered FL or cluster-wise personalized…
▽ More
Knowledge sharing and model personalization are essential components to tackle the non-IID challenge in federated learning (FL). Most existing FL methods focus on two extremes: 1) to learn a shared model to serve all clients with non-IID data, and 2) to learn personalized models for each client, namely personalized FL. There is a trade-off solution, namely clustered FL or cluster-wise personalized FL, which aims to cluster similar clients into one cluster, and then learn a shared model for all clients within a cluster. This paper is to revisit the research of clustered FL by formulating them into a bi-level optimization framework that could unify existing methods. We propose a new theoretical analysis framework to prove the convergence by considering the clusterability among clients. In addition, we embody this framework in an algorithm, named Weighted Clustered Federated Learning (WeCFL). Empirical analysis verifies the theoretical results and demonstrates the effectiveness of the proposed WeCFL under the proposed cluster-wise non-IID settings.
△ Less
Submitted 7 June, 2022; v1 submitted 12 February, 2022;
originally announced February 2022.
-
An Evolutionary Pathway for the Quantum Internet Relying on Secure Classical Repeaters
Authors:
Gui-Lu Long,
Dong Pan,
Yu-Bo Sheng,
Qikun Xue,
Jianhua Lu,
Lajos Hanzo
Abstract:
Until quantum repeaters become mature, quantum networks remain restricted either to limited areas of directly connected nodes or to nodes connected to a common node. We circumvent this limitation by conceiving quantum networks using secure classical repeaters combined with the quantum secure direct communication (QSDC) principle, which is a compelling form of quantum communication that directly tr…
▽ More
Until quantum repeaters become mature, quantum networks remain restricted either to limited areas of directly connected nodes or to nodes connected to a common node. We circumvent this limitation by conceiving quantum networks using secure classical repeaters combined with the quantum secure direct communication (QSDC) principle, which is a compelling form of quantum communication that directly transmits information over quantum channel. The final component of this promising solution is our classical quantum-resistant algorithm. Explicitly, in these networks, the ciphertext gleaned from a quantum-resistant algorithm is transmitted using QSDC along the nodes, where it is read out and then transmitted to the next node. At the repeaters, the information is protected by our quantum-resistant algorithm, which is secure even in the face of a quantum computer. Hence, our solution offers secure end-to-end communication across the entire network, since it is capable of both eavesdrop** detection and prevention in the emerging quantum internet. It is compatible with operational networks, and will enjoy the compelling services of the popular Internet, including authentication. Hence, it smoothens the transition from the classical Internet to the Quantum Internet (Qinternet) by following a gradual evolutionary upgrade. It will act as an alternative network in quantum computing networks in the future. We have presented the first experimental demonstration of a secure classical repeater based hybrid quantum network constructed by a serial concatenation of an optical fiber and free-space communication link. In conclusion, secure repeater networks may indeed be constructed using existing technology and continue to support a seamless evolutionary pathway to the future Qinternet of quantum computers.
△ Less
Submitted 7 February, 2022;
originally announced February 2022.
-
Can one-zone hadronuclear model explain the hard-TeV spectrum of BL Lac objects?
Authors:
Wei-Jian Li,
Rui Xue,
Guang-Bo Long,
Ze-Rui Wang,
Shigehiro Nagataki,
Da-Hai Yan,
Jian-Cheng Wang
Abstract:
Context. The intrinsic TeV emission of some BL Lacs are characterized by a hard spectrum (the hard-TeV spectrum) after correcting for the extragalactic background light. The hard-TeV spectra pose a challenge to conventional one-zone models, including the leptonic model, the photohadronic model, the proton synchrotron model, etc.
Aims. In this work, we study if the one-zone hadronuclear (pp) mode…
▽ More
Context. The intrinsic TeV emission of some BL Lacs are characterized by a hard spectrum (the hard-TeV spectrum) after correcting for the extragalactic background light. The hard-TeV spectra pose a challenge to conventional one-zone models, including the leptonic model, the photohadronic model, the proton synchrotron model, etc.
Aims. In this work, we study if the one-zone hadronuclear (pp) model can be used to interpret the hard-TeV spectra of BL Lacs without introducing extreme parameters.
Methods. We give analytical calculations to study if there is a parameter space and the charge neutrality condition of jet can be satisfied when interpreting the hard-TeV spectra of BL Lacs without introducing a super-Eddington jet power.
Results. We find that in a sample of hard-TeV BL Lacs collected by Xue et al. (2019a), only the hard-TeV spectrum of 1ES 0229+200 could be explained by gamma-ray from pi-0 decay produced in the pp interactions, but at the cost of setting a small radius of the radiation region that comparable to the Schwarzschild radius of the central black hole. Combining with previous studies of other one-zone models, we suggest that the hard-TeV spectra of BL Lacs cannot be explained by any one-zone models without introducing extreme parameters, and should originate from the multiple radiation regions.
△ Less
Submitted 29 January, 2022;
originally announced January 2022.
-
Dual-Frequency Quantum Phase Estimation Mitigates the Spectral Leakage of Quantum Algorithms
Authors:
Yifeng Xiong,
Soon Xin Ng,
Gui-Lu Long,
Lajos Hanzo
Abstract:
Quantum phase estimation is an important component in diverse quantum algorithms. However, it suffers from spectral leakage, when the reciprocal of the record length is not an integer multiple of the unknown phase, which incurs an accuracy degradation. For the existing single-sample estimation scheme, window-based methods have been proposed for spectral leakage mitigation. As a further advance, we…
▽ More
Quantum phase estimation is an important component in diverse quantum algorithms. However, it suffers from spectral leakage, when the reciprocal of the record length is not an integer multiple of the unknown phase, which incurs an accuracy degradation. For the existing single-sample estimation scheme, window-based methods have been proposed for spectral leakage mitigation. As a further advance, we propose a dual-frequency estimator, which asymptotically approaches the Cramer-Rao bound, when multiple samples are available. Numerical results show that the proposed estimator outperforms the existing window-based methods, when the number of samples is sufficiently high.
△ Less
Submitted 19 March, 2022; v1 submitted 23 January, 2022;
originally announced January 2022.
-
Revisiting the constraints on primordial black hole abundance with the isotropic gamma ray background
Authors:
Siyu Chen,
Hong-Hao Zhang,
Guangbo Long
Abstract:
We revisit the constraints on primordial black holes (PBHs) in the mass range $10^{13}-10^{18}$ g by comparing the 100\,keV-5\,GeV gamma-ray background with isotropic flux from PBH Hawking radiation (HR). We investigate three effects that may update the constraints on the PBH abundance; i) reliably calculating the secondary spectra of HR for energy below 5\,GeV, ii) the contributions to the measur…
▽ More
We revisit the constraints on primordial black holes (PBHs) in the mass range $10^{13}-10^{18}$ g by comparing the 100\,keV-5\,GeV gamma-ray background with isotropic flux from PBH Hawking radiation (HR). We investigate three effects that may update the constraints on the PBH abundance; i) reliably calculating the secondary spectra of HR for energy below 5\,GeV, ii) the contributions to the measured isotropic flux from the Galactic PBH HR and that from annihilation radiation due to evaporated positrons, iii) inclusion of astrophysical background from gamma-ray sources. The conservative constraint is significantly improved by more than an order of magnitude at $2\times10^{16}$g$\lesssim M\lesssim 10^{17}$g over the past relevant work, where the effect ii is dominant. After further accounting for the astrophysical background, more than a tenfold improvement extends to a much wider mass range $10^{15}$g$\lesssim M\lesssim 2\times 10^{17}$g.
△ Less
Submitted 16 February, 2022; v1 submitted 31 December, 2021;
originally announced December 2021.
-
Quantum secure direct communication with private dense coding using general preshared quantum state
Authors:
Jiawei Wu,
Gui-Lu Long,
Masahito Hayashi
Abstract:
We study quantum secure direct communication by using a general preshared quantum state and a generalization of dense coding. In this scenario, Alice is allowed to apply a unitary on the preshared state to encode her message, and the set of allowed unitaries forms a group. To decode the message, Bob is allowed to apply a measurement across his own system and the system he receives. In the worst sc…
▽ More
We study quantum secure direct communication by using a general preshared quantum state and a generalization of dense coding. In this scenario, Alice is allowed to apply a unitary on the preshared state to encode her message, and the set of allowed unitaries forms a group. To decode the message, Bob is allowed to apply a measurement across his own system and the system he receives. In the worst scenario, we guarantee that Eve obtains no information for the message even when Eve access the joint system between the system that she intercepts and her original system of the preshared state. For a practical application, we propose a concrete protocol and derive an upper bound of information leakage in the finite-length setting. We also discuss how to apply our scenario to the case with discrete Weyl-Heisenberg representation when the preshared state is unknown.
△ Less
Submitted 22 May, 2022; v1 submitted 30 December, 2021;
originally announced December 2021.
-
A full circuit-based quantum algorithm for excited-states in quantum chemistry
Authors:
**gwei Wen,
Zhengan Wang,
Chitong Chen,
Junxiang Xiao,
Hang Li,
Ling Qian,
Zhiguo Huang,
Heng Fan,
Shijie Wei,
Guilu Long
Abstract:
Utilizing quantum computer to investigate quantum chemistry is an important research field nowadays. In addition to the ground-state problems that have been widely studied, the determination of excited-states plays a crucial role in the prediction and modeling of chemical reactions and other physical processes. Here, we propose a non-variational full circuit-based quantum algorithm for obtaining t…
▽ More
Utilizing quantum computer to investigate quantum chemistry is an important research field nowadays. In addition to the ground-state problems that have been widely studied, the determination of excited-states plays a crucial role in the prediction and modeling of chemical reactions and other physical processes. Here, we propose a non-variational full circuit-based quantum algorithm for obtaining the excited-state spectrum of a quantum chemistry Hamiltonian. Compared with previous classical-quantum hybrid variational algorithms, our method eliminates the classical optimization process, reduces the resource cost caused by the interaction between different systems, and achieves faster convergence rate and stronger robustness against noise without barren plateau. The parameter updating for determining the next energy-level is naturally dependent on the energy measurement outputs of the previous energy-level and can be realized by only modifying the state preparation process of ancillary system, introducing little additional resource overhead. Numerical simulations of the algorithm with hydrogen, LiH, H2O and NH3 molecules are presented. Furthermore, we offer an experimental demonstration of the algorithm on a superconducting quantum computing platform, and the results show a good agreement with theoretical expectations. The algorithm can be widely applied to various Hamiltonian spectrum determination problems on the fault-tolerant quantum computers.
△ Less
Submitted 3 January, 2024; v1 submitted 28 December, 2021;
originally announced December 2021.
-
Tunable partial polarization beam splitter and optomechanically induced Faraday effect
Authors:
Xuan Mao,
Guo-Qing Qin,
Hong Yang,
Zeguo Wang,
Min Wang,
Gui-Qin Li,
Peng Xue,
Gui-Lu Long
Abstract:
Polarization beam splitter (PBS) is a crucial photonic element to separately extract transverse-electric (TE) and transverse-magnetic (TM) polarizations from the propagating light fields. Here, we propose a concise, continuously tunable and all-optical partial PBS in the vector optomechanical system which contains two orthogonal polarized cavity modes with degenerate frequency. The results show th…
▽ More
Polarization beam splitter (PBS) is a crucial photonic element to separately extract transverse-electric (TE) and transverse-magnetic (TM) polarizations from the propagating light fields. Here, we propose a concise, continuously tunable and all-optical partial PBS in the vector optomechanical system which contains two orthogonal polarized cavity modes with degenerate frequency. The results show that one can manipulate the polarization states of different output fields by tuning the polarization angle of the pum** field and the system function as partial PBS when the pump laser polarizes vertically or horizontally. As a significant application of the tunable PBS, we propose a scheme of implementing quantum walks in resonator arrays without the aid of other auxiliary systems. Furthermore, we investigate the optomechanically induced Faraday effect in the vector optomechanical system which enables arbitrary tailoring of the input lights and the behaviors of polarization angles of the output fields in the under couple, critical couple, and over couple regimes. Our findings prove the optomechanical system is a potential platform to manipulate the polarization states in multimode resonators and boost the process of applications related to polarization modulation.
△ Less
Submitted 20 December, 2021;
originally announced December 2021.
-
Close binary evolution based on Gaia DR2: the origin of late WC-type Wolf-Rayet stars with low luminosity
Authors:
Weiguo Peng,
Hanfeng Song,
Georges Meynet,
Andre Maeder,
Fabio Barblan,
Ruiyu Zhang,
Sylvia Ekströmt,
Cyril Georgy,
Gang Long,
Liuyan Zhao,
Ying Qin
Abstract:
The observed late-type WC Wolf-Rayet stars (WC7-9) with low luminosity below $\rm \log L/L_{\odot} < 5.4$ in the HR diagram cannot be reproduced satisfactorily by the evolutionary track of single stars. The mass transfer due to Roche lobe overflow drastically modifies the internal structure and surface compositions of two components. Therefore, binaries provide a very promising evolutionary channe…
▽ More
The observed late-type WC Wolf-Rayet stars (WC7-9) with low luminosity below $\rm \log L/L_{\odot} < 5.4$ in the HR diagram cannot be reproduced satisfactorily by the evolutionary track of single stars. The mass transfer due to Roche lobe overflow drastically modifies the internal structure and surface compositions of two components. Therefore, binaries provide a very promising evolutionary channel to produce these WC stars.
△ Less
Submitted 25 November, 2021;
originally announced November 2021.
-
The effective dynamics of weak coupling loop quantum gravity
Authors:
Gao** Long,
Yongge Ma
Abstract:
By taking the limit that Newton's Gravitational constant tends to zero, the weak coupling loop quantum gravity can be formulated as a $U(1)^3$ gauge theory instead of the original $SU(2)$ gauge theory. In this paper, a parametrization of the $SU(2)$ holonomy-flux variables by the $U(1)^3$ holonomy-flux variables is introduced, and the Hamiltonian operator based on this parametrization is obtained…
▽ More
By taking the limit that Newton's Gravitational constant tends to zero, the weak coupling loop quantum gravity can be formulated as a $U(1)^3$ gauge theory instead of the original $SU(2)$ gauge theory. In this paper, a parametrization of the $SU(2)$ holonomy-flux variables by the $U(1)^3$ holonomy-flux variables is introduced, and the Hamiltonian operator based on this parametrization is obtained for the weak coupling loop quantum gravity. It is shown that the effective dynamics obtained from the coherent state path integrals in $U(1)^3$ and $SU(2)$ loop quantum gravity respectively are consistent to each other in the weak coupling limit, provided that the expectation values of the Hamiltonian operators on the coherent states in these two theories coincide with their classical expressions respectively.
△ Less
Submitted 23 November, 2021;
originally announced November 2021.