Search | arXiv e-print repository

Value-Penalized Auxiliary Control from Examples for Learning without Rewards or Demonstrations

Authors: Trevor Ablett, Bryan Chan, Jayce Haoran Wang, Jonathan Kelly

Abstract: Learning from examples of success is an appealing approach to reinforcement learning that eliminates many of the disadvantages of using hand-crafted reward functions or full expert-demonstration trajectories, both of which can be difficult to acquire, biased, or suboptimal. However, learning from examples alone dramatically increases the exploration challenge, especially for complex tasks. This wo… ▽ More Learning from examples of success is an appealing approach to reinforcement learning that eliminates many of the disadvantages of using hand-crafted reward functions or full expert-demonstration trajectories, both of which can be difficult to acquire, biased, or suboptimal. However, learning from examples alone dramatically increases the exploration challenge, especially for complex tasks. This work introduces value-penalized auxiliary control from examples (VPACE); we significantly improve exploration in example-based control by adding scheduled auxiliary control and examples of auxiliary tasks. Furthermore, we identify a value-calibration problem, where policy value estimates can exceed their theoretical limits based on successful data. We resolve this problem, which is exacerbated by learning auxiliary tasks, through the addition of an above-success-level value penalty. Across three simulated and one real robotic manipulation environment, and 21 different main tasks, we show that our approach substantially improves learning efficiency. Videos, code, and datasets are available at https://papers.starslab.ca/vpace. △ Less

Submitted 3 July, 2024; originally announced July 2024.

Comments: Submitted to the Conference on Robot Learning (CoRL'24), Munich, Germany, Nov. 6-9, 2024

arXiv:2405.09798 [pdf, other]

Many-Shot In-Context Learning in Multimodal Foundation Models

Authors: Yixing Jiang, Jeremy Irvin, Ji Hun Wang, Muhammad Ahmed Chaudhry, Jonathan H. Chen, Andrew Y. Ng

Abstract: Large language models are well-known to be effective at few-shot in-context learning (ICL). Recent advancements in multimodal foundation models have enabled unprecedentedly long context windows, presenting an opportunity to explore their capability to perform ICL with many more demonstrating examples. In this work, we evaluate the performance of multimodal foundation models scaling from few-shot t… ▽ More Large language models are well-known to be effective at few-shot in-context learning (ICL). Recent advancements in multimodal foundation models have enabled unprecedentedly long context windows, presenting an opportunity to explore their capability to perform ICL with many more demonstrating examples. In this work, we evaluate the performance of multimodal foundation models scaling from few-shot to many-shot ICL. We benchmark GPT-4o and Gemini 1.5 Pro across 10 datasets spanning multiple domains (natural imagery, medical imagery, remote sensing, and molecular imagery) and tasks (multi-class, multi-label, and fine-grained classification). We observe that many-shot ICL, including up to almost 2,000 multimodal demonstrating examples, leads to substantial improvements compared to few-shot (<100 examples) ICL across all of the datasets. Further, Gemini 1.5 Pro performance continues to improve log-linearly up to the maximum number of tested examples on many datasets. Given the high inference costs associated with the long prompts required for many-shot ICL, we also explore the impact of batching multiple queries in a single API call. We show that batching up to 50 queries can lead to performance improvements under zero-shot and many-shot ICL, with substantial gains in the zero-shot setting on multiple datasets, while drastically reducing per-query cost and latency. Finally, we measure ICL data efficiency of the models, or the rate at which the models learn from more demonstrating examples. We find that while GPT-4o and Gemini 1.5 Pro achieve similar zero-shot performance across the datasets, Gemini 1.5 Pro exhibits higher ICL data efficiency than GPT-4o on most datasets. Our results suggest that many-shot ICL could enable users to efficiently adapt multimodal foundation models to new applications and domains. Our codebase is publicly available at https://github.com/stanfordmlgroup/ManyICL . △ Less

Submitted 16 May, 2024; originally announced May 2024.

arXiv:2312.06932 [pdf, other]

Predictive variational autoencoder for learning robust representations of time-series data

Authors: Julia Huiming Wang, Dexter Tsin, Tatiana Engel

Abstract: Variational autoencoders (VAEs) have been used extensively to discover low-dimensional latent factors governing neural activity and animal behavior. However, without careful model selection, the uncovered latent factors may reflect noise in the data rather than true underlying features, rendering such representations unsuitable for scientific interpretation. Existing solutions to this problem invo… ▽ More Variational autoencoders (VAEs) have been used extensively to discover low-dimensional latent factors governing neural activity and animal behavior. However, without careful model selection, the uncovered latent factors may reflect noise in the data rather than true underlying features, rendering such representations unsuitable for scientific interpretation. Existing solutions to this problem involve introducing additional measured variables or data augmentations specific to a particular data type. We propose a VAE architecture that predicts the next point in time and show that it mitigates the learning of spurious features. In addition, we introduce a model selection metric based on smoothness over time in the latent space. We show that together these two constraints on VAEs to be smooth over time produce robust latent representations and faithfully recover latent factors on synthetic datasets. △ Less

Submitted 11 December, 2023; originally announced December 2023.

Comments: 16 pages, 4 main figures, 4 supplemental figures, accepted for publication at Unireps Workshop in 37th Conference on Neural Information Processing Systems (NeurIPS 2023)

arXiv:2311.17449 [pdf, other]

Weakly-semi-supervised object detection in remotely sensed imagery

Authors: Ji Hun Wang, Jeremy Irvin, Beri Kohen Behar, Ha Tran, Raghav Samavedam, Quentin Hsu, Andrew Y. Ng

Abstract: Deep learning for detecting objects in remotely sensed imagery can enable new technologies for important applications including mitigating climate change. However, these models often require large datasets labeled with bounding box annotations which are expensive to curate, prohibiting the development of models for new tasks and geographies. To address this challenge, we develop weakly-semi-superv… ▽ More Deep learning for detecting objects in remotely sensed imagery can enable new technologies for important applications including mitigating climate change. However, these models often require large datasets labeled with bounding box annotations which are expensive to curate, prohibiting the development of models for new tasks and geographies. To address this challenge, we develop weakly-semi-supervised object detection (WSSOD) models on remotely sensed imagery which can leverage a small amount of bounding boxes together with a large amount of point labels that are easy to acquire at scale in geospatial data. We train WSSOD models which use large amounts of point-labeled images with varying fractions of bounding box labeled images in FAIR1M and a wind turbine detection dataset, and demonstrate that they substantially outperform fully supervised models trained with the same amount of bounding box labeled images on both datasets. Furthermore, we find that the WSSOD models trained with 2-10x fewer bounding box labeled images can perform similarly to or outperform fully supervised models trained on the full set of bounding-box labeled images. We believe that the approach can be extended to other remote sensing tasks to reduce reliance on bounding box labels and increase development of models for impactful applications. △ Less

Submitted 29 November, 2023; originally announced November 2023.

Comments: Tackling Climate Change with Machine Learning at NeurIPS 2023

arXiv:2308.04761 [pdf, other]

Feature Matching Data Synthesis for Non-IID Federated Learning

Authors: Zijian Li, Yuchang Sun, Jiawei Shao, Yuyi Mao, Jessie Hui Wang, Jun Zhang

Abstract: Federated learning (FL) has emerged as a privacy-preserving paradigm that trains neural networks on edge devices without collecting data at a central server. However, FL encounters an inherent challenge in dealing with non-independent and identically distributed (non-IID) data among devices. To address this challenge, this paper proposes a hard feature matching data synthesis (HFMDS) method to sha… ▽ More Federated learning (FL) has emerged as a privacy-preserving paradigm that trains neural networks on edge devices without collecting data at a central server. However, FL encounters an inherent challenge in dealing with non-independent and identically distributed (non-IID) data among devices. To address this challenge, this paper proposes a hard feature matching data synthesis (HFMDS) method to share auxiliary data besides local models. Specifically, synthetic data are generated by learning the essential class-relevant features of real samples and discarding the redundant features, which helps to effectively tackle the non-IID issue. For better privacy preservation, we propose a hard feature augmentation method to transfer real features towards the decision boundary, with which the synthetic data not only improve the model generalization but also erase the information of real features. By integrating the proposed HFMDS method with FL, we present a novel FL framework with data augmentation to relieve data heterogeneity. The theoretical analysis highlights the effectiveness of our proposed data synthesis method in solving the non-IID challenge. Simulation results further demonstrate that our proposed HFMDS-FL algorithm outperforms the baselines in terms of accuracy, privacy preservation, and computational cost on various benchmark datasets. △ Less

Submitted 9 August, 2023; originally announced August 2023.

Comments: 16 pages

arXiv:2305.15706 [pdf, other]

pFedSim: Similarity-Aware Model Aggregation Towards Personalized Federated Learning

Authors: Jiahao Tan, Yipeng Zhou, Gang Liu, Jessie Hui Wang, Shui Yu

Abstract: The federated learning (FL) paradigm emerges to preserve data privacy during model training by only exposing clients' model parameters rather than original data. One of the biggest challenges in FL lies in the non-IID (not identical and independently distributed) data (a.k.a., data heterogeneity) distributed on clients. To address this challenge, various personalized FL (pFL) methods are proposed… ▽ More The federated learning (FL) paradigm emerges to preserve data privacy during model training by only exposing clients' model parameters rather than original data. One of the biggest challenges in FL lies in the non-IID (not identical and independently distributed) data (a.k.a., data heterogeneity) distributed on clients. To address this challenge, various personalized FL (pFL) methods are proposed such as similarity-based aggregation and model decoupling. The former one aggregates models from clients of a similar data distribution. The later one decouples a neural network (NN) model into a feature extractor and a classifier. Personalization is captured by classifiers which are obtained by local training. To advance pFL, we propose a novel pFedSim (pFL based on model similarity) algorithm in this work by combining these two kinds of methods. More specifically, we decouple a NN model into a personalized feature extractor, obtained by aggregating models from similar clients, and a classifier, which is obtained by local training and used to estimate client similarity. Compared with the state-of-the-art baselines, the advantages of pFedSim include: 1) significantly improved model accuracy; 2) low communication and computation overhead; 3) a low risk of privacy leakage; 4) no requirement for any external public information. To demonstrate the superiority of pFedSim, extensive experiments are conducted on real datasets. The results validate the superb performance of our algorithm which can significantly outperform baselines under various heterogeneous data settings. △ Less

Submitted 25 May, 2023; originally announced May 2023.

arXiv:2303.15790 [pdf, other]

doi 10.1007/s11467-023-1333-z

STCF Conceptual Design Report: Volume 1 -- Physics & Detector

Authors: M. Achasov, X. C. Ai, R. Aliberti, L. P. An, Q. An, X. Z. Bai, Y. Bai, O. Bakina, A. Barnyakov, V. Blinov, V. Bobrovnikov, D. Bodrov, A. Bogomyagkov, A. Bondar, I. Boyko, Z. H. Bu, F. M. Cai, H. Cai, J. J. Cao, Q. H. Cao, Z. Cao, Q. Chang, K. T. Chao, D. Y. Chen, H. Chen , et al. (413 additional authors not shown)

Abstract: The Super $τ$-Charm facility (STCF) is an electron-positron collider proposed by the Chinese particle physics community. It is designed to operate in a center-of-mass energy range from 2 to 7 GeV with a peak luminosity of $0.5\times 10^{35}{\rm cm}^{-2}{\rm s}^{-1}$ or higher. The STCF will produce a data sample about a factor of 100 larger than that by the present $τ$-Charm factory -- the BEPCII,… ▽ More The Super $τ$-Charm facility (STCF) is an electron-positron collider proposed by the Chinese particle physics community. It is designed to operate in a center-of-mass energy range from 2 to 7 GeV with a peak luminosity of $0.5\times 10^{35}{\rm cm}^{-2}{\rm s}^{-1}$ or higher. The STCF will produce a data sample about a factor of 100 larger than that by the present $τ$-Charm factory -- the BEPCII, providing a unique platform for exploring the asymmetry of matter-antimatter (charge-parity violation), in-depth studies of the internal structure of hadrons and the nature of non-perturbative strong interactions, as well as searching for exotic hadrons and physics beyond the Standard Model. The STCF project in China is under development with an extensive R\&D program. This document presents the physics opportunities at the STCF, describes conceptual designs of the STCF detector system, and discusses future plans for detector R\&D and physics case studies. △ Less

Submitted 5 October, 2023; v1 submitted 28 March, 2023; originally announced March 2023.

Journal ref: Front. Phys. 19(1), 14701 (2024)

arXiv:2206.05507 [pdf, other]

Federated Learning with GAN-based Data Synthesis for Non-IID Clients

Authors: Zijian Li, Jiawei Shao, Yuyi Mao, Jessie Hui Wang, Jun Zhang

Abstract: Federated learning (FL) has recently emerged as a popular privacy-preserving collaborative learning paradigm. However, it suffers from the non-independent and identically distributed (non-IID) data among clients. In this paper, we propose a novel framework, named Synthetic Data Aided Federated Learning (SDA-FL), to resolve this non-IID challenge by sharing synthetic data. Specifically, each client… ▽ More Federated learning (FL) has recently emerged as a popular privacy-preserving collaborative learning paradigm. However, it suffers from the non-independent and identically distributed (non-IID) data among clients. In this paper, we propose a novel framework, named Synthetic Data Aided Federated Learning (SDA-FL), to resolve this non-IID challenge by sharing synthetic data. Specifically, each client pretrains a local generative adversarial network (GAN) to generate differentially private synthetic data, which are uploaded to the parameter server (PS) to construct a global shared synthetic dataset. To generate confident pseudo labels for the synthetic dataset, we also propose an iterative pseudo labeling mechanism performed by the PS. A combination of the local private dataset and synthetic dataset with confident pseudo labels leads to nearly identical data distributions among clients, which improves the consistency among local models and benefits the global aggregation. Extensive experiments evidence that the proposed framework outperforms the baseline methods by a large margin in several benchmark datasets under both the supervised and semi-supervised settings. △ Less

Submitted 11 June, 2022; originally announced June 2022.

Comments: 8 pages. To be published in the International Workshop on Trustworthy Federated Learning in Conjunction with IJCAI 2022 (FL-IJCAI'22)

arXiv:2112.10389 [pdf, other]

Decentralized Stochastic Proximal Gradient Descent with Variance Reduction over Time-varying Networks

Authors: Xuanjie Li, Yuedong Xu, Jessie Hui Wang, Xin Wang, John C. S. Lui

Abstract: In decentralized learning, a network of nodes cooperate to minimize an overall objective function that is usually the finite-sum of their local objectives, and incorporates a non-smooth regularization term for the better generalization ability. Decentralized stochastic proximal gradient (DSPG) method is commonly used to train this type of learning models, while the convergence rate is retarded by… ▽ More In decentralized learning, a network of nodes cooperate to minimize an overall objective function that is usually the finite-sum of their local objectives, and incorporates a non-smooth regularization term for the better generalization ability. Decentralized stochastic proximal gradient (DSPG) method is commonly used to train this type of learning models, while the convergence rate is retarded by the variance of stochastic gradients. In this paper, we propose a novel algorithm, namely DPSVRG, to accelerate the decentralized training by leveraging the variance reduction technique. The basic idea is to introduce an estimator in each node, which tracks the local full gradient periodically, to correct the stochastic gradient at each iteration. By transforming our decentralized algorithm into a centralized inexact proximal gradient algorithm with variance reduction, and controlling the bounds of error sequences, we prove that DPSVRG converges at the rate of $O(1/T)$ for general convex objectives plus a non-smooth term with $T$ as the number of iterations, while DSPG converges at the rate $O(\frac{1}{\sqrt{T}})$. Our experiments on different applications, network topologies and learning models demonstrate that DPSVRG converges much faster than DSPG, and the loss function of DPSVRG decreases smoothly along with the training epochs. △ Less

Submitted 23 January, 2022; v1 submitted 20 December, 2021; originally announced December 2021.

Comments: 16 pages, 14 figures

MSC Class: 68T05 ACM Class: I.2.11

arXiv:2112.10313 [pdf, other]

Semi-Decentralized Federated Edge Learning with Data and Device Heterogeneity

Authors: Yuchang Sun, Jiawei Shao, Yuyi Mao, Jessie Hui Wang, Jun Zhang

Abstract: Federated edge learning (FEEL) has attracted much attention as a privacy-preserving paradigm to effectively incorporate the distributed data at the network edge for training deep learning models. Nevertheless, the limited coverage of a single edge server results in an insufficient number of participated client nodes, which may impair the learning performance. In this paper, we investigate a novel… ▽ More Federated edge learning (FEEL) has attracted much attention as a privacy-preserving paradigm to effectively incorporate the distributed data at the network edge for training deep learning models. Nevertheless, the limited coverage of a single edge server results in an insufficient number of participated client nodes, which may impair the learning performance. In this paper, we investigate a novel framework of FEEL, namely semi-decentralized federated edge learning (SD-FEEL), where multiple edge servers are employed to collectively coordinate a large number of client nodes. By exploiting the low-latency communication among edge servers for efficient model sharing, SD-FEEL can incorporate more training data, while enjoying much lower latency compared with conventional federated learning. We detail the training algorithm for SD-FEEL with three main steps, including local model update, intra-cluster, and inter-cluster model aggregations. The convergence of this algorithm is proved on non-independent and identically distributed (non-IID) data, which also helps to reveal the effects of key parameters on the training efficiency and provides practical design guidelines. Meanwhile, the heterogeneity of edge devices may cause the straggler effect and deteriorate the convergence speed of SD-FEEL. To resolve this issue, we propose an asynchronous training algorithm with a staleness-aware aggregation scheme for SD-FEEL, of which, the convergence performance is also analyzed. The simulation results demonstrate the effectiveness and efficiency of the proposed algorithms for SD-FEEL and corroborate our analysis. △ Less

Submitted 25 April, 2023; v1 submitted 19 December, 2021; originally announced December 2021.

arXiv:2109.10728 [pdf, ps, other]

doi 10.1103/PhysRevB.104.125127

Superconductivity in PtPb$_{4}$ with Possible Nontrivial Band Topology

Authors: C. Q. Xu, B. Li, L. Zhang, J. Pollanen, X. L. Yi, X. Z. Xing, Y. Liu, J. H. Wang, Zengwei Zhu, Z. X. Shi, Xiaofeng Xu, X. Ke

Abstract: Superconductivity in topological quantum materials is much sought after as it represents the key avenue to searching for topological superconductors, which host a full pairing gap in the bulk but Majorana bound states at the surface. To date, however, the simultaneous realization of nontrivial band topology and superconductivity in the same material under ambient conditions remains rare. In this p… ▽ More Superconductivity in topological quantum materials is much sought after as it represents the key avenue to searching for topological superconductors, which host a full pairing gap in the bulk but Majorana bound states at the surface. To date, however, the simultaneous realization of nontrivial band topology and superconductivity in the same material under ambient conditions remains rare. In this paper, we study both superconducting and topological properties of a binary compound PtPb$_{4}$ ($T_c$ $\sim$ 2.7 K) that was recently reported to exhibit large Rashba splitting, inherent to the heavy 5$d$ Pt and 6$p$ Pb. We show that in PtPb$_{4}$, the specific heat jump at $T_c$ reaches $ΔC/γT_{c}$$\sim$1.70$\pm0.04$, larger than 1.43 expected for the weak-coupling BCS superconductors. Moreover, the measurement of quantum oscillation suggests the possibility for a topological band structure, which is further studied by density functional theory calculations. Our study may stimulate future experimental and theoretical investigations in this intriguing material. △ Less

Submitted 22 September, 2021; originally announced September 2021.

Journal ref: Physical Review B 104, 125127 (2021)

arXiv:2107.13962 [pdf, other]

The Robustness of Graph k-shell Structure under Adversarial Attacks

Authors: B. Zhou, Y. Q. Lv, Y. C. Mao, J. H. Wang, S. Q. Yu, Q. Xuan

Abstract: The k-shell decomposition plays an important role in unveiling the structural properties of a network, i.e., it is widely adopted to find the densest part of a network across a broad range of scientific fields, including Internet, biological networks, social networks, etc. However, there arises concern about the robustness of the k-shell structure when networks suffer from adversarial attacks. Her… ▽ More The k-shell decomposition plays an important role in unveiling the structural properties of a network, i.e., it is widely adopted to find the densest part of a network across a broad range of scientific fields, including Internet, biological networks, social networks, etc. However, there arises concern about the robustness of the k-shell structure when networks suffer from adversarial attacks. Here, we introduce and formalize the problem of the k-shell attack and develop an efficient strategy to attack the k-shell structure by rewiring a small number of links. To the best of our knowledge, it is the first time to study the robustness of graph k-shell structure under adversarial attacks. In particular, we propose a Simulated Annealing (SA) based k-shell attack method and testify it on four real-world social networks. The extensive experiments validate that the k-shell structure of a network is robust under random perturbation, but it is quite vulnerable under adversarial attack, e.g., in Dolphin and Throne networks, more than 40% nodes change their k-shell values when only 10% links are changed based on our SA-based k-shell attack. Such results suggest that a single structural feature could also be significantly disturbed when only a small fraction of links are changed purposefully in a network. Therefore, it could be an interesting topic to improve the robustness of various network properties against adversarial attack in the future. △ Less

Submitted 29 July, 2021; originally announced July 2021.

arXiv:2104.12678 [pdf, other]

Semi-Decentralized Federated Edge Learning for Fast Convergence on Non-IID Data

Authors: Yuchang Sun, Jiawei Shao, Yuyi Mao, Jessie Hui Wang, Jun Zhang

Abstract: Federated edge learning (FEEL) has emerged as an effective approach to reduce the large communication latency in Cloud-based machine learning solutions, while preserving data privacy. Unfortunately, the learning performance of FEEL may be compromised due to limited training data in a single edge cluster. In this paper, we investigate a novel framework of FEEL, namely semi-decentralized federated e… ▽ More Federated edge learning (FEEL) has emerged as an effective approach to reduce the large communication latency in Cloud-based machine learning solutions, while preserving data privacy. Unfortunately, the learning performance of FEEL may be compromised due to limited training data in a single edge cluster. In this paper, we investigate a novel framework of FEEL, namely semi-decentralized federated edge learning (SD-FEEL). By allowing model aggregation across different edge clusters, SD-FEEL enjoys the benefit of FEEL in reducing the training latency, while improving the learning performance by accessing richer training data from multiple edge clusters. A training algorithm for SD-FEEL with three main procedures in each round is presented, including local model updates, intra-cluster and inter-cluster model aggregations, which is proved to converge on non-independent and identically distributed (non-IID) data. We also characterize the interplay between the network topology of the edge servers and the communication overhead of inter-cluster model aggregation on the training performance. Experiment results corroborate our analysis and demonstrate the effectiveness of SD-FFEL in achieving faster convergence than traditional federated learning architectures. Besides, guidelines on choosing critical hyper-parameters of the training algorithm are also provided. △ Less

Submitted 31 December, 2021; v1 submitted 26 April, 2021; originally announced April 2021.

arXiv:2009.06805 [pdf, other]

doi 10.1038/s41535-022-00477-z

Dual topological superconducting states in the layered titanium-based oxypnictide superconductor BaTi$_2$Sb$_2$O

Authors: Z. Huang, W. L. Liu, H. Y. Wang, Y. L. Su, Z. T. Liu, X. B. Shi, S. Y. Gao, Z. C. Jiang, Z. H. Liu, J. S. Liu, X. L. Lu, Y. C. Yang, J. X. Zhang, S. C. Huan, W. Xia, J. H. Wang, Y. S. Wu, X. Wang, N. Yu, Y. B. Huang, S. Qiao, J. Li, W. W. Zhao, Y. F. Guo, G. Li , et al. (1 additional authors not shown)

Abstract: Topological superconductors have long been predicted to host Majorana zero modes which obey non-Abelian statistics and have potential for realizing non-decoherence topological quantum computation. However, material realization of topological superconductors is still a challenge in condensed matter physics. Utilizing high-resolution angle-resolved photoemission spectroscopy and first-principles cal… ▽ More Topological superconductors have long been predicted to host Majorana zero modes which obey non-Abelian statistics and have potential for realizing non-decoherence topological quantum computation. However, material realization of topological superconductors is still a challenge in condensed matter physics. Utilizing high-resolution angle-resolved photoemission spectroscopy and first-principles calculations, we predict and then unveil the coexistence of topological Dirac semimetal and topological insulator states in the vicinity of Fermi energy ($E_F$) in the titanium-based oxypnictide superconductor BaTi$_2$Sb$_2$O. Further spin-resolved measurements confirm its spin-helical surface states around $E_F$, which are topologically protected and give an opportunity for realization of Majorana zero modes and Majorana flat bands in one material. Hosting dual topological superconducting states, the intrinsic superconductor BaTi$_2$Sb$_2$O is expected to be a promising platform for further investigation of topological superconductivity. △ Less

Submitted 14 September, 2020; originally announced September 2020.

Comments: 6 pages, 4 figures

Journal ref: npj Quantum Mater. 7, 70 (2022)

arXiv:2003.08542 [pdf, other]

doi 10.1103/PhysRevA.102.022619

Error analysis in suppression of unwanted qubit interactions for a parametric gate in a tunable superconducting circuit

Authors: X. Y. Han, T. Q. Cai, X. G. Li, Y. K. Wu, Y. W. Ma, Y. L. Ma, J. H. Wang, H. Y. Zhang, Y. P. Song, L. M. Duan

Abstract: We experimentally demonstrate a parametric iSWAP gate in a superconducting circuit based on a tunable coupler for achieving a continuous tunability to eliminate unwanted qubit interactions. We implement the twoqubit iSWAP gate by applying a fast-flux bias modulation pulse on the coupler to turn on parametric exchange interaction between computational qubits. The controllable interaction can provid… ▽ More We experimentally demonstrate a parametric iSWAP gate in a superconducting circuit based on a tunable coupler for achieving a continuous tunability to eliminate unwanted qubit interactions. We implement the twoqubit iSWAP gate by applying a fast-flux bias modulation pulse on the coupler to turn on parametric exchange interaction between computational qubits. The controllable interaction can provide an extra degree of freedom to verify the optimal condition for constructing the parametric gate. Aiming to fully investigate error sources of the two-qubit gates, we perform quantum process tomography measurements and numerical simulations as varying static ZZ coupling strength. We quantitatively calculate the dynamic ZZ coupling parasitizing in two-qubit gate operation, and extract the particular gate error from the decoherence, dynamic ZZ coupling and high-order oscillation terms. Our results reveal that the main gate error comes from the decoherence, while the increase in the dynamic ZZ coupling and high-order oscillation error degrades the parametric gate performance. This approach, which has not yet been previously explored, provides a guiding principle to improve gate fidelity of parametric iSWAP gate by suppression of the unwanted qubit interactions. This controllable interaction, together with the parametric modulation technique, is desirable for crosstalk free multiqubit quantum circuits and quantum simulation applications. △ Less

Submitted 27 August, 2020; v1 submitted 18 March, 2020; originally announced March 2020.

Journal ref: Phys. Rev. A 102, 022619 (2020)

arXiv:2001.10237 [pdf, ps, other]

Faster Activity and Data Detection in Massive Random Access: A Multi-armed Bandit Approach

Authors: Jialin Dong, Jun Zhang, Yuanming Shi, Jessie Hui Wang

Abstract: This paper investigates the grant-free random access with massive IoT devices. By embedding the data symbols in the signature sequences, joint device activity detection and data decoding can be achieved, which, however, significantly increases the computational complexity. Coordinate descent algorithms that enjoy a low per-iteration complexity have been employed to solve the detection problem, but… ▽ More This paper investigates the grant-free random access with massive IoT devices. By embedding the data symbols in the signature sequences, joint device activity detection and data decoding can be achieved, which, however, significantly increases the computational complexity. Coordinate descent algorithms that enjoy a low per-iteration complexity have been employed to solve the detection problem, but previous works typically employ a random coordinate selection policy which leads to slow convergence. In this paper, we develop multi-armed bandit approaches for more efficient detection via coordinate descent, which make a delicate trade-off between exploration and exploitation in coordinate selection. Specifically, we first propose a bandit based strategy, i.e., Bernoulli sampling, to speed up the convergence rate of coordinate descent, by learning which coordinates will result in more aggressive descent of the objective function. To further improve the convergence rate, an inner multi-armed bandit problem is established to learn the exploration policy of Bernoulli sampling. Both convergence rate analysis and simulation results are provided to show that the proposed bandit based algorithms enjoy faster convergence rates with a lower time complexity compared with the state-of-the-art algorithm. Furthermore, our proposed algorithms are applicable to different scenarios, e.g., massive random access with low-precision analog-to-digital converters (ADCs). △ Less

Submitted 28 January, 2020; originally announced January 2020.

Comments: 30 pages, 5 figures

arXiv:1708.04090 [pdf, other]

doi 10.1103/PhysRevLett.120.067202

Ultralow-temperature thermal conductivity of the Kitaev honeycomb magnet $α$-RuCl$_3$ across the field-induced phase transition

Authors: Y. J. Yu, Y. Xu, K. J. Ran, J. M. Ni, Y. Y. Huang, J. H. Wang, J. S. Wen, S. Y. Li

Abstract: Recently, there have been increasingly hot debates on whether there exists a quantum spin liquid in the Kitaev honeycomb magnet $α$-RuCl$_3$ in high magnetic field. To investigate this issue, we perform the ultralow-temperature thermal conductivity measurements on the single crystals of $α$-RuCl$_3$ down to 80 mK and up to 9 T. Our experiments clearly show a field-induced phase transition occurrin… ▽ More Recently, there have been increasingly hot debates on whether there exists a quantum spin liquid in the Kitaev honeycomb magnet $α$-RuCl$_3$ in high magnetic field. To investigate this issue, we perform the ultralow-temperature thermal conductivity measurements on the single crystals of $α$-RuCl$_3$ down to 80 mK and up to 9 T. Our experiments clearly show a field-induced phase transition occurring at $H_c$ $\approx$ 7.5 T, above which the zigzag magnetic order is completely suppressed. The minimum of thermal conductivity at 7.5 T is attributed to the strong scattering of phonons by the magnetic fluctuations. Most importantly, above 7.5 T, we do not observe any significant contribution of thermal conductivity from gapless magnetic excitations, which puts a strong constraint on the nature of the high-field phase of $α$-RuCl$_3$. △ Less

Submitted 14 September, 2017; v1 submitted 14 August, 2017; originally announced August 2017.

Comments: 6 pages, 4 figures

Journal ref: Phys. Rev. Lett. 120, 067202 (2018)

arXiv:1707.00216 [pdf]

doi 10.1103/PhysRevMaterials.1.064201

Enhanced electron correlations in the new binary stannide PdSn4: a homologue of the Dirac nodal arc semimetal PtSn4

Authors: C. Q. Xu, W. Zhou, R. Sankar, X. Z. Xing, Z. X. Shi, Z. D. Han, B. Qian, J. H. Wang, Zengwei Zhu, J. L. Zhang, A. F. Bangura, N. E. Hussey, Xiaofeng Xu

Abstract: The advent of nodal-line semi-metals, i.e. systems in which the conduction and valence bands cross each other along a closed trajectory (line or loop) inside the Brillouin zone, has opened up a new arena for the exploration of topological condensed matter in which, due to a vanishing density of states near the Fermi level, electron correlation effects may also play an important role. In spite of t… ▽ More The advent of nodal-line semi-metals, i.e. systems in which the conduction and valence bands cross each other along a closed trajectory (line or loop) inside the Brillouin zone, has opened up a new arena for the exploration of topological condensed matter in which, due to a vanishing density of states near the Fermi level, electron correlation effects may also play an important role. In spite of this conceptual richness however, material realization of nodal-line (loop) fermions is rare, with PbTaSe2, ZrSiS and PtSn4 the only promising known candidates. Here we report the synthesis and physical properties of a new compound PdSn4 that is isostructural with PtSn4 yet possesses quasiparticles with significantly enhanced effective masses. In addition, PdSn4 displays an unusual polar angular magnetoresistance which at a certain field orientation, varies linearly with field up to 55 Tesla. Our study suggests that, in association with its homologue PtSn4 whose low-lying excitations were recently claimed to possess Dirac node arcs, PdSn4 may be a promising candidate in the search for novel topological states with enhanced correlation effects. △ Less

Submitted 1 July, 2017; originally announced July 2017.

Comments: 6 figures, 1 table

Journal ref: Phys. Rev. Materials 1, 064201 (2017)

arXiv:1605.01535 [pdf]

Tip Pressure Induced Incoherent Energy Gap in CaFe2As2

Authors: J. -X. Yin, J. H. Wang, Z. Wu, A. Li, X. J. Liang, H. Q. Mao, G. F. Chen, B. Lv, C. -W. Chu, H. Ding, S. H. Pan

Abstract: In CaFe2As2, superconductivity can be achieved by applying a modest c-axis pressure of several kbar. Here we use scanning tunneling microscopy/spectroscopy (STM/S) to explore the STM tip pressure effect on single crystals of CaFe2As2. When performing STM/S measurements, the tip-sample interaction can be controlled to act repulsive with reduction of the junction resistance, thus to apply a tip pres… ▽ More In CaFe2As2, superconductivity can be achieved by applying a modest c-axis pressure of several kbar. Here we use scanning tunneling microscopy/spectroscopy (STM/S) to explore the STM tip pressure effect on single crystals of CaFe2As2. When performing STM/S measurements, the tip-sample interaction can be controlled to act repulsive with reduction of the junction resistance, thus to apply a tip pressure on the sample. We find that an incoherent energy gap emerges at the Fermi level in the differential conductance spectrum when the tip pressure is increased. This energy gap is of the similar order of magnitude as the superconducting gap in the chemical doped compound Ca0.4Na0.6Fe2As2 and disappears at the temperature well below that of the bulk magnetic ordering. Moreover, we also observe the rhombic distortion of the As lattice, which agrees with the orthorhombic distortion of the underlying Fe lattice. These findings suggest that the STM tip pressure can induce the local Cooper pairing in the orthorhombic phase of CaFe2As2. △ Less

Submitted 5 May, 2016; originally announced May 2016.

Journal ref: Chin. Phys. Lett. 33, 067401 (2016)

arXiv:1602.01930 [pdf, ps, other]

On The Robustness of Price-Anticipating Kelly Mechanism

Authors: Yuedong Xu, Zhujun Xiao, Tianyu Ni, Jessie Hui Wang, Xin Wang, Eitan Altman

Abstract: The price-anticipating Kelly mechanism (PAKM) is one of the most extensively used strategies to allocate divisible resources for strategic users in communication networks and computing systems. The users are deemed as selfish and also benign, each of which maximizes his individual utility of the allocated resources minus his payment to the network operator. However, in many applications a user can… ▽ More The price-anticipating Kelly mechanism (PAKM) is one of the most extensively used strategies to allocate divisible resources for strategic users in communication networks and computing systems. The users are deemed as selfish and also benign, each of which maximizes his individual utility of the allocated resources minus his payment to the network operator. However, in many applications a user can use his payment to reduce the utilities of his opponents, thus playing a misbehaving role. It remains mysterious to what extent the misbehaving user can damage or influence the performance of benign users and the network operator. In this work, we formulate a non-cooperative game consisting of a finite amount of benign users and one misbehaving user. The maliciousness of this misbehaving user is captured by his willingness to pay to trade for unit degradation in the utilities of benign users. The network operator allocates resources to all the users via the price-anticipating Kelly mechanism. We present six important performance metrics with regard to the total utility and the total net utility of benign users, and the revenue of network operator under three different scenarios: with and without the misbehaving user, and the maximum. We quantify the robustness of PAKM against the misbehaving actions by deriving the upper and lower bounds of these metrics. With new approaches, all the theoretical bounds are applicable to an arbitrary population of benign users. Our study reveals two important insights: i) the performance bounds are very sensitive to the misbehaving user's willingness to pay at certain ranges; ii) the network operator acquires more revenues in the presence of the misbehaving user which might disincentivize his countermeasures against the misbehaving actions. △ Less

Submitted 4 October, 2021; v1 submitted 5 February, 2016; originally announced February 2016.

Comments: 21

arXiv:1202.1356 [pdf, ps, other]

doi 10.1088/0004-637X/748/1/44

The GRB 071112C: A Case Study of Different Mechanisms in X-ray and Optical Temporal Evolution

Authors: K. Y. Huang, Y. Urata, Y. H. Tung, H. M. Lin, L. P. Xin, M. Yoshida, W. Zheng, C. Akerlof, S. Y. Wang, W. H. Ip, M. J. Lehner, F. B. Bianco, N. Kawai, D. Kuroda, S. L. Marshall, M. E. Schwamb, Y. Qiu, J. H. Wang, C. Y. Wen, J. Wei, K. Yanagisawa, Z. W. Zhang

Abstract: We present the study on GRB 071112C X-ray and optical light curves. In these two wavelength ranges, we have found different temporal properties. The R-band light curve showed an initial rise followed by a single power-law decay, while the X-ray light curve was described by a single power-law decay plus a flare-like feature. Our analysis shows that the observed temporal evolution cannot be describe… ▽ More We present the study on GRB 071112C X-ray and optical light curves. In these two wavelength ranges, we have found different temporal properties. The R-band light curve showed an initial rise followed by a single power-law decay, while the X-ray light curve was described by a single power-law decay plus a flare-like feature. Our analysis shows that the observed temporal evolution cannot be described by the external shock model in which the X-ray and optical emission are produced by the same emission mechanism. No significant color changes in multi-band light curves and a reasonable value of the initial Lorentz factor (Γ0 = 275 \pm 20) in a uniform ISM support the afterglow onset scenario as the correct interpretation for the early R-band rise. The result suggests the optical flux is dominated by afterglow. Our further investigations show that the X-ray flux could be created by an additional feature related to energy injection and X-ray afterglow. Different theoretical interpretations indicate the additional feature in X-ray can be explained by either late internal dissipation or local inverse-Compton scattering in the external shock. △ Less

Submitted 7 February, 2012; originally announced February 2012.

Comments: 20 pages, 3 figures, accepted for publication in ApJ

arXiv:cond-mat/9908206 [pdf, ps, other]

doi 10.1103/PhysRevB.60.R15051

C-axis Penetration Depth and Inter-layer Conductivity in the Thallium Based Cuprate Superconductors

Authors: D. Dulic, D. van der Marel, A. A. Tsvetkov, W. N. Hardy, Z. F. Ren, J. H. Wang, B. A. Willemsen

Abstract: The c-axis Josephson plasmon in optimally doped single-layer and bi-layer high Tc cuprates Tl2201 and Tl2212 have been investigated using infrared spectroscopy. We observed the plasma frequencies for these two compounds at 27.8 and 25.6 cm-1 respectively, which we interpret as a Josephson resonance across the TlO blocking layers. No maximum in the temperature dependence of the c-axis conductivit… ▽ More The c-axis Josephson plasmon in optimally doped single-layer and bi-layer high Tc cuprates Tl2201 and Tl2212 have been investigated using infrared spectroscopy. We observed the plasma frequencies for these two compounds at 27.8 and 25.6 cm-1 respectively, which we interpret as a Josephson resonance across the TlO blocking layers. No maximum in the temperature dependence of the c-axis conductivity was observed below Tc, indicating that even in the superconducting state a coherent quasi-particle contribution to the c-axis conductivity is absent or very weak, in contrast to the behaviour of the ab-plane conductivity. △ Less

Submitted 13 August, 1999; originally announced August 1999.

Comments: 4 pages, 3 figures

arXiv:cond-mat/9906187 [pdf, ps, other]

doi 10.1103/PhysRevB.60.13196

Systematics of c-axis Phonons in the Thallium and Bismuth Based Cuprate Superconductors

Authors: A. A. Tsvetkov, D. Dulic, D. van der Marel, A. Damascelli, G. A. Kaljushnaia, J. I. Gorina, N. N. Senturina, N. N. Kolesnikov, Z. F. Ren, J. H. Wang, A. A. Menovsky, T. T. M. Palstra

Abstract: We present grazing incidence reflectivity measurements in the far infrared region at temperatures above and below Tc for a series of thallium (Tl2Ba2CuO6, Tl2Ba2CaCu2O8) and bismuth (Bi2Sr2CuO6, Bi2Sr2CaCu2O8, and Bi(2-x)Pb(x)Sr2CaCu2O8) based cuprate superconductors. From the spectra, which are dominated by the c-axis phonons, longitudinal frequencies (LO) are directly obtained. The reflectivit… ▽ More We present grazing incidence reflectivity measurements in the far infrared region at temperatures above and below Tc for a series of thallium (Tl2Ba2CuO6, Tl2Ba2CaCu2O8) and bismuth (Bi2Sr2CuO6, Bi2Sr2CaCu2O8, and Bi(2-x)Pb(x)Sr2CaCu2O8) based cuprate superconductors. From the spectra, which are dominated by the c-axis phonons, longitudinal frequencies (LO) are directly obtained. The reflectivity curves are well fitted by a series of Lorentz oscillators. In this way the transverse (TO) phonon frequencies were accurately determined. On the basis of the comparative study of the Bi and Tl based cuprates with different number of CuO2 layers per unit cell, we suggest modifications of the assignment of the main oxygen modes. We compare the LO frequencies in Bi2Sr2CaCu2O8 and Tl2Ba2Ca2Cu3O10 obtained from intrinsic Josephson junction characteristics with our measurements, and explain the discrepancy in LO frequencies obtained by the two different methods. △ Less

Submitted 13 June, 1999; originally announced June 1999.

Comments: 8 pages Revtex, 6 eps figures, 3 tables, to appear in Phys. Rev. B

Journal ref: Phys. Rev. B 60, 13196-13205 (1999)

arXiv:cond-mat/9811184 [pdf, ps, other]

doi 10.1038/26439

Global and Local Measures of the Intrinsic Josephson Coupling in Tl2Ba2CuO6

Authors: A. A. Tsvetkov, D. van der Marel, K. A. Moler, J. R. Kirtley, J. L. de Boer, A. Meetsma, Z. F. Ren, N. Koleshnikov, D. Dulic, A. Damascelli, M. Grueninger, J. Schuetzmann, J. W. van der Eb, H. S. Somal, J. H. Wang

Abstract: The Intlerlayer Josephson coupling between the planes of Tl2Ba2CuO6 was determined using infrared spectroscopy and magnetic flux vortex imaging. These methods give a consistent value of $ω_J$= 28 cm$^{-1}$ which, when combined with the condensation energy produces a discrepancy of at least an order of magnitude with deductions based on the interlayer tunneling model. The Intlerlayer Josephson coupling between the planes of Tl2Ba2CuO6 was determined using infrared spectroscopy and magnetic flux vortex imaging. These methods give a consistent value of $ω_J$= 28 cm$^{-1}$ which, when combined with the condensation energy produces a discrepancy of at least an order of magnitude with deductions based on the interlayer tunneling model. △ Less

Submitted 16 November, 1998; v1 submitted 13 November, 1998; originally announced November 1998.

Comments: 5 pages, article, epsf, 3 encapsulated postscript figures

Report number: GmGd-98-3-2

Journal ref: Nature 395, 360 (1998)

arXiv:cond-mat/9702021 [pdf, ps, other]

doi 10.1103/PhysRevB.55.11118

Experimental Test of the Inter-Layer Pairing Models for High-Tc Superconductivity Using Grazing Incidence Infrared Reflectometry

Authors: J. Schutzmann, H. S. Somal, A. A. Tsvetkov, D. van der Marel, G. E. J. Koops, N. Koleshnikov, Z. F. Ren, J. H. Wang, E. Bruck, A. A. Menovsky

Abstract: From measurements of the far-infrared reflectivity at grazing angles of incidence with p-polarized light we determined the c-axis Josephson plasma frequencies of the single layer high T_c cuprates Tl_2Ba_2CuO_6 and La_{2-x}Sr_xCuO_4. We detected a strong plasma resonance at 50 cm^{-1} for La_{2-x}Sr_xCuO_4 in excellent agreement with previously published results. For Tl_2Ba_2CuO_6 we were able t… ▽ More From measurements of the far-infrared reflectivity at grazing angles of incidence with p-polarized light we determined the c-axis Josephson plasma frequencies of the single layer high T_c cuprates Tl_2Ba_2CuO_6 and La_{2-x}Sr_xCuO_4. We detected a strong plasma resonance at 50 cm^{-1} for La_{2-x}Sr_xCuO_4 in excellent agreement with previously published results. For Tl_2Ba_2CuO_6 we were able to determine an upper limit of the unscreened c-axis Josephson plasma frequency 100 cm^{-1} or a c-axis penetration depth > 15 μm. The small value of $ω_J$ stands in contrast to recent a prediction based on the inter-layer tunneling mechanism of superconductivity. △ Less

Submitted 3 February, 1997; originally announced February 1997.

Comments: 4 pages, Phys. Rev. B, in press, Revtex, 4 postscript figures

Report number: GmGd-96-8-1

Journal ref: Phys. Rev. B 55 (1997), 11118

Showing 1–25 of 25 results for author: Wang, J H