Search | arXiv e-print repository

3D Cascade RCNN: High Quality Object Detection in Point Clouds

Authors: Qi Cai, Yingwei Pan, Ting Yao, Tao Mei

Abstract: Recent progress on 2D object detection has featured Cascade RCNN, which capitalizes on a sequence of cascade detectors to progressively improve proposal quality, towards high-quality object detection. However, there has not been evidence in support of building such cascade structures for 3D object detection, a challenging detection scenario with highly sparse LiDAR point clouds. In this work, we p… ▽ More Recent progress on 2D object detection has featured Cascade RCNN, which capitalizes on a sequence of cascade detectors to progressively improve proposal quality, towards high-quality object detection. However, there has not been evidence in support of building such cascade structures for 3D object detection, a challenging detection scenario with highly sparse LiDAR point clouds. In this work, we present a simple yet effective cascade architecture, named 3D Cascade RCNN, that allocates multiple detectors based on the voxelized point clouds in a cascade paradigm, pursuing higher quality 3D object detector progressively. Furthermore, we quantitatively define the sparsity level of the points within 3D bounding box of each object as the point completeness score, which is exploited as the task weight for each proposal to guide the learning of each stage detector. The spirit behind is to assign higher weights for high-quality proposals with relatively complete point distribution, while down-weight the proposals with extremely sparse points that often incur noise during training. This design of completeness-aware re-weighting elegantly upgrades the cascade paradigm to be better applicable for the sparse input data, without increasing any FLOP budgets. Through extensive experiments on both the KITTI dataset and Waymo Open Dataset, we validate the superiority of our proposed 3D Cascade RCNN, when comparing to state-of-the-art 3D object detection techniques. The source code is publicly available at \url{https://github.com/caiqi/Cascasde-3D}. △ Less

Submitted 15 November, 2022; originally announced November 2022.

Comments: IEEE Transactions on Image Processing (TIP) 2022. The source code is publicly available at \url{https://github.com/caiqi/Cascasde-3D}

arXiv:2211.02806 [pdf]

Modified EDAS Method Based on Cumulative Prospect Theory for Multiple Attributes Group Decision Making with Interval-valued Intuitionistic Fuzzy Information

Authors: **g Wang, Qiang Cai, Guiwu Wei, Ningna Liao

Abstract: The Interval-valued intuitionistic fuzzy sets (IVIFSs) based on the intuitionistic fuzzy sets combines the classical decision method is in its research and application is attracting attention. After comparative analysis, there are multiple classical methods with IVIFSs information have been applied into many practical issues. In this paper, we extended the classical EDAS method based on cumulative… ▽ More The Interval-valued intuitionistic fuzzy sets (IVIFSs) based on the intuitionistic fuzzy sets combines the classical decision method is in its research and application is attracting attention. After comparative analysis, there are multiple classical methods with IVIFSs information have been applied into many practical issues. In this paper, we extended the classical EDAS method based on cumulative prospect theory (CPT) considering the decision makers (DMs) psychological factor under IVIFSs. Taking the fuzzy and uncertain character of the IVIFSs and the psychological preference into consideration, the original EDAS method based on the CPT under IVIFSs (IVIF-CPT-MABAC) method is built for MAGDM issues. Meanwhile, information entropy method is used to evaluate the attribute weight. Finally, a numerical example for project selection of green technology venture capital has been given and some comparisons is used to illustrate advantages of IVIF-CPT-MABAC method and some comparison analysis and sensitivity analysis are applied to prove this new methods effectiveness and stability. △ Less

Submitted 4 November, 2022; originally announced November 2022.

Comments: 48 pages

MSC Class: 91B06 ACM Class: F.2.2

arXiv:2210.06906 [pdf, other]

Hierarchical and Progressive Image Matting

Authors: Yu Qiao, Yuhao Liu, Ziqi Wei, Yuxin Wang, Qiang Cai, Guofeng Zhang, Xin Yang

Abstract: Most matting researches resort to advanced semantics to achieve high-quality alpha mattes, and direct low-level features combination is usually explored to complement alpha details. However, we argue that appearance-agnostic integration can only provide biased foreground details and alpha mattes require different-level feature aggregation for better pixel-wise opacity perception. In this paper, we… ▽ More Most matting researches resort to advanced semantics to achieve high-quality alpha mattes, and direct low-level features combination is usually explored to complement alpha details. However, we argue that appearance-agnostic integration can only provide biased foreground details and alpha mattes require different-level feature aggregation for better pixel-wise opacity perception. In this paper, we propose an end-to-end Hierarchical and Progressive Attention Matting Network (HAttMatting++), which can better predict the opacity of the foreground from single RGB images without additional input. Specifically, we utilize channel-wise attention to distill pyramidal features and employ spatial attention at different levels to filter appearance cues. This progressive attention mechanism can estimate alpha mattes from adaptive semantics and semantics-indicated boundaries. We also introduce a hybrid loss function fusing Structural SIMilarity (SSIM), Mean Square Error (MSE), Adversarial loss, and sentry supervision to guide the network to further improve the overall foreground structure. Besides, we construct a large-scale and challenging image matting dataset comprised of 59, 600 training images and 1000 test images (a total of 646 distinct foreground alpha mattes), which can further improve the robustness of our hierarchical and progressive aggregation model. Extensive experiments demonstrate that the proposed HAttMatting++ can capture sophisticated foreground structures and achieve state-of-the-art performance with single RGB images as input. △ Less

Submitted 13 October, 2022; originally announced October 2022.

Comments: 23 pages, 11 Figures, ACM TOMM accepted

arXiv:2210.00547 [pdf, ps, other]

doi 10.1140/epjp/s13360-022-03284-4

Multiple electromagnetically induced transparency without a control field in an atomic array coupled to a waveguide

Authors: W. Z. Jia, Q. Y. Cai

Abstract: We investigate multiple electromagnetically induced transparency (EIT) in a waveguide quantum electrodynamics (wQED) system containing an atom array. By analyzing the effective Hamiltonian of the system, we find that in terms of the single-excitation collective states, a properly designed $N$-atom array can be mapped into a driven ($N+1$)-level system that can produce multiple EIT-type phenomenon.… ▽ More We investigate multiple electromagnetically induced transparency (EIT) in a waveguide quantum electrodynamics (wQED) system containing an atom array. By analyzing the effective Hamiltonian of the system, we find that in terms of the single-excitation collective states, a properly designed $N$-atom array can be mapped into a driven ($N+1$)-level system that can produce multiple EIT-type phenomenon. The corresponding scattering spectra of the atom-array wQED system are discussed both in the single-photon sector and beyond the single-photon limit. The most significant feather of this type of EIT scheme is control-field-free, which may provide an alternative way to produce EIT-like phenomenon in wQED system when external control fields are not available. The results given in our paper may provide good guidance for future experiments on multiple EIT without a control field in wQED system. △ Less

Submitted 2 October, 2022; originally announced October 2022.

Comments: 15 pages, 5 figures

arXiv:2209.12807 [pdf, other]

Out-of-Distribution Detection with Hilbert-Schmidt Independence Optimization

Authors: **gyang Lin, Yu Wang, Qi Cai, Yingwei Pan, Ting Yao, Hongyang Chao, Tao Mei

Abstract: Outlier detection tasks have been playing a critical role in AI safety. There has been a great challenge to deal with this task. Observations show that deep neural network classifiers usually tend to incorrectly classify out-of-distribution (OOD) inputs into in-distribution classes with high confidence. Existing works attempt to solve the problem by explicitly imposing uncertainty on classifiers w… ▽ More Outlier detection tasks have been playing a critical role in AI safety. There has been a great challenge to deal with this task. Observations show that deep neural network classifiers usually tend to incorrectly classify out-of-distribution (OOD) inputs into in-distribution classes with high confidence. Existing works attempt to solve the problem by explicitly imposing uncertainty on classifiers when OOD inputs are exposed to the classifier during training. In this paper, we propose an alternative probabilistic paradigm that is both practically useful and theoretically viable for the OOD detection tasks. Particularly, we impose statistical independence between inlier and outlier data during training, in order to ensure that inlier data reveals little information about OOD data to the deep estimator during training. Specifically, we estimate the statistical dependence between inlier and outlier data through the Hilbert-Schmidt Independence Criterion (HSIC), and we penalize such metric during training. We also associate our approach with a novel statistical test during the inference time coupled with our principled motivation. Empirical results show that our method is effective and robust for OOD detection on various benchmarks. In comparison to SOTA models, our approach achieves significant improvement regarding FPR95, AUROC, and AUPR metrics. Code is available: \url{https://github.com/jylins/hood}. △ Less

Submitted 26 September, 2022; originally announced September 2022.

Comments: Source code is available at \url{https://github.com/jylins/hood}

arXiv:2209.11433 [pdf, other]

The Kriston AI System for the VoxCeleb Speaker Recognition Challenge 2022

Authors: Qutang Cai, Guoqiang Hong, Zhijian Ye, Ximin Li, Haizhou Li

Abstract: This technical report describes our system for track 1, 2 and 4 of the VoxCeleb Speaker Recognition Challenge 2022 (VoxSRC-22). By combining several ResNet variants, our submission for track 1 attained a minDCF of 0:090 with EER 1:401%. By further incorporating three fine-tuned pre-trained models, our submission for track 2 achieved a minDCF of 0:072 with EER 1:119%. For track 4, our system consis… ▽ More This technical report describes our system for track 1, 2 and 4 of the VoxCeleb Speaker Recognition Challenge 2022 (VoxSRC-22). By combining several ResNet variants, our submission for track 1 attained a minDCF of 0:090 with EER 1:401%. By further incorporating three fine-tuned pre-trained models, our submission for track 2 achieved a minDCF of 0:072 with EER 1:119%. For track 4, our system consisted of voice activity detection (VAD), speaker embedding extraction, agglomerative hierarchical clustering (AHC) followed by a re-clustering step based on a Bayesian hidden Markov model and overlapped speech detection and handling. Our submission for track 4 achieved a diarisation error rate (DER) of 4.86%. The submissions all ranked the 2nd places for the corresponding tracks. △ Less

Submitted 23 September, 2022; originally announced September 2022.

Comments: System description of VoxSRC 2022: track 1, 2 and 4

arXiv:2206.06289 [pdf, other]

Silver-Bullet-3D at ManiSkill 2021: Learning-from-Demonstrations and Heuristic Rule-based Methods for Object Manipulation

Authors: Yingwei Pan, Yehao Li, Yiheng Zhang, Qi Cai, Fuchen Long, Zhaofan Qiu, Ting Yao, Tao Mei

Abstract: This paper presents an overview and comparative analysis of our systems designed for the following two tracks in SAPIEN ManiSkill Challenge 2021: No Interaction Track: The No Interaction track targets for learning policies from pre-collected demonstration trajectories. We investigate both imitation learning-based approach, i.e., imitating the observed behavior using classical supervised learning… ▽ More This paper presents an overview and comparative analysis of our systems designed for the following two tracks in SAPIEN ManiSkill Challenge 2021: No Interaction Track: The No Interaction track targets for learning policies from pre-collected demonstration trajectories. We investigate both imitation learning-based approach, i.e., imitating the observed behavior using classical supervised learning techniques, and offline reinforcement learning-based approaches, for this track. Moreover, the geometry and texture structures of objects and robotic arms are exploited via Transformer-based networks to facilitate imitation learning. No Restriction Track: In this track, we design a Heuristic Rule-based Method (HRM) to trigger high-quality object manipulation by decomposing the task into a series of sub-tasks. For each sub-task, the simple rule-based controlling strategies are adopted to predict actions that can be applied to robotic arms. To ease the implementations of our systems, all the source codes and pre-trained models are available at \url{https://github.com/caiqi/Silver-Bullet-3D/}. △ Less

Submitted 13 June, 2022; originally announced June 2022.

Comments: Accepted by ICLR 2022 Workshop on Generalizable Policy Learning in Physical World. Top-performing systems for both no interaction and no restriction tracks in SAPIEN ManiSkill Challenge 2021. The source code and model are publicly available at: https://github.com/caiqi/Silver-Bullet-3D/

arXiv:2206.02620 [pdf, other]

ResAct: Reinforcing Long-term Engagement in Sequential Recommendation with Residual Actor

Authors: Wanqi Xue, Qingpeng Cai, Ruohan Zhan, Dong Zheng, Peng Jiang, Kun Gai, Bo An

Abstract: Long-term engagement is preferred over immediate engagement in sequential recommendation as it directly affects product operational metrics such as daily active users (DAUs) and dwell time. Meanwhile, reinforcement learning (RL) is widely regarded as a promising framework for optimizing long-term engagement in sequential recommendation. However, due to expensive online interactions, it is very dif… ▽ More Long-term engagement is preferred over immediate engagement in sequential recommendation as it directly affects product operational metrics such as daily active users (DAUs) and dwell time. Meanwhile, reinforcement learning (RL) is widely regarded as a promising framework for optimizing long-term engagement in sequential recommendation. However, due to expensive online interactions, it is very difficult for RL algorithms to perform state-action value estimation, exploration and feature extraction when optimizing long-term engagement. In this paper, we propose ResAct which seeks a policy that is close to, but better than, the online-serving policy. In this way, we can collect sufficient data near the learned policy so that state-action values can be properly estimated, and there is no need to perform online exploration. ResAct optimizes the policy by first reconstructing the online behaviors and then improving it via a Residual Actor. To extract long-term information, ResAct utilizes two information-theoretical regularizers to confirm the expressiveness and conciseness of features. We conduct experiments on a benchmark dataset and a large-scale industrial dataset which consists of tens of millions of recommendation requests. Experimental results show that our method significantly outperforms the state-of-the-art baselines in various long-term engagement optimization tasks. △ Less

Submitted 16 June, 2023; v1 submitted 31 May, 2022; originally announced June 2022.

Comments: Accpetd by ICLR 2023

arXiv:2205.13476 [pdf, ps, other]

Embed to Control Partially Observed Systems: Representation Learning with Provable Sample Efficiency

Authors: Lingxiao Wang, Qi Cai, Zhuoran Yang, Zhaoran Wang

Abstract: Reinforcement learning in partially observed Markov decision processes (POMDPs) faces two challenges. (i) It often takes the full history to predict the future, which induces a sample complexity that scales exponentially with the horizon. (ii) The observation and state spaces are often continuous, which induces a sample complexity that scales exponentially with the extrinsic dimension. Addressing… ▽ More Reinforcement learning in partially observed Markov decision processes (POMDPs) faces two challenges. (i) It often takes the full history to predict the future, which induces a sample complexity that scales exponentially with the horizon. (ii) The observation and state spaces are often continuous, which induces a sample complexity that scales exponentially with the extrinsic dimension. Addressing such challenges requires learning a minimal but sufficient representation of the observation and state histories by exploiting the structure of the POMDP. To this end, we propose a reinforcement learning algorithm named Embed to Control (ETC), which learns the representation at two levels while optimizing the policy.~(i) For each step, ETC learns to represent the state with a low-dimensional feature, which factorizes the transition kernel. (ii) Across multiple steps, ETC learns to represent the full history with a low-dimensional embedding, which assembles the per-step feature. We integrate (i) and (ii) in a unified framework that allows a variety of estimators (including maximum likelihood estimators and generative adversarial networks). For a class of POMDPs with a low-rank structure in the transition kernel, ETC attains an $O(1/ε^2)$ sample complexity that scales polynomially with the horizon and the intrinsic dimension (that is, the rank). Here $ε$ is the optimality gap. To our best knowledge, ETC is the first sample-efficient algorithm that bridges representation learning and policy optimization in POMDPs with infinite observation and state spaces. △ Less

Submitted 31 March, 2024; v1 submitted 26 May, 2022; originally announced May 2022.

Comments: Accepted by ICLR 2022

arXiv:2205.13248 [pdf, other]

Constrained Reinforcement Learning for Short Video Recommendation

Authors: Qingpeng Cai, Ruohan Zhan, Chi Zhang, Jie Zheng, Guangwei Ding, **hua Gong, Dong Zheng, Peng Jiang

Abstract: The wide popularity of short videos on social media poses new opportunities and challenges to optimize recommender systems on the video-sharing platforms. Users provide complex and multi-faceted responses towards recommendations, including watch time and various types of interactions with videos. As a result, established recommendation algorithms that concern a single objective are not adequate to… ▽ More The wide popularity of short videos on social media poses new opportunities and challenges to optimize recommender systems on the video-sharing platforms. Users provide complex and multi-faceted responses towards recommendations, including watch time and various types of interactions with videos. As a result, established recommendation algorithms that concern a single objective are not adequate to meet this new demand of optimizing comprehensive user experiences. In this paper, we formulate the problem of short video recommendation as a constrained Markov Decision Process (MDP), where platforms want to optimize the main goal of user watch time in long term, with the constraint of accommodating the auxiliary responses of user interactions such as sharing/downloading videos. To solve the constrained MDP, we propose a two-stage reinforcement learning approach based on actor-critic framework. At stage one, we learn individual policies to optimize each auxiliary response. At stage two, we learn a policy to (i) optimize the main response and (ii) stay close to policies learned at the first stage, which effectively guarantees the performance of this main policy on the auxiliaries. Through extensive simulations, we demonstrate effectiveness of our approach over alternatives in both optimizing the main goal as well as balancing the others. We further show the advantage of our approach in live experiments of short video recommendations, where it significantly outperforms other baselines in terms of watch time and interactions from video views. Our approach has been fully launched in the production system to optimize user experiences on the platform. △ Less

Submitted 26 May, 2022; originally announced May 2022.

arXiv:2204.09787 [pdf, ps, other]

Reinforcement Learning from Partial Observation: Linear Function Approximation with Provable Sample Efficiency

Authors: Qi Cai, Zhuoran Yang, Zhaoran Wang

Abstract: We study reinforcement learning for partially observed Markov decision processes (POMDPs) with infinite observation and state spaces, which remains less investigated theoretically. To this end, we make the first attempt at bridging partial observability and function approximation for a class of POMDPs with a linear structure. In detail, we propose a reinforcement learning algorithm (Optimistic Exp… ▽ More We study reinforcement learning for partially observed Markov decision processes (POMDPs) with infinite observation and state spaces, which remains less investigated theoretically. To this end, we make the first attempt at bridging partial observability and function approximation for a class of POMDPs with a linear structure. In detail, we propose a reinforcement learning algorithm (Optimistic Exploration via Adversarial Integral Equation or OP-TENET) that attains an $ε$-optimal policy within $O(1/ε^2)$ episodes. In particular, the sample complexity scales polynomially in the intrinsic dimension of the linear structure and is independent of the size of the observation and state spaces. The sample efficiency of OP-TENET is enabled by a sequence of ingredients: (i) a Bellman operator with finite memory, which represents the value function in a recursive manner, (ii) the identification and estimation of such an operator via an adversarial integral equation, which features a smoothed discriminator tailored to the linear structure, and (iii) the exploration of the observation and state spaces via optimism, which is based on quantifying the uncertainty in the adversarial integral equation. △ Less

Submitted 31 March, 2024; v1 submitted 20 April, 2022; originally announced April 2022.

arXiv:2203.10247 [pdf, other]

doi 10.1109/TIP.2023.3279977

HIPA: Hierarchical Patch Transformer for Single Image Super Resolution

Authors: Qing Cai, Yiming Qian, **xing Li, Jun Lv, Yee-Hong Yang, Feng Wu, David Zhang

Abstract: Transformer-based architectures start to emerge in single image super resolution (SISR) and have achieved promising performance. Most existing Vision Transformers divide images into the same number of patches with a fixed size, which may not be optimal for restoring patches with different levels of texture richness. This paper presents HIPA, a novel Transformer architecture that progressively reco… ▽ More Transformer-based architectures start to emerge in single image super resolution (SISR) and have achieved promising performance. Most existing Vision Transformers divide images into the same number of patches with a fixed size, which may not be optimal for restoring patches with different levels of texture richness. This paper presents HIPA, a novel Transformer architecture that progressively recovers the high resolution image using a hierarchical patch partition. Specifically, we build a cascaded model that processes an input image in multiple stages, where we start with tokens with small patch sizes and gradually merge to the full resolution. Such a hierarchical patch mechanism not only explicitly enables feature aggregation at multiple resolutions but also adaptively learns patch-aware features for different image regions, e.g., using a smaller patch for areas with fine details and a larger patch for textureless regions. Meanwhile, a new attention-based position encoding scheme for Transformer is proposed to let the network focus on which tokens should be paid more attention by assigning different weights to different tokens, which is the first time to our best knowledge. Furthermore, we also propose a new multi-reception field attention module to enlarge the convolution reception field from different branches. The experimental results on several public datasets demonstrate the superior performance of the proposed HIPA over previous methods quantitatively and qualitatively. △ Less

Submitted 6 June, 2023; v1 submitted 19 March, 2022; originally announced March 2022.

arXiv:2203.02533 [pdf, other]

BoostMIS: Boosting Medical Image Semi-supervised Learning with Adaptive Pseudo Labeling and Informative Active Annotation

Authors: Wenqiao Zhang, Lei Zhu, James Hallinan, Andrew Makmur, Shengyu Zhang, Qingpeng Cai, Beng Chin Ooi

Abstract: In this paper, we propose a novel semi-supervised learning (SSL) framework named BoostMIS that combines adaptive pseudo labeling and informative active annotation to unleash the potential of medical image SSL models: (1) BoostMIS can adaptively leverage the cluster assumption and consistency regularization of the unlabeled data according to the current learning status. This strategy can adaptively… ▽ More In this paper, we propose a novel semi-supervised learning (SSL) framework named BoostMIS that combines adaptive pseudo labeling and informative active annotation to unleash the potential of medical image SSL models: (1) BoostMIS can adaptively leverage the cluster assumption and consistency regularization of the unlabeled data according to the current learning status. This strategy can adaptively generate one-hot "hard" labels converted from task model predictions for better task model training. (2) For the unselected unlabeled images with low confidence, we introduce an Active learning (AL) algorithm to find the informative samples as the annotation candidates by exploiting virtual adversarial perturbation and model's density-aware entropy. These informative candidates are subsequently fed into the next training cycle for better SSL label propagation. Notably, the adaptive pseudo-labeling and informative active annotation form a learning closed-loop that are mutually collaborative to boost medical image SSL. To verify the effectiveness of the proposed method, we collected a metastatic epidural spinal cord compression (MESCC) dataset that aims to optimize MESCC diagnosis and classification for improved specialist referral and treatment. We conducted an extensive experimental study of BoostMIS on MESCC and another public dataset COVIDx. The experimental results verify our framework's effectiveness and generalisability for different medical image datasets with a significant improvement over various state-of-the-art methods. △ Less

Submitted 21 March, 2022; v1 submitted 4 March, 2022; originally announced March 2022.

Comments: 11 pages

Journal ref: CVPR 2022

arXiv:2202.10059 [pdf, other]

doi 10.22331/q-2023-12-06-1201

Improving the performance of twin-field quantum key distribution with advantage distillation technology

Authors: Hong-Wei Li, Rui-Qiang Wang, Chun-Mei Zhang, Qing-Yu Cai

Abstract: In this work, we apply the advantage distillation method to improve the performance of a practical twin-field quantum key distribution system under collective attack. Compared with the previous analysis result given by Maeda, Sasaki and Koashi [Nature Communication 10, 3140 (2019)], the maximal transmission distance obtained by our analysis method will be increased from 420 km to 470 km. By increa… ▽ More In this work, we apply the advantage distillation method to improve the performance of a practical twin-field quantum key distribution system under collective attack. Compared with the previous analysis result given by Maeda, Sasaki and Koashi [Nature Communication 10, 3140 (2019)], the maximal transmission distance obtained by our analysis method will be increased from 420 km to 470 km. By increasing the loss-independent misalignment error to 12%, the previous analysis method can not overcome the rate-distance bound. However, our analysis method can still overcome the rate-distance bound when the misalignment error is 16%. More surprisingly, we prove that twin-field quantum key distribution can generate positive secure key even if the misalignment error is close to 50%, thus our analysis method can significantly improve the performance of a practical twin-field quantum key distribution system. △ Less

Submitted 4 December, 2023; v1 submitted 21 February, 2022; originally announced February 2022.

Comments: accepted by Quantum

Journal ref: Quantum 7, 1201 (2023)

arXiv:2201.09674 [pdf, ps, other]

Euler's transformation, zeta functions and generalizations of Wallis' formula

Authors: Qianqian Cai, Su Hu, Min-Soo Kim

Abstract: In this note, we extend Euler's transformation formula from the alternating series to more general series. Then we give new expressions for the Riemann zeta function $ζ(s)$ by the generalized difference operator $Δ_{c}$, which provide analytic continuation of $ζ(s)$ and new ways to evaluate the special values of $ζ(-m)$ for $m=0,1,2,\ldots$. Applying these results, we further extend Huylebrouck's… ▽ More In this note, we extend Euler's transformation formula from the alternating series to more general series. Then we give new expressions for the Riemann zeta function $ζ(s)$ by the generalized difference operator $Δ_{c}$, which provide analytic continuation of $ζ(s)$ and new ways to evaluate the special values of $ζ(-m)$ for $m=0,1,2,\ldots$. Applying these results, we further extend Huylebrouck's generalization of Wallis' well-known formula for $π$ in the half planes Re$(s)>0$ and Re$(s)>-1$, respectively. They imply several interesting special cases including $$ \frac{2π}{3^{\frac{3}{2}}}=\frac{3^{\frac{4}{3}}}{2^{\frac{4}{3}}} \frac{2^{\frac{1}{3}}\cdot3^{\frac{1}{3}}\cdot3^{\frac{1}{3}}\cdot4^{\frac{1}{3}}\cdot6^{\frac{2}{3}}\cdot6^{\frac{2}{3}}}{4^{\frac{1}{3}}\cdot4^{\frac{1}{3}}\cdot5^{\frac{1}{3}}\cdot5^{\frac{1}{3}}\cdot4^{\frac{2}{3}}\cdot5^{\frac{2}{3}}}\cdots, $$ $$ 3^{γ-\frac{\log 3}{2}}=\frac{3^{\frac{1}{3}}\cdot3^{\frac{1}{3}}}{2^{\frac{1}{2}}\cdot4^{\frac{1}{4}}} \frac{6^{\frac{1}{6}}\cdot6^{\frac{1}{6}}}{5^{\frac{1}{5}}\cdot7^{\frac{1}{7}}}\frac{9^{\frac{1}{9}}\cdot9^{\frac{1}{9}}}{8^{\frac{1}{8}}\cdot10^{\frac{1}{10}}}\cdots, $$ and $$ \left(3\left(\frac{2πe^γ}{A^{12}}\right)^{2}\right)^{\frac{π^2}{18}}=\frac{3^{\frac{1}{3^2}}\cdot3^{\frac{1}{3^2}}}{2^{\frac{1}{2^2}}\cdot4^{\frac{1}{4^2}}} \frac{6^{\frac{1}{6^2}}\cdot6^{\frac{1}{6^2}}}{5^{\frac{1}{5^2}}\cdot7^{\frac{1}{7^2}}}\frac{9^{\frac{1}{9^2}}\cdot9^{\frac{1}{9^2}}}{8^{\frac{1}{8^2}}\cdot10^{\frac{1}{10^2}}}\cdots,$$ where $γ$ is the Euler-Mascheroni constant and $A$ is the Glaisher-Kinkelin constant. △ Less

Submitted 25 May, 2023; v1 submitted 24 January, 2022; originally announced January 2022.

Comments: 14 pages

MSC Class: 11M06; 11B68; 11Y60; 40G05; 26B20

arXiv:2201.04808 [pdf, ps, other]

doi 10.1103/PhysRevA.104.033710

Coherent single-photon scattering spectra for a giant-atom waveguide-QED system beyond dipole approximation

Authors: Q. Y. Cai, W. Z. Jia

Abstract: We investigate the single-photon scattering spectra of a giant atom coupled to a one dimensional waveguide via multiple connection points or a continuous coupling region. Using a full quantum mechanical method, we obtain the general analytic expressions for the single-photon scattering coefficients, which are valid in both the Markovian and the non-arkovian regimes. We summarize the influences of… ▽ More We investigate the single-photon scattering spectra of a giant atom coupled to a one dimensional waveguide via multiple connection points or a continuous coupling region. Using a full quantum mechanical method, we obtain the general analytic expressions for the single-photon scattering coefficients, which are valid in both the Markovian and the non-arkovian regimes. We summarize the influences of the non-dipole effects, mainly caused by the phases accumulated by photons traveling between coupling points, on the scattering spectra. We find that under the Markovian limit, the phase decay is detuning-independent, resulting in Lorentzian lineshapes characterized by the Lamb shifts and the effective decay rates. While in the non-Markovian regime, the accumulated phases become detuning-dependent, giving rise to non-Lorentzian lineshapes, characterized by multiple side peaks and total transmission points. Another interesting phenomenon in the non-Markovian regime is generation of broad photonic band gap by a single giant atom. We further generalize the case of discrete coupling points to the continuum limit with atom coupling to the waveguide via a continuous area, and analyze the scattering spectra for some typical distributions of coupling strength. △ Less

Submitted 13 January, 2022; originally announced January 2022.

Comments: 12 pages, 8 figures

Journal ref: Phys. Rev. A 104, 033710 (2021)

arXiv:2112.06558 [pdf, other]

MAGIC: Multimodal relAtional Graph adversarIal inferenCe for Diverse and Unpaired Text-based Image Captioning

Authors: Wenqiao Zhang, Haochen Shi, Jiannan Guo, Shengyu Zhang, Qingpeng Cai, Juncheng Li, Sihui Luo, Yueting Zhuang

Abstract: Text-based image captioning (TextCap) requires simultaneous comprehension of visual content and reading the text of images to generate a natural language description. Although a task can teach machines to understand the complex human environment further given that text is omnipresent in our daily surroundings, it poses additional challenges in normal captioning. A text-based image intuitively cont… ▽ More Text-based image captioning (TextCap) requires simultaneous comprehension of visual content and reading the text of images to generate a natural language description. Although a task can teach machines to understand the complex human environment further given that text is omnipresent in our daily surroundings, it poses additional challenges in normal captioning. A text-based image intuitively contains abundant and complex multimodal relational content, that is, image details can be described diversely from multiview rather than a single caption. Certainly, we can introduce additional paired training data to show the diversity of images' descriptions, this process is labor-intensive and time-consuming for TextCap pair annotations with extra texts. Based on the insight mentioned above, we investigate how to generate diverse captions that focus on different image parts using an unpaired training paradigm. We propose the Multimodal relAtional Graph adversarIal inferenCe (MAGIC) framework for diverse and unpaired TextCap. This framework can adaptively construct multiple multimodal relational graphs of images and model complex relationships among graphs to represent descriptive diversity. Moreover, a cascaded generative adversarial network is developed from modeled graphs to infer the unpaired caption generation in image-sentence feature alignment and linguistic coherence levels. We validate the effectiveness of MAGIC in generating diverse captions from different relational information items of an image. Experimental results show that MAGIC can generate very promising outcomes without using any image-caption training pairs. △ Less

Submitted 4 March, 2022; v1 submitted 13 December, 2021; originally announced December 2021.

Journal ref: AAAI 2022

arXiv:2111.14481 [pdf]

doi 10.1007/s00340-022-07791-1

A measurement method of transverse light-shift in atomic spin co-magnetometer

Authors: Li Xing, Wei Quan, Tianxiao Song, Qingzhong Cai, Wen Ye

Abstract: We disclose a method to obtain the transverse light-shift along the probe light of a single-axis alkali metal-noble gas co-magnetometer. The relationship between transverse compensating field and light-shift is deduced through the steady-state solution of Bloch equations. The variety of probe light intensity is used to obtain the residual magnetic field, and step modulation tests are applied to ac… ▽ More We disclose a method to obtain the transverse light-shift along the probe light of a single-axis alkali metal-noble gas co-magnetometer. The relationship between transverse compensating field and light-shift is deduced through the steady-state solution of Bloch equations. The variety of probe light intensity is used to obtain the residual magnetic field, and step modulation tests are applied to acquire the total spin-relaxation rate of electron spins and self-compensation point. Finally, the transverse light-shift is reduced from -0.115 nT to -0.039 nT by optimizing the probe light wavelength, and the value of the calibration coefficient can be increased simultaneously. △ Less

Submitted 29 November, 2021; originally announced November 2021.

arXiv:2110.13612 [pdf, ps, other]

An explicit and non-iterative moving-least-squares immersed-boundary method with low boundary velocity error

Authors: Wenyuan Chen, Shufan Zou, Qingdong Cai, Yantao Yang

Abstract: In this work, based on the moving-least-squares immersed boundary method, we proposed a new technique to improve the calculation of the volume force representing the body boundary. For boundary with simple geometry, we theoretically analyse the error between the desired volume force at boundary and the actual force given by the original method. The ratio between the two forces is very close to a c… ▽ More In this work, based on the moving-least-squares immersed boundary method, we proposed a new technique to improve the calculation of the volume force representing the body boundary. For boundary with simple geometry, we theoretically analyse the error between the desired volume force at boundary and the actual force given by the original method. The ratio between the two forces is very close to a constant. Numerical experiments reveal that for complex geometry, this ratio exhibits very narrow distribution around certain value. A spatially uniform coefficient is then introduced to correct the force and fixed by the least-square method over all boundary markers. Such method is explicit and non-iterative, and can be easily implemented into the existing scheme. Several test cases have been simulated with stationary and moving boundaries. Our new method can reduce the residual boundary velocity to the level comparable to that given by the iterative method, but requires much less computing time. Moreover, the new method can be readily combined with the iterative method and further reduces the residual boundary velocity. △ Less

Submitted 18 October, 2021; originally announced October 2021.

Comments: 21 pages, 10 figures, 4 tables

arXiv:2110.10396 [pdf, other]

UPPRESSO: Untraceable and Unlinkable Privacy-PREserving Single Sign-On Services

Authors: Chengqian Guo, **gqiang Lin, Quanwei Cai, Wei Wang, Fengjun Li, Qiongxiao Wang, Jiwu **g, Bin Zhao

Abstract: Single sign-on (SSO) allows a user to maintain only the credential at the identity provider (IdP), to login to numerous RPs. However, SSO introduces extra privacy threats, compared with traditional authentication mechanisms, as (a) the IdP could track all RPs which a user is visiting, and (b) collusive RPs could learn a user's online profile by linking his identities across these RPs. This paper p… ▽ More Single sign-on (SSO) allows a user to maintain only the credential at the identity provider (IdP), to login to numerous RPs. However, SSO introduces extra privacy threats, compared with traditional authentication mechanisms, as (a) the IdP could track all RPs which a user is visiting, and (b) collusive RPs could learn a user's online profile by linking his identities across these RPs. This paper proposes a privacypreserving SSO system, called UPPRESSO, to protect a user's login activities against both the curious IdP and collusive RPs. We analyze the identity dilemma between the security requirements and these privacy concerns, and convert the SSO privacy problems into an identity transformation challenge. In each login instance, an ephemeral pseudo-identity (denoted as PID_RP ) of the RP, is firstly negotiated between the user and the RP. PID_RP is sent to the IdP and designated in the identity token, so the IdP is not aware of the visited RP. Meanwhile, PID_RP is used by the IdP to transform the permanent user identity ID_U into an ephemeral user pseudo-identity (denoted as PID_U ) in the identity token. On receiving the identity token, the RP transforms PID_U into a permanent account (denoted as Acct) of the user, by an ephemeral trapdoor in the negotiation. Given a user, the account at each RP is unique and different from ID_U, so collusive RPs cannot link his identities across these RPs. We build the UPPRESSO prototype on top of MITREid Connect, an open-source implementation of OIDC. The extensive evaluation shows that UPPRESSO fulfills the requirements of both security and privacy and introduces reasonable overheads. △ Less

Submitted 2 September, 2022; v1 submitted 20 October, 2021; originally announced October 2021.

arXiv:2110.10012 [pdf, other]

doi 10.3847/1538-4357/ac3135

Decimetric type U solar radio bursts and associated EUV phenomena on 2011 February 9

Authors: Guannan Gao, Qiangwei Cai, Shaojie Guo, Min Wang

Abstract: A GOES M1.9 flare took place in active region AR 11153 on February 9,2011. With the resolution of 200 kHz and a time cadence of 80 ms, the reverse-drifting (RS) type III bursts, intermittent sequence of type U bursts, drifting pulsation structure (DPS), and fine structures were observed by the Yunnan Observatories Solar Radio Spectrometer(YNSRS). Combined information revealed by the multi-waveleng… ▽ More A GOES M1.9 flare took place in active region AR 11153 on February 9,2011. With the resolution of 200 kHz and a time cadence of 80 ms, the reverse-drifting (RS) type III bursts, intermittent sequence of type U bursts, drifting pulsation structure (DPS), and fine structures were observed by the Yunnan Observatories Solar Radio Spectrometer(YNSRS). Combined information revealed by the multi-wavelength data indicated that after the DPS which observed by YNSRS, the generation rate of type U bursts suddenly increased 5 times than before. In this event, the generation rate of type U bursts may depend on the magnetic reconnection rate. Our observations are consistent with previous numerical simulations results. After the first plasmoid produced (plasma instability occurred), the magnetic reconnection rate increased suddenly 5-8 times than before. Furthermore, after the DPS, the frequency range of turnover frequency of type U bursts is obviously broadened 3 times than before, which indicates the fluctuation amplitude of the density in the loop-top. Our observations also support the numerical simulations during the flare impulsive phase. The turbulence occurs at the top of the flare loop, the plasmoids can trap the non-thermal particles and cause the density fluctuation at the loop-top. The observations are generally consistent with the results of numerical simulations, hel** us to better understand the characteristics of the whole physical process of eruption. △ Less

Submitted 19 October, 2021; originally announced October 2021.

Comments: 14 pages,10 figures, accepted by ApJ

arXiv:2110.00215 [pdf]

doi 10.1002/aelm.202200017

Thermal Conductivity of BAs under Pressure

Authors: Songrui Hou, Bo Sun, Fei Tian, Qingan Cai, Shanming Wang, Wanyue Peng, Xi Chen, Zhifeng Ren, Chen Li, Richard Wilson

Abstract: The thermal conductivity of boron arsenide (BAs) is believed to be influenced by phonon scattering selection rules due to its special phonon dispersion. Compression of BAs leads to significant changes in phonon dispersion, which allows for a test of first principles theories for how phonon dispersion affects three- and four-phonon scattering rates. This study reports the thermal conductivity of BA… ▽ More The thermal conductivity of boron arsenide (BAs) is believed to be influenced by phonon scattering selection rules due to its special phonon dispersion. Compression of BAs leads to significant changes in phonon dispersion, which allows for a test of first principles theories for how phonon dispersion affects three- and four-phonon scattering rates. This study reports the thermal conductivity of BAs from 0 to 30 GPa. Thermal conductivity vs. pressure of BAs is measured by time-domain thermoreflectance with a diamond anvil cell. In stark contrast to what is typical for nonmetallic crystals, BAs is observed to have a pressure independent thermal conductivity below 30 GPa. The thermal conductivity of nonmetallic crystals typically increases upon compression. The unusual pressure independence of thermal conductivity of BAs shows the important relationship between phonon dispersion properties and three- and four-phonon scattering rates. △ Less

Submitted 16 February, 2023; v1 submitted 1 October, 2021; originally announced October 2021.

arXiv:2108.10219 [pdf, other]

Study of Proximal Normalized Subband Adaptive Algorithm for Acoustic Echo Cancellation

Authors: Gang Guo, Yi Yu, Rodrigo C. de Lamare, Zongsheng Zheng, Lu Lu, Qiangming Cai

Abstract: In this paper, we propose a novel normalized subband adaptive filter algorithm suited for sparse scenarios, which combines the proportionate and sparsity-aware mechanisms. The proposed algorithm is derived based on the proximal forward-backward splitting and the soft-thresholding methods. We analyze the mean and mean square behaviors of the algorithm, which is supported by simulations. In addition… ▽ More In this paper, we propose a novel normalized subband adaptive filter algorithm suited for sparse scenarios, which combines the proportionate and sparsity-aware mechanisms. The proposed algorithm is derived based on the proximal forward-backward splitting and the soft-thresholding methods. We analyze the mean and mean square behaviors of the algorithm, which is supported by simulations. In addition, an adaptive approach for the choice of the thresholding parameter in the proximal step is also proposed based on the minimization of the mean square deviation. Simulations in the contexts of system identification and acoustic echo cancellation verify the superiority of the proposed algorithm over its counterparts. △ Less

Submitted 14 August, 2021; originally announced August 2021.

Comments: 12 figures, 13 pages

arXiv:2108.06631 [pdf, ps, other]

Direct Observation of Chiral Phonons by Inelastic X-ray Scattering

Authors: Qingan Cai, Olle Hellman, Bin Wei, Qiyang Sun, Ayman H. Said, Thomas Gog, Barry Winn, Chen Li

Abstract: Phonon chirality has attracted intensive attention since it breaks the traditional cognition that phonons are linear propagating bosons. This new quasiparticle property has been extensively studied theoretically and experimentally. However, characterization of the phonon chirality throughout the full Brillouin zone is still not possible due to the lack of available experimental tools. In this work… ▽ More Phonon chirality has attracted intensive attention since it breaks the traditional cognition that phonons are linear propagating bosons. This new quasiparticle property has been extensively studied theoretically and experimentally. However, characterization of the phonon chirality throughout the full Brillouin zone is still not possible due to the lack of available experimental tools. In this work, phonon dispersion and chirality of tungsten carbide were investigated by millielectronvolt energy-resolution inelastic X-ray scattering. The atomistic calculation indicates that in-plane longitudinal and transverse acoustic phonons near K and K$^\prime$ points are circularly polarized due to the broken inversion symmetry. Anomalous inelastic X-ray scattering by these circularly polarized phonons was observed and attributed to their chirality. Our results show that inelastic X-ray scattering can be utilized to characterize phonon chirality in materials and suggest that a revision to the phonon scattering function is necessary. △ Less

Submitted 14 August, 2021; originally announced August 2021.

arXiv:2108.05492 [pdf, ps, other]

Some Results on $k$-Critical $P_5$-Free Graphs

Authors: Qingqiong Cai, Jan Goedgebeur, Shenwei Huang

Abstract: A graph $G$ is $k$-vertex-critical if $G$ has chromatic number $k$ but every proper induced subgraph of $G$ has chromatic number less than $k$. The study of $k$-vertex-critical graphs for graph classes is an important topic in algorithmic graph theory because if the number of such graphs that are in a given hereditary graph class is finite, then there is a polynomial-time algorithm to decide if a… ▽ More A graph $G$ is $k$-vertex-critical if $G$ has chromatic number $k$ but every proper induced subgraph of $G$ has chromatic number less than $k$. The study of $k$-vertex-critical graphs for graph classes is an important topic in algorithmic graph theory because if the number of such graphs that are in a given hereditary graph class is finite, then there is a polynomial-time algorithm to decide if a graph in the class is $(k-1)$-colorable. In this paper, we prove that for every fixed integer $k\ge 1$, there are only finitely many $k$-vertex-critical ($P_5$,gem)-free graphs and $(P_5,\overline{P_3+P_2})$-free graphs. To prove the results we use a known structure theorem for ($P_5$,gem)-free graphs combined with properties of $k$-vertex-critical graphs. Moreover, we characterize all $k$-vertex-critical ($P_5$,gem)-free graphs and $(P_5,\overline{P_3+P_2})$-free graphs for $k \in \{4,5\}$ using a computer generation algorithm. △ Less

Submitted 11 August, 2021; originally announced August 2021.

Comments: 15 pages, 4 figures. arXiv admin note: substantial text overlap with arXiv:2005.03441

arXiv:2108.04526 [pdf, other]

A Survey on Deep Reinforcement Learning for Data Processing and Analytics

Authors: Qingpeng Cai, Can Cui, Yiyuan Xiong, Wei Wang, Zhongle Xie, Meihui Zhang

Abstract: Data processing and analytics are fundamental and pervasive. Algorithms play a vital role in data processing and analytics where many algorithm designs have incorporated heuristics and general rules from human knowledge and experience to improve their effectiveness. Recently, reinforcement learning, deep reinforcement learning (DRL) in particular, is increasingly explored and exploited in many are… ▽ More Data processing and analytics are fundamental and pervasive. Algorithms play a vital role in data processing and analytics where many algorithm designs have incorporated heuristics and general rules from human knowledge and experience to improve their effectiveness. Recently, reinforcement learning, deep reinforcement learning (DRL) in particular, is increasingly explored and exploited in many areas because it can learn better strategies in complicated environments it is interacting with than statically designed algorithms. Motivated by this trend, we provide a comprehensive review of recent works focusing on utilizing DRL to improve data processing and analytics. First, we present an introduction to key concepts, theories, and methods in DRL. Next, we discuss DRL deployment on database systems, facilitating data processing and analytics in various aspects, including data organization, scheduling, tuning, and indexing. Then, we survey the application of DRL in data processing and analytics, ranging from data preparation, natural language processing to healthcare, fintech, etc. Finally, we discuss important open challenges and future research directions of using DRL in data processing and analytics. △ Less

Submitted 4 February, 2022; v1 submitted 10 August, 2021; originally announced August 2021.

Comments: 39 pages, 3 figures and 3 tables

arXiv:2108.02696 [pdf, other]

A Low Rank Promoting Prior for Unsupervised Contrastive Learning

Authors: Yu Wang, **gyang Lin, Qi Cai, Yingwei Pan, Ting Yao, Hongyang Chao, Tao Mei

Abstract: Unsupervised learning is just at a tip** point where it could really take off. Among these approaches, contrastive learning has seen tremendous progress and led to state-of-the-art performance. In this paper, we construct a novel probabilistic graphical model that effectively incorporates the low rank promoting prior into the framework of contrastive learning, referred to as LORAC. In contrast t… ▽ More Unsupervised learning is just at a tip** point where it could really take off. Among these approaches, contrastive learning has seen tremendous progress and led to state-of-the-art performance. In this paper, we construct a novel probabilistic graphical model that effectively incorporates the low rank promoting prior into the framework of contrastive learning, referred to as LORAC. In contrast to the existing conventional self-supervised approaches that only considers independent learning, our hypothesis explicitly requires that all the samples belonging to the same instance class lie on the same subspace with small dimension. This heuristic poses particular joint learning constraints to reduce the degree of freedom of the problem during the search of the optimal network parameterization. Most importantly, we argue that the low rank prior employed here is not unique, and many different priors can be invoked in a similar probabilistic way, corresponding to different hypotheses about underlying truth behind the contrastive features. Empirical evidences show that the proposed algorithm clearly surpasses the state-of-the-art approaches on multiple benchmarks, including image classification, object detection, instance segmentation and keypoint detection. △ Less

Submitted 5 August, 2021; originally announced August 2021.

arXiv:2107.07959 [pdf]

doi 10.1016/j.mtphys.2021.100599

Giant Anisotropic in-Plane Thermal Conduction Induced by Anomalous Phonons in Nearly-Equilaterally Structured PdSe2

Authors: Bin Wei, Junyan Liu, Qingan Cai, Ahmet Alatas, Ayman H. said, Chen Li, Jiawang Hong

Abstract: In two-dimensional materials, structure difference induces the difference in phonon dispersions, leading to the anisotropy of in-plane thermal transport. Here, we report an exceptional case in layered PdSe2, where the bonding, force constants, and lattice constants are nearly-equal along the in-plane crystallographic axis directions. The phonon dispersions show significant differences between the… ▽ More In two-dimensional materials, structure difference induces the difference in phonon dispersions, leading to the anisotropy of in-plane thermal transport. Here, we report an exceptional case in layered PdSe2, where the bonding, force constants, and lattice constants are nearly-equal along the in-plane crystallographic axis directions. The phonon dispersions show significant differences between the Gamma-X and Gamma-Y directions, leading to the anisotropy of in-plane thermal conductivity with a ratio up to 1.8. Such anisotropy is not only unexpected in equilaterally structured (in-plane) materials but also comparable to the record in the non-equilaterally structured material reported to date. By combining inelastic X-ray scattering and first-principles calculations, we attribute such anisotropy to the low-energy phonons along Gamma-X, in particular, their lower group velocities and "avoided-crossing" behavior. The different bucking structures between a- (zigzag-type) and b-axis (flat-type) are mainly responsible for the unique phonon dynamics properties of PdSe2. The present results illustrate the unusual thermal conduction mechanism of the equilaterally structured materials and provide valuable insights on thermal management in electronic devices. △ Less

Submitted 16 July, 2021; originally announced July 2021.

Comments: 16 figures, 2 tables

Journal ref: MTPHYS 100599 2021

arXiv:2106.10922 [pdf]

doi 10.1038/s42005-021-00727-9

Matryoshka Phonon Twinning in alpha-GaN

Authors: Bin Wei, Qingan Cai, Qiyang Sun, Yaokun Su, Ayman H. Said, Douglas L. Abernathy, Jiawang Hong, Chen Li

Abstract: Understanding lattice dynamics is crucial for effective thermal management in high-power electronic devices because phonons dominate thermal transport in most semiconductors. This study utilizes complementary inelastic X-ray and neutron scattering techniques and reports the temperature-dependent phonon dynamics of alpha-GaN, one of the most important third-generation power semiconductors. A promin… ▽ More Understanding lattice dynamics is crucial for effective thermal management in high-power electronic devices because phonons dominate thermal transport in most semiconductors. This study utilizes complementary inelastic X-ray and neutron scattering techniques and reports the temperature-dependent phonon dynamics of alpha-GaN, one of the most important third-generation power semiconductors. A prominent Matryoshka phonon dispersion is discovered with the scattering tools and confirmed by the first-principles calculations. Such Matryoshka twinning throughout the three-dimension reciprocal space is demonstrated to amplify the anharmonicity of the related phonon modes through creating abundant three-phonon scattering channels and cutting the phonon lifetime of affected modes by more than 50%. Such phonon topology effectively contributes to the reduction of the in-plane thermal transport, thus the anisotropic thermal conductivity of alpha-GaN. The results not only have significant implications for engineering the thermal performance and other phonon-related properties of alpha-GaN, but also offer valuable insights on the role of anomalous phonon topology in thermal transport of other technically important semiconductors. △ Less

Submitted 21 June, 2021; originally announced June 2021.

Comments: 34 pages, 15 figures

Journal ref: COMMUNICATIONS PHYSICS 2021

arXiv:2106.10662 [pdf, other]

FedXGBoost: Privacy-Preserving XGBoost for Federated Learning

Authors: Nhan Khanh Le, Yang Liu, Quang Minh Nguyen, Qingchen Liu, Fangzhou Liu, Quanwei Cai, Sandra Hirche

Abstract: Federated learning is the distributed machine learning framework that enables collaborative training across multiple parties while ensuring data privacy. Practical adaptation of XGBoost, the state-of-the-art tree boosting framework, to federated learning remains limited due to high cost incurred by conventional privacy-preserving methods. To address the problem, we propose two variants of federate… ▽ More Federated learning is the distributed machine learning framework that enables collaborative training across multiple parties while ensuring data privacy. Practical adaptation of XGBoost, the state-of-the-art tree boosting framework, to federated learning remains limited due to high cost incurred by conventional privacy-preserving methods. To address the problem, we propose two variants of federated XGBoost with privacy guarantee: FedXGBoost-SMM and FedXGBoost-LDP. Our first protocol FedXGBoost-SMM deploys enhanced secure matrix multiplication method to preserve privacy with lossless accuracy and lower overhead than encryption-based techniques. Developed independently, the second protocol FedXGBoost-LDP is heuristically designed with noise perturbation for local differential privacy, and empirically evaluated on real-world and synthetic datasets. △ Less

Submitted 12 August, 2021; v1 submitted 20 June, 2021; originally announced June 2021.

arXiv:2105.11775 [pdf]

doi 10.1117/1.AP.3.5.056002

Active spintronic-metasurface terahertz emitters with tunable chirality

Authors: Changqin Liu, Sheng Zhang, Shunjia Wang, Qingnan Cai, Peng Wang, Chuanshan Tian, Lei Zhou, Yizheng Wu, Zhensheng Tao

Abstract: The ability to manipulate the electric-field vector of broadband terahertz waves is essential for applications of terahertz technologies in many areas, and can open up new possibilities for nonlinear terahertz spectroscopy and coherent control. Here, we propose a novel laser-driven terahertz emitter, consisting of metasurface-patterned magnetic multilayer heterostructures. Such hybrid terahertz em… ▽ More The ability to manipulate the electric-field vector of broadband terahertz waves is essential for applications of terahertz technologies in many areas, and can open up new possibilities for nonlinear terahertz spectroscopy and coherent control. Here, we propose a novel laser-driven terahertz emitter, consisting of metasurface-patterned magnetic multilayer heterostructures. Such hybrid terahertz emitters can combine the advantages of spintronic emitters for being ultrabroadband, efficient and flexible, as well as those of metasurfaces for the unique capability to manipulate terahertz waves with high precision and degree of freedom. Taking a stripe-patterned metasurface as an example, we demonstrate the generation of broadband terahertz waves with tunable chirality. Based on experimental and theoretical studies, the interplay between the laser-induced spintronic-origin currents and the metasurface-induced transient charges/currents are investigated, revealing the strong influence on the device functionality originated from both the light-matter interactions in individual metasurface units and the dynamic coupling between them. Our work not only offers a flexible, reliable and cost-effective solution for chiral terahertz wave generation and manipulation, but also opens a new pathway to metasurface-tailored spintronic devices for efficient vector-control of electromagnetic waves in the terahertz regime. △ Less

Submitted 25 May, 2021; originally announced May 2021.

Journal ref: Advanced Photonics 3, 056002 (2021)

arXiv:2103.01530 [pdf]

A Pose-only Solution to Visual Reconstruction and Navigation

Authors: Qi Cai, Lilian Zhang, Yuanxin Wu, Wenxian Yu, Dewen Hu

Abstract: Visual navigation and three-dimensional (3D) scene reconstruction are essential for robotics to interact with the surrounding environment. Large-scale scenes and critical camera motions are great challenges facing the research community to achieve this goal. We raised a pose-only imaging geometry framework and algorithms that can help solve these challenges. The representation is a linear function… ▽ More Visual navigation and three-dimensional (3D) scene reconstruction are essential for robotics to interact with the surrounding environment. Large-scale scenes and critical camera motions are great challenges facing the research community to achieve this goal. We raised a pose-only imaging geometry framework and algorithms that can help solve these challenges. The representation is a linear function of camera global translations, which allows for efficient and robust camera motion estimation. As a result, the spatial feature coordinates can be analytically reconstructed and do not require nonlinear optimization. Experiments demonstrate that the computational efficiency of recovering the scene and associated camera poses is significantly improved by 2-4 orders of magnitude. This solution might be promising to unlock real-time 3D visual computing in many forefront applications. △ Less

Submitted 20 September, 2021; v1 submitted 2 March, 2021; originally announced March 2021.

arXiv:2102.10871

Decreasing emissions and increasing sink capacity to support China in achieving carbon neutrality before 2060

Authors: Pengfei Han, Ning Zeng, Wen Zhang, Qixiang Cai, Ruqi Yang, Bo Yao, Xiaohui Lin, Guocheng Wang, Di Liu, Yongqiang Yu

Abstract: In September 2020, President ** announced that China strives to achieve carbon neutrality before 2060. This ambitious and bold commitment was well received by the global community. However, the technology and pathway are not so clear. Here, we conducted an extensive review covering more than 200 published papers and summarized the key technologies to achieve carbon neutrality. We projected… ▽ More In September 2020, President ** announced that China strives to achieve carbon neutrality before 2060. This ambitious and bold commitment was well received by the global community. However, the technology and pathway are not so clear. Here, we conducted an extensive review covering more than 200 published papers and summarized the key technologies to achieve carbon neutrality. We projected sectoral CO2 emissions for 2020-2050 based on our previous studies and published scenarios. We applied a medium sink scenario for terrestrial sinks due to the potential resource competition and included an ocean sink, which has generally not been included in previous estimates. We analyzed and revisited China's historical terrestrial carbon sink capacity from 1980-2020 based on multiple models and a literature review. To achieve neutrality, it is necessary to increase sink capacity and decrease emissions from many sources. On the one hand, critical measures to reduce emissions include decreasing the use of fossil fuels; substantially increasing the proportion of the renewable energy and nuclear energy. On the other hand, the capacity of future carbon sinks is projected to decrease due to the natural evolution of terrestrial ecosystems, and anthropogenic management practices are needed to increase sink capacity, including increasing the forest sinks through national ecological restoration projects and large-scale land greening campaigns; increasing wood harvesting and storage; and develo** CCUS. This paper provides basic source and sink data,and established and promising new technologies for decreasing emissions and increasing sinks for use by the scientific community and policy makers. △ Less

Submitted 17 December, 2023; v1 submitted 22 February, 2021; originally announced February 2021.

Comments: needs further revisions in policy part

arXiv:2101.11229 [pdf]

Mechanisms behind high CO2/CH4 selectivity using ZIF-8 metal organic frameworks with encapsulated ionic liquids: a computational study

Authors: Tianhao Yu, Qiong Cai, Guo** Lian, Yinge Bai, Xiaochun Zhang, ** Zhang, Lei Liu

Abstract: CO2/CH4 separation using ionic liquids (ILs) encapsulated metal-organic frameworks (MOFs), especially ZIF-8, has shown promise as a new technique for separating CO2 from CH4. However, the mechanisms behind the high CO2/CH4 selectivity of the method remains indistinct. Here we report the progress of understanding the mechanisms from examining the ZIF-8 aperture configuration variation using DFT and… ▽ More CO2/CH4 separation using ionic liquids (ILs) encapsulated metal-organic frameworks (MOFs), especially ZIF-8, has shown promise as a new technique for separating CO2 from CH4. However, the mechanisms behind the high CO2/CH4 selectivity of the method remains indistinct. Here we report the progress of understanding the mechanisms from examining the ZIF-8 aperture configuration variation using DFT and MD simulations. The results indicate that the pristine aperture configuration exhibits the best separation performance, and the addition of ILs prevents the apertures from large swing (i.e. configuration variation). Subsequently, the effect of IL viscosity on the layout variation was investigated. MD simulations also show that the pristine aperture configuration is more stabilized by ILs with large viscosity (0-87Cp). Further increase of IL viscosity above 87Cp did not result in noticeable changes in the aperture stability. △ Less

Submitted 27 January, 2021; originally announced January 2021.

arXiv:2101.10779 [pdf, other]

Superradiant detection of microscopic optical dipolar interactions

Authors: Ling**g Ji, Yizun He, Qingnan Cai, Zhening Fang, Yuzhuo Wang, Liyang Qiu, Lei Zhou, Saijun Wu, Stefano Grava, Darrick E. Chang

Abstract: The interaction between light and cold atoms is a complex phenomenon potentially featuring many-body resonant dipole interactions. A major obstacle toward exploring these quantum resources of the system is macroscopic light propagation effects, which not only limit the available time for the microscopic correlations to locally build up, but also create a directional, superradiant emission backgrou… ▽ More The interaction between light and cold atoms is a complex phenomenon potentially featuring many-body resonant dipole interactions. A major obstacle toward exploring these quantum resources of the system is macroscopic light propagation effects, which not only limit the available time for the microscopic correlations to locally build up, but also create a directional, superradiant emission background whose variations can overwhelm the microscopic effects. In this Letter, we demonstrate a method to perform ``background-free'' detection of the microscopic optical dynamics in a laser-cooled atomic ensemble. This is made possible by transiently suppressing the macroscopic optical propagation over a substantial time, before a recall of superradiance that imprints the effect of the accumulated microscopic dynamics into an efficiently detectable outgoing field. We apply this technique to unveil and precisely characterize a density-dependent, microscopic dipolar dephasing effect that generally limits the lifetime of optical spin-wave order in ensemble-based atom-light interfaces. △ Less

Submitted 12 October, 2023; v1 submitted 26 January, 2021; originally announced January 2021.

Comments: 18 pages, 13 figures, improved data with substantial revision

arXiv:2011.04301 [pdf, other]

doi 10.1103/PhysRevA.103.052419

Microwave Quantum Illumination via Cavity Magnonics

Authors: Qizhi Cai, **kun Liao, Bohai Shen, Guangcan Guo, Qiang Zhou

Abstract: Quantum illumination (QI) is a quantum sensing protocol mainly for target detection which uses entangled signal-idler photon pairs to enhance the detection efficiency of low-reflectivity objects immersed in thermal noisy environments. Especially, due to the naturally occurring background radiation, the photon emitted toward potential targets more appropriately lies in the microwave region. Here, w… ▽ More Quantum illumination (QI) is a quantum sensing protocol mainly for target detection which uses entangled signal-idler photon pairs to enhance the detection efficiency of low-reflectivity objects immersed in thermal noisy environments. Especially, due to the naturally occurring background radiation, the photon emitted toward potential targets more appropriately lies in the microwave region. Here, we propose a hybrid quantum source based on cavity magnonics for microwave QI, where the medium that bridges the optical and the microwave modes is magnon, the quanta of spin wave. Within experimentally accessible parameters, significant microwave-optical quantum resources of interest can be generated, leading to orders of magnitude lower detecting error probability compared with the electro-optomechanical prototype quantum radar and any classical microwave radar with equal transmitted energy. △ Less

Submitted 16 November, 2020; v1 submitted 9 November, 2020; originally announced November 2020.

Comments: 15 pages, 5 figures

Journal ref: Phys. Rev. A 103, 052419 (2021)

arXiv:2010.13025 [pdf]

Global to local impacts on atmospheric CO2 caused by COVID-19 lockdown

Authors: Ning Zeng, Pengfei Han, Di Liu, Zhiqiang Liu, Tomohiro Oda, Cory Martin, Zhu Liu, Bo Yao, Wanqi Sun, Pucai Wang, Qixiang Cai, Russell Dickerson, Shamil Maksyutov

Abstract: The world-wide lockdown in response to the COVID-19 pandemic in year 2020 led to economic slowdown and large reduction of fossil fuel CO2 emissions, but it is unclear how much it would reduce atmospheric CO2 concentration, and whether it can be observed. We estimated that a 7.9% reduction in emissions for 4 months would result in a 0.25 ppm decrease in the Northern Hemisphere CO2, an increment tha… ▽ More The world-wide lockdown in response to the COVID-19 pandemic in year 2020 led to economic slowdown and large reduction of fossil fuel CO2 emissions, but it is unclear how much it would reduce atmospheric CO2 concentration, and whether it can be observed. We estimated that a 7.9% reduction in emissions for 4 months would result in a 0.25 ppm decrease in the Northern Hemisphere CO2, an increment that is within the capability of current CO2 analyzers, but is a few times smaller than natural CO2 variabilities caused by weather and the biosphere such as El Nino. We used a state-of-the-art atmospheric transport model to simulate CO2, driven by a new daily fossil fuel emissions dataset and hourly biospheric fluxes from a carbon cycle model forced with observed climate variability. Our results show a 0.13 ppm decrease in atmospheric column CO2 anomaly averaged over 50S-50N for the period February-April 2020 relative to a 10-year climatology. A similar decrease was observed by the carbon satellite GOSAT3. Using model sensitivity experiments, we further found that COVID, the biosphere and weather contributed 54%, 23%, and 23% respectively. This seemingly small change stands out as the largest sub-annual anomaly in the last 10 years. Measurements from global ground stations were analyzed. At city scale, on-road CO2 enhancement measured in Bei**g shows reduction of 20-30 ppm, consistent with drastically reduced traffic during the lockdown. The ability of our current carbon monitoring systems in detecting the small and short-lasting COVID signal on the background of fossil fuel CO2 accumulated over the last two centuries is encouraging. The COVID-19 pandemic is an unintended experiment whose impact suggests that to keep atmospheric CO2 at a climate-safe level will require sustained effort of similar magnitude and improved accuracy and expanded spatiotemporal coverage of our monitoring systems. △ Less

Submitted 24 October, 2020; originally announced October 2020.

arXiv:2010.09177 [pdf, other]

Softmax Deep Double Deterministic Policy Gradients

Authors: Ling Pan, Qingpeng Cai, Longbo Huang

Abstract: A widely-used actor-critic reinforcement learning algorithm for continuous control, Deep Deterministic Policy Gradients (DDPG), suffers from the overestimation problem, which can negatively affect the performance. Although the state-of-the-art Twin Delayed Deep Deterministic Policy Gradient (TD3) algorithm mitigates the overestimation issue, it can lead to a large underestimation bias. In this pap… ▽ More A widely-used actor-critic reinforcement learning algorithm for continuous control, Deep Deterministic Policy Gradients (DDPG), suffers from the overestimation problem, which can negatively affect the performance. Although the state-of-the-art Twin Delayed Deep Deterministic Policy Gradient (TD3) algorithm mitigates the overestimation issue, it can lead to a large underestimation bias. In this paper, we propose to use the Boltzmann softmax operator for value function estimation in continuous control. We first theoretically analyze the softmax operator in continuous action space. Then, we uncover an important property of the softmax operator in actor-critic algorithms, i.e., it helps to smooth the optimization landscape, which sheds new light on the benefits of the operator. We also design two new algorithms, Softmax Deep Deterministic Policy Gradients (SD2) and Softmax Deep Double Deterministic Policy Gradients (SD3), by building the softmax operator upon single and double estimators, which can effectively improve the overestimation and underestimation bias. We conduct extensive experiments on challenging continuous control tasks, and results show that SD3 outperforms state-of-the-art methods. △ Less

Submitted 18 October, 2020; originally announced October 2020.

Comments: NeurIPS 2020

arXiv:2010.05131 [pdf, other]

Segmenting Epipolar Line

Authors: Shengjie Li, Qi Cai, Yuanxin Wu

Abstract: Identifying feature correspondence between two images is a fundamental procedure in three-dimensional computer vision. Usually the feature search space is confined by the epipolar line. Using the cheirality constraint, this paper finds that the feature search space can be restrained to one of two or three segments of the epipolar line that are defined by the epipole and a so-called virtual infinit… ▽ More Identifying feature correspondence between two images is a fundamental procedure in three-dimensional computer vision. Usually the feature search space is confined by the epipolar line. Using the cheirality constraint, this paper finds that the feature search space can be restrained to one of two or three segments of the epipolar line that are defined by the epipole and a so-called virtual infinity point. △ Less

Submitted 10 October, 2020; originally announced October 2020.

Comments: 5 pages, 6 figures

arXiv:2009.14776 [pdf, other]

Joint Contrastive Learning with Infinite Possibilities

Authors: Qi Cai, Yu Wang, Yingwei Pan, Ting Yao, Tao Mei

Abstract: This paper explores useful modifications of the recent development in contrastive learning via novel probabilistic modeling. We derive a particular form of contrastive loss named Joint Contrastive Learning (JCL). JCL implicitly involves the simultaneous learning of an infinite number of query-key pairs, which poses tighter constraints when searching for invariant features. We derive an upper bound… ▽ More This paper explores useful modifications of the recent development in contrastive learning via novel probabilistic modeling. We derive a particular form of contrastive loss named Joint Contrastive Learning (JCL). JCL implicitly involves the simultaneous learning of an infinite number of query-key pairs, which poses tighter constraints when searching for invariant features. We derive an upper bound on this formulation that allows analytical solutions in an end-to-end training manner. While JCL is practically effective in numerous computer vision applications, we also theoretically unveil the certain mechanisms that govern the behavior of JCL. We demonstrate that the proposed formulation harbors an innate agency that strongly favors similarity within each instance-specific class, and therefore remains advantageous when searching for discriminative features among distinct instances. We evaluate these proposals on multiple benchmarks, demonstrating considerable improvements over existing algorithms. Code is publicly available at: https://github.com/caiqi/Joint-Contrastive-Learning. △ Less

Submitted 10 October, 2020; v1 submitted 30 September, 2020; originally announced September 2020.

Comments: NeurIPS 2020 Spotlight; Code is publicly available at: https://github.com/caiqi/Joint-Contrastive-Learning

arXiv:2009.06903 [pdf, other]

A Robust and Reliable Point Cloud Recognition Network Under Rigid Transformation

Authors: Dongrui Liu, Chuanchuan Chen, Changqing Xu, Qi Cai, Lei Chu, Fei Wen, Robert Caiming Qiu

Abstract: Point cloud recognition is an essential task in industrial robotics and autonomous driving. Recently, several point cloud processing models have achieved state-of-the-art performances. However, these methods lack rotation robustness, and their performances degrade severely under random rotations, failing to extend to real-world scenarios with varying orientations. To this end, we propose a method… ▽ More Point cloud recognition is an essential task in industrial robotics and autonomous driving. Recently, several point cloud processing models have achieved state-of-the-art performances. However, these methods lack rotation robustness, and their performances degrade severely under random rotations, failing to extend to real-world scenarios with varying orientations. To this end, we propose a method named Self Contour-based Transformation (SCT), which can be flexibly integrated into various existing point cloud recognition models against arbitrary rotations. SCT provides efficient rotation and translation invariance by introducing Contour-Aware Transformation (CAT), which linearly transforms Cartesian coordinates of points to translation and rotation-invariant representations. We prove that CAT is a rotation and translation-invariant transformation based on the theoretical analysis. Furthermore, the Frame Alignment module is proposed to enhance discriminative feature extraction by capturing contours and transforming self contour-based frames into intra-class frames. Extensive experimental results show that SCT outperforms the state-of-the-art approaches under arbitrary rotations in effectiveness and efficiency on synthetic and real-world benchmarks. Furthermore, the robustness and generality evaluations indicate that SCT is robust and is applicable to various point cloud processing models, which highlights the superiority of SCT in industrial applications. △ Less

Submitted 28 December, 2021; v1 submitted 15 September, 2020; originally announced September 2020.

Comments: 13 pages, 11 figures

arXiv:2009.05187 [pdf, ps, other]

doi 10.1016/j.physletb.2020.135747

Wheeler-DeWitt equation rejects quantum effects of grown-up universes as a candidate for dark energy

Authors: Dongshan He, Qing-yu Cai

Abstract: In this paper, we study the changes of quantum effects of a growing universe by using Wheeler-DeWitt equation (WDWE) together with de Broglie-Bohm quantum trajectory approach. From WDWE, we obtain the quantum modified Friedmann equations which have additional terms called quantum potential compared to standard Friedmann equations. The quantum potential governs the behavior of the early universe, p… ▽ More In this paper, we study the changes of quantum effects of a growing universe by using Wheeler-DeWitt equation (WDWE) together with de Broglie-Bohm quantum trajectory approach. From WDWE, we obtain the quantum modified Friedmann equations which have additional terms called quantum potential compared to standard Friedmann equations. The quantum potential governs the behavior of the early universe, providing energy for inflation, while it decreases rapidly as the universe grows. The quantum potential of the grown-up universe is much smaller than that required for accelerating expansion. This indicates that quantum effects of our universe cannot be treated as a candidate for dark energy. △ Less

Submitted 10 September, 2020; originally announced September 2020.

Comments: Any comments are welcome!

Journal ref: Phys. Lett. B 809, 135747 (2020)

arXiv:2008.02257 [pdf, other]

doi 10.1021/acs.nanolett.1c01905

Signature of Many-Body Localization of Phonons in Strongly Disordered Superlattices

Authors: Thanh Nguyen, Nina Andrejevic, Hoi Chun Po, Qichen Song, Yoichiro Tsurimaki, Nathan C. Drucker, Ahmet Alatas, Ercan E. Alp, Bogdan M. Leu, Alessandro Cunsolo, Yong Q. Cai, Lijun Wu, Joseph A. Garlow, Yimei Zhu, Hong Lu, Arthur C. Gossard, Alexander A. Puretzky, David B. Geohegan, Shengxi Huang, Mingda Li

Abstract: Many-body localization (MBL) has attracted significant attention due to its immunity to thermalization, role in logarithmic entanglement entropy growth, and opportunities to reach exotic quantum orders. However, experimental realization of MBL in solid-state systems has remained challenging. Here we report evidence of a possible phonon MBL phase in disordered GaAs/AlAs superlattices. Through grazi… ▽ More Many-body localization (MBL) has attracted significant attention due to its immunity to thermalization, role in logarithmic entanglement entropy growth, and opportunities to reach exotic quantum orders. However, experimental realization of MBL in solid-state systems has remained challenging. Here we report evidence of a possible phonon MBL phase in disordered GaAs/AlAs superlattices. Through grazing-incidence inelastic X-ray scattering, we observe a strong deviation of the phonon population from equilibrium in samples doped with ErAs nanodots at low temperature, signaling a departure from thermalization. This behavior occurs within finite phonon energy and wavevector windows, suggesting a localization-thermalization crossover. We support our observation by proposing a theoretical model for the effective phonon Hamiltonian in disordered superlattices, and showing that it can be mapped exactly to a disordered 1D Bose-Hubbard model with a known MBL phase. Our work provides momentum-resolved experimental evidence of phonon localization, extending the scope of MBL to disordered solid-state systems. △ Less

Submitted 14 September, 2021; v1 submitted 5 August, 2020; originally announced August 2020.

Journal ref: Nano Lett. 2021, 21, 17, 7419-7425

arXiv:2008.01657 [pdf]

doi 10.1038/ncomms15815

Mechanical Properties of Atomically Thin Boron Nitride and the Role of Interlayer Interactions

Authors: Aleksey Falin, Qiran Cai, Elton J. G. Santos, Declan Scullion, Dong Qian, Rui Zhang, Zhi Yang, Shaoming Huang, Kenji Watanabe, Takashi Taniguchi, Matthew R. Barnett, Ying Chen, Rodney S. Ruoff, Lu Hua Li

Abstract: Atomically thin boron nitride (BN) nanosheets are important two-dimensional nanomaterials with many unique properties distinct from those of graphene, but the investigation of their mechanical properties still greatly lacks. Here we report that high-quality single-crystalline mono- and few-layer BN nanosheets are one of the strongest electrically insulating materials. More intriguingly, few-layer… ▽ More Atomically thin boron nitride (BN) nanosheets are important two-dimensional nanomaterials with many unique properties distinct from those of graphene, but the investigation of their mechanical properties still greatly lacks. Here we report that high-quality single-crystalline mono- and few-layer BN nanosheets are one of the strongest electrically insulating materials. More intriguingly, few-layer BN shows mechanical behaviors quite different from those of few-layer graphene under indentation. In striking contrast to graphene, whose strength decreases by more than 30% when the number of layers increases from 1 to 8, the mechanical strength of BN nanosheets is not sensitive to increasing thickness. We attribute this difference to the distinct interlayer interactions and hence sliding tendencies in these two materials under indentation. The significantly better mechanical integrity of BN nanosheets makes them a more attractive candidate than graphene for several applications, e.g. as mechanical reinforcements. △ Less

Submitted 2 August, 2020; originally announced August 2020.

Journal ref: NATURE COMMUNICATIONS | 8:15815 | 2017

arXiv:2008.01656 [pdf]

doi 10.1039/c6nr09312d

Raman Signature and Phonon Dispersion of Atomically Thin Boron Nitride

Authors: Qiran Cai, Declan Scullion, Aleksey Falin, Kenji Watanabe, Takashi Taniguchi, Ying Chen, Elton J. G. Santos, Lu Hua Li

Abstract: Raman spectroscopy has become an essential technique to characterize and investigate graphene and many other two-dimensional materials. However, there still lacks consensus on the Raman signature and phonon dispersion of atomically thin boron nitride (BN), which has many unique properties distinct from graphene. Such a knowledge gap greatly affects the understanding of basic physical and chemical… ▽ More Raman spectroscopy has become an essential technique to characterize and investigate graphene and many other two-dimensional materials. However, there still lacks consensus on the Raman signature and phonon dispersion of atomically thin boron nitride (BN), which has many unique properties distinct from graphene. Such a knowledge gap greatly affects the understanding of basic physical and chemical properties of atomically thin BN as well as the use of Raman spectroscopy to study these nanomaterials. Here, we use both experiment and simulation to reveal the intrinsic Raman signature of monolayer and few-layer BN. We find experimentally that atomically thin BN without interaction with substrate has a G band frequency similar to that of bulk hexagonal BN, but strain induced by substrate can cause pronounced Raman shifts. This is in excellent agreement with our first-principles density functional theory (DFT) calculations at two levels of theory, including van der Waals dispersion forces (opt-vdW) and a fractional of the exact exchange from Hartree-Fock (HF) theory through hybrid HSE06 functional. Both calculations demonstrate that the intrinsic E2g mode of BN does not depend sensibly on the number of layers. Our simulations also suggest the importance of the exact exchange mixing parameter in calculating the vibrational modes in BN, as it determines the fraction of HF exchange included in the DFT calculations. △ Less

Submitted 2 August, 2020; originally announced August 2020.

Journal ref: Nanoscale 9, 3059-3067(2017)

arXiv:2008.00451 [pdf]

doi 10.1021/acsnano.9b06858

Atomically Thin Boron Nitride as an Ideal Spacer for Metal-Enhanced Fluorescence

Authors: Wei Gan, Christos Tserkezis, Qiran Cai, Alexey Falin, Srikanth Mateti, Minh Nguyen, Igor Aharonovich, Kenji Watanabe, Takashi Taniguchi, Fumin Huang, Li Song, Lingxue Kong, Ying Chen, Lu Hua Li

Abstract: The metal-enhanced fluorescence (MEF) considerably enhances the luminescence for various applications, but its performance largely depends on the dielectric spacer between the fluorophore and plasmonic system. It is still challenging to produce a defect-free spacer having an optimized thickness with a subnanometer accuracy that enables reusability without affecting the enhancement. In this study,… ▽ More The metal-enhanced fluorescence (MEF) considerably enhances the luminescence for various applications, but its performance largely depends on the dielectric spacer between the fluorophore and plasmonic system. It is still challenging to produce a defect-free spacer having an optimized thickness with a subnanometer accuracy that enables reusability without affecting the enhancement. In this study, we demonstrate the use of atomically thin hexagonal boron nitride (BN) as an ideal MEF spacer owing to its multifold advantages over the traditional dielectric thin films. With rhodamine 6G as a representative fluorophore, it largely improves the enhancement factor (up to ~95+-5), sensitivity (10^-8 M), reproducibility, and reusability (~90% of the plasmonic activity is retained after 30 cycles of heating at 350 °C in air) of MEF. This can be attributed to its two-dimensional structure, thickness control at the atomic level, defect-free quality, high affinities to aromatic fluorophores, good thermal stability, and excellent impermeability. The atomically thin BN spacers could increase the use of MEF in different fields and industries. △ Less

Submitted 2 August, 2020; originally announced August 2020.

Journal ref: ACS Nano 13, 12184-12191(2019)

arXiv:2008.00447 [pdf]

doi 10.1021/acsami.0c01157

Two-dimensional van der Waals Heterostructures for Synergistically Improved Surface Enhanced Raman Spectroscopy

Authors: Qiran Cai, Wei Gan, Alexey Falin, Kenji Watanabe, Takashi Taniguchi, **cheng Zhuang, Weichang Hao, Shaoming Huang, Tao Tao, Ying Chen, Lu Hua Li

Abstract: Surface enhanced Raman spectroscopy (SERS) is a precise and non-invasive analytical technique that is widely used in chemical analysis, environmental protection, food processing, pharmaceutics, and diagnostic biology. However, it is still a challenge to produce highly sensitive and reusable SERS substrates with minimum fluorescence background. In this work, we propose the use of van der Waals hete… ▽ More Surface enhanced Raman spectroscopy (SERS) is a precise and non-invasive analytical technique that is widely used in chemical analysis, environmental protection, food processing, pharmaceutics, and diagnostic biology. However, it is still a challenge to produce highly sensitive and reusable SERS substrates with minimum fluorescence background. In this work, we propose the use of van der Waals heterostructures of two-dimensional materials (2D materials) to cover plasmonic metal nanoparticles to solve this challenge. The heterostructures of atomically thin boron nitride (BN) and graphene provide synergistic effects: (1) electrons could tunnel through the atomically thin BN, allowing the charge transfer between graphene and probe molecules to suppress fluorescence background; (2) the SERS sensitivity is enhanced by graphene via chemical enhancement mechanism (CM) in addition to electromagnetic field mechanism (EM); (3) the atomically thin BN protects the underlying graphene and Ag nanoparticles from oxidation during heating for regeneration at 360 °C in the air so that the SERS substrates could be reused. These advances will facilitate wider applications of SERS, especially on the detection of fluorescent molecules with higher sensitivity. △ Less

Submitted 2 August, 2020; originally announced August 2020.

Journal ref: ACS Applied Materials Interfaces 12, 21985-21991(2020)

arXiv:2008.00443 [pdf]

doi 10.1103/PhysRevLett.125.085902

Outstanding Thermal Conductivity of Single Atomic Layer Isotope-Modified Boron Nitride

Authors: Qiran Cai, Declan Scullion, Wei Gan, Alexey Falin, Pavel Cizek, Song Liu, James H. Edgar, Rong Liu, Bruce C. C. Cowie, Elton J. G. Santos, Lu Hua Li

Abstract: Materials with high thermal conductivities (k) is valuable to solve the challenge of waste heat dissipation in highly integrated and miniaturized modern devices. Herein, we report the first synthesis of atomically thin isotopically pure hexagonal boron nitride (BN) and its one of the highest k among all semiconductors and electric insulators. Single atomic layer (1L) BN enriched with 11B has a k u… ▽ More Materials with high thermal conductivities (k) is valuable to solve the challenge of waste heat dissipation in highly integrated and miniaturized modern devices. Herein, we report the first synthesis of atomically thin isotopically pure hexagonal boron nitride (BN) and its one of the highest k among all semiconductors and electric insulators. Single atomic layer (1L) BN enriched with 11B has a k up to 1009 W/mK at room temperature. We find that the isotope engineering mainly suppresses the out-of-plane optical (ZO) phonon scatterings in BN, which subsequently reduces acoustic-optical scatterings between ZO and transverse acoustic (TA) and longitudinal acoustic (LA) phonons. On the other hand, reducing the thickness to single atomic layer diminishes the interlayer interactions and hence Umklapp scatterings of the out-of-plane acoustic (ZA) phonons, though this thickness-induced k enhancement is not as dramatic as that in naturally occurring BN. With many of its unique properties, atomically thin monoisotopic BN is promising on heat management in van der Waals (vdW) devices and future flexible electronics. The isotope engineering of atomically thin BN may also open up other appealing applications and opportunities in 2D materials yet to be explored. △ Less

Submitted 21 August, 2020; v1 submitted 2 August, 2020; originally announced August 2020.

Journal ref: Physical Review Letters 125, 085902 (2020)

arXiv:2007.03672 [pdf, other]

Long-term Human Motion Prediction with Scene Context

Authors: Zhe Cao, Hang Gao, Karttikeya Mangalam, Qi-Zhi Cai, Minh Vo, Jitendra Malik

Abstract: Human movement is goal-directed and influenced by the spatial layout of the objects in the scene. To plan future human motion, it is crucial to perceive the environment -- imagine how hard it is to navigate a new room with lights off. Existing works on predicting human motion do not pay attention to the scene context and thus struggle in long-term prediction. In this work, we propose a novel three… ▽ More Human movement is goal-directed and influenced by the spatial layout of the objects in the scene. To plan future human motion, it is crucial to perceive the environment -- imagine how hard it is to navigate a new room with lights off. Existing works on predicting human motion do not pay attention to the scene context and thus struggle in long-term prediction. In this work, we propose a novel three-stage framework that exploits scene context to tackle this task. Given a single scene image and 2D pose histories, our method first samples multiple human motion goals, then plans 3D human paths towards each goal, and finally predicts 3D human pose sequences following each path. For stable training and rigorous evaluation, we contribute a diverse synthetic dataset with clean annotations. In both synthetic and real datasets, our method shows consistent quantitative and qualitative improvements over existing methods. △ Less

Submitted 31 July, 2020; v1 submitted 7 July, 2020; originally announced July 2020.

Comments: ECCV 2020 Oral. Dataset & Code: https://github.com/ZheC/GTA-IM-Dataset Video: https://people.eecs.berkeley.edu/~zhecao/hmp/index.html

arXiv:2006.13182 [pdf, ps, other]

On the Global Optimality of Model-Agnostic Meta-Learning

Authors: Lingxiao Wang, Qi Cai, Zhuoran Yang, Zhaoran Wang

Abstract: Model-agnostic meta-learning (MAML) formulates meta-learning as a bilevel optimization problem, where the inner level solves each subtask based on a shared prior, while the outer level searches for the optimal shared prior by optimizing its aggregated performance over all the subtasks. Despite its empirical success, MAML remains less understood in theory, especially in terms of its global optimali… ▽ More Model-agnostic meta-learning (MAML) formulates meta-learning as a bilevel optimization problem, where the inner level solves each subtask based on a shared prior, while the outer level searches for the optimal shared prior by optimizing its aggregated performance over all the subtasks. Despite its empirical success, MAML remains less understood in theory, especially in terms of its global optimality, due to the nonconvexity of the meta-objective (the outer-level objective). To bridge such a gap between theory and practice, we characterize the optimality gap of the stationary points attained by MAML for both reinforcement learning and supervised learning, where the inner-level and outer-level problems are solved via first-order optimization methods. In particular, our characterization connects the optimality gap of such stationary points with (i) the functional geometry of inner-level objectives and (ii) the representation power of function approximators, including linear models and neural networks. To the best of our knowledge, our analysis establishes the global optimality of MAML with nonconvex meta-objectives for the first time. △ Less

Submitted 23 June, 2020; originally announced June 2020.

Comments: 41 pages; accepted to ICML; initial draft submitted in Feb, 2020

Showing 51–100 of 253 results for author: Cai, Q