Skip to main content

Showing 1–50 of 83 results for author: Zeng, P

.
  1. arXiv:2406.13362  [pdf, other

    cs.CV cs.CL cs.LG

    VisualRWKV: Exploring Recurrent Neural Networks for Visual Language Models

    Authors: Haowen Hou, Peigen Zeng, Fei Ma, Fei Richard Yu

    Abstract: Visual Language Models (VLMs) have rapidly progressed with the recent success of large language models. However, there have been few attempts to incorporate efficient linear Recurrent Neural Networks (RNNs) architectures into VLMs. In this study, we introduce VisualRWKV, the first application of a linear RNN model to multimodal learning tasks, leveraging the pre-trained RWKV language model. We pro… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

    Comments: 18 pages,14 tables,6 figures

  2. arXiv:2406.13150  [pdf

    eess.IV cs.CV

    MCAD: Multi-modal Conditioned Adversarial Diffusion Model for High-Quality PET Image Reconstruction

    Authors: Jiaqi Cui, Xinyi Zeng, Pinxian Zeng, Bo Liu, Xi Wu, Jiliu Zhou, Yan Wang

    Abstract: Radiation hazards associated with standard-dose positron emission tomography (SPET) images remain a concern, whereas the quality of low-dose PET (LPET) images fails to meet clinical requirements. Therefore, there is great interest in reconstructing SPET images from LPET images. However, prior studies focus solely on image data, neglecting vital complementary information from other modalities, e.g.… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

    Comments: Early accepted by MICCAI2024

  3. arXiv:2406.12131  [pdf, other

    cs.CL

    Gram2Vec: An Interpretable Document Vectorizer

    Authors: Peter Zeng, Eric Sclafani, Owen Rambow

    Abstract: We present Gram2Vec, a grammatical style embedding algorithm that embeds documents into a higher dimensional space by extracting the normalized relative frequencies of grammatical features present in the text. Compared to neural approaches, Gram2Vec offers inherent interpretability based on how the feature vectors are generated. In our demo, we present a way to visualize a map** of authors to do… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

    Comments: 6 pages, 2 figures

  4. arXiv:2406.04307  [pdf, other

    quant-ph cond-mat.str-el physics.comp-ph

    High-precision and low-depth eigenstate property estimation: theory and resource estimation

    Authors: **zhao Sun, Pei Zeng, Tom Gur, M. S. Kim

    Abstract: Estimating the eigenstate properties of quantum many-body systems is a long-standing, challenging problem for both classical and quantum computing. For the task of eigenstate preparation, quantum signal processing (QSP) has established near-optimal query complexity $O( Δ^{-1} \log(ε^{-1}) )$ by querying the block encoding of the Hamiltonian $H$ where $Δ$ is the energy gap and $ε$ is the target pre… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

    Comments: 48 pages, 7 figures, and 4 tables

  5. arXiv:2405.12710  [pdf, other

    cs.CV

    Text-Video Retrieval with Global-Local Semantic Consistent Learning

    Authors: Haonan Zhang, Pengpeng Zeng, Lianli Gao, **gkuan Song, Yihang Duan, Xinyu Lyu, Hengtao Shen

    Abstract: Adapting large-scale image-text pre-training models, e.g., CLIP, to the video domain represents the current state-of-the-art for text-video retrieval. The primary approaches involve transferring text-video pairs to a common embedding space and leveraging cross-modal interactions on specific entities for semantic alignment. Though effective, these paradigms entail prohibitive computational costs, l… ▽ More

    Submitted 21 May, 2024; originally announced May 2024.

    Comments: 9 pages

  6. arXiv:2405.11299  [pdf, other

    cs.DB cs.LG

    The CAP Principle for LLM Serving: A Survey of Long-Context Large Language Model Serving

    Authors: Pai Zeng, Zhenyu Ning, Jieru Zhao, Weihao Cui, Mengwei Xu, Liwei Guo, Xusheng Chen, Yizhou Shan

    Abstract: We survey the large language model (LLM) serving area to understand the intricate dynamics between cost-efficiency and accuracy, which is magnified by the growing need for longer contextual understanding when deploying models at a massive scale. Our findings reveal that works in this space optimize along three distinct but conflicting goals: improving serving context length (C), improving serving… ▽ More

    Submitted 26 May, 2024; v1 submitted 18 May, 2024; originally announced May 2024.

  7. arXiv:2403.07284  [pdf, other

    cs.CV

    SparseLIF: High-Performance Sparse LiDAR-Camera Fusion for 3D Object Detection

    Authors: Hongcheng Zhang, Liu Liang, Pengxin Zeng, Xiao Song, Zhe Wang

    Abstract: Sparse 3D detectors have received significant attention since the query-based paradigm embraces low latency without explicit dense BEV feature construction. However, these detectors achieve worse performance than their dense counterparts. In this paper, we find the key to bridging the performance gap is to enhance the awareness of rich representations in two modalities. Here, we present a high-per… ▽ More

    Submitted 11 March, 2024; originally announced March 2024.

  8. arXiv:2403.02451  [pdf, other

    cs.CL

    Views Are My Own, but Also Yours: Benchmarking Theory of Mind Using Common Ground

    Authors: Adil Soubki, John Murzaku, Arash Yousefi Jordehi, Peter Zeng, Magdalena Markowska, Seyed Abolghasem Mirroshandel, Owen Rambow

    Abstract: Evaluating the theory of mind (ToM) capabilities of language models (LMs) has recently received a great deal of attention. However, many existing benchmarks rely on synthetic data, which risks misaligning the resulting experiments with human behavior. We introduce the first ToM dataset based on naturally occurring spoken dialogs, Common-ToM, and show that LMs struggle to demonstrate ToM. We then s… ▽ More

    Submitted 5 June, 2024; v1 submitted 4 March, 2024; originally announced March 2024.

    Journal ref: ACL 2024 Findings

  9. Image2Points:A 3D Point-based Context Clusters GAN for High-Quality PET Image Reconstruction

    Authors: Jiaqi Cui, Yan Wang, Lu Wen, Pinxian Zeng, Xi Wu, Jiliu Zhou, Dinggang Shen

    Abstract: To obtain high-quality Positron emission tomography (PET) images while minimizing radiation exposure, numerous methods have been proposed to reconstruct standard-dose PET (SPET) images from the corresponding low-dose PET (LPET) images. However, these methods heavily rely on voxel-based representations, which fall short of adequately accounting for the precise structure and fine-grained context, le… ▽ More

    Submitted 1 February, 2024; originally announced February 2024.

    Comments: Accepted by ICASSP 2024

  10. arXiv:2312.12478  [pdf, other

    cs.CV

    ProS: Prompting-to-simulate Generalized knowledge for Universal Cross-Domain Retrieval

    Authors: Kaipeng Fang, **gkuan Song, Lianli Gao, Pengpeng Zeng, Zhi-Qi Cheng, Xiyao Li, Heng Tao Shen

    Abstract: The goal of Universal Cross-Domain Retrieval (UCDR) is to achieve robust performance in generalized test scenarios, wherein data may belong to strictly unknown domains and categories during training. Recently, pre-trained models with prompt tuning have shown strong generalization capabilities and attained noteworthy achievements in various downstream tasks, such as few-shot learning and video-text… ▽ More

    Submitted 29 February, 2024; v1 submitted 19 December, 2023; originally announced December 2023.

  11. arXiv:2310.20578  [pdf, other

    quant-ph

    Fault-Tolerant Operation of Bosonic Qubits with Discrete-Variable Ancillae

    Authors: Qian Xu, Pei Zeng, Daohong Xu, Liang Jiang

    Abstract: Fault-tolerant quantum computation with bosonic qubits often necessitates the use of noisy discrete-variable ancillae. In this work, we establish a comprehensive and practical fault-tolerance framework for such a hybrid system and synthesize it with fault-tolerant protocols by combining bosonic quantum error correction (QEC) and advanced quantum control techniques. We introduce essential building… ▽ More

    Submitted 31 October, 2023; originally announced October 2023.

    Comments: 23 pages, 10 figures. Comments are welcome

  12. arXiv:2310.16428  [pdf, ps, other

    stat.AP

    Similarity-driven and Task-driven Models for Diversity of Opinion in Crowdsourcing Markets

    Authors: Chen Jason Zhang, Yunrui Liu, Pengcheng Zeng, Ting Wu, Lei Chen, Pan Hui, Fei Hao

    Abstract: The recent boom in crowdsourcing has opened up a new avenue for utilizing human intelligence in the realm of data analysis. This innovative approach provides a powerful means for connecting online workers to tasks that cannot effectively be done solely by machines or conducted by professional experts due to cost constraints. Within the field of social science, four elements are required to constru… ▽ More

    Submitted 28 February, 2024; v1 submitted 25 October, 2023; originally announced October 2023.

    Comments: 37 pages, 11 figures

  13. arXiv:2309.03789  [pdf, other

    quant-ph

    Pilot-reference-free continuous-variable quantum key distribution with efficient decoy-state analysis

    Authors: Anran **, Xingjian Zhang, Liang Jiang, Richard V. Penty, Pei Zeng

    Abstract: Continuous-variable quantum key distribution (CV QKD) using optical coherent detectors is practically favorable due to its low implementation cost, flexibility of wavelength division multiplexing, and compatibility with standard coherent communication technologies. However, the security analysis and parameter estimation of CV QKD are complicated due to the infinite-dimensional latent Hilbert space… ▽ More

    Submitted 7 September, 2023; originally announced September 2023.

    Comments: 27 pages, 6 figures, 6 tables. Comments are welcomed

  14. arXiv:2308.05365  [pdf

    eess.IV cs.CV

    TriDo-Former: A Triple-Domain Transformer for Direct PET Reconstruction from Low-Dose Sinograms

    Authors: Jiaqi Cui, Pinxian Zeng, Xinyi Zeng, Peng Wang, Xi Wu, Jiliu Zhou, Yan Wang, Dinggang Shen

    Abstract: To obtain high-quality positron emission tomography (PET) images while minimizing radiation exposure, various methods have been proposed for reconstructing standard-dose PET (SPET) images from low-dose PET (LPET) sinograms directly. However, current methods often neglect boundaries during sinogram-to-image reconstruction, resulting in high-frequency distortion in the frequency domain and diminishe… ▽ More

    Submitted 10 August, 2023; originally announced August 2023.

  15. arXiv:2308.04802  [pdf, other

    cs.CV

    Generalized Unbiased Scene Graph Generation

    Authors: Xinyu Lyu, Lianli Gao, Junlin Xie, Pengpeng Zeng, Yulu Tian, Jie Shao, Heng Tao Shen

    Abstract: Existing Unbiased Scene Graph Generation (USGG) methods only focus on addressing the predicate-level imbalance that high-frequency classes dominate predictions of rare ones, while overlooking the concept-level imbalance. Actually, even if predicates themselves are balanced, there is still a significant concept-imbalance within them due to the long-tailed distribution of contexts (i.e., subject-obj… ▽ More

    Submitted 9 August, 2023; originally announced August 2023.

  16. arXiv:2306.17496  [pdf, other

    cs.IT

    Performance Analysis for Polar Codes under Successive Cancellation List Decoding with Fixed List Size

    Authors: **nan Piao, Dong Li, Xueting Yu, Zhibo Li, Ming Yang, **di Liu, Peng Zeng

    Abstract: In this paper, we first indicate that the block error event of polar codes under successive cancellation list (SCL) decoding is composed of path loss (PL) error event and path selection (PS) error event, where the PL error event is that correct codeword is lost during the SCL decoding and the PS error event is that correct codeword is reserved in the decoded list but not selected as the decoded co… ▽ More

    Submitted 6 July, 2023; v1 submitted 30 June, 2023; originally announced June 2023.

  17. Semantic Invariant Multi-view Clustering with Fully Incomplete Information

    Authors: Pengxin Zeng, Mouxing Yang, Yiding Lu, Changqing Zhang, Peng Hu, Xi Peng

    Abstract: Robust multi-view learning with incomplete information has received significant attention due to issues such as incomplete correspondences and incomplete instances that commonly affect real-world multi-view applications. Existing approaches heavily rely on paired samples to realign or impute defective ones, but such preconditions cannot always be satisfied in practice due to the complexity of data… ▽ More

    Submitted 21 December, 2023; v1 submitted 22 May, 2023; originally announced May 2023.

  18. arXiv:2305.07481  [pdf, other

    stat.CO

    Extended ADMM for general penalized quantile regression with linear constraints in big data

    Authors: Yongxin Liu, Peng Zeng

    Abstract: Quantile regression (QR) can be used to describe the comprehensive relationship between a response and predictors. Prior domain knowledge and assumptions in application are usually formulated as constraints of parameters to improve the estimation efficiency. This paper develops methods based on multi-block ADMM to fit general penalized QR with linear constraints of regression coefficients. Differe… ▽ More

    Submitted 12 May, 2023; originally announced May 2023.

  19. arXiv:2304.08915  [pdf, other

    cs.NE cs.LG

    Differentiable Genetic Programming for High-dimensional Symbolic Regression

    Authors: Peng Zeng, Xiaotian Song, Andrew Lensen, Yuwei Ou, Yanan Sun, Mengjie Zhang, Jiancheng Lv

    Abstract: Symbolic regression (SR) is the process of discovering hidden relationships from data with mathematical expressions, which is considered an effective way to reach interpretable machine learning (ML). Genetic programming (GP) has been the dominator in solving SR problems. However, as the scale of SR problems increases, GP often poorly demonstrates and cannot effectively address the real-world high-… ▽ More

    Submitted 18 April, 2023; originally announced April 2023.

  20. arXiv:2304.08339  [pdf, other

    cond-mat.mtrl-sci

    Development of Nb-GaAs based superconductor semiconductor hybrid platform by combining in-situ dc magnetron sputtering and molecular beam epitaxy

    Authors: Clemens Todt, Sjoerd Telkamp, Filip Krizek, Christian Reichl, Mihai Gabureac, Rüdiger Schott, Erik Cheah, Peng Zeng, Thomas Weber, Arnold Müller, Christof Vockenhuber, Mohsen Bahrami Panah, Werner Wegscheider

    Abstract: We present Nb thin films deposited in-situ on GaAs by combining molecular beam epitaxy and magnetron sputtering within an ultra-high vacuum cluster. Nb films deposited at varying power, and a reference film from a commercial system, are compared. The results show clear variation between the in-situ and ex-situ deposition which we relate to differences in magnetron sputtering conditions and chamber… ▽ More

    Submitted 18 April, 2023; v1 submitted 17 April, 2023; originally announced April 2023.

    Comments: 12 pages paper, 9 pages supplementary, 6 figures paper, 7 figures supplementary

  21. arXiv:2303.13019  [pdf, other

    cs.IT

    Construction Methods Based Minimum Weight Distribution for Polar Codes with Successive Cancellation List Decoding

    Authors: **nan Piao, Dong Li, **di Liu, Xueting Yu, Zhibo Li, Ming Yang, Peng Zeng

    Abstract: In this paper, we focus on the construction methods based MWD for polar codes to improve the performance with successive cancellation list (SCL) decoding. We first propose an ordered and nested reliability sequence, namely MWD sequence, to improve the ML performance of polar codes and apply fast construction without the original channel information. In the MWD sequence, the synthetic channels are… ▽ More

    Submitted 22 March, 2023; originally announced March 2023.

  22. arXiv:2303.04296  [pdf, ps, other

    math.OC

    Event-Triggered Active Disturbance Rejection Control for Uncertain Random Nonlinear Systems

    Authors: Ze-Hao Wu, Feiqi Deng, Pengyu Zeng, Hua-Cheng Zhou, Hongyi Li

    Abstract: In this paper, event-triggered active disturbance rejection control (ADRC) is first addressed for a class of uncertain random nonlinear systems driven by bounded noise and colored noise. The event-triggered extended state observer (ESO) and ADRC controller are designed, where two respective event-triggering mechanisms with a fixed positive lower bound for the inter-execution times are proposed. Th… ▽ More

    Submitted 8 May, 2024; v1 submitted 7 March, 2023; originally announced March 2023.

    Comments: arXiv admin note: text overlap with arXiv:2302.02395

  23. arXiv:2301.06795  [pdf, other

    cond-mat.mtrl-sci cond-mat.mes-hall

    Control over epitaxy and the role of the InAs/Al interface in hybrid two-dimensional electron gas systems

    Authors: E. Cheah, D. Z. Haxell, R. Schott, P. Zeng, E. Paysen, S. C. ten Kate, M. Coraiola, M. Landstetter, A. B. Zadeh, A. Trampert, M. Sousa, H. Riel, F. Nichele, W. Wegscheider, F. Krizek

    Abstract: In-situ synthesised semiconductor/superconductor hybrid structures became an important material platform in condensed matter physics. Their development enabled a plethora of novel quantum transport experiments with focus on Andreev and Majorana physics. The combination of InAs and Al has become the workhorse material and has been successfully implemented in the form of one-dimensional structures a… ▽ More

    Submitted 17 January, 2023; originally announced January 2023.

    Comments: 12 pages, 7 figures and supplementary material

    Journal ref: Physical Review Materials 7, 073403 (2023)

  24. arXiv:2212.04566  [pdf, other

    quant-ph

    Simple and high-precision Hamiltonian simulation by compensating Trotter error with linear combination of unitary operations

    Authors: Pei Zeng, **zhao Sun, Liang Jiang, Qi Zhao

    Abstract: Trotter and linear-combination-of-unitary (LCU) are two popular Hamiltonian simulation methods. We propose Hamiltonian simulation algorithms using LCU to compensate Trotter error, which enjoy both of their advantages. By adding few gates after the Kth-order Trotter, we realize a better time scaling than 2Kth-order Trotter. Our first algorithm exponentially improves the accuracy scaling of the Kth-… ▽ More

    Submitted 8 December, 2022; originally announced December 2022.

    Comments: 74 pages, 15 figures. Comments are welcome

  25. arXiv:2212.01209  [pdf, other

    cs.AI eess.SP

    FECAM: Frequency Enhanced Channel Attention Mechanism for Time Series Forecasting

    Authors: Maowei Jiang, Pengyu Zeng, Kai Wang, Huan Liu, Wenbo Chen, Haoran Liu

    Abstract: Time series forecasting is a long-standing challenge due to the real-world information is in various scenario (e.g., energy, weather, traffic, economics, earthquake warning). However some mainstream forecasting model forecasting result is derailed dramatically from ground truth. We believe it's the reason that model's lacking ability of capturing frequency information which richly contains in real… ▽ More

    Submitted 2 December, 2022; originally announced December 2022.

    Comments: 11pages.10 figures,conference. arXiv admin note: text overlap with arXiv:2205.14415 by other authors

  26. arXiv:2211.14017  [pdf, other

    cs.CV eess.IV

    Learnable Blur Kernel for Single-Image Defocus Deblurring in the Wild

    Authors: Jucai Zhai, Pengcheng Zeng, Chihao Ma, Yong Zhao, Jie Chen

    Abstract: Recent research showed that the dual-pixel sensor has made great progress in defocus map estimation and image defocus deblurring. However, extracting real-time dual-pixel views is troublesome and complex in algorithm deployment. Moreover, the deblurred image generated by the defocus deblurring network lacks high-frequency details, which is unsatisfactory in human perception. To overcome this issue… ▽ More

    Submitted 25 November, 2022; originally announced November 2022.

    Comments: 9 pages, 7 figures

  27. arXiv:2211.10541  [pdf, ps, other

    math.ST stat.CO

    Phase transition and higher order analysis of $L_q$ regularization under dependence

    Authors: Hanwen Huang, Peng Zeng, Qinglong Yang

    Abstract: We study the problem of estimating a $k$-sparse signal ${\mbox{$β$}}_0\in{\bf R}^p$ from a set of noisy observations ${\bf y}\in{\bf R}^n$ under the model ${\bf y}={\bf X}{\mbox{$β$}}+{\bf w}$, where ${\bf X}\in{\bf R}^{n\times p}$ is the measurement matrix the row of which is drawn from distribution $N(0,{\mbox{$Σ$}})$. We consider the class of $L_q$-regularized least squares (LQLS) given by the… ▽ More

    Submitted 1 December, 2022; v1 submitted 18 November, 2022; originally announced November 2022.

    Comments: 35 pages, 11 figures

  28. arXiv:2211.09469  [pdf, other

    cs.CV

    Visual Commonsense-aware Representation Network for Video Captioning

    Authors: Pengpeng Zeng, Haonan Zhang, Lianli Gao, Xiangpeng Li, ** Qian, Heng Tao Shen

    Abstract: Generating consecutive descriptions for videos, i.e., Video Captioning, requires taking full advantage of visual representation along with the generation process. Existing video captioning methods focus on making an exploration of spatial-temporal representations and their relationships to produce inferences. However, such methods only exploit the superficial association contained in the video its… ▽ More

    Submitted 17 November, 2022; originally announced November 2022.

  29. arXiv:2211.09460  [pdf, other

    cs.CV

    Progressive Tree-Structured Prototype Network for End-to-End Image Captioning

    Authors: Pengpeng Zeng, **kuan Zhu, **gkuan Song, Lianli Gao

    Abstract: Studies of image captioning are shifting towards a trend of a fully end-to-end paradigm by leveraging powerful visual pre-trained models and transformer-based generation architecture for more flexible model training and faster inference speed. State-of-the-art approaches simply extract isolated concepts or attributes to assist description generation. However, such approaches do not consider the hi… ▽ More

    Submitted 17 November, 2022; originally announced November 2022.

  30. arXiv:2209.12396  [pdf, other

    cs.LG cs.CY

    Deep Fair Clustering via Maximizing and Minimizing Mutual Information: Theory, Algorithm and Metric

    Authors: Pengxin Zeng, Yunfan Li, Peng Hu, Dezhong Peng, Jiancheng Lv, Xi Peng

    Abstract: Fair clustering aims to divide data into distinct clusters while preventing sensitive attributes (\textit{e.g.}, gender, race, RNA sequencing technique) from dominating the clustering. Although a number of works have been conducted and achieved huge success recently, most of them are heuristical, and there lacks a unified theory for algorithm design. In this work, we fill this blank by develo**… ▽ More

    Submitted 20 April, 2023; v1 submitted 25 September, 2022; originally announced September 2022.

  31. Experimental mode-pairing measurement-device-independent quantum key distribution without global phase-locking

    Authors: Hao-Tao Zhu, Yizhi Huang, Hui Liu, Pei Zeng, Mi Zou, Yunqi Dai, Shibiao Tang, Hao Li, Lixing You, Zhen Wang, Yu-Ao Chen, Xiongfeng Ma, Teng-Yun Chen, Jian-Wei Pan

    Abstract: In the past two decades, quantum key distribution networks based on telecom fibers have been implemented on metropolitan and intercity scales. One of the bottlenecks lies in the exponential decay of the key rate with respect to the transmission distance. Recently proposed schemes mainly focus on achieving longer distances by creating a long-arm single-photon interferometer over two communication p… ▽ More

    Submitted 9 February, 2023; v1 submitted 11 August, 2022; originally announced August 2022.

    Comments: 19 pages, 9 figures, 7 tables

    Journal ref: Phys. Rev. Lett. 130, 030801 (2023)

  32. arXiv:2207.07913  [pdf, other

    cs.CV

    Dual-branch Hybrid Learning Network for Unbiased Scene Graph Generation

    Authors: Chaofan Zheng, Lianli Gao, Xinyu Lyu, Pengpeng Zeng, Abdulmotaleb El Saddik, Heng Tao Shen

    Abstract: The current studies of Scene Graph Generation (SGG) focus on solving the long-tailed problem for generating unbiased scene graphs. However, most de-biasing methods overemphasize the tail predicates and underestimate head ones throughout training, thereby wrecking the representation ability of head predicate features. Furthermore, these impaired features from head predicates harm the learning of ta… ▽ More

    Submitted 16 July, 2022; originally announced July 2022.

  33. arXiv:2207.04602  [pdf, other

    cs.CV cs.AI

    Adaptive Fine-Grained Predicates Learning for Scene Graph Generation

    Authors: Xinyu Lyu, Lianli Gao, Pengpeng Zeng, Heng Tao Shen, **gkuan Song

    Abstract: The performance of current Scene Graph Generation (SGG) models is severely hampered by hard-to-distinguish predicates, e.g., woman-on/standing on/walking on-beach. As general SGG models tend to predict head predicates and re-balancing strategies prefer tail categories, none of them can appropriately handle hard-to-distinguish predicates. To tackle this issue, inspired by fine-grained image classif… ▽ More

    Submitted 10 July, 2022; originally announced July 2022.

    Comments: arXiv admin note: text overlap with arXiv:2204.02597

  34. arXiv:2206.11653  [pdf, other

    cs.CV

    Learning To Generate Scene Graph from Head to Tail

    Authors: Chaofan Zheng, Xinyu Lyu, Yuyu Guo, Pengpeng Zeng, **gkuan Song, Lianli Gao

    Abstract: Scene Graph Generation (SGG) represents objects and their interactions with a graph structure. Recently, many works are devoted to solving the imbalanced problem in SGG. However, underestimating the head predicates in the whole training process, they wreck the features of head predicates that provide general features for tail ones. Besides, assigning excessive attention to the tail predicates lead… ▽ More

    Submitted 23 June, 2022; originally announced June 2022.

  35. arXiv:2206.09302  [pdf, other

    cs.IT eess.SP

    Delay-aware Multiple Access Design for Intelligent Reflecting Surface Aided Uplink Transmission

    Authors: Piao Zeng, Guangji Chen, Qingqing Wu, Deli Qiao, Abbas Jamalipour

    Abstract: In this paper, we develop a hybrid multiple access (MA) protocol for an intelligent reflecting surface (IRS) aided uplink transmission network by incorporating the IRS-aided time-division MA (I-TDMA) protocol and the IRS-aided non-orthogonal MA (I-NOMA) protocol as special cases. Two typical communication scenarios, namely the transmit power limited case and the transmit energy limited case are co… ▽ More

    Submitted 26 June, 2023; v1 submitted 18 June, 2022; originally announced June 2022.

    Comments: Submitted to TWC

  36. arXiv:2206.01923  [pdf, other

    cs.CV

    From Pixels to Objects: Cubic Visual Attention for Visual Question Answering

    Authors: **gkuan Song, Pengpeng Zeng, Lianli Gao, Heng Tao Shen

    Abstract: Recently, attention-based Visual Question Answering (VQA) has achieved great success by utilizing question to selectively target different visual areas that are related to the answer. Existing visual attention models are generally planar, i.e., different channels of the last conv-layer feature map of an image share the same weight. This conflicts with the attention mechanism because CNN features a… ▽ More

    Submitted 4 June, 2022; originally announced June 2022.

  37. arXiv:2206.01017  [pdf, other

    cs.CV

    Structured Two-stream Attention Network for Video Question Answering

    Authors: Lianli Gao, Pengpeng Zeng, **gkuan Song, Yuan-Fang Li, Wu Liu, Tao Mei, Heng Tao Shen

    Abstract: To date, visual question answering (VQA) (i.e., image QA and video QA) is still a holy grail in vision and language understanding, especially for video QA. Compared with image QA that focuses primarily on understanding the associations between image region-level details and corresponding questions, video QA requires a model to jointly reason across both spatial and long-range temporal structures o… ▽ More

    Submitted 2 June, 2022; originally announced June 2022.

  38. arXiv:2205.09523  [pdf, other

    stat.ML cs.LG

    scICML: Information-theoretic Co-clustering-based Multi-view Learning for the Integrative Analysis of Single-cell Multi-omics data

    Authors: Pengcheng Zeng, Zhixiang Lin

    Abstract: Modern high-throughput sequencing technologies have enabled us to profile multiple molecular modalities from the same single cell, providing unprecedented opportunities to assay celluar heterogeneity from multiple biological layers. However, the datasets generated from these technologies tend to have high level of noise and are highly sparse, bringing challenges to data analysis. In this paper, we… ▽ More

    Submitted 19 May, 2022; originally announced May 2022.

    Comments: 11 pages; 1 figure

  39. arXiv:2205.09307  [pdf, other

    cs.CV

    Support-set based Multi-modal Representation Enhancement for Video Captioning

    Authors: Xiaoya Chen, **gkuan Song, Pengpeng Zeng, Lianli Gao, Heng Tao Shen

    Abstract: Video captioning is a challenging task that necessitates a thorough comprehension of visual scenes. Existing methods follow a typical one-to-one map**, which concentrates on a limited sample space while ignoring the intrinsic semantic associations between samples, resulting in rigid and uninformative expressions. To address this issue, we propose a novel and flexible framework, namely Support-se… ▽ More

    Submitted 18 May, 2022; originally announced May 2022.

  40. Scalable fast benchmarking for individual quantum gates with local twirling

    Authors: Yihong Zhang, Wenjun Yu, Pei Zeng, Guoding Liu, Xiongfeng Ma

    Abstract: With the development of controllable quantum systems, fast and practical characterization for multi-qubit gates is essential for building high-fidelity quantum computing devices. The usual way to fulfill this requirement via randomized benchmarking asks for the complicated implementation of numerous multi-qubit twirling gates. How to efficiently and reliably estimate the fidelity of a quantum proc… ▽ More

    Submitted 9 February, 2023; v1 submitted 19 March, 2022; originally announced March 2022.

    Comments: 30 pages, 7 figures

    Journal ref: Photonics Research Vol. 11, Issue 1, pp. 81-99 (2023)

  41. arXiv:2202.05992  [pdf

    physics.optics

    Soliton Microcombs in Integrated Chalcogenide Microresonators

    Authors: Di Xia, Zelin Yang, **yang Zeng, Bin Zhang, Jiayue Wu, Zifu Wang, Jiaxin Zhao, Mingqi Gao, Yufei Huang, Jianteng Huang, Liyang Luo, Dong Liu, Shuixian Yang, Hairun Guo, Zhaohui Li

    Abstract: Photonic integrated microcombs have enabled advanced applications in optical communication, microwave synthesis, and optical metrology, which in nature unveil an optical dissipative soliton pattern under cavity-enhanced nonlinear processes. The most decisive factor of microcombs lies in the photonic material platforms, where materials with high nonlinearity and in capacity of high-quality chip int… ▽ More

    Submitted 12 February, 2022; originally announced February 2022.

    Comments: 22 pages, 5 figures

    Journal ref: Laser & Photonics Reviews 16 202200219 (2022)

  42. arXiv:2201.11924  [pdf, other

    cs.RO

    Close the Optical Sensing Domain Gap by Physics-Grounded Active Stereo Sensor Simulation

    Authors: Xiaoshuai Zhang, Rui Chen, Ang Li, Fanbo Xiang, Yuzhe Qin, Jiayuan Gu, Zhan Ling, Minghua Liu, Peiyu Zeng, Songfang Han, Zhiao Huang, Tongzhou Mu, **g Xu, Hao Su

    Abstract: In this paper, we focus on the simulation of active stereovision depth sensors, which are popular in both academic and industry communities. Inspired by the underlying mechanism of the sensors, we designed a fully physics-grounded simulation pipeline that includes material acquisition, ray-tracing-based infrared (IR) image rendering, IR noise simulation, and depth estimation. The pipeline is able… ▽ More

    Submitted 5 January, 2023; v1 submitted 27 January, 2022; originally announced January 2022.

    Comments: The paper will appear in the IEEE Transactions on Robotics. 20 pages, 14 figures, 10 tables

  43. Quantum key distribution surpassing the repeaterless rate-transmittance bound without global phase locking

    Authors: Pei Zeng, Hongyi Zhou, Weijie Wu, Xiongfeng Ma

    Abstract: Quantum key distribution -- the establishment of information-theoretically secure keys based on quantum physics -- is mainly limited by its practical performance, which is characterised by the dependence of the key rate on the channel transmittance $R(η)$. Recently, schemes based on single-photon interference have been proposed to improve the key rate to $R=O(\sqrtη)$ by overcoming the point-to-po… ▽ More

    Submitted 30 January, 2022; v1 submitted 12 January, 2022; originally announced January 2022.

    Comments: 56 pages, 24 figures. Comments are welcome

    Journal ref: Nat Commun 13, 3903 (2022)

  44. arXiv:2111.13855  [pdf, other

    quant-ph

    Quantum Complementarity Approach to Device-Independent Security

    Authors: Xingjian Zhang, Pei Zeng, Tian Ye, Hoi-Kwong Lo, Xiongfeng Ma

    Abstract: Complementarity is an essential feature of quantum mechanics. The preparation of an eigenstate of one observable implies complete randomness in its complementary observable. In quantum cryptography, complementarity allows us to formulate security analyses in terms of phase-error correction. However, in the device-independent regime that offers security without device characterization, the concept… ▽ More

    Submitted 11 October, 2022; v1 submitted 27 November, 2021; originally announced November 2021.

    Comments: 57 pages, 21 figures, 4 tables; In this version, we have (1) added security statements for general device-independent tasks; (2) updated the finite-size analysis with Kato's inequality; (3) presented more numerical simulation results, including a detailed presentation of analysing the reported data from the recent ion-trap DIQKD experiment; (4) fixed a few typos

  45. arXiv:2111.11600  [pdf, other

    cs.IT eess.SP

    Throughput Maximization for Active Intelligent Reflecting Surface Aided Wireless Powered Communications

    Authors: Piao Zeng, Deli Qiao, Qingqing Wu, Yuan Wu

    Abstract: This paper considers an active intelligent reflecting surface (IRS)-aided wireless powered communication network (WPCN), where devices first harvest energy and then transmit information to a hybrid access point (HAP). Different from the existing works on passive IRS-aided WPCNs, this is the first work that introduces the active IRS in WPCNs. To guarantee fairness, the problem is formulated as an a… ▽ More

    Submitted 11 January, 2022; v1 submitted 22 November, 2021; originally announced November 2021.

    Comments: Submitted to Wireless Communications Letters

  46. Bootstrap** Calabi-Yau Quantum Mechanics

    Authors: Bao-ning Du, Min-xin Huang, Pei-xuan Zeng

    Abstract: Recently, a novel bootstrap method for numerical calculations in matrix models and quantum mechanical systems is proposed. We apply the method to certain quantum mechanical systems derived from some well-known local toric Calabi-Yau geometries, where the exact quantization conditions have been conjecturally related to topological string theory. We find that the bootstrap method provides a promisin… ▽ More

    Submitted 16 November, 2021; originally announced November 2021.

    Comments: 21 pages, 21 figures

    Report number: USTC-ICTS/PCFT-21-43

  47. arXiv:2109.15304  [pdf, other

    quant-ph cond-mat.stat-mech cond-mat.str-el

    Universal quantum algorithmic cooling on a quantum computer

    Authors: Pei Zeng, **zhao Sun, Xiao Yuan

    Abstract: Quantum cooling, a deterministic process that drives any state to the lowest eigenstate, has been widely used from studying ground state properties of chemistry and condensed matter quantum physics, to general optimization problems. However, the cooling procedure is generally non-unitary, hence its realization on a quantum computer either requires deep circuits or assumes specific input states wit… ▽ More

    Submitted 2 June, 2022; v1 submitted 30 September, 2021; originally announced September 2021.

    Comments: 35 pages, 7 figures. Comments are welcome

  48. A Model-free Variable Screening Method Based on Leverage Score

    Authors: Wenxuan Zhong, Yiwen Liu, Peng Zeng

    Abstract: With rapid advances in information technology, massive datasets are collected in all fields of science, such as biology, chemistry, and social science. Useful or meaningful information is extracted from these data often through statistical learning or model fitting. In massive datasets, both sample size and number of predictors can be large, in which case conventional methods face computational ch… ▽ More

    Submitted 20 September, 2021; originally announced September 2021.

    Comments: Journal of the American Statistical Association, published online: 21 Jun 2021

  49. Reference-frame-independent design of phase-matching quantum key distribution

    Authors: Anran **, Pei Zeng, Richard V. Penty, Xiongfeng Ma

    Abstract: The recently proposed phase-matching quantum key distribution offers means to overcome the linear key rate-transmittance bound. Since the key information is encoded onto the phases of coherent states, the misalignment between the two remote reference frames would yield errors and significantly degrade the key generation rate from the ideal case. In this work, we propose a reference-frame-independe… ▽ More

    Submitted 15 September, 2021; originally announced September 2021.

    Comments: 20 pages, 8 figures

    Journal ref: Phys. Rev. Applied 16, 034017 (2021)

  50. arXiv:2109.03233  [pdf, other

    eess.IV

    Contrastive Learning with Temporal Correlated Medical Images: A Case Study using Lung Segmentation in Chest X-Rays

    Authors: Dewen Zeng, John N. Kheir, Peng Zeng, Yiyu Shi

    Abstract: Contrastive learning has been proved to be a promising technique for image-level representation learning from unlabeled data. Many existing works have demonstrated improved results by applying contrastive learning in classification and object detection tasks for either natural images or medical images. However, its application to medical image segmentation tasks has been limited. In this work, we… ▽ More

    Submitted 16 September, 2021; v1 submitted 6 September, 2021; originally announced September 2021.

    Comments: 7 pages, submitted to ICCAD'21 special session