Skip to main content

Showing 51–100 of 281 results for author: Qu, L

.
  1. arXiv:2401.05676  [pdf, other

    cs.CV

    Exploring Self- and Cross-Triplet Correlations for Human-Object Interaction Detection

    Authors: Weibo Jiang, Weihong Ren, Jiandong Tian, Liangqiong Qu, Zhiyong Wang, Honghai Liu

    Abstract: Human-Object Interaction (HOI) detection plays a vital role in scene understanding, which aims to predict the HOI triplet in the form of <human, object, action>. Existing methods mainly extract multi-modal features (e.g., appearance, object semantics, human pose) and then fuse them together to directly predict HOI triplets. However, most of these methods focus on seeking for self-triplet aggregati… ▽ More

    Submitted 11 January, 2024; originally announced January 2024.

  2. arXiv:2401.05153  [pdf, other

    cs.CV eess.IV

    CrossDiff: Exploring Self-Supervised Representation of Pansharpening via Cross-Predictive Diffusion Model

    Authors: Yinghui Xing, Litao Qu, Shizhou Zhang, Kai Zhang, Yanning Zhang

    Abstract: Fusion of a panchromatic (PAN) image and corresponding multispectral (MS) image is also known as pansharpening, which aims to combine abundant spatial details of PAN and spectral information of MS. Due to the absence of high-resolution MS images, available deep-learning-based methods usually follow the paradigm of training at reduced resolution and testing at both reduced and full resolution. When… ▽ More

    Submitted 13 January, 2024; v1 submitted 10 January, 2024; originally announced January 2024.

  3. arXiv:2312.10864  [pdf, ps, other

    cs.IR

    On-Device Recommender Systems: A Tutorial on The New-Generation Recommendation Paradigm

    Authors: Hongzhi Yin, Tong Chen, Liang Qu, Bin Cui

    Abstract: Given the sheer volume of contemporary e-commerce applications, recommender systems (RSs) have gained significant attention in both academia and industry. However, traditional cloud-based RSs face inevitable challenges, such as resource-intensive computation, reliance on network access, and privacy breaches. In response, a new paradigm called on-device recommender systems (ODRSs) has emerged recen… ▽ More

    Submitted 17 December, 2023; originally announced December 2023.

    Comments: Technical tutorial; to appear at The Web Conference 2024

  4. arXiv:2312.08844  [pdf, ps, other

    math.NT

    Oriented Supersingular Elliptic Curves and Eichler Orders

    Authors: Guanju Xiao, Zijian Zhou, Longjiang Qu

    Abstract: Let $p>3$ be a prime and $E$ be a supersingular elliptic curve defined over $\mathbb{F}_{p^2}$. Let $c$ be a prime with $c < 3p/16$ and $G$ be a subgroup of $E[c]$ of order $c$. The pair $(E,G)$ is called a supersingular elliptic curve with level-$c$ structure, and the endomorphism ring $\text{End}(E,G)$ is isomorphic to an Eichler order with level $c$. We construct two kinds of Eichler orders… ▽ More

    Submitted 14 December, 2023; originally announced December 2023.

    Comments: 26 pages. arXiv admin note: text overlap with arXiv:2203.02097

  5. arXiv:2312.05103  [pdf, other

    cs.CL cs.CY cs.LG

    TMID: A Comprehensive Real-world Dataset for Trademark Infringement Detection in E-Commerce

    Authors: Tongxin Hu, Zhuang Li, Xin **, Lizhen Qu, Xin Zhang

    Abstract: Annually, e-commerce platforms incur substantial financial losses due to trademark infringements, making it crucial to identify and mitigate potential legal risks tied to merchant information registered to the platforms. However, the absence of high-quality datasets hampers research in this area. To address this gap, our study introduces TMID, a novel dataset to detect trademark infringement in me… ▽ More

    Submitted 8 December, 2023; originally announced December 2023.

    Comments: EMNLP 2023, Industry Track

  6. arXiv:2311.14968  [pdf, other

    cs.IR

    Hide Your Model: A Parameter Transmission-free Federated Recommender System

    Authors: Wei Yuan, Chaoqun Yang, Liang Qu, Quoc Viet Hung Nguyen, Jianxin Li, Hongzhi Yin

    Abstract: With the growing concerns regarding user data privacy, Federated Recommender System (FedRec) has garnered significant attention recently due to its privacy-preserving capabilities. Existing FedRecs generally adhere to a learning protocol in which a central server shares a global recommendation model with clients, and participants achieve collaborative learning by frequently communicating the model… ▽ More

    Submitted 12 February, 2024; v1 submitted 25 November, 2023; originally announced November 2023.

    Comments: Accepted by ICDE2024

  7. A Phase-resolved View of the Low-frequency Quasiperiodic Oscillations from the Black Hole Binary MAXI J1820+070

    Authors: Qing C. Shui, S. Zhang, Shuang N. Zhang, Yu P. Chen, Ling D. Kong, Peng J. Wang, **g Q. Peng, L. Ji, A. Santangelo, Hong X. Yin, ** L. Qu, L. Tao, Ming Y. Ge, Y. Huang, L. Zhang, Hong H. Liu, P. Zhang, W. Yu, Z. Chang, J. Li, Wen T. Ye, Pan P. Li, Zhuo L. Yu, Z. Yan

    Abstract: Although low-frequency quasiperiodic oscillations (LFQPOs) are commonly detected in the X-ray light curves of accreting black hole X-ray binaries, their origin still remains elusive. In this study, we conduct phase-resolved spectroscopy in a broad energy band for LFQPOs in MAXI J1820+070 during its 2018 outburst, utilizing Insight-HXMT observations. By employing the Hilbert-Huang transform method,… ▽ More

    Submitted 8 November, 2023; v1 submitted 6 November, 2023; originally announced November 2023.

    Comments: Accepted for publication in The Astrophysical Journal

  8. arXiv:2310.16652  [pdf, other

    cs.LG

    How Robust is Federated Learning to Communication Error? A Comparison Study Between Uplink and Downlink Channels

    Authors: Lin** Qu, Shenghui Song, Chi-Ying Tsui, Yuyi Mao

    Abstract: Because of its privacy-preserving capability, federated learning (FL) has attracted significant attention from both academia and industry. However, when being implemented over wireless networks, it is not clear how much communication error can be tolerated by FL. This paper investigates the robustness of FL to the uplink and downlink communication error. Our theoretical analysis reveals that the r… ▽ More

    Submitted 12 January, 2024; v1 submitted 25 October, 2023; originally announced October 2023.

    Comments: Accepted by IEEE WCNC 2024

  9. Can ChatGPT Perform Reasoning Using the IRAC Method in Analyzing Legal Scenarios Like a Lawyer?

    Authors: Xiaoxi Kang, Lizhen Qu, Lay-Ki Soon, Adnan Trakic, Terry Yue Zhuo, Patrick Charles Emerton, Genevieve Grant

    Abstract: Large Language Models (LLMs), such as ChatGPT, have drawn a lot of attentions recently in the legal domain due to its emergent ability to tackle a variety of legal tasks. However, it is still unknown if LLMs are able to analyze a legal case and perform reasoning in the same manner as lawyers. Therefore, we constructed a novel corpus consisting of scenarios pertain to Contract Acts Malaysia and Aus… ▽ More

    Submitted 2 November, 2023; v1 submitted 23 October, 2023; originally announced October 2023.

    Comments: EMNLP 2023 Findings

    Report number: 2023.findings-emnlp.929

    Journal ref: 2023.findings-emnlp.929

  10. arXiv:2310.04412  [pdf, other

    cs.CV

    FedConv: Enhancing Convolutional Neural Networks for Handling Data Heterogeneity in Federated Learning

    Authors: Peiran Xu, Zeyu Wang, Jieru Mei, Liangqiong Qu, Alan Yuille, Cihang Xie, Yuyin Zhou

    Abstract: Federated learning (FL) is an emerging paradigm in machine learning, where a shared model is collaboratively learned using data from multiple devices to mitigate the risk of data leakage. While recent studies posit that Vision Transformer (ViT) outperforms Convolutional Neural Networks (CNNs) in addressing data heterogeneity in FL, the specific architectural components that underpin this advantage… ▽ More

    Submitted 6 October, 2023; originally announced October 2023.

    Comments: 9 pages, 6 figures. Equal contribution by P. Xu and Z. Wang

  11. arXiv:2309.14858  [pdf, ps, other

    astro-ph.HE hep-ph

    Timing properties of the X-ray accreting pulsar RX J0440.9+4431 studied with Insight-HXMT and NICER

    Authors: P. P. Li, L. Tao, Y. L. Tuo, M. Y. Ge, L. D. Kong, L. Zhang, Q. C. Bu, L. Ji, J. L. Qu, S. Zhang, S. N. Zhang, Y. Huang, X. Ma, W. T. Ye, Q. C. Zhao, R. C. Ma, S. J. Zhao, X. Hou, Z. X. Yang, P. J. Wang, S. M. Jia, Q. C. Shui, J. Guan

    Abstract: RX J0440.9+4431, a Be/X-ray binary, had its brightest outburst in 2022 since its discovery, with a peak X-ray flux of 2.25 Crab (as recorded by Swift/BAT, 15-50 keV). We analyze the timing properties of this giant outburst using data from Insight-HXMT and NICER, focusing on the evolution of the pulse profile and pulse fraction. We observe that when the luminosity reached around ~ 3*10^{37} er s^{-… ▽ More

    Submitted 27 September, 2023; v1 submitted 26 September, 2023; originally announced September 2023.

    Comments: 15 pages, 8 figures, accepted for publication in MNRAS

  12. arXiv:2309.05519  [pdf, other

    cs.AI cs.CL cs.LG

    NExT-GPT: Any-to-Any Multimodal LLM

    Authors: Shengqiong Wu, Hao Fei, Leigang Qu, Wei Ji, Tat-Seng Chua

    Abstract: While recently Multimodal Large Language Models (MM-LLMs) have made exciting strides, they mostly fall prey to the limitation of only input-side multimodal understanding, without the ability to produce content in multiple modalities. As we humans always perceive the world and communicate with people through various modalities, develo** any-to-any MM-LLMs capable of accepting and delivering conte… ▽ More

    Submitted 25 June, 2024; v1 submitted 11 September, 2023; originally announced September 2023.

    Comments: ICML 2024 (Oral)

  13. arXiv:2308.13712  [pdf, other

    cs.CV cs.LG

    Residual Denoising Diffusion Models

    Authors: Jiawei Liu, Qiang Wang, Huijie Fan, Yinong Wang, Yandong Tang, Liangqiong Qu

    Abstract: We propose residual denoising diffusion models (RDDM), a novel dual diffusion process that decouples the traditional single denoising diffusion process into residual diffusion and noise diffusion. This dual diffusion framework expands the denoising-based diffusion models, initially uninterpretable for image restoration, into a unified and interpretable model for both image generation and restorati… ▽ More

    Submitted 22 March, 2024; v1 submitted 25 August, 2023; originally announced August 2023.

    Comments: Accepted to CVPR2024

  14. arXiv:2308.05095  [pdf, other

    cs.CV cs.AI

    LayoutLLM-T2I: Eliciting Layout Guidance from LLM for Text-to-Image Generation

    Authors: Leigang Qu, Shengqiong Wu, Hao Fei, Liqiang Nie, Tat-Seng Chua

    Abstract: In the text-to-image generation field, recent remarkable progress in Stable Diffusion makes it possible to generate rich kinds of novel photorealistic images. However, current models still face misalignment issues (e.g., problematic spatial relation understanding and numeration failure) in complex natural scenes, which impedes the high-faithfulness text-to-image generation. Although recent efforts… ▽ More

    Submitted 12 August, 2023; v1 submitted 9 August, 2023; originally announced August 2023.

    Comments: Accepted by ACM MM 2023

  15. arXiv:2308.03610  [pdf, other

    cs.CV

    AvatarVerse: High-quality & Stable 3D Avatar Creation from Text and Pose

    Authors: Huichao Zhang, Bowen Chen, Hao Yang, Liao Qu, Xu Wang, Li Chen, Chao Long, Feida Zhu, Kang Du, Min Zheng

    Abstract: Creating expressive, diverse and high-quality 3D avatars from highly customized text descriptions and pose guidance is a challenging task, due to the intricacy of modeling and texturing in 3D that ensure details and various styles (realistic, fictional, etc). We present AvatarVerse, a stable pipeline for generating expressive high-quality 3D avatars from nothing but text descriptions and pose guid… ▽ More

    Submitted 7 August, 2023; originally announced August 2023.

  16. arXiv:2307.15901  [pdf, other

    physics.optics

    Bright Second Harmonic Emission from Photonic Crystal Vertical Cavity

    Authors: Lun Qu, Zhidong Gu, Chenyang Li, Yuan Qin, Yiting Zhang, Di Zhang, Jiaxian Zhao, Qiang Liu, Chunyan **, Lishuan Wang, Wei Wu, Wei Cai, Huasong Liu, Mengxin Ren, **gjun Xu

    Abstract: We present a study on photonic vertical cavities consisting of nonlinear materials embedded in photonic crystals (PhCs) for resonantly enhancing second harmonic generation (SHG). Previous attempts at SHG in such structures have been limited to efficiencies of 10$^{-7}$ to 10$^{-5}$, but we demonstrate here a high SHG efficiency of 0.28% by constructing a vertical cavity with a lithium niobate memb… ▽ More

    Submitted 29 July, 2023; originally announced July 2023.

  17. Detection of a strong ~2.5 Hz modulation in the Newly Discovered Millisecond Pulsar MAXI J1816-195

    Authors: P. P. Li, L. Tao, L. Zhang, Q. C. Bu, J. L. Qu, L. Ji, P. J. Wang, Y. P. Chen, S. Zhang, R. C. Ma, Z. X. Yang, W. T. Ye, S. J. Zhao, Q. C. Zhao, Y. Huang, X. Ma, E. L. Qiao, S. M. Jia, S. N. Zhang

    Abstract: MAXI J181-195 is a newly discovered accreting millisecond X-ray pulsar that went outburst in June 2022. Through timing analysis with NICER and NuSTAR observations, we find a transient modulation at ~2.5 Hz during the decay period of MAXI J1816-195. The modulation is strongly correlated with a spectral hardening, and its fractional rms amplitude increases with energy. These results suggest that the… ▽ More

    Submitted 26 July, 2023; originally announced July 2023.

    Comments: 12 pages, 13 figures

  18. arXiv:2307.13953  [pdf, other

    cs.CV cs.SD eess.AS

    The Hidden Dance of Phonemes and Visage: Unveiling the Enigmatic Link between Phonemes and Facial Features

    Authors: Liao Qu, Xianwei Zou, Xiang Li, Yandong Wen, Rita Singh, Bhiksha Raj

    Abstract: This work unveils the enigmatic link between phonemes and facial features. Traditional studies on voice-face correlations typically involve using a long period of voice input, including generating face images from voices and reconstructing 3D face meshes from voices. However, in situations like voice-based crimes, the available voice evidence may be short and limited. Additionally, from a physiolo… ▽ More

    Submitted 26 July, 2023; originally announced July 2023.

    Comments: Interspeech 2023

  19. arXiv:2307.12810  [pdf, other

    cs.IR

    HeteFedRec: Federated Recommender Systems with Model Heterogeneity

    Authors: Wei Yuan, Liang Qu, Lizhen Cui, Yongxin Tong, Xiaofang Zhou, Hongzhi Yin

    Abstract: Owing to the nature of privacy protection, federated recommender systems (FedRecs) have garnered increasing interest in the realm of on-device recommender systems. However, most existing FedRecs only allow participating clients to collaboratively train a recommendation model of the same public parameter size. Training a model of the same size for all clients can lead to suboptimal performance sinc… ▽ More

    Submitted 5 December, 2023; v1 submitted 24 July, 2023; originally announced July 2023.

  20. Intermittent QPO properties of MAXI J1820+070 revealed by Insight-HXMT

    Authors: P. Zhang, R. Soria, S. Zhang, L. Ji, L. D. Kong, Y. P. Chen, S. N. Zhang, Z. Chang, M. Y. Ge, J. Li, G. C. Liu, Q. Z. Liu, X. Ma, J. Q. Peng, J. L. Qu, Q. C. Shui, L. Tao, H. J. Tian, P. J. Wang, J. Z. Yan, X. Y. Zeng

    Abstract: We investigate the dynamical properties of low frequency quasi-periodic oscillations (QPOs) observed from the black hole X-ray binary MAXI J1820+070 during the early part of its 2018 outburst, when the system was in a bright hard state. To this aim, we use a series of observations from the Hard X-ray Modulation Telescope Insight-HXMT, and apply a wavelet decomposition (weighted wavelet Z-transform… ▽ More

    Submitted 15 July, 2023; originally announced July 2023.

    Comments: 8 pages, 4 figures

    Journal ref: A&A 677, A178 (2023)

  21. arXiv:2307.05254  [pdf, other

    cs.CV

    OpenAL: An Efficient Deep Active Learning Framework for Open-Set Pathology Image Classification

    Authors: Linhao Qu, Yingfan Ma, Zhiwei Yang, Manning Wang, Zhijian Song

    Abstract: Active learning (AL) is an effective approach to select the most informative samples to label so as to reduce the annotation cost. Existing AL methods typically work under the closed-set assumption, i.e., all classes existing in the unlabeled sample pool need to be classified by the target model. However, in some practical clinical tasks, the unlabeled pool may contain not only the target classes… ▽ More

    Submitted 11 July, 2023; originally announced July 2023.

    Comments: Accepted by MICCAI2023

  22. arXiv:2307.02249  [pdf, other

    cs.CV

    Rethinking Multiple Instance Learning for Whole Slide Image Classification: A Good Instance Classifier is All You Need

    Authors: Linhao Qu, Yingfan Ma, Xiaoyuan Luo, Manning Wang, Zhijian Song

    Abstract: Weakly supervised whole slide image classification is usually formulated as a multiple instance learning (MIL) problem, where each slide is treated as a bag, and the patches cut out of it are treated as instances. Existing methods either train an instance classifier through pseudo-labeling or aggregate instance features into a bag feature through attention mechanisms and then train a bag classifie… ▽ More

    Submitted 11 May, 2024; v1 submitted 5 July, 2023; originally announced July 2023.

    Comments: Accepted by IEEE TCSVT

  23. arXiv:2306.13860  [pdf

    cond-mat.mes-hall physics.app-ph

    Multiple magnetoplasmon polaritons of magneto-optical graphene in near-field radiative heat transfer

    Authors: Ming-Jian He, Lei Qu, Ya-Tao Ren, Hong Qi, Mauro Antezza, He-** Tan

    Abstract: Graphene, as a two-dimensional magneto-optical material, supports magnetoplasmon polaritons (MPP) when exposed to an applied magnetic field. Recently, MPP of a single-layer graphene has shown an excellent capability in the modulation of near-field radiative heat transfer (NFRHT). In this study, we present a comprehensive theoretical analysis of NFRHT between two multilayered graphene structures, w… ▽ More

    Submitted 24 June, 2023; originally announced June 2023.

    Journal ref: Mat. Today Phys. 37, 101207 (2023)

  24. arXiv:2306.10532  [pdf, other

    cs.IR

    Personalized Elastic Embedding Learning for On-Device Recommendation

    Authors: Ruiqi Zheng, Liang Qu, Tong Chen, Kai Zheng, Yuhui Shi, Hongzhi Yin

    Abstract: To address privacy concerns and reduce network latency, there has been a recent trend of compressing cumbersome recommendation models trained on the cloud and deploying compact recommender models to resource-limited devices for the real-time recommendation. Existing solutions generally overlook device heterogeneity and user heterogeneity. They require devices with the same budget to share the same… ▽ More

    Submitted 16 November, 2023; v1 submitted 18 June, 2023; originally announced June 2023.

  25. arXiv:2305.17891  [pdf, other

    cs.CV

    The Rise of AI Language Pathologists: Exploring Two-level Prompt Learning for Few-shot Weakly-supervised Whole Slide Image Classification

    Authors: Linhao Qu, Xiaoyuan Luo, Kexue Fu, Manning Wang, Zhijian Song

    Abstract: This paper introduces the novel concept of few-shot weakly supervised learning for pathology Whole Slide Image (WSI) classification, denoted as FSWC. A solution is proposed based on prompt learning and the utilization of a large language model, GPT-4. Since a WSI is too large and needs to be divided into patches for processing, WSI classification is commonly approached as a Multiple Instance Learn… ▽ More

    Submitted 28 January, 2024; v1 submitted 29 May, 2023; originally announced May 2023.

    Comments: Accepted by NeurIPS 2023

  26. arXiv:2305.17497  [pdf, other

    cs.CL

    FACTUAL: A Benchmark for Faithful and Consistent Textual Scene Graph Parsing

    Authors: Zhuang Li, Yuyang Chai, Terry Yue Zhuo, Lizhen Qu, Gholamreza Haffari, Fei Li, Donghong Ji, Quan Hung Tran

    Abstract: Textual scene graph parsing has become increasingly important in various vision-language applications, including image caption evaluation and image retrieval. However, existing scene graph parsers that convert image captions into scene graphs often suffer from two types of errors. First, the generated scene graphs fail to capture the true semantics of the captions or the corresponding images, resu… ▽ More

    Submitted 1 June, 2023; v1 submitted 27 May, 2023; originally announced May 2023.

    Comments: 9 pages, ACL 2023 (findings)

  27. arXiv:2305.12737  [pdf, other

    cs.CL cs.AI

    The Best of Both Worlds: Combining Human and Machine Translations for Multilingual Semantic Parsing with Active Learning

    Authors: Zhuang Li, Lizhen Qu, Philip R. Cohen, Raj V. Tumuluri, Gholamreza Haffari

    Abstract: Multilingual semantic parsing aims to leverage the knowledge from the high-resource languages to improve low-resource semantic parsing, yet commonly suffers from the data imbalance problem. Prior works propose to utilize the translations by either humans or machines to alleviate such issues. However, human translations are expensive, while machine translations are cheap but prone to error and bias… ▽ More

    Submitted 22 May, 2023; originally announced May 2023.

    Comments: ACL 2023

  28. arXiv:2305.04460  [pdf, other

    cs.CL cs.AI

    Language Independent Neuro-Symbolic Semantic Parsing for Form Understanding

    Authors: Bhanu Prakash Voutharoja, Lizhen Qu, Fatemeh Shiri

    Abstract: Recent works on form understanding mostly employ multimodal transformers or large-scale pre-trained language models. These models need ample data for pre-training. In contrast, humans can usually identify key-value pairings from a form only by looking at layouts, even if they don't comprehend the language used. No prior research has been conducted to investigate how helpful layout information alon… ▽ More

    Submitted 8 May, 2023; originally announced May 2023.

    Comments: Accepted to ICDAR 2023

  29. arXiv:2305.01323  [pdf, other

    cs.CL

    Turning Flowchart into Dialog: Augmenting Flowchart-grounded Troubleshooting Dialogs via Synthetic Data Generation

    Authors: Haolan Zhan, Sameen Maruf, Lizhen Qu, Yufei Wang, Ingrid Zukerman, Gholamreza Haffari

    Abstract: Flowchart-grounded troubleshooting dialogue (FTD) systems, which follow the instructions of a flowchart to diagnose users' problems in specific domains (e.g., vehicle, laptop), have been gaining research interest in recent years. However, collecting sufficient dialogues that are naturally grounded on flowcharts is costly, thus FTD systems are impeded by scarce training data. To mitigate the data s… ▽ More

    Submitted 29 October, 2023; v1 submitted 2 May, 2023; originally announced May 2023.

    Comments: Accepted by ALTA 2023

  30. Learnable Pillar-based Re-ranking for Image-Text Retrieval

    Authors: Leigang Qu, Meng Liu, Wenjie Wang, Zhedong Zheng, Liqiang Nie, Tat-Seng Chua

    Abstract: Image-text retrieval aims to bridge the modality gap and retrieve cross-modal content based on semantic similarities. Prior work usually focuses on the pairwise relations (i.e., whether a data sample matches another) but ignores the higher-order neighbor relations (i.e., a matching structure among multiple data samples). Re-ranking, a popular post-processing practice, has revealed the superiority… ▽ More

    Submitted 25 April, 2023; originally announced April 2023.

    Comments: Accepted by SIGIR'2023

    Journal ref: Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2023)

  31. arXiv:2304.12026  [pdf, other

    cs.CL

    SocialDial: A Benchmark for Socially-Aware Dialogue Systems

    Authors: Haolan Zhan, Zhuang Li, Yufei Wang, Linhao Luo, Tao Feng, Xiaoxi Kang, Yuncheng Hua, Lizhen Qu, Lay-Ki Soon, Suraj Sharma, Ingrid Zukerman, Zhaleh Semnani-Azad, Gholamreza Haffari

    Abstract: Dialogue systems have been widely applied in many scenarios and are now more powerful and ubiquitous than ever before. With large neural models and massive available data, current dialogue systems have access to more knowledge than any people in their life. However, current dialogue systems still do not perform at a human level. One major gap between conversational agents and humans lies in their… ▽ More

    Submitted 24 April, 2023; originally announced April 2023.

    Comments: Accepted by SIGIR 2023

  32. arXiv:2304.04238  [pdf, other

    eess.IV cs.CV

    Towards Arbitrary-scale Histopathology Image Super-resolution: An Efficient Dual-branch Framework based on Implicit Self-texture Enhancement

    Authors: Linhao Qu, Minghong Duan, Zhiwei Yang, Manning Wang, Zhijian Song

    Abstract: Existing super-resolution models for pathology images can only work in fixed integer magnifications and have limited performance. Though implicit neural network-based methods have shown promising results in arbitrary-scale super-resolution of natural images, it is not effective to directly apply them in pathology images, because pathology images have special fine-grained image textures different f… ▽ More

    Submitted 9 April, 2023; originally announced April 2023.

  33. Time-varying $β$-model for dynamic directed networks

    Authors: Yuqing Du, Lianqiang Qu, Ting Yan, Yuan Zhang

    Abstract: We extend the well-known $β$-model for directed graphs to dynamic network setting, where we observe snapshots of adjacency matrices at different time points. We propose a kernel-smoothed likelihood approach for estimating $2n$ time-varying parameters in a network with $n$ nodes, from $N$ snapshots. We establish consistency and asymptotic normality properties of our kernel-smoothed estimators as ei… ▽ More

    Submitted 5 April, 2023; originally announced April 2023.

    Journal ref: Scandinavian Journal of Statistics, 2023

  34. arXiv:2303.01962  [pdf, other

    cs.CL cs.AI

    Less is More: Mitigate Spurious Correlations for Open-Domain Dialogue Response Generation Models by Causal Discovery

    Authors: Tao Feng, Lizhen Qu, Gholamreza Haffari

    Abstract: In this paper, we conduct the first study on spurious correlations for open-domain response generation models based on a corpus CGDIALOG curated in our work. The cur rent models indeed suffer from spurious correlations and have a tendency of generating irrelevant and generic responses. Inspired by causal discovery algorithms, we propose a novel model-agnostic method for training and inference of r… ▽ More

    Submitted 2 March, 2023; originally announced March 2023.

  35. A detailed view of low-frequency quasi-periodic oscillation in the broadband 0.2-200 keV with Insight-HXMT and NICER

    Authors: X. Ma, L. Zhang, L. Tao, Q. C. Bu, J. L. Qu, S. N. Zhang, D. K. Zhou, Y. Huang, S. M. Jia, L. M. Song, S. Zhang, M. Y. Ge, H. X. Liu, Z. X. Yang, W. Yu, E. S. Yorgancioglu

    Abstract: We report the X-ray timing results of the black hole candidate MAXI J1820+070 during its 2018 outburst using the Hard X-ray Modulation Telescope (Insight-HXMT) and Neutron Star Interior Composition Explorer Mission (NICER) observations. Low frequency quasi-periodic oscillations (LFQPOs) are detected in the low/hard state and the hard intermediate state, which lasted for about 90 days. Thanks to th… ▽ More

    Submitted 1 March, 2023; originally announced March 2023.

  36. arXiv:2303.00232  [pdf, other

    eess.IV cs.CV

    Towards more precise automatic analysis: a comprehensive survey of deep learning-based multi-organ segmentation

    Authors: Xiaoyu Liu, Linhao Qu, Ziyue Xie, Jiayue Zhao, Yonghong Shi, Zhijian Song

    Abstract: Accurate segmentation of multiple organs of the head, neck, chest, and abdomen from medical images is an essential step in computer-aided diagnosis, surgical navigation, and radiation therapy. In the past few years, with a data-driven feature extraction approach and end-to-end training, automatic deep learning-based multi-organ segmentation method has far outperformed traditional methods and becom… ▽ More

    Submitted 2 March, 2023; v1 submitted 28 February, 2023; originally announced March 2023.

    Comments: 25 pages, 9 figures, 16 tabels

  37. arXiv:2302.10900  [pdf, other

    cs.LG cs.AI cs.IR

    Semi-decentralized Federated Ego Graph Learning for Recommendation

    Authors: Liang Qu, Ningzhi Tang, Ruiqi Zheng, Quoc Viet Hung Nguyen, Zi Huang, Yuhui Shi, Hongzhi Yin

    Abstract: Collaborative filtering (CF) based recommender systems are typically trained based on personal interaction data (e.g., clicks and purchases) that could be naturally represented as ego graphs. However, most existing recommendation methods collect these ego graphs from all users to compose a global graph to obtain high-order collaborative information between users and items, and these centralized CF… ▽ More

    Submitted 9 February, 2023; originally announced February 2023.

  38. Emphasizing Unseen Words: New Vocabulary Acquisition for End-to-End Speech Recognition

    Authors: Leyuan Qu, Cornelius Weber, Stefan Wermter

    Abstract: Due to the dynamic nature of human language, automatic speech recognition (ASR) systems need to continuously acquire new vocabulary. Out-Of-Vocabulary (OOV) words, such as trending words and new named entities, pose problems to modern ASR systems that require long training times to adapt their large numbers of parameters. Different from most previous research focusing on language model post-proces… ▽ More

    Submitted 21 February, 2023; v1 submitted 19 February, 2023; originally announced February 2023.

    Comments: Neural Networks, Volume 161, April 2023, Pages 494-504

  39. arXiv:2302.08079  [pdf, other

    cs.CL

    Document Flattening: Beyond Concatenating Context for Document-Level Neural Machine Translation

    Authors: Minghao Wu, George Foster, Lizhen Qu, Gholamreza Haffari

    Abstract: Existing work in document-level neural machine translation commonly concatenates several consecutive sentences as a pseudo-document, and then learns inter-sentential dependencies. This strategy limits the model's ability to leverage information from distant context. We overcome this limitation with a novel Document Flattening (DocFlat) technique that integrates Flat-Batch Attention (FBA) and Neura… ▽ More

    Submitted 15 February, 2023; originally announced February 2023.

    Comments: 15 pages, 8 figures, accepted by EACL 2023

  40. Timing analysis of EXO 2030+375 during its 2021 giant outburst observed with Insight-HXMT

    Authors: Yu-Cong Fu, L. M. Song, G. Q. Ding, M. Y. Ge, Y. L. Tuo, S. Zhang, S. N. Zhang, X. Hou, J. L. Qu, J. Zhang, L. Zhang, Q. C. Bu, Y. Huang, X. Ma, X. Zhou, W. M. Yan, Z. X. Yang, X. F. Lu, T. M. Li, Y. C. Xu, P. J. Wang, S. H. Xiao, H. X. Liu, X. Q. Ren, Y. F. Du , et al. (2 additional authors not shown)

    Abstract: We report the evolution of the X-ray pulsations of EXO 2030+375 during its 2021 outburst using the observations from \textit{Insight}-HXMT. Based on the accretion torque model, we study the correlation between the spin frequency derivatives and the luminosity. Pulsations can be detected in the energy band of 1--160 keV. The pulse profile evolves significantly with luminosity during the outburst, l… ▽ More

    Submitted 25 February, 2023; v1 submitted 4 February, 2023; originally announced February 2023.

  41. Reanalysis of the X-ray burst associated FRB 200428 with Insight-HXMT observations

    Authors: M. Y. Ge, C. Z. Liu, S. N. Zhang, F. J. Lu, Z. Zhang, Z. Chang, Y. L. Tuo, X. B. Li, C. K. Li, S. L. Xiong, C. Cai, X. F. Li, R. Zhang, Z. G. Dai, J. L. Qu, L. M. Song, S. Zhang, L. J. Wang

    Abstract: A double-peak X-ray burst from the Galactic magnetar SGR J1935+2154 was discovered as associated with the two radio pulses of FRB 200428 separated by 28.97+-0.02 ms. Precise measurements of the timing and spectral properties of the X-ray bursts are helpful for understanding the physical origin of fast radio bursts (FRBs). In this paper, we have reconstructed some information about the hard X-ray e… ▽ More

    Submitted 31 January, 2023; originally announced February 2023.

    Comments: 11 pages, 7 figures

  42. arXiv:2301.04296  [pdf, other

    math.ST

    A degree-corrected Cox model for dynamic networks

    Authors: Yuguo Chen, Lianqiang Qu, **feng Xu, Ting Yan, Yunpeng Zhou

    Abstract: Continuous time network data have been successfully modeled by multivariate counting processes, in which the intensity function is characterized by covariate information. However, degree heterogeneity has not been incorporated into the model which may lead to large biases for the estimation of homophily effects. In this paper, we propose a degree-corrected Cox network model to simultaneously analy… ▽ More

    Submitted 10 January, 2023; originally announced January 2023.

    Comments: 60 pages, 10 figures

  43. arXiv:2212.10025  [pdf, other

    cs.LG cs.CL

    When Federated Learning Meets Pre-trained Language Models' Parameter-Efficient Tuning Methods

    Authors: Zhuo Zhang, Yuanhang Yang, Yong Dai, Lizhen Qu, Zenglin Xu

    Abstract: With increasing privacy concerns on data, recent studies have made significant progress using federated learning (FL) on privacy-sensitive natural language processing (NLP) tasks. Much literature suggests fully fine-tuning pre-trained language models (PLMs) in the FL paradigm can mitigate the data heterogeneity problem and close the performance gap with centralized training. However, large PLMs br… ▽ More

    Submitted 2 June, 2023; v1 submitted 20 December, 2022; originally announced December 2022.

  44. arXiv:2212.09072  [pdf, other

    cs.CL

    Let's Negotiate! A Survey of Negotiation Dialogue Systems

    Authors: Haolan Zhan, Yufei Wang, Tao Feng, Yuncheng Hua, Suraj Sharma, Zhuang Li, Lizhen Qu, Gholamreza Haffari

    Abstract: Negotiation is one of the crucial abilities in human communication, and there has been a resurgent research interest in negotiation dialogue systems recently, which goal is to empower intelligent agents with such ability that can efficiently help humans resolve conflicts or reach beneficial agreements. Although there have been many explorations in negotiation dialogue systems, a systematic review… ▽ More

    Submitted 18 December, 2022; originally announced December 2022.

    Comments: An early version, work in progress

  45. arXiv:2212.06972  [pdf, other

    cs.SD cs.CL eess.AS

    Disentangling Prosody Representations with Unsupervised Speech Reconstruction

    Authors: Leyuan Qu, Taihao Li, Cornelius Weber, Theresa Pekarek-Rosin, Fuji Ren, Stefan Wermter

    Abstract: Human speech can be characterized by different components, including semantic content, speaker identity and prosodic information. Significant progress has been made in disentangling representations for semantic content and speaker identity in Automatic Speech Recognition (ASR) and speaker verification tasks respectively. However, it is still an open challenging research question to extract prosodi… ▽ More

    Submitted 25 September, 2023; v1 submitted 13 December, 2022; originally announced December 2022.

    Comments: Accepted by IEEE/ACM Transactions on Audio, Speech, and Language Processing

  46. Trace the Accretion Geometry of H 1743--322 with Type C Quasi-periodic Oscillations in Multiple Outbursts

    Authors: Qing-Cang Shui, Shu Zhang, Yu-Peng P. Chen, Shuang-Nan Zhang, Ling-Da Kong, Peng-Ju Wang, Long Ji, Hong-Xing Yin, J. L. Qu, L. Tao, M. Y. Ge, **g-Qiang Peng, Zhi Chang, Jian Li, Peng Zhang

    Abstract: We present a systematic analysis of type C quasi-periodic oscillation (QPO) observations of H 1743--322 throughout the Rossi X-ray Timing Explorer (RXTE) era. We find that, while different outbursts have significant flux differences, they show consistent positive correlations between the QPO fractional root-mean-square (rms) amplitude and non-thermal fraction of the emission, which indicate an ind… ▽ More

    Submitted 29 November, 2022; originally announced November 2022.

    Comments: 21 pages, 12 figures

  47. arXiv:2211.15235  [pdf, other

    cs.CV

    Reducing Domain Gap in Frequency and Spatial domain for Cross-modality Domain Adaptation on Medical Image Segmentation

    Authors: Shaolei Liu, Siqi Yin, Linhao Qu, Manning Wang

    Abstract: Unsupervised domain adaptation (UDA) aims to learn a model trained on source domain and performs well on unlabeled target domain. In medical image segmentation field, most existing UDA methods depend on adversarial learning to address the domain gap between different image modalities, which is ineffective due to its complicated training process. In this paper, we propose a simple yet effective UDA… ▽ More

    Submitted 28 November, 2022; originally announced November 2022.

    Comments: accepted at Thirty-Seventh AAAI Conference on Artificial Intelligence (AAAI-23)

  48. arXiv:2211.14843  [pdf, other

    cs.CV

    Learning Object-Language Alignments for Open-Vocabulary Object Detection

    Authors: Chuang Lin, Peize Sun, Yi Jiang, ** Luo, Lizhen Qu, Gholamreza Haffari, Zehuan Yuan, Jianfei Cai

    Abstract: Existing object detection methods are bounded in a fixed-set vocabulary by costly labeled data. When dealing with novel categories, the model has to be retrained with more bounding box annotations. Natural language supervision is an attractive alternative for its annotation-free attributes and broader object concepts. However, learning open-vocabulary object detection from language is challenging… ▽ More

    Submitted 27 November, 2022; originally announced November 2022.

    Comments: Technical Report

  49. arXiv:2211.11176  [pdf, other

    cs.LG cs.AI eess.SP

    Modeling Multivariate Biosignals With Graph Neural Networks and Structured State Space Models

    Authors: Siyi Tang, Jared A. Dunnmon, Liangqiong Qu, Khaled K. Saab, Tina Baykaner, Christopher Lee-Messer, Daniel L. Rubin

    Abstract: Multivariate biosignals are prevalent in many medical domains, such as electroencephalography, polysomnography, and electrocardiography. Modeling spatiotemporal dependencies in multivariate biosignals is challenging due to (1) long-range temporal dependencies and (2) complex spatial correlations between the electrodes. To address these challenges, we propose representing multivariate biosignals as… ▽ More

    Submitted 29 April, 2023; v1 submitted 20 November, 2022; originally announced November 2022.

    Comments: Published as a conference paper at CHIL 2023

  50. arXiv:2211.08843  [pdf, other

    cs.SD cs.AI eess.AS

    Improving Speech Emotion Recognition with Unsupervised Speaking Style Transfer

    Authors: Leyuan Qu, Wei Wang, Cornelius Weber, Pengcheng Yue, Taihao Li, Stefan Wermter

    Abstract: Humans can effortlessly modify various prosodic attributes, such as the placement of stress and the intensity of sentiment, to convey a specific emotion while maintaining consistent linguistic content. Motivated by this capability, we propose EmoAug, a novel style transfer model designed to enhance emotional expression and tackle the data scarcity issue in speech emotion recognition tasks. EmoAug… ▽ More

    Submitted 28 December, 2023; v1 submitted 16 November, 2022; originally announced November 2022.

    Comments: Accepted by ICASSP2024