Skip to main content

Showing 1–50 of 59 results for author: Miao, Q

.
  1. arXiv:2406.02774  [pdf, other

    cs.CV

    Diffusion-Refined VQA Annotations for Semi-Supervised Gaze Following

    Authors: Qiaomu Miao, Alexandros Graikos, **gwei Zhang, Sounak Mondal, Minh Hoai, Dimitris Samaras

    Abstract: Training gaze following models requires a large number of images with gaze target coordinates annotated by human annotators, which is a laborious and inherently ambiguous process. We propose the first semi-supervised method for gaze following by introducing two novel priors to the task. We obtain the first prior using a large pretrained Visual Question Answering (VQA) model, where we compute Grad-… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

  2. arXiv:2405.19957  [pdf, other

    cs.CV cs.AI

    PLA4D: Pixel-Level Alignments for Text-to-4D Gaussian Splatting

    Authors: Qiaowei Miao, Yawei Luo, Yi Yang

    Abstract: As text-conditioned diffusion models (DMs) achieve breakthroughs in image, video, and 3D generation, the research community's focus has shifted to the more challenging task of text-to-4D synthesis, which introduces a temporal dimension to generate dynamic 3D objects. In this context, we identify Score Distillation Sampling (SDS), a widely used technique for text-to-3D synthesis, as a significant h… ▽ More

    Submitted 5 June, 2024; v1 submitted 30 May, 2024; originally announced May 2024.

  3. arXiv:2405.17929  [pdf, other

    cs.CV

    Towards Unified Robustness Against Both Backdoor and Adversarial Attacks

    Authors: Zhenxing Niu, Yuyao Sun, Qiguang Miao, Rong **, Gang Hua

    Abstract: Deep Neural Networks (DNNs) are known to be vulnerable to both backdoor and adversarial attacks. In the literature, these two types of attacks are commonly treated as distinct robustness problems and solved separately, since they belong to training-time and inference-time attacks respectively. However, this paper revealed that there is an intriguing connection between them: (1) planting a backdoor… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

  4. arXiv:2405.14905  [pdf, other

    eess.IV cs.AI cs.CL

    Structural Entities Extraction and Patient Indications Incorporation for Chest X-ray Report Generation

    Authors: Kang Liu, Zhuoqi Ma, Xiaolu Kang, Zhusi Zhong, Zhicheng Jiao, Grayson Baird, Harrison Bai, Qiguang Miao

    Abstract: The automated generation of imaging reports proves invaluable in alleviating the workload of radiologists. A clinically applicable reports generation algorithm should demonstrate its effectiveness in producing reports that accurately describe radiology findings and attend to patient-specific indications. In this paper, we introduce a novel method, \textbf{S}tructural \textbf{E}ntities extraction a… ▽ More

    Submitted 22 May, 2024; originally announced May 2024.

    Comments: The code is available at https://github.com/mk-runner/SEI-Temp or https://github.com/mk-runner/SEI

  5. arXiv:2405.11151  [pdf, other

    cs.CV cs.AI

    Multi-scale Information Sharing and Selection Network with Boundary Attention for Polyp Segmentation

    Authors: Xiaolu Kang, Zhuoqi Ma, Kang Liu, Yunan Li, Qiguang Miao

    Abstract: Polyp segmentation for colonoscopy images is of vital importance in clinical practice. It can provide valuable information for colorectal cancer diagnosis and surgery. While existing methods have achieved relatively good performance, polyp segmentation still faces the following challenges: (1) Varying lighting conditions in colonoscopy and differences in polyp locations, sizes, and morphologies. (… ▽ More

    Submitted 17 May, 2024; originally announced May 2024.

  6. arXiv:2405.09586  [pdf, other

    eess.IV cs.AI cs.CV

    Factual Serialization Enhancement: A Key Innovation for Chest X-ray Report Generation

    Authors: Kang Liu, Zhuoqi Ma, Mengmeng Liu, Zhicheng Jiao, Xiaolu Kang, Qiguang Miao, Kun Xie

    Abstract: The automation of writing imaging reports is a valuable tool for alleviating the workload of radiologists. Crucial steps in this process involve the cross-modal alignment between medical images and reports, as well as the retrieval of similar historical cases. However, the presence of presentation-style vocabulary (e.g., sentence structure and grammar) in reports poses challenges for cross-modal a… ▽ More

    Submitted 15 May, 2024; originally announced May 2024.

  7. arXiv:2404.17510  [pdf, other

    physics.optics

    Kerr Nonlinearity Induced Nonreciprocity in dissipatively coupled resonators

    Authors: Qingtian Miao, G. S. Agarwal

    Abstract: Nonlinearity induced nonreciprocity is studied in a system comprising two resonators coupled to a one-dimensional waveguide when the linear system does not exhibit nonreciprocity. The analysis is based on the Hamiltonian of the coupled system and includes the dissipative coupling between the waveguide and resonators, along with the input-output relations. We consider a large number of scenarios wh… ▽ More

    Submitted 26 April, 2024; originally announced April 2024.

  8. arXiv:2404.10357  [pdf, other

    cs.CV

    Optimization of Prompt Learning via Multi-Knowledge Representation for Vision-Language Models

    Authors: Enming Zhang, Bingke Zhu, Yingying Chen, Qinghai Miao, Ming Tang, **qiao Wang

    Abstract: Vision-Language Models (VLMs), such as CLIP, play a foundational role in various cross-modal applications. To fully leverage VLMs' potential in adapting to downstream tasks, context optimization methods like Prompt Tuning are essential. However, one key limitation is the lack of diversity in prompt templates, whether they are hand-crafted or learned through additional modules. This limitation rest… ▽ More

    Submitted 16 April, 2024; v1 submitted 16 April, 2024; originally announced April 2024.

  9. arXiv:2403.02624  [pdf, other

    cs.LG cs.AI

    Pareto-Optimal Estimation and Policy Learning on Short-term and Long-term Treatment Effects

    Authors: Yingrong Wang, Anpeng Wu, Haoxuan Li, Weiming Liu, Qiaowei Miao, Ruoxuan Xiong, Fei Wu, Kun Kuang

    Abstract: This paper focuses on develo** Pareto-optimal estimation and policy learning to identify the most effective treatment that maximizes the total reward from both short-term and long-term effects, which might conflict with each other. For example, a higher dosage of medication might increase the speed of a patient's recovery (short-term) but could also result in severe long-term side effects. Altho… ▽ More

    Submitted 12 March, 2024; v1 submitted 4 March, 2024; originally announced March 2024.

  10. arXiv:2402.17257  [pdf, other

    cs.LG cs.AI cs.RO

    RIME: Robust Preference-based Reinforcement Learning with Noisy Preferences

    Authors: Jie Cheng, Gang Xiong, Xingyuan Dai, Qinghai Miao, Yisheng Lv, Fei-Yue Wang

    Abstract: Preference-based Reinforcement Learning (PbRL) circumvents the need for reward engineering by harnessing human preferences as the reward signal. However, current PbRL methods excessively depend on high-quality feedback from domain experts, which results in a lack of robustness. In this paper, we present RIME, a robust PbRL algorithm for effective reward learning from noisy preferences. Our method… ▽ More

    Submitted 30 May, 2024; v1 submitted 27 February, 2024; originally announced February 2024.

    Comments: Accepted by ICML2024

  11. arXiv:2402.07883  [pdf, other

    quant-ph

    Equivalence of cost concentration and gradient vanishing for quantum circuits: an elementary proof in the Riemannian formulation

    Authors: Qiang Miao, Thomas Barthel

    Abstract: The optimization of quantum circuits can be hampered by a decay of average gradient amplitudes with the system size. When the decay is exponential, this is called the barren plateau problem. Considering explicit circuit parametrizations (in terms of rotation angles), it has been shown in Arrasmith et al., Quantum Sci. Technol. 7, 045015 (2022) that barren plateaus are equivalent to an exponential… ▽ More

    Submitted 12 February, 2024; originally announced February 2024.

    Comments: 8 pages, 3 figures

  12. arXiv:2402.01422  [pdf, other

    cs.CV

    EmoSpeaker: One-shot Fine-grained Emotion-Controlled Talking Face Generation

    Authors: Guanwen Feng, Haoran Cheng, Yunan Li, Zhiyuan Ma, Chaoneng Li, Zhihao Qian, Qiguang Miao, Chi-Man Pun

    Abstract: Implementing fine-grained emotion control is crucial for emotion generation tasks because it enhances the expressive capability of the generative model, allowing it to accurately and comprehensively capture and express various nuanced emotional states, thereby improving the emotional quality and personalization of generated content. Generating fine-grained facial animations that accurately portray… ▽ More

    Submitted 2 February, 2024; originally announced February 2024.

  13. arXiv:2312.06117  [pdf, other

    cs.CV

    M3SOT: Multi-frame, Multi-field, Multi-space 3D Single Object Tracking

    Authors: Jiaming Liu, Yue Wu, Maoguo Gong, Qiguang Miao, Wen** Ma, Can Qin

    Abstract: 3D Single Object Tracking (SOT) stands a forefront task of computer vision, proving essential for applications like autonomous driving. Sparse and occluded data in scene point clouds introduce variations in the appearance of tracked objects, adding complexity to the task. In this research, we unveil M3SOT, a novel 3D SOT framework, which synergizes multiple input frames (template sets), multiple r… ▽ More

    Submitted 10 December, 2023; originally announced December 2023.

    Comments: 12 pages, 10 figures, 10 tables, AAAI 2024

    Journal ref: AAAI 2024

  14. arXiv:2312.06063  [pdf, other

    cs.CV cs.AI

    PCRDiffusion: Diffusion Probabilistic Models for Point Cloud Registration

    Authors: Yue Wu, Yongzhe Yuan, Xiaolong Fan, Xiaoshui Huang, Maoguo Gong, Qiguang Miao

    Abstract: We propose a new framework that formulates point cloud registration as a denoising diffusion process from noisy transformation to object transformation. During training stage, object transformation diffuses from ground-truth transformation to random distribution, and the model learns to reverse this noising process. In sampling stage, the model refines randomly generated transformation to the outp… ▽ More

    Submitted 10 December, 2023; originally announced December 2023.

  15. arXiv:2311.04942  [pdf, other

    eess.IV cs.CV

    CSAM: A 2.5D Cross-Slice Attention Module for Anisotropic Volumetric Medical Image Segmentation

    Authors: Alex Ling Yu Hung, Haoxin Zheng, Kai Zhao, Xiaoxi Du, Kaifeng Pang, Qi Miao, Steven S. Raman, Demetri Terzopoulos, Kyunghyun Sung

    Abstract: A large portion of volumetric medical data, especially magnetic resonance imaging (MRI) data, is anisotropic, as the through-plane resolution is typically much lower than the in-plane resolution. Both 3D and purely 2D deep learning-based segmentation methods are deficient in dealing with such volumetric data since the performance of 3D methods suffers when confronting anisotropic data, and 2D meth… ▽ More

    Submitted 26 November, 2023; v1 submitted 7 November, 2023; originally announced November 2023.

  16. arXiv:2310.15533  [pdf, other

    cs.CV

    Learning with Noisy Labels Using Collaborative Sample Selection and Contrastive Semi-Supervised Learning

    Authors: Qing Miao, Xiaohe Wu, Chao Xu, Yanli Ji, Wangmeng Zuo, Yiwen Guo, Zhaopeng Meng

    Abstract: Learning with noisy labels (LNL) has been extensively studied, with existing approaches typically following a framework that alternates between clean sample selection and semi-supervised learning (SSL). However, this approach has a limitation: the clean set selected by the Deep Neural Network (DNN) classifier, trained through self-training, inevitably contains noisy samples. This mixture of clean… ▽ More

    Submitted 24 October, 2023; originally announced October 2023.

  17. arXiv:2308.15864  [pdf

    cs.MA

    (Mis)align: A Simple Dynamic Framework for Modeling Interpersonal Coordination

    Authors: Grace Qiyuan Miao, Rick Dale, Alexia Galati

    Abstract: As people coordinate in daily interactions, they engage in different patterns of behavior to achieve successful outcomes. This includes both synchrony - the temporal coordination of the same behaviors at the same time - and complementarity - the coordination of the same or different behaviors that may occur at different relative times. Using computational methods, we develop a simple framework to… ▽ More

    Submitted 30 August, 2023; originally announced August 2023.

    Comments: Code and data necessary to reproduce findings in this article can be found at the following GitHub repository: https://github.com/miaoqy0729/sim-syn-sims

  18. arXiv:2308.12831  [pdf, other

    cs.CV

    EFormer: Enhanced Transformer towards Semantic-Contour Features of Foreground for Portraits Matting

    Authors: Zitao Wang, Qiguang Miao, Peipei Zhao, Yue Xi

    Abstract: The portrait matting task aims to extract an alpha matte with complete semantics and finely-detailed contours. In comparison to CNN-based approaches, transformers with self-attention module have a better capacity to capture long-range dependencies and low-frequency semantic information of a portrait. However, the recent research shows that self-attention mechanism struggles with modeling high-freq… ▽ More

    Submitted 30 November, 2023; v1 submitted 24 August, 2023; originally announced August 2023.

    Comments: 10 pages, 5 figures

  19. arXiv:2308.09609  [pdf, ps, other

    math.AP

    Global well-posedness and refined regularity criterion for the uni-directional Euler-alignment system

    Authors: Yatao Li, Qianyun Miao, Changhui Tan, Liutang Xue

    Abstract: We investigate global solutions to the Euler-alignment system in $d$ dimensions with unidirectional flows and strongly singular communication protocols $φ(x) = |x|^{-(d+α)}$ for $α\in (0,2)$. Our paper establishes global regularity results in both the subcritical regime $1<α<2$ and the critical regime $α=1$. Notably, when $α=1$, the system exhibits a critical scaling similar to the critical quasi-… ▽ More

    Submitted 18 August, 2023; originally announced August 2023.

    Comments: 31 pages

    MSC Class: 35Q35; 76N10; 35B65; 35B40

  20. arXiv:2307.14019  [pdf, other

    cs.CV cs.AI

    One-Nearest Neighborhood Guides Inlier Estimation for Unsupervised Point Cloud Registration

    Authors: Yongzhe Yuan, Yue Wu, Maoguo Gong, Qiguang Miao, A. K. Qin

    Abstract: The precision of unsupervised point cloud registration methods is typically limited by the lack of reliable inlier estimation and self-supervised signal, especially in partially overlap** scenarios. In this paper, we propose an effective inlier estimation method for unsupervised point cloud registration by capturing geometric structure consistency between the source point cloud and its correspon… ▽ More

    Submitted 26 July, 2023; originally announced July 2023.

  21. Engineering bound states in continuum via nonlinearity induced extra dimension

    Authors: Qingtian Miao, Jayakrishnan M. P. Nair, Girish S. Agarwal

    Abstract: Bound states in continuum (BICs) are localized states of a system possessing significantly large life times with applications across various branches of science. In this work, we propose an expedient protocol to engineer BICs which involves the use of Kerr nonlinearities in the system. The generation of BICs is a direct artifact of the nonlinearity and the associated expansion in the dimensionalit… ▽ More

    Submitted 10 July, 2023; originally announced July 2023.

    Comments: 7 pages, 4 figures

    Journal ref: Physical Review RESEARCH 5, 043053 (2023)

  22. arXiv:2304.14320  [pdf, other

    quant-ph

    Isometric tensor network optimization for extensive Hamiltonians is free of barren plateaus

    Authors: Qiang Miao, Thomas Barthel

    Abstract: We explain why and numerically confirm that there are no barren plateaus in the energy optimization of isometric tensor network states (TNS) for extensive Hamiltonians with finite-range interactions which are, for example, typical in condensed matter physics. Specifically, we consider matrix product states (MPS) with open boundary conditions, tree tensor network states (TTNS), and the multiscale e… ▽ More

    Submitted 11 March, 2024; v1 submitted 27 April, 2023; originally announced April 2023.

    Comments: 7 pages, 5 figures; improved and extended discussion; added analysis showing that power-law decay observed in [Liu et al., arXiv:1902.02663] for MPS is a finite-size effect

  23. arXiv:2304.00680  [pdf, other

    quant-ph

    Polaritonic Ultrastrong Coupling: Quantum Entanglement in Ground State

    Authors: Qingtian Miao, G. S. Agarwal

    Abstract: The ultrastrong coupling between the elementary excitations of matter and microcavity modes is studied in a fully analytical quantum-mechanical theoretical framework. The elementary excitation could be phonons, excitons, plasmons, etc. From the diagonalization of the Hamiltonian, we obtain the ground state of the polariton Hamiltonian. The ground state belongs to the Gaussian class. Using the Gaus… ▽ More

    Submitted 2 April, 2023; originally announced April 2023.

  24. arXiv:2304.00161  [pdf, other

    quant-ph cond-mat.str-el physics.comp-ph

    Absence of barren plateaus and scaling of gradients in the energy optimization of isometric tensor network states

    Authors: Thomas Barthel, Qiang Miao

    Abstract: Vanishing gradients can pose substantial obstacles for high-dimensional optimization problems. Here we consider energy minimization problems for quantum many-body systems with extensive Hamiltonians, which can be studied on classical computers or in the form of variational quantum eigensolvers on quantum computers. Barren plateaus correspond to scenarios where the average amplitude of the energy g… ▽ More

    Submitted 19 May, 2023; v1 submitted 31 March, 2023; originally announced April 2023.

    Comments: 29 pages main text, 11 pages appendix, 11 figures; added 6 figures concerning MERA and TTNS, added analysis for nonary 2D MERA and TTNS, additional references, further minor improvements

  25. arXiv:2303.08910  [pdf, other

    quant-ph cond-mat.str-el physics.comp-ph

    Convergence and Quantum Advantage of Trotterized MERA for Strongly-Correlated Systems

    Authors: Qiang Miao, Thomas Barthel

    Abstract: Strongly-correlated quantum many-body systems are difficult to study and simulate classically. Our recent work [arXiv:2108.13401] proposed a variational quantum eigensolver (VQE) based on the multiscale entanglement renormalization ansatz (MERA) with tensors constrained to certain Trotter circuits. Here, we extend the theoretical analysis, testing different initialization and convergence schemes,… ▽ More

    Submitted 15 March, 2023; originally announced March 2023.

    Comments: 7 pages, 6 figures

  26. arXiv:2302.12434  [pdf, other

    cs.SD cs.AI eess.AS

    Catch You and I Can: Revealing Source Voiceprint Against Voice Conversion

    Authors: Jiangyi Deng, Yanjiao Chen, Yinan Zhong, Qianhao Miao, Xueluan Gong, Wenyuan Xu

    Abstract: Voice conversion (VC) techniques can be abused by malicious parties to transform their audios to sound like a target speaker, making it hard for a human being or a speaker verification/identification system to trace the source speaker. In this paper, we make the first attempt to restore the source voiceprint from audios synthesized by voice conversion methods with high credit. However, unveiling t… ▽ More

    Submitted 23 February, 2023; originally announced February 2023.

    Comments: Accepted by USENIX Security Symposium 2023. Please cite this paper as "Jiangyi Deng, Yanjiao Chen, Yinan Zhong, Qianhao Miao, Xueluan Gong, Wenyuan Xu. Catch You and I Can: Revealing Source Voiceprint Against Voice Conversion. In 32nd USENIX Security Symposium (USENIX Security 23)."

  27. arXiv:2212.05679  [pdf, other

    cs.CV

    Evolutionary Multitasking with Solution Space Cutting for Point Cloud Registration

    Authors: Wu Yue, Peiran Gong, Maoguo Gong, Hangqi Ding, Zedong Tang, Yibo Liu, Wen** Ma, Qiguang Miao

    Abstract: Point cloud registration (PCR) is a popular research topic in computer vision. Recently, the registration method in an evolutionary way has received continuous attention because of its robustness to the initial pose and flexibility in objective function design. However, most evolving registration methods cannot tackle the local optimum well and they have rarely investigated the success ratio, whic… ▽ More

    Submitted 14 June, 2023; v1 submitted 11 December, 2022; originally announced December 2022.

  28. arXiv:2211.11062  [pdf, other

    cs.CV

    Patch-level Gaze Distribution Prediction for Gaze Following

    Authors: Qiaomu Miao, Minh Hoai, Dimitris Samaras

    Abstract: Gaze following aims to predict where a person is looking in a scene, by predicting the target location, or indicating that the target is located outside the image. Recent works detect the gaze target by training a heatmap regression task with a pixel-wise mean-square error (MSE) loss, while formulating the in/out prediction task as a binary classification task. This training formulation puts a str… ▽ More

    Submitted 20 November, 2022; originally announced November 2022.

    Comments: Accepted to WACV 2023

  29. arXiv:2211.00277  [pdf

    cs.LG cs.AI cs.CR

    HFN: Heterogeneous Feature Network for Multivariate Time Series Anomaly Detection

    Authors: Jun Zhan, Chengkun Wu, Canqun Yang, Qiucheng Miao, Xiandong Ma

    Abstract: Network or physical attacks on industrial equipment or computer systems may cause massive losses. Therefore, a quick and accurate anomaly detection (AD) based on monitoring data, especially the multivariate time-series (MTS) data, is of great significance. As the key step of anomaly detection for MTS data, learning the relations among different variables has been explored by many approaches. Howev… ▽ More

    Submitted 1 November, 2022; v1 submitted 1 November, 2022; originally announced November 2022.

  30. arXiv:2210.02655  [pdf, other

    cs.CV

    Domain Generalization via Contrastive Causal Learning

    Authors: Qiaowei Miao, Junkun Yuan, Kun Kuang

    Abstract: Domain Generalization (DG) aims to learn a model that can generalize well to unseen target domains from a set of source domains. With the idea of invariant causal mechanism, a lot of efforts have been put into learning robust causal effects which are determined by the object yet insensitive to the domain changes. Despite the invariance of causal effects, they are difficult to be quantified and opt… ▽ More

    Submitted 5 October, 2022; originally announced October 2022.

  31. arXiv:2208.09240  [pdf, other

    cs.LG cs.AI

    An Unsupervised Short- and Long-Term Mask Representation for Multivariate Time Series Anomaly Detection

    Authors: Qiucheng Miao, Chuanfu Xu, Jun Zhan, Dong Zhu, Chengkun Wu

    Abstract: Anomaly detection of multivariate time series is meaningful for system behavior monitoring. This paper proposes an anomaly detection method based on unsupervised Short- and Long-term Mask Representation learning (SLMR). The main idea is to extract short-term local dependency patterns and long-term global trend patterns of the multivariate time series by using multi-scale residual dilated convoluti… ▽ More

    Submitted 19 August, 2022; originally announced August 2022.

  32. arXiv:2208.03561  [pdf

    cs.CV cs.AI

    Study of detecting behavioral signatures within DeepFake videos

    Authors: Qiaomu Miao, Sinhwa Kang, Stacy Marsella, Steve DiPaola, Chao Wang, Ari Shapiro

    Abstract: There is strong interest in the generation of synthetic video imagery of people talking for various purposes, including entertainment, communication, training, and advertisement. With the development of deep fake generation models, synthetic video imagery will soon be visually indistinguishable to the naked eye from a naturally capture video. In addition, many methods are continuing to improve to… ▽ More

    Submitted 6 August, 2022; originally announced August 2022.

    Comments: 9 pages

  33. arXiv:2207.06143  [pdf, ps, other

    math.AP

    A global second order Sobolev regularity for $p$-Laplacian type equations with variable coefficients in bounded domains

    Authors: Qianyun Miao, Fa Peng, Yuan Zhou

    Abstract: Let $Ω\subset R^n$ be a bounded convex domain with $n\ge2$. Suppose that $A$ is uniformly elliptic and belongs to $W^{1,n}$ when $n\ge 3$ or $W^{1,q}$ for some $q>2$ when $n=2$. For $1<p<\infty$, we build up a global second order regularity estimate $$\|D[|Du|^{p-2} Du]\|_{L^2(Ω)}+\|D[ |\sqrt{A}Du|^{p-2} A Du]\|_{L^2(Ω)} \le C \|f\|_{L^2(Ω)} $$ for inhomogeneous $p$-Laplace type equation \begin{eq… ▽ More

    Submitted 13 July, 2022; originally announced July 2022.

  34. arXiv:2207.02429  [pdf, ps, other

    math.AP

    Global well-posedness and asymptotic behavior in critical spaces for the compressible Euler system with velocity alignment

    Authors: Xiang Bai, Qianyun Miao, Changhui Tan, Liutang Xue

    Abstract: In this paper, we study the Cauchy problem of the compressible Euler system with strongly singular velocity alignment. We prove the existence and uniqueness of global solutions in critical Besov spaces to the considered system with small initial data. The local-in-time solvability is also addressed. Moreover, we show the large-time asymptotic behavior and optimal decay estimates of the solutions a… ▽ More

    Submitted 6 July, 2022; originally announced July 2022.

    Comments: 39 pages

    MSC Class: 35Q31; 35R11; 76N10; 35B40

  35. arXiv:2205.12662  [pdf, other

    cs.CL

    DFM: Dialogue Foundation Model for Universal Large-Scale Dialogue-Oriented Task Learning

    Authors: Zhi Chen, Jijia Bao, Lu Chen, Yuncong Liu, Da Ma, Bei Chen, Mengyue Wu, Su Zhu, Xin Dong, Fujiang Ge, Qingliang Miao, Jian-Guang Lou, Kai Yu

    Abstract: Building a universal conversational agent has been a long-standing goal of the dialogue research community. Most previous works only focus on a small set of dialogue tasks. In this work, we aim to build a unified dialogue foundation model (DFM) which can be used to solve massive diverse dialogue tasks. To achieve this goal, a large-scale well-annotated dialogue dataset with rich task diversity (Di… ▽ More

    Submitted 9 October, 2022; v1 submitted 25 May, 2022; originally announced May 2022.

    Comments: Work in Progress

  36. arXiv:2205.02996  [pdf, other

    cs.CV cs.NE

    Multi-view Point Cloud Registration based on Evolutionary Multitasking with Bi-Channel Knowledge Sharing Mechanism

    Authors: Yue Wu, Yibo Liu, Maoguo Gong, Peiran Gong, Hao Li, Zedong Tang, Qiguang Miao, Wen** Ma

    Abstract: Multi-view point cloud registration is fundamental in 3D reconstruction. Since there are close connections between point clouds captured from different viewpoints, registration performance can be enhanced if these connections be harnessed properly. Therefore, this paper models the registration problem as multi-task optimization, and proposes a novel bi-channel knowledge sharing mechanism for effec… ▽ More

    Submitted 23 August, 2022; v1 submitted 5 May, 2022; originally announced May 2022.

  37. arXiv:2203.15163  [pdf, other

    eess.IV cs.CV

    CAT-Net: A Cross-Slice Attention Transformer Model for Prostate Zonal Segmentation in MRI

    Authors: Alex Ling Yu Hung, Haoxin Zheng, Qi Miao, Steven S. Raman, Demetri Terzopoulos, Kyunghyun Sung

    Abstract: Prostate cancer is the second leading cause of cancer death among men in the United States. The diagnosis of prostate MRI often relies on the accurate prostate zonal segmentation. However, state-of-the-art automatic segmentation methods often fail to produce well-contained volumetric segmentation of the prostate zones since certain slices of prostate MRI, such as base and apex slices, are harder t… ▽ More

    Submitted 16 June, 2022; v1 submitted 28 March, 2022; originally announced March 2022.

  38. arXiv:2201.00443  [pdf, other

    cs.CV

    Scene Graph Generation: A Comprehensive Survey

    Authors: Guangming Zhu, Liang Zhang, Youliang Jiang, Yixuan Dang, Haoran Hou, Peiyi Shen, Mingtao Feng, Xia Zhao, Qiguang Miao, Syed Afaq Ali Shah, Mohammed Bennamoun

    Abstract: Deep learning techniques have led to remarkable breakthroughs in the field of generic object detection and have spawned a lot of scene-understanding tasks in recent years. Scene graph has been the focus of research because of its powerful semantic representation and applications to scene understanding. Scene Graph Generation (SGG) refers to the task of automatically map** an image into a semanti… ▽ More

    Submitted 22 June, 2022; v1 submitted 2 January, 2022; originally announced January 2022.

    Comments: Submitted to TPAMI

  39. Few-Shot NLU with Vector Projection Distance and Abstract Triangular CRF

    Authors: Su Zhu, Lu Chen, Ruisheng Cao, Zhi Chen, Qingliang Miao, Kai Yu

    Abstract: Data sparsity problem is a key challenge of Natural Language Understanding (NLU), especially for a new target domain. By training an NLU model in source domains and applying the model to an arbitrary target domain directly (even without fine-tuning), few-shot NLU becomes crucial to mitigate the data scarcity issue. In this paper, we propose to improve prototypical networks with vector projection d… ▽ More

    Submitted 9 December, 2021; originally announced December 2021.

    Comments: Accepted by NLPCC 2021

  40. arXiv:2111.12235  [pdf, ps, other

    math.AP

    Global well-posedness for 2D fractional inhomogeneous Navier-Stokes equations with rough density

    Authors: Yatao Li, Qianyun Miao, Liutang Xue

    Abstract: The paper concerns with the global well-posedness issue of the 2D incompressible inhomogeneous Navier-Stokes (INS) equations with fractional dissipation and rough density. We first establish the $L^q_t(L^p)$-maximal regularity estimate for the generalized Stokes system with fractional dissipation, and then we employ it to obtain the global existence of solution for the 2D fractional INS equations… ▽ More

    Submitted 23 November, 2021; originally announced November 2021.

    Comments: 43 pages

    MSC Class: 35Q30; 76D05; 35B40

  41. arXiv:2110.06442  [pdf, ps, other

    math.AP

    Global regularity of non-diffusive temperature fronts for the 2D viscous Boussinesq system

    Authors: Dongho Chae, Qianyun Miao, Liutang Xue

    Abstract: In this paper we address the temperature patch problem of the 2D viscous Boussinesq system without heat diffusion term. The temperature satisfies the transport equation and the initial data of temperature is given in the form of non-constant patch, usually called the temperature front initial data. Introducing a good unknown and applying the method of striated estimates, we prove that our partiall… ▽ More

    Submitted 28 October, 2021; v1 submitted 12 October, 2021; originally announced October 2021.

    Comments: 50 pages. The striated estimates are simplified

    MSC Class: 76D03; 35Q35; 35Q86

  42. arXiv:2108.13401  [pdf, other

    quant-ph cond-mat.quant-gas cond-mat.str-el

    Quantum-classical eigensolver using multiscale entanglement renormalization

    Authors: Qiang Miao, Thomas Barthel

    Abstract: We propose a variational quantum eigensolver (VQE) for the simulation of strongly-correlated quantum matter based on a multi-scale entanglement renormalization ansatz (MERA) and gradient-based optimization. This MERA quantum eigensolver can have substantially lower computation costs than corresponding classical algorithms. Due to its narrow causal cone, the algorithm can be implemented on noisy in… ▽ More

    Submitted 31 August, 2023; v1 submitted 30 August, 2021; originally announced August 2021.

    Comments: 14 pages, 9 figures; additional discussions of the computational complexity, layer-transition maps for homogeneous MERA, mid-circuit qubit resets, and data on the quantum advantage; further minor improvements; published version

    Journal ref: Phys. Rev. Res. 5, 033141 (2023)

  43. arXiv:2104.09079  [pdf, other

    cs.AI cs.LG eess.SP

    A novel time-frequency Transformer based on self-attention mechanism and its application in fault diagnosis of rolling bearings

    Authors: Yifei Ding, Min** Jia, Qiuhua Miao, Yudong Cao

    Abstract: The scope of data-driven fault diagnosis models is greatly extended through deep learning (DL). However, the classical convolution and recurrent structure have their defects in computational efficiency and feature representation, while the latest Transformer architecture based on attention mechanism has not yet been applied in this field. To solve these problems, we propose a novel time-frequency… ▽ More

    Submitted 4 December, 2021; v1 submitted 19 April, 2021; originally announced April 2021.

    Journal ref: Mech. Syst. Signal Process., vol. 168, p. 108616, Apr. 2022

  44. arXiv:2010.07265  [pdf, other

    cond-mat.stat-mech cond-mat.str-el quant-ph

    Eigenstate entanglement scaling for critical interacting spin chains

    Authors: Qiang Miao, Thomas Barthel

    Abstract: With increasing subsystem size and energy, bipartite entanglement entropies of energy eigenstates cross over from the groundstate scaling to a volume law. In previous work, we pointed out that, when strong or weak eigenstate thermalization (ETH) applies, the entanglement entropies of all or, respectively, almost all eigenstates follow a single crossover function. The crossover functions are determ… ▽ More

    Submitted 1 February, 2022; v1 submitted 14 October, 2020; originally announced October 2020.

    Comments: 8 pages, 7 figures. Data for larger systems, added discussion of substructures due to approximately conserved quantities, minor improvements. Published version. This complements arXiv:1905.07760 and arXiv:1912.10045

    Journal ref: Quantum 6, 642 (2022)

  45. arXiv:2010.03365  [pdf, other

    cs.RO cs.AI

    "Drunk Man" Saves Our Lives: Route Planning by a Biased Random Walk Mode

    Authors: Xinyi Hu, Quchen Miao, Zexuan Zhao

    Abstract: Based on the hurricane striking Puerto Rico in 2017, we developed a transportable disaster response system "DroneGo" featuring a drone fleet capable of delivering the medical package and videoing roads. Covering with a genetic algorithm and a biased random walk model mimicking a drunk man to explore feasible routes on a field with altitude and road information. A proposal mechanism guaranteeing st… ▽ More

    Submitted 4 October, 2020; originally announced October 2020.

  46. arXiv:2004.03652  [pdf, ps, other

    math.AP

    Global regularity for a 1D Euler-alignment system with misalignment

    Authors: Qianyun Miao, Changhui Tan, Liutang Xue

    Abstract: We study one-dimensional Eulerian dynamics with nonlocal alignment interactions, featuring strong short-range alignment, and long-range misalignment. Compared with the well-studied Euler-alignment system, the presence of the misalignment brings different behaviors of the solutions, including the possible creation of vacuum at infinite time, which destabilizes the solutions. We show that with a str… ▽ More

    Submitted 7 April, 2020; originally announced April 2020.

    Comments: 38 pages, 1 figure

    MSC Class: 35Q35; 35R11; 92D25; 76N10

  47. arXiv:2001.03859  [pdf, ps, other

    nucl-th astro-ph.HE astro-ph.SR

    Nucleon effective mass in hot dense matter

    Authors: X. L. Shang, A. Li, Z. Q. Miao, G. F. Burgio, H. -J. Schulze

    Abstract: Nucleon effective masses are studied in the framework of the Brueckner-Hartree-Fock many-body approach at finite temperature. Self-consistent calculations using the Argonne $V_{18}$ interaction including microscopic three-body forces are reported for varying temperature and proton fraction up to several times the nuclear saturation density. Our calculations are based on the exact treatment of the… ▽ More

    Submitted 28 April, 2020; v1 submitted 12 January, 2020; originally announced January 2020.

    Comments: version accepted for publication in Physical Review C

    Journal ref: Phys. Rev. C 101, 065801 (2020)

  48. arXiv:1912.10045  [pdf, other

    cond-mat.stat-mech hep-th quant-ph

    Scaling functions for eigenstate entanglement crossovers in harmonic lattices

    Authors: Thomas Barthel, Qiang Miao

    Abstract: For quantum matter, eigenstate entanglement entropies obey an area law or log-area law at low energies and small subsystem sizes and cross over to volume laws for high energies and large subsystems. This transition is captured by crossover functions, which assume a universal scaling form in quantum critical regimes. We demonstrate this for the harmonic lattice model, which describes quantized latt… ▽ More

    Submitted 7 September, 2021; v1 submitted 20 December, 2019; originally announced December 2019.

    Comments: 12 pages, 5 figures. Added a large-deviation analysis, improved text; published version. See also arXiv:1905.07760 [PRL 127, 040603 (2021)], where the concept of scaling functions for eigenstate entanglement crossovers has been introduced and demonstrated for other models

    Journal ref: Phys. Rev. A 104, 022414 (2021)

  49. The Defocusing Energy-critical Klein-Gordon-Hartree Equation

    Authors: Qianyun Miao, Jiqiang Zheng

    Abstract: In this paper, we study the scattering theory for the defocusing energy-critical Klein-Gordon equation with a cubic convolution $u_{tt}-Δu+u+(|x|^{-4}\ast|u|^2)u=0$ in the spatial dimension $d \geq 5$. We utilize the strategy in [S. Ibrahim, N. Masmoudi and K. Nakanishi, Scattering threshold for the focusing nonlinear Klein-Gordon equation. Analysis and PDE., 4 (2011), 405-460.] derived from conce… ▽ More

    Submitted 16 August, 2019; originally announced August 2019.

    Comments: 23 pages. arXiv admin note: substantial text overlap with arXiv:math/0612028

    Journal ref: Colloquium Mathematicum, 140(2015),31-58

  50. ChaLearn Looking at People: IsoGD and ConGD Large-scale RGB-D Gesture Recognition

    Authors: Jun Wan, Chi Lin, Longyin Wen, Yunan Li, Qiguang Miao, Sergio Escalera, Gholamreza Anbarjafari, Isabelle Guyon, Guodong Guo, Stan Z. Li

    Abstract: The ChaLearn large-scale gesture recognition challenge has been run twice in two workshops in conjunction with the International Conference on Pattern Recognition (ICPR) 2016 and International Conference on Computer Vision (ICCV) 2017, attracting more than $200$ teams round the world. This challenge has two tracks, focusing on isolated and continuous gesture recognition, respectively. This paper d… ▽ More

    Submitted 28 July, 2019; originally announced July 2019.

    Comments: 14 pages, 8 figures, 6 tables

    Journal ref: IEEE Transactions on Cybernetics 2020