Skip to main content

Showing 51–100 of 1,131 results for author: Zhou, M

.
  1. arXiv:2403.16409  [pdf

    astro-ph.IM astro-ph.CO

    Large-scale Array for Radio Astronomy on the Farside

    Authors: Xuelei Chen, Feng Gao, Fengquan Wu, Yechi Zhang, Tong Wang, Weilin Liu, Dali Zou, Furen Deng, Yang Gong, Kai He, Jixia Li, Shijie Sun, Nanben Suo, Yougang Wang, Pengju Wu, Jiaqin Xu, Yidong Xu, Bin Yue, Cong Zhang, Jia Zhou, Minquan Zhou, Chenguang Zhu, Jiacong Zhu

    Abstract: At the Royal Society meeting in 2023, we have mainly presented our lunar orbit array concept called DSL, and also briefly introduced a concept of a lunar surface array, LARAF. As the DSL concept had been presented before, in this article we introduce the LARAF. We propose to build an array in the far side of the Moon, with a master station which handles the data collection and processing, and 20 s… ▽ More

    Submitted 24 March, 2024; originally announced March 2024.

    Comments: final submission version, 30 pages, 16 figures

    Journal ref: Phil. Trans. R. Soc. A.382,20230094(2024)

  2. arXiv:2403.15698  [pdf, other

    cs.CV cs.AI

    SceneX:Procedural Controllable Large-scale Scene Generation via Large-language Models

    Authors: Mengqi Zhou, Jun Hou, Chuanchen Luo, Yuxi Wang, Zhaoxiang Zhang, Junran Peng

    Abstract: Due to its great application potential, large-scale scene generation has drawn extensive attention in academia and industry. Recent research employs powerful generative models to create desired scenes and achieves promising results. However, most of these methods represent the scene using 3D primitives (e.g. point cloud or radiance field) incompatible with the industrial pipeline, which leads to a… ▽ More

    Submitted 22 March, 2024; originally announced March 2024.

  3. arXiv:2403.15483  [pdf

    eess.SP cs.LG

    Rolling bearing fault diagnosis method based on generative adversarial enhanced multi-scale convolutional neural network model

    Authors: Maoxuan Zhou, Wei Kang, Kun He

    Abstract: In order to solve the problem that current convolutional neural networks can not capture the correlation features between the time domain signals of rolling bearings effectively, and the model accuracy is limited by the number and quality of samples, a rolling bearing fault diagnosis method based on generative adversarial enhanced multi-scale convolutional neural network model is proposed. Firstly… ▽ More

    Submitted 21 March, 2024; originally announced March 2024.

  4. arXiv:2403.13583  [pdf, other

    cs.SE cs.CL cs.LG

    CoCoST: Automatic Complex Code Generation with Online Searching and Correctness Testing

    Authors: Xinyi He, Jiaru Zou, Yun Lin, Mengyu Zhou, Shi Han, Zejian Yuan, Dongmei Zhang

    Abstract: Large Language Models have revolutionized code generation ability by converting natural language descriptions into executable code. However, generating complex code within real-world scenarios remains challenging due to intricate structures, subtle bugs, understanding of advanced data types, and lack of supplementary contents. To address these challenges, we introduce the CoCoST framework, which e… ▽ More

    Submitted 1 July, 2024; v1 submitted 20 March, 2024; originally announced March 2024.

  5. arXiv:2403.12027  [pdf, other

    cs.CL cs.AI cs.CV

    From Pixels to Insights: A Survey on Automatic Chart Understanding in the Era of Large Foundation Models

    Authors: Kung-Hsiang Huang, Hou Pong Chan, Yi R. Fung, Haoyi Qiu, Mingyang Zhou, Shafiq Joty, Shih-Fu Chang, Heng Ji

    Abstract: Data visualization in the form of charts plays a pivotal role in data analysis, offering critical insights and aiding in informed decision-making. Automatic chart understanding has witnessed significant advancements with the rise of large foundation models in recent years. Foundation models, such as large language models, have revolutionized various natural language processing tasks and are increa… ▽ More

    Submitted 25 March, 2024; v1 submitted 18 March, 2024; originally announced March 2024.

  6. Measurements of All-Particle Energy Spectrum and Mean Logarithmic Mass of Cosmic Rays from 0.3 to 30 PeV with LHAASO-KM2A

    Authors: The LHAASO Collaboration, Zhen Cao, F. Aharonian, Q. An, A. Axikegu, Y. X. Bai, Y. W. Bao, D. Bastieri, X. J. Bi, Y. J. Bi, J. T. Cai, Q. Cao, W. Y. Cao, Zhe Cao, J. Chang, J. F. Chang, A. M. Chen, E. S. Chen, Liang Chen, Lin Chen, Long Chen, M. J. Chen, M. L. Chen, Q. H. Chen, S. H. Chen , et al. (256 additional authors not shown)

    Abstract: We present the measurements of all-particle energy spectrum and mean logarithmic mass of cosmic rays in the energy range of 0.3-30 PeV using data collected from LHAASO-KM2A between September 2021 and December 2022, which is based on a nearly composition-independent energy reconstruction method, achieving unprecedented accuracy. Our analysis reveals the position of the knee at… ▽ More

    Submitted 26 March, 2024; v1 submitted 15 March, 2024; originally announced March 2024.

    Comments: 8 pages, 3 figures

    Journal ref: Physical Review Letters 132, 131002 (2024)

  7. Likely detection of magnetic field related LFQPO in the soft X-ray re-brightening of GRS~1915+105

    Authors: Ling-Da Kong, Long Ji, Andrea Santangelo, Meng-Lei Zhou, Qing-Cang Shui, Shu Zhang

    Abstract: Utilizing NICER observations, we present an analysis of the soft X-ray re-brightening event of GRS 1915+105 observed in 2021. During this event, we observed the emergence of a stable, long-lasting low-frequency quasi-periodic oscillation (LFQPO) with frequencies ranging from 0.17 to 0.21 Hz. Through a careful spectral analysis, we demonstrate that a low-temperature Compton-thick gas model well cha… ▽ More

    Submitted 14 March, 2024; originally announced March 2024.

    Journal ref: A&A 686, A211 (2024)

  8. arXiv:2403.05567  [pdf, other

    cs.HC

    A Unified Framework for Underwater Metaverse with Optical Perception

    Authors: **gyang Cao, Mu Zhou, Jiacheng Wang, Guangyuan Liu, Dusit Niyato, Shiwen Mao, Zhu Han, Jiawen Kang

    Abstract: With the advancement of AI technology and increasing attention to deep-sea exploration, the underwater Metaverse is gradually emerging. This paper explores the concept of underwater Metaverse, emerging virtual reality systems and services aimed at simulating and enhancing virtual experience of marine environments. First, we discuss potential applications of underwater Metaverse in underwater scien… ▽ More

    Submitted 20 February, 2024; originally announced March 2024.

  9. arXiv:2403.05063  [pdf, other

    cs.IR cs.AI

    Aligning Large Language Models for Controllable Recommendations

    Authors: Wensheng Lu, Jianxun Lian, Wei Zhang, Guanghua Li, Mingyang Zhou, Hao Liao, Xing Xie

    Abstract: Inspired by the exceptional general intelligence of Large Language Models (LLMs), researchers have begun to explore their application in pioneering the next generation of recommender systems - systems that are conversational, explainable, and controllable. However, existing literature primarily concentrates on integrating domain-specific knowledge into LLMs to enhance accuracy, often neglecting th… ▽ More

    Submitted 8 March, 2024; originally announced March 2024.

    Comments: 13 pages

    MSC Class: 68T50

  10. arXiv:2403.04918  [pdf, other

    cs.CR

    Secure Information Embedding and Extraction in Forensic 3D Fingerprinting

    Authors: Canran Wang, **wen Wang, Mi Zhou, Vinh Pham, Senyue Hao, Chao Zhou, Ning Zhang, Netanel Raviv

    Abstract: The prevalence of 3D printing poses a significant risk to public safety, as any individual with internet access and a commodity printer is able to produce untraceable firearms, keys, counterfeit products, etc. To aid government authorities in combating these new security threats, several approaches have been taken to tag 3D-prints with identifying information. Known as fingerprints, this informati… ▽ More

    Submitted 12 June, 2024; v1 submitted 7 March, 2024; originally announced March 2024.

  11. arXiv:2403.02726  [pdf

    econ.GN cs.AI cs.CY

    Bias in Generative AI

    Authors: Mi Zhou, Vibhanshu Abhishek, Timothy Derdenger, Jaymo Kim, Kannan Srinivasan

    Abstract: This study analyzed images generated by three popular generative artificial intelligence (AI) tools - Midjourney, Stable Diffusion, and DALLE 2 - representing various occupations to investigate potential bias in AI generators. Our analysis revealed two overarching areas of concern in these AI generators, including (1) systematic gender and racial biases, and (2) subtle biases in facial expressions… ▽ More

    Submitted 5 March, 2024; originally announced March 2024.

  12. arXiv:2403.00987  [pdf, other

    cs.MA cs.RO eess.SY

    Composite Distributed Learning and Synchronization of Nonlinear Multi-Agent Systems with Complete Uncertain Dynamics

    Authors: Emadodin Jandaghi, Dalton L. Stein, Adam Hoburg, Paolo Stegagno, Mingxi Zhou, Chengzhi Yuan

    Abstract: This paper addresses the problem of composite synchronization and learning control in a network of multi-agent robotic manipulator systems with heterogeneous nonlinear uncertainties under a leader-follower framework. A novel two-layer distributed adaptive learning control strategy is introduced, comprising a first-layer distributed cooperative estimator and a second-layer decentralized determinist… ▽ More

    Submitted 9 May, 2024; v1 submitted 1 March, 2024; originally announced March 2024.

  13. arXiv:2402.17208  [pdf, other

    math.OC

    Solving Time-Continuous Stochastic Optimal Control Problems: Algorithm Design and Convergence Analysis of Actor-Critic Flow

    Authors: Mo Zhou, Jianfeng Lu

    Abstract: We propose an actor-critic framework to solve the time-continuous stochastic optimal control problem. A least square temporal difference method is applied to compute the value function for the critic. The policy gradient method is implemented as policy improvement for the actor. Our key contribution lies in establishing the global convergence property of our proposed actor-critic flow, demonstrati… ▽ More

    Submitted 26 February, 2024; originally announced February 2024.

    Comments: arXiv admin note: text overlap with arXiv:2302.05816

    MSC Class: 93E20 (Primary); 49L12 49M25 (secondary) ACM Class: G.1.6; G.1.8

  14. arXiv:2402.17207  [pdf, other

    cs.CV

    Deployment Prior Injection for Run-time Calibratable Object Detection

    Authors: Mo Zhou, Yiding Yang, Haoxiang Li, Vishal M. Patel, Gang Hua

    Abstract: With a strong alignment between the training and test distributions, object relation as a context prior facilitates object detection. Yet, it turns into a harmful but inevitable training set bias upon test distributions that shift differently across space and time. Nevertheless, the existing detectors cannot incorporate deployment context prior during the test phase without parameter update. Such… ▽ More

    Submitted 26 February, 2024; originally announced February 2024.

  15. arXiv:2402.14436  [pdf

    cond-mat.supr-con

    Structural and resistivity properties of Fe$_{1-x}$Co${_x}$Se single crystals grown by the molten salt method

    Authors: Qiaoyu Wang, Mingwei Ma, Binbin Ruan, Menghu Zhou, Yadong Gu, Qingsong Yang, Lewei Chen, Yunqing Shi, Junkun Yi, Genfu Chen, Zhian Ren

    Abstract: A series of tetragonal Fe$_{1-x}$Co${_x}$Se single crystals with a complete Co do** range (0$\leq$x$\leq$0.52) up to its solid solubility limit in FeSe have been grown by an eutectic AlCl${_3}$/KCl molten salt method. The typical lateral size of as-grown Fe$_{1-x}$Co${_x}$Se single crystals is 1$-$5 mm. The chemical composition and homogeneity of the crystals was examined by both inductively cou… ▽ More

    Submitted 22 February, 2024; originally announced February 2024.

  16. arXiv:2402.14270  [pdf, other

    cs.LG

    Take the Bull by the Horns: Hard Sample-Reweighted Continual Training Improves LLM Generalization

    Authors: Xuxi Chen, Zhendong Wang, Daouda Sow, Junjie Yang, Tianlong Chen, Yingbin Liang, Mingyuan Zhou, Zhangyang Wang

    Abstract: In the rapidly advancing arena of large language models (LLMs), a key challenge is to enhance their capabilities amid a looming shortage of high-quality training data. Our study starts from an empirical strategy for the light continual training of LLMs using their original pre-training data sets, with a specific focus on selective retention of samples that incur moderately high losses. These sampl… ▽ More

    Submitted 1 March, 2024; v1 submitted 21 February, 2024; originally announced February 2024.

    Comments: Preprint; updated reference and related works

  17. arXiv:2402.12192  [pdf, other

    cs.CV

    Pan-Mamba: Effective pan-sharpening with State Space Model

    Authors: Xuanhua He, Ke Cao, Keyu Yan, Rui Li, Chengjun Xie, Jie Zhang, Man Zhou

    Abstract: Pan-sharpening involves integrating information from low-resolution multi-spectral and high-resolution panchromatic images to generate high-resolution multi-spectral counterparts. While recent advancements in the state space model, particularly the efficient long-range dependency modeling achieved by Mamba, have revolutionized computer vision community, its untapped potential in pan-sharpening mot… ▽ More

    Submitted 8 March, 2024; v1 submitted 19 February, 2024; originally announced February 2024.

  18. arXiv:2402.10958  [pdf, other

    cs.CL cs.AI cs.LG

    Relative Preference Optimization: Enhancing LLM Alignment through Contrasting Responses across Identical and Diverse Prompts

    Authors: Yueqin Yin, Zhendong Wang, Yi Gu, Hai Huang, Weizhu Chen, Mingyuan Zhou

    Abstract: In the field of large language models (LLMs), aligning models with the diverse preferences of users is a critical challenge. Direct Preference Optimization (DPO) has played a key role in this area. It works by using pairs of preferences derived from the same prompts, and it functions without needing an additional reward model. However, DPO does not fully reflect the complex nature of human learnin… ▽ More

    Submitted 27 May, 2024; v1 submitted 12 February, 2024; originally announced February 2024.

  19. arXiv:2402.10315  [pdf, other

    astro-ph.HE

    A variable ionized disk wind in MAXI J1803-298 revealed by NICER

    Authors: Zuobin Zhang, Cosimo Bambi, Honghui Liu, Jiachen Jiang, Fangzheng Shi, Yuexin Zhang, Andrew J. Young, John A. Tomsick, Benjamin M. Coughenour, Menglei Zhou

    Abstract: We present the results from the NICER observation data of MAXI J1803-298 across the entire 2021 outburst. In the intermediate and soft state, we detect significant absorption lines at $\sim 7.0$ keV and $\sim 6.7$ keV, arising from the X-ray disk wind outflowing with a velocity of hundreds of km per second along our line of sight. The fitting results from photoionized model suggest that the wind i… ▽ More

    Submitted 15 February, 2024; originally announced February 2024.

  20. arXiv:2402.08265  [pdf, other

    cs.CV

    A Dense Reward View on Aligning Text-to-Image Diffusion with Preference

    Authors: Shentao Yang, Tianqi Chen, Mingyuan Zhou

    Abstract: Aligning text-to-image diffusion model (T2I) with preference has been gaining increasing research attention. While prior works exist on directly optimizing T2I by preference data, these methods are developed under the bandit assumption of a latent reward on the entire diffusion reverse chain, while ignoring the sequential nature of the generation process. This may harm the efficacy and efficiency… ▽ More

    Submitted 12 May, 2024; v1 submitted 13 February, 2024; originally announced February 2024.

    Comments: 41st International Conference on Machine Learning (ICML 2024)

  21. arXiv:2402.06859  [pdf, other

    cs.LG cs.AI cs.IR

    LiRank: Industrial Large Scale Ranking Models at LinkedIn

    Authors: Fedor Borisyuk, Mingzhou Zhou, Qingquan Song, Siyu Zhu, Birjodh Tiwana, Ganesh Parameswaran, Siddharth Dangi, Lars Hertel, Qiang Xiao, Xiaochen Hou, Yunbo Ouyang, Aman Gupta, Sheallika Singh, Dan Liu, Hailing Cheng, Lei Le, Jonathan Hung, Sathiya Keerthi, Ruoyan Wang, Fengyu Zhang, Mohit Kothari, Chen Zhu, Daqi Sun, Yun Dai, Xun Luan , et al. (9 additional authors not shown)

    Abstract: We present LiRank, a large-scale ranking framework at LinkedIn that brings to production state-of-the-art modeling architectures and optimization methods. We unveil several modeling improvements, including Residual DCN, which adds attention and residual connections to the famous DCNv2 architecture. We share insights into combining and tuning SOTA architectures to create a unified model, including… ▽ More

    Submitted 9 February, 2024; originally announced February 2024.

    ACM Class: H.3.3

  22. arXiv:2402.06190  [pdf, other

    cs.CV cs.LG

    Masked LoGoNet: Fast and Accurate 3D Image Analysis for Medical Domain

    Authors: Amin Karimi Monsefi, Payam Karisani, Mengxi Zhou, Stacey Choi, Nathan Doble, Heng Ji, Srinivasan Parthasarathy, Rajiv Ramnath

    Abstract: Standard modern machine-learning-based imaging methods have faced challenges in medical applications due to the high cost of dataset construction and, thereby, the limited labeled training data available. Additionally, upon deployment, these methods are usually used to process a large volume of data on a daily basis, imposing a high maintenance cost on medical facilities. In this paper, we introdu… ▽ More

    Submitted 9 February, 2024; originally announced February 2024.

  23. arXiv:2402.05493  [pdf, other

    cs.SE cs.AI cs.CR

    Investigating White-Box Attacks for On-Device Models

    Authors: Mingyi Zhou, Xiang Gao, **g Wu, Kui Liu, Hailong Sun, Li Li

    Abstract: Numerous mobile apps have leveraged deep learning capabilities. However, on-device models are vulnerable to attacks as they can be easily extracted from their corresponding mobile apps. Existing on-device attacking approaches only generate black-box attacks, which are far less effective and efficient than white-box strategies. This is because mobile deep learning frameworks like TFLite do not supp… ▽ More

    Submitted 1 March, 2024; v1 submitted 8 February, 2024; originally announced February 2024.

    Comments: Published in The International Conference on Software Engineering 2024 (ICSE'24)

  24. arXiv:2402.02263  [pdf, other

    cs.LG cs.AI cs.CV

    MixedNUTS: Training-Free Accuracy-Robustness Balance via Nonlinearly Mixed Classifiers

    Authors: Yatong Bai, Mo Zhou, Vishal M. Patel, Somayeh Sojoudi

    Abstract: Adversarial robustness often comes at the cost of degraded accuracy, impeding the real-life application of robust classification models. Training-based solutions for better trade-offs are limited by incompatibilities with already-trained high-performance large models, necessitating the exploration of training-free ensemble approaches. Observing that robust models are more confident in correct pred… ▽ More

    Submitted 12 April, 2024; v1 submitted 3 February, 2024; originally announced February 2024.

    MSC Class: 68T07

  25. arXiv:2401.13942  [pdf, other

    cs.CV

    StyleInject: Parameter Efficient Tuning of Text-to-Image Diffusion Models

    Authors: Mohan Zhou, Yalong Bai, Qing Yang, Tiejun Zhao

    Abstract: The ability to fine-tune generative models for text-to-image generation tasks is crucial, particularly facing the complexity involved in accurately interpreting and visualizing textual inputs. While LoRA is efficient for language model adaptation, it often falls short in text-to-image tasks due to the intricate demands of image generation, such as accommodating a broad spectrum of styles and nuanc… ▽ More

    Submitted 10 May, 2024; v1 submitted 24 January, 2024; originally announced January 2024.

    Comments: 11 pages, 11 figures

  26. How Are Paid and Volunteer Open Source Developers Different? A Study of the Rust Project

    Authors: Yuxia Zhang, Mian Qin, Klaas-Jan Stol, Minghui Zhou, Hui Liu

    Abstract: It is now commonplace for organizations to pay developers to work on specific open source software (OSS) projects to pursue their business goals. Such paid developers work alongside voluntary contributors, but given the different motivations of these two groups of developers, conflict may arise, which may pose a threat to a project's sustainability. This paper presents an empirical study of paid d… ▽ More

    Submitted 24 January, 2024; originally announced January 2024.

  27. arXiv:2401.11078  [pdf, other

    cs.CV

    UltrAvatar: A Realistic Animatable 3D Avatar Diffusion Model with Authenticity Guided Textures

    Authors: Mingyuan Zhou, Rakib Hyder, Ziwei Xuan, Guojun Qi

    Abstract: Recent advances in 3D avatar generation have gained significant attentions. These breakthroughs aim to produce more realistic animatable avatars, narrowing the gap between virtual and real-world experiences. Most of existing works employ Score Distillation Sampling (SDS) loss, combined with a differentiable renderer and text condition, to guide a diffusion model in generating 3D avatars. However,… ▽ More

    Submitted 19 January, 2024; originally announced January 2024.

    Comments: The project page is at http://usrc-sea.github.io/UltrAvatar/

  28. arXiv:2401.09547  [pdf, other

    math.OC

    A deep learning algorithm for computing mean field control problems via forward-backward score dynamics

    Authors: Mo Zhou, Stanley Osher, Wuchen Li

    Abstract: We propose a deep learning approach to compute mean field control problems with individual noises. The problem consists of the Fokker-Planck (FP) equation and the Hamilton-Jacobi-Bellman (HJB) equation. Using the differential of the entropy, namely the score function, we first formulate the deterministic forward-backward characteristics for the mean field control system, which is different from th… ▽ More

    Submitted 17 January, 2024; originally announced January 2024.

    MSC Class: 49N80 (Primary) 35Q89 (Secondary) ACM Class: G.1.6; G.1.8

  29. arXiv:2401.07039  [pdf, other

    quant-ph cs.LG

    Quantum Generative Diffusion Model: A Fully Quantum-Mechanical Model for Generating Quantum State Ensemble

    Authors: Chuangtao Chen, Qinglin Zhao, MengChu Zhou, Zhimin He, Zhili Sun, Haozhen Situ

    Abstract: Classical diffusion models have shown superior generative results and have been applied to many problems. Exploring these models in the quantum domain can advance the field of quantum generative learning. In this paper, we introduce the Quantum Generative Diffusion Model (QGDM), a simple and elegant quantum counterpart of classical diffusion models. The core idea of QGDM is that any target quant… ▽ More

    Submitted 3 June, 2024; v1 submitted 13 January, 2024; originally announced January 2024.

    Comments: Comments are welcome

  30. arXiv:2401.05212  [pdf, other

    astro-ph.GA

    Outflow-related radio emission in radio-quiet quasars

    Authors: Mai Liao, Junxian Wang, Wenke Ren, Minhua Zhou

    Abstract: In this work, we revisit the relationship between [O III] line width $w_{\rm 90}$ (as the indicator of AGN outflow velocity) and the radio emission in RQQs by employing a large sample of Type I quasars ($\sim 37,000$) selected from the Sloan Digital Sky Survey (SDSS) Data Release Sixteen. By median stacking the radio images (to include the dominant fraction of individually radio non-detected RQQs)… ▽ More

    Submitted 10 January, 2024; originally announced January 2024.

    Comments: 9 pages, 4 figures, accepted by MNRAS

  31. arXiv:2401.03788  [pdf, other

    cs.CV

    Low-light Image Enhancement via CLIP-Fourier Guided Wavelet Diffusion

    Authors: Minglong Xue, **hong He, Wenhai Wang, Mingliang Zhou

    Abstract: Low-light image enhancement techniques have significantly progressed, but unstable image quality recovery and unsatisfactory visual perception are still significant challenges. To solve these problems, we propose a novel and robust low-light image enhancement method via CLIP-Fourier Guided Wavelet Diffusion, abbreviated as CFWD. Specifically, CFWD leverages multimodal visual-language information i… ▽ More

    Submitted 17 April, 2024; v1 submitted 8 January, 2024; originally announced January 2024.

  32. arXiv:2401.03261  [pdf, other

    astro-ph.HE

    The X-ray high-energy cutoff in Compact Symmetric Object Mrk 348

    Authors: Mai Liao, Junxian Wang, Jialai Kang, Xiaofeng Li, Minhua Zhou

    Abstract: Compact radio AGN are thought to be young radio active galactic nuclei (AGN) at the early stage of AGN evolution, thus are ideal laboratory to study the high-energy emission throughout the evolution of radio AGN. In this work, we report for the first time the detection of the high-energy cutoff ($E_{\rm cut}$), a direct indicator of thermal coronal radiation, of X-ray emission in Mrk 348 ($z$ = 0.… ▽ More

    Submitted 10 January, 2024; v1 submitted 6 January, 2024; originally announced January 2024.

    Comments: 7 pages, 5 figures, 2 tables, accepted by MNRAS

  33. arXiv:2401.02539  [pdf, other

    cs.RO cs.CV

    Robot-Assisted Deep Venous Thrombosis Ultrasound Examination using Virtual Fixture

    Authors: Dianye Huang, Chenguang Yang, Mingchuan Zhou, Angelos Karlas, Nassir Navab, Zhongliang Jiang

    Abstract: Deep Venous Thrombosis (DVT) is a common vascular disease with blood clots inside deep veins, which may block blood flow or even cause a life-threatening pulmonary embolism. A typical exam for DVT using ultrasound (US) imaging is by pressing the target vein until its lumen is fully compressed. However, the compression exam is highly operator-dependent. To alleviate intra- and inter-variations, we… ▽ More

    Submitted 4 January, 2024; originally announced January 2024.

    Comments: Accepted Paper IEEE T-ASE

  34. arXiv:2401.02458  [pdf, other

    cs.LG cs.AI

    Data-Centric Foundation Models in Computational Healthcare: A Survey

    Authors: Yunkun Zhang, ** Gao, Zheling Tan, Lingfeng Zhou, Kexin Ding, Mu Zhou, Shaoting Zhang, Dequan Wang

    Abstract: The advent of foundation models (FMs) as an emerging suite of AI techniques has struck a wave of opportunities in computational healthcare. The interactive nature of these models, guided by pre-training data and human instructions, has ignited a data-centric AI paradigm that emphasizes better data characterization, quality, and scale. In healthcare AI, obtaining and processing high-quality clinica… ▽ More

    Submitted 4 January, 2024; originally announced January 2024.

  35. arXiv:2401.02309  [pdf, other

    cs.CV cs.MM

    TR-DETR: Task-Reciprocal Transformer for Joint Moment Retrieval and Highlight Detection

    Authors: Hao Sun, Mingyao Zhou, Wen**g Chen, Wei Xie

    Abstract: Video moment retrieval (MR) and highlight detection (HD) based on natural language queries are two highly related tasks, which aim to obtain relevant moments within videos and highlight scores of each video clip. Recently, several methods have been devoted to building DETR-based networks to solve both MR and HD jointly. These methods simply add two separate task heads after multi-modal feature ext… ▽ More

    Submitted 4 January, 2024; v1 submitted 4 January, 2024; originally announced January 2024.

    Comments: Accepted by AAAI-24

  36. arXiv:2401.02161  [pdf, other

    cs.CV

    Enhancing RAW-to-sRGB with Decoupled Style Structure in Fourier Domain

    Authors: Xuanhua He, Tao Hu, Guoli Wang, Ze** Wang, Run Wang, Qian Zhang, Keyu Yan, Ziyi Chen, Rui Li, Chenjun Xie, Jie Zhang, Man Zhou

    Abstract: RAW to sRGB map**, which aims to convert RAW images from smartphones into RGB form equivalent to that of Digital Single-Lens Reflex (DSLR) cameras, has become an important area of research. However, current methods often ignore the difference between cell phone RAW images and DSLR camera RGB images, a difference that goes beyond the color matrix and extends to spatial structure due to resolution… ▽ More

    Submitted 4 January, 2024; originally announced January 2024.

  37. arXiv:2401.02151  [pdf, other

    cs.CV

    Frequency-Adaptive Pan-Sharpening with Mixture of Experts

    Authors: Xuanhua He, Keyu Yan, Rui Li, Chengjun Xie, Jie Zhang, Man Zhou

    Abstract: Pan-sharpening involves reconstructing missing high-frequency information in multi-spectral images with low spatial resolution, using a higher-resolution panchromatic image as guidance. Although the inborn connection with frequency domain, existing pan-sharpening research has not almost investigated the potential solution upon frequency domain. To this end, we propose a novel Frequency Adaptive Mi… ▽ More

    Submitted 4 January, 2024; originally announced January 2024.

  38. arXiv:2401.00160  [pdf, other

    eess.SP

    Acceleration Estimation of Signal Propagation Path Length Changes for Wireless Sensing

    Authors: Jiacheng Wang, Hongyang Du, Dusit Niyato, Mu Zhou, Jiawen Kang, H. Vincent Poor

    Abstract: As indoor applications grow in diversity, wireless sensing, vital in areas like localization and activity recognition, is attracting renewed interest. Indoor wireless sensing relies on signal processing, particularly channel state information (CSI) based signal parameter estimation. Nonetheless, regarding reflected signals induced by dynamic human targets, no satisfactory algorithm yet exists for… ▽ More

    Submitted 30 December, 2023; originally announced January 2024.

  39. arXiv:2401.00006  [pdf, other

    cs.AI

    Building Open-Ended Embodied Agent via Language-Policy Bidirectional Adaptation

    Authors: Shaopeng Zhai, Jie Wang, Tianyi Zhang, Fuxian Huang, Qi Zhang, Ming Zhou, **g Hou, Yu Qiao, Yu Liu

    Abstract: Building embodied agents on integrating Large Language Models (LLMs) and Reinforcement Learning (RL) have revolutionized human-AI interaction: researchers can now leverage language instructions to plan decision-making for open-ended tasks. However, existing research faces challenges in meeting the requirement of open-endedness. They typically either train LLM/RL models to adapt to a fixed counterp… ▽ More

    Submitted 6 February, 2024; v1 submitted 12 December, 2023; originally announced January 2024.

  40. arXiv:2312.14013  [pdf, ps, other

    stat.ME

    Two-Stage Pseudo Maximum Likelihood Estimation of Semiparametric Copula-based Regression Models for Semi-Competing Risks Data

    Authors: Sakie J. Arachchige, Xinyuan Chen, Qian M. Zhou

    Abstract: We propose a two-stage estimation procedure for a copula-based model with semi-competing risks data, where the non-terminal event is subject to dependent censoring by the terminal event, and both events are subject to independent censoring. Under a copula-based model, the marginal survival functions of individual event times are specified by semiparametric transformation models, and the dependence… ▽ More

    Submitted 21 December, 2023; originally announced December 2023.

    Comments: 24 pages, 1 figure

  41. arXiv:2312.13671  [pdf, other

    cs.CL cs.LG

    Text2Analysis: A Benchmark of Table Question Answering with Advanced Data Analysis and Unclear Queries

    Authors: Xinyi He, Mengyu Zhou, Xinrun Xu, Xiaojun Ma, Rui Ding, Lun Du, Yan Gao, Ran Jia, Xu Chen, Shi Han, Zejian Yuan, Dongmei Zhang

    Abstract: Tabular data analysis is crucial in various fields, and large language models show promise in this area. However, current research mostly focuses on rudimentary tasks like Text2SQL and TableQA, neglecting advanced analysis like forecasting and chart generation. To address this gap, we developed the Text2Analysis benchmark, incorporating advanced analysis tasks that go beyond the SQL-compatible ope… ▽ More

    Submitted 21 December, 2023; originally announced December 2023.

    Comments: Accepted by AAAI'2024

  42. arXiv:2312.13351  [pdf, other

    astro-ph.GA

    A large jet narrow-line Seyfert 1 galaxy: observations from pc to 100 kpc scales

    Authors: Sina Chen, Preeti Kharb, Silpa Sasikumar, Sumana Nandi, Marco Berton, Emilia Jarvela, Ari Laor, Ehud Behar, Luigi Foschini, Amelia Vietri, Minfeng Gu, Giovanni La Mura, Luca Crepaldi, Minhua Zhou

    Abstract: We present new 1.5-8.5 GHz Very Long Baseline Array (VLBA) observations and 0.32-1.26 GHz Giant Meterwave Radio Telescope (GMRT) observations of J0354-1340, which is the only known radio-quiet (RQ) or radio-intermediate (RI) narrow-line Seyfert 1 galaxy with a 100-kpc two-sided radio jet. A pc-scale one-sided jet in the southeast direction from the core emission is found in the VLBA observations,… ▽ More

    Submitted 20 December, 2023; originally announced December 2023.

    Comments: Accepted for publication in ApJ

  43. arXiv:2312.10160  [pdf, other

    cs.CL

    Do LVLMs Understand Charts? Analyzing and Correcting Factual Errors in Chart Captioning

    Authors: Kung-Hsiang Huang, Mingyang Zhou, Hou Pong Chan, Yi R. Fung, Zhenhailong Wang, Lingyu Zhang, Shih-Fu Chang, Heng Ji

    Abstract: Recent advancements in large vision-language models (LVLMs) have led to significant progress in generating natural language descriptions for visual content and thus enhancing various applications. One issue with these powerful models is that they sometimes produce texts that are factually inconsistent with the visual input. While there has been some effort to mitigate such inconsistencies in natur… ▽ More

    Submitted 30 May, 2024; v1 submitted 15 December, 2023; originally announced December 2023.

    Comments: ACL 2024 Findings

  44. arXiv:2312.09576  [pdf, other

    eess.IV cs.CV

    SegRap2023: A Benchmark of Organs-at-Risk and Gross Tumor Volume Segmentation for Radiotherapy Planning of Nasopharyngeal Carcinoma

    Authors: Xiangde Luo, Jia Fu, Yunxin Zhong, Shuolin Liu, Bing Han, Mehdi Astaraki, Simone Bendazzoli, Iuliana Toma-Dasu, Yiwen Ye, Ziyang Chen, Yong Xia, Yanzhou Su, ** Ye, Junjun He, Zhaohu Xing, Hongqiu Wang, Lei Zhu, Kaixiang Yang, Xin Fang, Zhiwei Wang, Chan Woong Lee, Sang Joon Park, Jaehee Chun, Constantin Ulrich, Klaus H. Maier-Hein , et al. (17 additional authors not shown)

    Abstract: Radiation therapy is a primary and effective NasoPharyngeal Carcinoma (NPC) treatment strategy. The precise delineation of Gross Tumor Volumes (GTVs) and Organs-At-Risk (OARs) is crucial in radiation treatment, directly impacting patient prognosis. Previously, the delineation of GTVs and OARs was performed by experienced radiation oncologists. Recently, deep learning has achieved promising results… ▽ More

    Submitted 15 December, 2023; originally announced December 2023.

    Comments: A challenge report of SegRap2023 (organized in conjunction with MICCAI2023)

  45. arXiv:2312.09050  [pdf, other

    cs.AI

    A Sparse Cross Attention-based Graph Convolution Network with Auxiliary Information Awareness for Traffic Flow Prediction

    Authors: Lingqiang Chen, Qinglin Zhao, Guanghui Li, Mengchu Zhou, Chenglong Dai, Yiming Feng

    Abstract: Deep graph convolution networks (GCNs) have recently shown excellent performance in traffic prediction tasks. However, they face some challenges. First, few existing models consider the influence of auxiliary information, i.e., weather and holidays, which may result in a poor grasp of spatial-temporal dynamics of traffic data. Second, both the construction of a dynamic adjacent matrix and regular… ▽ More

    Submitted 14 December, 2023; originally announced December 2023.

  46. arXiv:2312.09039  [pdf, other

    cs.CL cs.AI

    TAP4LLM: Table Provider on Sampling, Augmenting, and Packing Semi-structured Data for Large Language Model Reasoning

    Authors: Yuan Sui, Jiaru Zou, Mengyu Zhou, Xinyi He, Lun Du, Shi Han, Dongmei Zhang

    Abstract: Table-based reasoning has shown remarkable progress in combining deep models with discrete reasoning, which requires reasoning over both free-form natural language (NL) questions and semi-structured tabular data. However, previous table reasoning solutions only consider small-sized tables and exhibit limitations in handling larger tables. In addition, most existing methods struggle to reason over… ▽ More

    Submitted 17 February, 2024; v1 submitted 14 December, 2023; originally announced December 2023.

  47. arXiv:2312.08716  [pdf, other

    cond-mat.mes-hall

    Induced magneto-conductivity in a two-node Weyl semimetal under Gaussian random disorder

    Authors: Chuan-Xiong Xu, Hao-** Yu, Mei Zhou, Xuanting Ji

    Abstract: Measuring the magnetoconductivity induced from impurities may help determine the impurity distribution and reveal the structure of a Weyl semimetal sample. To verify this, we utilized the Gaussian random disorder to simulate charged impurities in a two-node Weyl semimetal model and investigate the impact of charged impurities on magnetoconductivity in Weyl semimetals. We first compute the longitud… ▽ More

    Submitted 14 December, 2023; originally announced December 2023.

    Comments: 16 pages,7 figures

  48. arXiv:2312.05006  [pdf, other

    cs.CV

    Decoupling Degradation and Content Processing for Adverse Weather Image Restoration

    Authors: Xi Wang, Xueyang Fu, Peng-Tao Jiang, Jie Huang, Mi Zhou, Bo Li, Zheng-Jun Zha

    Abstract: Adverse weather image restoration strives to recover clear images from those affected by various weather types, such as rain, haze, and snow. Each weather type calls for a tailored degradation removal approach due to its unique impact on images. Conversely, content reconstruction can employ a uniform approach, as the underlying image content remains consistent. Although previous techniques can han… ▽ More

    Submitted 8 December, 2023; originally announced December 2023.

  49. arXiv:2312.04767  [pdf, other

    eess.SY

    Finite Horizon Reinforcement Learning in Solving Optimal Control of State-Dependent Switched Systems

    Authors: Mi Zhou

    Abstract: In this article, the deep deterministic policy gradient (DDPG) method is used to learn an optimal control policy of a multi-region state-dependent switched system. We observe good performance of this model-free method and explain it in a rigorous mathematical language. The performance of the learning-based methods is compared with the optimal solution given by vanilla differential dynamic programm… ▽ More

    Submitted 14 December, 2023; v1 submitted 7 December, 2023; originally announced December 2023.

  50. arXiv:2312.04257  [pdf, other

    cs.AR

    Proxima: Near-storage Acceleration for Graph-based Approximate Nearest Neighbor Search in 3D NAND

    Authors: Weihong Xu, Junwei Chen, Po-Kai Hsu, Jaeyoung Kang, Minxuan Zhou, Sumukh **e, Shimeng Yu, Tajana Rosing

    Abstract: Approximate nearest neighbor search (ANNS) plays an indispensable role in a wide variety of applications, including recommendation systems, information retrieval, and semantic search. Among the cutting-edge ANNS algorithms, graph-based approaches provide superior accuracy and scalability on massive datasets. However, the best-performing graph-based ANN search solutions incur tens of hundreds of me… ▽ More

    Submitted 7 December, 2023; originally announced December 2023.