Skip to main content

Showing 101–150 of 3,457 results for author: Chen, M

.
  1. arXiv:2405.10128  [pdf, other

    cs.CL cs.AI

    Red Teaming Language Models for Contradictory Dialogues

    Authors: Xiaofei Wen, Bangzheng Li, Tenghao Huang, Muhao Chen

    Abstract: Most language models currently available are prone to self-contradiction during dialogues. To mitigate this issue, this study explores a novel contradictory dialogue processing task that aims to detect and modify contradictory statements in a conversation. This task is inspired by research on context faithfulness and dialogue comprehension, which have demonstrated that the detection and understand… ▽ More

    Submitted 16 May, 2024; v1 submitted 16 May, 2024; originally announced May 2024.

    Comments: 18 pages, 5 figures

  2. arXiv:2405.09910  [pdf, other

    hep-ex astro-ph.IM physics.ins-det

    Performance testing of a novel short axis photomultiplier tube for the HUNT project

    Authors: Yijiang Peng, Zike Wang, Bo Gao, Yiyue Tang, Mingjun Chen, Kai Li, Ling Ren, Xiaohao You, Maoyuan Liu

    Abstract: Photomultiplier tubes (PMTs) with large-area cathodes are increasingly being used in cosmic-ray experiments to enhance detection efficiency. The optical modules (OMs) of the High-Energy Underwater Neutrino Telescope (HUNT) have employed a brand new N6205 20-inch microchannel plate photomultiplier tube (MCP-PMT) developed by the North Night Vision Science & Technology (Nan**g) Research Institute C… ▽ More

    Submitted 16 May, 2024; originally announced May 2024.

  3. arXiv:2405.09066  [pdf, other

    hep-ex

    Search for the leptonic decays $D^{*+}\to e^+ν_e$ and $D^{*+}\to μ^+ν_μ$

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, M. Albrecht, R. Aliberti, A. Amoroso, M. R. An, Q. An, Y. Bai, O. Bakina, R. Baldini Ferroli, I. Balossino, Y. Ban, V. Batozskaya, D. Becker, K. Begzsuren, N. Berger, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, J. Bloms, A. Bortone, I. Boyko , et al. (559 additional authors not shown)

    Abstract: We present the first search for the leptonic decays $D^{*+}\to e^+ν_e$ and $D^{*+}\to μ^+ν_μ$ by analyzing a data sample of electron-positron collisions recorded with the BESIII detector at center-of-mass energies between 4.178 and 4.226 GeV, corresponding to an integrated luminosity of 6.32~fb$^{-1}$. No significant signal is observed. The upper limits on the branching fractions for… ▽ More

    Submitted 14 May, 2024; originally announced May 2024.

    Comments: 14 pages, 7 figures

  4. arXiv:2405.09050  [pdf, other

    cs.CV

    3D Shape Augmentation with Content-Aware Shape Resizing

    Authors: Mingxiang Chen, Jian Zhang, Boli Zhou, Yang Song

    Abstract: Recent advancements in deep learning for 3D models have propelled breakthroughs in generation, detection, and scene understanding. However, the effectiveness of these algorithms hinges on large training datasets. We address the challenge by introducing Efficient 3D Seam Carving (E3SC), a novel 3D model augmentation method based on seam carving, which progressively deforms only part of the input mo… ▽ More

    Submitted 14 May, 2024; originally announced May 2024.

  5. arXiv:2405.08984  [pdf

    cond-mat.mes-hall physics.app-ph

    Charge-Transfer Hyperbolic Polaritons in $α$-MoO$_3$/graphene heterostructures

    Authors: J. Shen, M. Chen, V. Korostelev, H. Kim, P. Fathi-Hafshejani, M. Mahjouri-Samani, K. Klyukin, G-H. Lee, S. Dai

    Abstract: Charge transfer is a fundamental interface process that can be harnessed for light detection, photovoltaics, and photosynthesis. Recently, charge transfer was exploited in nanophotonics to alter plasmon polaritons by involving additional non-polaritonic materials to activate the charge transfer. Yet, direct charge transfer between polaritonic materials hasn't been demonstrated. We report the direc… ▽ More

    Submitted 14 May, 2024; originally announced May 2024.

    Journal ref: Applied Physics Reviews 11, 021409 (2024)

  6. arXiv:2405.08748  [pdf, other

    cs.CV

    Hunyuan-DiT: A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding

    Authors: Zhimin Li, Jianwei Zhang, Qin Lin, Jiangfeng Xiong, Yanxin Long, Xinchi Deng, Yingfang Zhang, Xingchao Liu, Minbin Huang, Zedong Xiao, Dayou Chen, Jiajun He, Jiahao Li, Wenyue Li, Chen Zhang, Rongwei Quan, Jianxiang Lu, Jiabin Huang, Xiaoyan Yuan, Xiaoxiao Zheng, Yixuan Li, Jihong Zhang, Chao Zhang, Meng Chen, Jie Liu , et al. (20 additional authors not shown)

    Abstract: We present Hunyuan-DiT, a text-to-image diffusion transformer with fine-grained understanding of both English and Chinese. To construct Hunyuan-DiT, we carefully design the transformer structure, text encoder, and positional encoding. We also build from scratch a whole data pipeline to update and evaluate data for iterative model optimization. For fine-grained language understanding, we train a Mu… ▽ More

    Submitted 14 May, 2024; originally announced May 2024.

    Comments: Project Page: https://dit.hunyuan.tencent.com/

  7. arXiv:2405.07741  [pdf, other

    hep-ex

    Search for the radiative transition $χ_{c1}(3872)\toγψ_2(3823)$

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, M. R. An, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko , et al. (635 additional authors not shown)

    Abstract: Using 9.0 $\rm fb^{-1}$ of $e^+e^-$ collision data collected at center-of-mass energies from 4.178 to 4.278 GeV with the BESIII detector at the BEPCII collider, we perform the first search for the radiative transition $χ_{c1}(3872)\toγψ_2(3823)$. No $χ_{c1}(3872)\toγψ_2(3823)$ signal is observed. The upper limit on the ratio of branching fractions… ▽ More

    Submitted 13 May, 2024; originally announced May 2024.

    Comments: 8 pages, 2 figures

  8. arXiv:2405.07691  [pdf, other

    astro-ph.HE

    Discovery of Very-high-energy Gamma-ray Emissions from the Low Luminosity AGN NGC 4278 by LHAASO

    Authors: Zhen Cao, F. Aharonian, Q. An, Axikegu, Y. X. Bai, Y. W. Bao, D. Bastieri, X. J. Bi, Y. J. Bi, J. T. Cai, Q. Cao, W. Y. Cao, Zhe Cao, J. Chang, J. F. Chang, A. M. Chen, E. S. Chen, Liang Chen, Lin Chen, Long Chen, M. J. Chen, M. L. Chen, Q. H. Chen, S. H. Chen, S. Z. Chen , et al. (255 additional authors not shown)

    Abstract: The first source catalog of Large High Altitude Air Shower Observatory reported the detection of a very-high-energy gamma ray source, 1LHAASO J1219+2915. In this paper a further detailed study of the spectral and temporal behavior of this point-like source have been carried. The best-fit position of the TeV source ($\rm{RA}=185.05^{\circ}\pm0.04^{\circ}$, $\rm{Dec}=29.25^{\circ}\pm0.03^{\circ}$) i… ▽ More

    Submitted 13 May, 2024; originally announced May 2024.

    Comments: 11 pages, 5 figures

  9. arXiv:2405.06590  [pdf, other

    physics.ao-ph cs.LG

    Decomposing weather forecasting into advection and convection with neural networks

    Authors: Mengxuan Chen, Ziqi Yuan, **xiao Zhang, Runmin Dong, Haohuan Fu

    Abstract: Operational weather forecasting models have advanced for decades on both the explicit numerical solvers and the empirical physical parameterization schemes. However, the involved high computational costs and uncertainties in these existing schemes are requiring potential improvements through alternative machine learning methods. Previous works use a unified model to learn the dynamics and physics… ▽ More

    Submitted 10 May, 2024; originally announced May 2024.

  10. arXiv:2405.06393  [pdf, other

    hep-ex

    Measurement of the ${e}^{+}{e}^{-}\to p \bar{p}π^{0}$ cross section at $\sqrt{s}=2.1000-3.0800$ GeV

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (639 additional authors not shown)

    Abstract: The process $e^{+}e^{-}\to p\bar{p}π^{0}$ is studied at 20 center-of-mass energies ranging from 2.1000 to 3.0800 GeV using 636.8 pb$^{-1}$ of data collected with the BESIII detector operating at the BEPCII collider. The Born cross sections for $e^{+}e^{-}\to p\bar{p}π^{0}$ are measured with high precision. Since the lowest center-of-mass energy, 2.1000 GeV, is less than 90 MeV above the… ▽ More

    Submitted 10 May, 2024; originally announced May 2024.

  11. arXiv:2405.06345  [pdf, other

    cs.CV

    Evaluating Adversarial Robustness in the Spatial Frequency Domain

    Authors: Keng-Hsin Liao, Chin-Yuan Yeh, Hsi-Wen Chen, Ming-Syan Chen

    Abstract: Convolutional Neural Networks (CNNs) have dominated the majority of computer vision tasks. However, CNNs' vulnerability to adversarial attacks has raised concerns about deploying these models to safety-critical applications. In contrast, the Human Visual System (HVS), which utilizes spatial frequency channels to process visual signals, is immune to adversarial attacks. As such, this paper presents… ▽ More

    Submitted 10 May, 2024; originally announced May 2024.

    Comments: 14 pages

  12. arXiv:2405.05674  [pdf

    cs.CV physics.med-ph

    TransAnaNet: Transformer-based Anatomy Change Prediction Network for Head and Neck Cancer Patient Radiotherapy

    Authors: Meixu Chen, Kai Wang, Michael Dohopolski, Howard Morgan, David Sher, **g Wang

    Abstract: Early identification of head and neck cancer (HNC) patients who would experience significant anatomical change during radiotherapy (RT) is important to optimize patient clinical benefit and treatment resources. This study aims to assess the feasibility of using a vision-transformer (ViT) based neural network to predict RT-induced anatomic change in HNC patients. We retrospectively included 121 HNC… ▽ More

    Submitted 22 May, 2024; v1 submitted 9 May, 2024; originally announced May 2024.

  13. arXiv:2405.05488  [pdf

    cs.CV physics.med-ph

    Advancing Head and Neck Cancer Survival Prediction via Multi-Label Learning and Deep Model Interpretation

    Authors: Meixu Chen, Kai Wang, **g Wang

    Abstract: A comprehensive and reliable survival prediction model is of great importance to assist in the personalized management of Head and Neck Cancer (HNC) patients treated with curative Radiation Therapy (RT). In this work, we propose IMLSP, an Interpretable Multi-Label multi-modal deep Survival Prediction framework for predicting multiple HNC survival outcomes simultaneously and provide time-event spec… ▽ More

    Submitted 8 May, 2024; originally announced May 2024.

    Comments: 10 pages, 4 figures, 2 tables, 2 pages of supplementary material

  14. arXiv:2405.04803  [pdf, other

    cs.CR cs.NI

    Blockchains for Internet of Things: Fundamentals, Applications, and Challenges

    Authors: Yusen Wu, Ye Hu, Mingzhe Chen, Yelena Yesha, Mérouane Debbah

    Abstract: Internet of Things (IoT) services necessitate the storage, transmission, and analysis of diverse data for inference, autonomy, and control. Blockchains, with their inherent properties of decentralization and security, offer efficient database solutions for these devices through consensus-based data sharing. However, it's essential to recognize that not every blockchain system is suitable for speci… ▽ More

    Submitted 14 June, 2024; v1 submitted 8 May, 2024; originally announced May 2024.

  15. arXiv:2405.04765  [pdf, other

    cs.LG cs.AI cs.DC

    When Foresight Pruning Meets Zeroth-Order Optimization: Efficient Federated Learning for Low-Memory Devices

    Authors: Pengyu Zhang, Yingjie Liu, Yingbo Zhou, Xiao Du, Xian Wei, Ting Wang, Mingsong Chen

    Abstract: Although Federated Learning (FL) enables collaborative learning in Artificial Intelligence of Things (AIoT) design, it fails to work on low-memory AIoT devices due to its heavy memory usage. To address this problem, various federated pruning methods are proposed to reduce memory usage during inference. However, few of them can substantially mitigate the memory burdens during pruning and training.… ▽ More

    Submitted 7 May, 2024; originally announced May 2024.

  16. arXiv:2405.04029  [pdf, other

    cs.CR

    Enabling Privacy-Preserving and Publicly Auditable Federated Learning

    Authors: Huang Zeng, Anjia Yang, Jian Weng, Min-Rong Chen, Fengjun Xiao, Yi Liu, Ye Yao

    Abstract: Federated learning (FL) has attracted widespread attention because it supports the joint training of models by multiple participants without moving private dataset. However, there are still many security issues in FL that deserve discussion. In this paper, we consider three major issues: 1) how to ensure that the training process can be publicly audited by any third party; 2) how to avoid the infl… ▽ More

    Submitted 7 May, 2024; originally announced May 2024.

    Comments: ICC 2024 - 2024 IEEE International Conference on Communications Conference Program

    ACM Class: C.2.2; C.2.4; E.3

  17. arXiv:2405.02872  [pdf, ps, other

    math.NA

    The weighted and shifted seven-step BDF method for parabolic equations

    Authors: Georgios Akrivis, Minghua Chen, Fan Yu

    Abstract: Stability of the BDF methods of order up to five for parabolic equations can be established by the energy technique via Nevanlinna--Odeh multipliers. The nonexistence of Nevanlinna--Odeh multipliers makes the six-step BDF method special; however, the energy technique was recently extended by the authors in [Akrivis et al., SIAM J. Numer. Anal. \textbf{59} (2021) 2449--2472] and covers all six stab… ▽ More

    Submitted 5 May, 2024; originally announced May 2024.

    Comments: 23 pages

  18. arXiv:2405.02678  [pdf, other

    cs.LG cs.AI cs.CV

    Position: Quo Vadis, Unsupervised Time Series Anomaly Detection?

    Authors: M. Saquib Sarfraz, Mei-Yen Chen, Lukas Layer, Kunyu Peng, Marios Koulakis

    Abstract: The current state of machine learning scholarship in Timeseries Anomaly Detection (TAD) is plagued by the persistent use of flawed evaluation metrics, inconsistent benchmarking practices, and a lack of proper justification for the choices made in novel deep learning-based model designs. Our paper presents a critical analysis of the status quo in TAD, revealing the misleading track of current resea… ▽ More

    Submitted 5 June, 2024; v1 submitted 4 May, 2024; originally announced May 2024.

    Comments: ICML 2024

  19. arXiv:2405.02351  [pdf, other

    cs.LG cs.AI cs.DC physics.optics

    Towards General Neural Surrogate Solvers with Specialized Neural Accelerators

    Authors: Chenkai Mao, Robert Lupoiu, Tianxiang Dai, Mingkun Chen, Jonathan A. Fan

    Abstract: Surrogate neural network-based partial differential equation (PDE) solvers have the potential to solve PDEs in an accelerated manner, but they are largely limited to systems featuring fixed domain sizes, geometric layouts, and boundary conditions. We propose Specialized Neural Accelerator-Powered Domain Decomposition Methods (SNAP-DDM), a DDM-based approach to PDE solving in which subdomain proble… ▽ More

    Submitted 14 June, 2024; v1 submitted 2 May, 2024; originally announced May 2024.

    Comments: 9 pages, 7 Figures, to be published in ICML 2024

  20. arXiv:2405.01515  [pdf, other

    cs.IT eess.SP

    Model-based Deep Learning for Rate Split Multiple Access in Vehicular Communications

    Authors: Hanwen Zhang, Mingzhe Chen, Alireza Vahid, Haijian Sun

    Abstract: Rate split multiple access (RSMA) has been proven as an effective communication scheme for 5G and beyond, especially in vehicular scenarios. However, RSMA requires complicated iterative algorithms for proper resource allocation, which cannot fulfill the stringent latency requirement in resource constrained vehicles. Although data driven approaches can alleviate this issue, they suffer from poor ge… ▽ More

    Submitted 2 May, 2024; originally announced May 2024.

    Comments: submitted to IEEE conference

  21. arXiv:2405.01510  [pdf, other

    cs.SI cs.DB

    Reverse Influential Community Search Over Social Networks (Technical Report)

    Authors: Qi Wen, Nan Zhang, Yutong Ye, Xiang Lian, Mingsong Chen

    Abstract: As an important fundamental task of numerous real-world applications such as social network analysis and online advertising/marketing, several prior works studied influential community search, which retrieves a community with high structural cohesiveness and maximum influences on other users in social networks. However, previous works usually considered the influences of the community on arbitrary… ▽ More

    Submitted 7 May, 2024; v1 submitted 2 May, 2024; originally announced May 2024.

  22. arXiv:2405.01413  [pdf, other

    cs.CV cs.AI cs.CL cs.LG

    MiniGPT-3D: Efficiently Aligning 3D Point Clouds with Large Language Models using 2D Priors

    Authors: Yuan Tang, Xu Han, Xianzhi Li, Qiao Yu, Yixue Hao, Long Hu, Min Chen

    Abstract: Large 2D vision-language models (2D-LLMs) have gained significant attention by bridging Large Language Models (LLMs) with images using a simple projector. Inspired by their success, large 3D point cloud-language models (3D-LLMs) also integrate point clouds into LLMs. However, directly aligning point clouds with LLM requires expensive training costs, typically in hundreds of GPU-hours on A100, whic… ▽ More

    Submitted 2 May, 2024; originally announced May 2024.

    Comments: 17 pages, 9 figures

  23. arXiv:2405.00627  [pdf, other

    eess.SY cs.LG

    Koopman-based Deep Learning for Nonlinear System Estimation

    Authors: Zexin Sun, Mingyu Chen, John Baillieul

    Abstract: Nonlinear differential equations are encountered as models of fluid flow, spiking neurons, and many other systems of interest in the real world. Common features of these systems are that their behaviors are difficult to describe exactly and invariably unmodeled dynamics present challenges in making precise predictions. In many cases the models exhibit extremely complicated behavior due to bifurcat… ▽ More

    Submitted 1 May, 2024; originally announced May 2024.

    Comments: 11 pages

  24. arXiv:2405.00622  [pdf, other

    cs.CL cs.AI cs.LG

    Causal Evaluation of Language Models

    Authors: Sirui Chen, Bo Peng, Meiqi Chen, Ruiqi Wang, Mengying Xu, Xingyu Zeng, Rui Zhao, Shengjie Zhao, Yu Qiao, Chaochao Lu

    Abstract: Causal reasoning is viewed as crucial for achieving human-level machine intelligence. Recent advances in language models have expanded the horizons of artificial intelligence across various domains, sparking inquiries into their potential for causal reasoning. In this work, we introduce Causal evaluation of Language Models (CaLM), which, to the best of our knowledge, is the first comprehensive ben… ▽ More

    Submitted 1 May, 2024; originally announced May 2024.

    Comments: 315 pages, 230 figures, 21 tables. Project website: https://opencausalab.github.io/CaLM

  25. arXiv:2405.00483  [pdf, other

    cs.CV cs.MM

    In Anticipation of Perfect Deepfake: Identity-anchored Artifact-agnostic Detection under Rebalanced Deepfake Detection Protocol

    Authors: Wei-Han Wang, Chin-Yuan Yeh, Hsi-Wen Chen, De-Nian Yang, Ming-Syan Chen

    Abstract: As deep generative models advance, we anticipate deepfakes achieving "perfection"-generating no discernible artifacts or noise. However, current deepfake detectors, intentionally or inadvertently, rely on such artifacts for detection, as they are exclusive to deepfakes and absent in genuine examples. To bridge this gap, we introduce the Rebalanced Deepfake Detection Protocol (RDDP) to stress-test… ▽ More

    Submitted 1 May, 2024; originally announced May 2024.

  26. arXiv:2404.19384  [pdf, other

    cs.CV cs.AI

    Pseudo Label Refinery for Unsupervised Domain Adaptation on Cross-dataset 3D Object Detection

    Authors: Zhanwei Zhang, Minghao Chen, Shuai Xiao, Liang Peng, Hengjia Li, Binbin Lin, ** Li, Wenxiao Wang, Boxi Wu, Deng Cai

    Abstract: Recent self-training techniques have shown notable improvements in unsupervised domain adaptation for 3D object detection (3D UDA). These techniques typically select pseudo labels, i.e., 3D boxes, to supervise models for the target domain. However, this selection process inevitably introduces unreliable 3D boxes, in which 3D points cannot be definitively assigned as foreground or background. Previ… ▽ More

    Submitted 30 April, 2024; originally announced April 2024.

    Comments: Accepted by CVPR2024

  27. arXiv:2404.19330  [pdf, other

    cs.CV cs.AI

    G2LTraj: A Global-to-Local Generation Approach for Trajectory Prediction

    Authors: Zhanwei Zhang, Zishuo Hua, Minghao Chen, Wei Lu, Binbin Lin, Deng Cai, Wenxiao Wang

    Abstract: Predicting future trajectories of traffic agents accurately holds substantial importance in various applications such as autonomous driving. Previous methods commonly infer all future steps of an agent either recursively or simultaneously. However, the recursive strategy suffers from the accumulated error, while the simultaneous strategy overlooks the constraints among future steps, resulting in k… ▽ More

    Submitted 30 April, 2024; originally announced April 2024.

    Comments: Accepted by IJCAI 2024

  28. arXiv:2404.19193  [pdf

    cond-mat.mtrl-sci physics.optics physics.plasm-ph

    Tunable Collective Excitations in Epitaxial Perovskite Nickelates

    Authors: Mengxia Sun, Xu He, Mingyao Chen, Chi Sin Tang, Xiongfang Liu, Liang Dai, Jishan Liu, Zhigang Zeng, Shuo Sun, Mark B. H. Breese, Chuanbing Cai, Yingge Du, Le Wang, Andrew T. S. Wee, Xinmao Yin

    Abstract: The formation of plasmons through the collective excitation of charge density has generated intense discussions, offering insights to fundamental sciences and potential applications. While the underlying physical principles have been well-established, the effects of many-body interactions and orbital hybridization on plasmonic dynamics remain understudied. In this work, we present the observation… ▽ More

    Submitted 1 June, 2024; v1 submitted 29 April, 2024; originally announced April 2024.

  29. arXiv:2404.18929  [pdf, other

    cs.CV

    DGE: Direct Gaussian 3D Editing by Consistent Multi-view Editing

    Authors: Minghao Chen, Iro Laina, Andrea Vedaldi

    Abstract: We consider the problem of editing 3D objects and scenes based on open-ended language instructions. The established paradigm to solve this problem is to use a 2D image generator or editor to guide the 3D editing process. However, this is often slow as it requires do update a computationally expensive 3D representations such as a neural radiance field, and to do so by using contradictory guidance f… ▽ More

    Submitted 29 April, 2024; originally announced April 2024.

    Comments: Project Page: https://silent-chen.github.io/DGE/

  30. arXiv:2404.18412  [pdf

    cond-mat.mtrl-sci cond-mat.str-el

    Uncovering an Interfacial Band Resulting from Orbital Hybridization in Nickelate Heterostructures

    Authors: Mingyao Chen, Huimin Liu, Xu He, Minjuan Li, Chi Sin Tang, Mengxia Sun, Krishna Prasad Koirala, Mark E. Bowden, Yangyang Li, Xiongfang Liu, Difan Zhou, Shuo Sun, Mark B. H. Breese, Chuanbing Cai, Yingge Du, Andrew T. S. Wee, Le Wang, Xinmao Yin

    Abstract: The interaction of atomic orbitals at the interface of perovskite oxide heterostructures has been investigated for its profound impact on the band structures and electronic properties, giving rise to unique electronic states and a variety of tunable functionalities. In this study, we conducted an extensive investigation of the optical and electronic properties of epitaxial NdNiO3 thin films grown… ▽ More

    Submitted 29 April, 2024; originally announced April 2024.

    Comments: 26 pages,4 figures

  31. arXiv:2404.17571  [pdf, other

    cs.CV

    Tunnel Try-on: Excavating Spatial-temporal Tunnels for High-quality Virtual Try-on in Videos

    Authors: Zhengze Xu, Mengting Chen, Zhao Wang, Linyu Xing, Zhonghua Zhai, Nong Sang, **song Lan, Shuai Xiao, Changxin Gao

    Abstract: Video try-on is a challenging task and has not been well tackled in previous works. The main obstacle lies in preserving the details of the clothing and modeling the coherent motions simultaneously. Faced with those difficulties, we address video try-on by proposing a diffusion-based framework named "Tunnel Try-on." The core idea is excavating a "focus tunnel" in the input video that gives close-u… ▽ More

    Submitted 26 April, 2024; originally announced April 2024.

    Comments: Project Page: https://mengtingchen.github.io/tunnel-try-on-page/

  32. arXiv:2404.16743  [pdf, other

    cs.CL cs.SD eess.AS

    Automatic Speech Recognition System-Independent Word Error Rate Estimation

    Authors: Chanho Park, Mingjie Chen, Thomas Hain

    Abstract: Word error rate (WER) is a metric used to evaluate the quality of transcriptions produced by Automatic Speech Recognition (ASR) systems. In many applications, it is of interest to estimate WER given a pair of a speech utterance and a transcript. Previous work on WER estimation focused on building models that are trained with a specific ASR system in mind (referred to as ASR system-dependent). Thes… ▽ More

    Submitted 26 April, 2024; v1 submitted 25 April, 2024; originally announced April 2024.

    Comments: Accepted to LREC-COLING 2024 (long)

  33. arXiv:2404.16685  [pdf, other

    cs.CV cs.AI

    Multi-scale HSV Color Feature Embedding for High-fidelity NIR-to-RGB Spectrum Translation

    Authors: Huiyu Zhai, Mo Chen, Xingxing Yang, Gusheng Kang

    Abstract: The NIR-to-RGB spectral domain translation is a formidable task due to the inherent spectral map** ambiguities within NIR inputs and RGB outputs. Thus, existing methods fail to reconcile the tension between maintaining texture detail fidelity and achieving diverse color variations. In this paper, we propose a Multi-scale HSV Color Feature Embedding Network (MCFNet) that decomposes the map** pr… ▽ More

    Submitted 25 April, 2024; originally announced April 2024.

  34. arXiv:2404.15777  [pdf, other

    cs.CL

    A Comprehensive Survey on Evaluating Large Language Model Applications in the Medical Industry

    Authors: Yining Huang, Keke Tang, Meilian Chen, Boyuan Wang

    Abstract: Since the inception of the Transformer architecture in 2017, Large Language Models (LLMs) such as GPT and BERT have evolved significantly, impacting various industries with their advanced capabilities in language understanding and generation. These models have shown potential to transform the medical field, highlighting the necessity for specialized evaluation frameworks to ensure their effective… ▽ More

    Submitted 29 May, 2024; v1 submitted 24 April, 2024; originally announced April 2024.

    Comments: 42 pages, 1 figure

  35. arXiv:2404.15603  [pdf

    quant-ph

    Development of Pattern Recognition Validation for Boson Sampling

    Authors: Yang Ji, Yongzheng Wu, Shi Wang, Jie Hou, Meiling Chen, Ming Ni

    Abstract: Boson sampling is one of the most attractive quantum computation models to demonstrate the quantum computational advantage. However, this aim may be hard to realize considering noise sources such as photon distinguishability. Inspired by the Bayesian validation developed to evaluate whether photon distinguishability is too high to demonstrate the quantum computational advantage, we develop the pat… ▽ More

    Submitted 23 April, 2024; originally announced April 2024.

  36. arXiv:2404.14890  [pdf, other

    cs.CV

    DENOISER: Rethinking the Robustness for Open-Vocabulary Action Recognition

    Authors: Haozhe Cheng, Cheng Ju, Haicheng Wang, **xiang Liu, Mengting Chen, Qiang Hu, Xiaoyun Zhang, Yanfeng Wang

    Abstract: As one of the fundamental video tasks in computer vision, Open-Vocabulary Action Recognition (OVAR) recently gains increasing attention, with the development of vision-language pre-trainings. To enable generalization of arbitrary classes, existing methods treat class labels as text descriptions, then formulate OVAR as evaluating embedding similarity between visual samples and textual classes. Howe… ▽ More

    Submitted 23 April, 2024; originally announced April 2024.

  37. arXiv:2404.14743  [pdf, other

    stat.ML cs.LG

    Gradient Guidance for Diffusion Models: An Optimization Perspective

    Authors: Yingqing Guo, Hui Yuan, Yukang Yang, Minshuo Chen, Mengdi Wang

    Abstract: Diffusion models have demonstrated empirical successes in various applications and can be adapted to task-specific needs via guidance. This paper introduces a form of gradient guidance for adapting or fine-tuning diffusion models towards user-specified optimization objectives. We study the theoretic aspects of a guided score-based sampling process, linking the gradient-guided diffusion model to fi… ▽ More

    Submitted 23 April, 2024; originally announced April 2024.

  38. arXiv:2404.14690  [pdf

    quant-ph

    Scalable cyclic transformation of orbital angular momentum modes based on a nonreciprocal Mach-Zehnder interferometer

    Authors: Y. F. Yang, M. Y. Chen, F. P. Li, Y. P. Ruan, Z. X. Li, M. Xiao, H. Zhang, K. Y. Xia

    Abstract: The orbital angular momentum (OAM) of photons provides a pivotal resource for carrying out high-dimensional classical and quantum information processing due to its unique discrete high-dimensional nature. The cyclic transformation of a set of orthogonal OAM modes is an essential building block for universal high-dimensional information processing. Its realization in the quantum domain is the unive… ▽ More

    Submitted 22 April, 2024; originally announced April 2024.

  39. arXiv:2404.14673  [pdf, other

    quant-ph physics.optics

    High-Dimensional Two-Photon Quantum Controlled Phase-Flip Gate

    Authors: Mingyuan Chen, Jiangshan Tang, Miao Cai, Franco Nori, Keyu Xia

    Abstract: High-dimensional quantum systems have been used to reveal interesting fundamental physics and to improve information capacity and noise resilience in quantum information processing. However, it remains a significant challenge to realize universal two-photon quantum gates in high dimensions with high success probability. Here, by considering an ion-cavity QED system, we theoretically propose, to th… ▽ More

    Submitted 22 April, 2024; originally announced April 2024.

  40. arXiv:2404.14497  [pdf, other

    cs.NI cs.LG eess.SP

    Map** Wireless Networks into Digital Reality through Joint Vertical and Horizontal Learning

    Authors: Zifan Zhang, Mingzhe Chen, Zhaohui Yang, Yuchen Liu

    Abstract: In recent years, the complexity of 5G and beyond wireless networks has escalated, prompting a need for innovative frameworks to facilitate flexible management and efficient deployment. The concept of digital twins (DTs) has emerged as a solution to enable real-time monitoring, predictive configurations, and decision-making processes. While existing works primarily focus on leveraging DTs to optimi… ▽ More

    Submitted 22 April, 2024; originally announced April 2024.

    Comments: Accepted by IFIP/IEEE Networking 2024

    ACM Class: C.2.1

  41. arXiv:2404.13941  [pdf, other

    eess.SY cs.AI cs.LG

    Autoencoder-assisted Feature Ensemble Net for Incipient Faults

    Authors: Mingxuan Gao, Min Wang, Maoyin Chen

    Abstract: Deep learning has shown the great power in the field of fault detection. However, for incipient faults with tiny amplitude, the detection performance of the current deep learning networks (DLNs) is not satisfactory. Even if prior information about the faults is utilized, DLNs can't successfully detect faults 3, 9 and 15 in Tennessee Eastman process (TEP). These faults are notoriously difficult to… ▽ More

    Submitted 22 April, 2024; originally announced April 2024.

  42. arXiv:2404.13922  [pdf

    hep-ex physics.plasm-ph

    A Platform for All-optical Thomson/ Compton Scattering with Versatile Parameters

    Authors: Siyu Chen, Wenchao Yan, Mingyang Zhu, Yaojun Li, Xichen Hu, Hao Xu, Jie Feng, Xulei Ge, Wenzhao Wang, Guangwei Lu, Mingxuan Wei, Lin Lu, Xiaojun Huang, Boyuan Li, Xiaohui Yuan, Feng Liu, Min Chen, Liming Chen, Jie Zhang

    Abstract: A dual-beam platform for all-optical electron-photon scattering, or Thomson/Compton scattering, with adjustable collision-angle and parameter tuning ability has been developed, which, in principle, can be used for the verification of strong-field quantum electrodynamics effects. Combining this platform with a 200 TW Ti:Sapphire laser system, we demonstrated the generation of inverse Compton scatte… ▽ More

    Submitted 22 April, 2024; originally announced April 2024.

  43. arXiv:2404.13840  [pdf, other

    hep-ex

    Study of $e^+e^-\toωX(3872)$ and $γX(3872)$ from 4.66 to 4.95 GeV

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (634 additional authors not shown)

    Abstract: Using data samples with an integrated luminosity of $4.5~\text{fb}^{-1}$ collected by the BESIII detector at center-of-mass energies ranging from 4.66 to 4.95 GeV, we study the processes of $e^+e^-\toωX(3872)$ and $e^+e^-\toγX(3872)$. With the $e^+e^-\toωX(3872)$ process, the branching fraction ratio $R\equiv\frac{\mathcal{B}(X(3872)\toγJ/ψ)}{\mathcal{B}(X(3872)\toπ^+π^- J/ψ)}$ is measured to be… ▽ More

    Submitted 21 April, 2024; originally announced April 2024.

    Comments: 19 pages, 10 figures

  44. arXiv:2404.13547  [pdf, other

    cs.CL

    E-QGen: Educational Lecture Abstract-based Question Generation System

    Authors: Mao-Siang Chen, An-Zi Yen

    Abstract: To optimize the preparation process for educators in academic lectures and associated question-and-answer sessions, this paper presents E-QGen, a lecture abstract-based question generation system. Given a lecture abstract, E-QGen generates potential student inquiries. The questions suggested by our system are expected to not only facilitate teachers in preparing answers in advance but also enable… ▽ More

    Submitted 21 April, 2024; originally announced April 2024.

    Comments: IJCAI 2024 Demo Paper

  45. arXiv:2404.12860  [pdf, other

    quant-ph

    Nonreciprocal PT-symmetric phase transition in a non-Hermitian chiral quantum optical system

    Authors: Miao Cai, Jiang-Shan Tang, Ming-Yuan Chen, Keyu Xia

    Abstract: Phase transitions, non-Hermiticity and nonreciprocity play central roles in fundamental physics. However, the triple interplay of these three fields is of lack in the quantum domain. Here, we show nonreciprocal parity-time-symmetric phase transition in a non-Hermitian chiral quantum electrodynamical system, caused by the directional system dissipation. In remarkable contrast to previously reported… ▽ More

    Submitted 21 April, 2024; v1 submitted 19 April, 2024; originally announced April 2024.

    Comments: 6 pages, 4 figures

  46. arXiv:2404.12850  [pdf, other

    cs.LG cs.DC

    CaBaFL: Asynchronous Federated Learning via Hierarchical Cache and Feature Balance

    Authors: Zeke Xia, Ming Hu, Dengke Yan, Xiaofei Xie, Tianlin Li, Anran Li, Junlong Zhou, Mingsong Chen

    Abstract: Federated Learning (FL) as a promising distributed machine learning paradigm has been widely adopted in Artificial Intelligence of Things (AIoT) applications. However, the efficiency and inference capability of FL is seriously limited due to the presence of stragglers and data imbalance across massive AIoT devices, respectively. To address the above challenges, we present a novel asynchronous FL a… ▽ More

    Submitted 19 April, 2024; originally announced April 2024.

  47. arXiv:2404.12846  [pdf, other

    cs.LG

    KoReA-SFL: Knowledge Replay-based Split Federated Learning Against Catastrophic Forgetting

    Authors: Zeke Xia, Ming Hu, Dengke Yan, Ruixuan Liu, Anran Li, Xiaofei Xie, Mingsong Chen

    Abstract: Although Split Federated Learning (SFL) is good at enabling knowledge sharing among resource-constrained clients, it suffers from the problem of low training accuracy due to the neglect of data heterogeneity and catastrophic forgetting. To address this issue, we propose a novel SFL approach named KoReA-SFL, which adopts a multi-model aggregation mechanism to alleviate gradient divergence caused by… ▽ More

    Submitted 19 April, 2024; originally announced April 2024.

  48. arXiv:2404.11525  [pdf, other

    cs.CV eess.IV

    JointViT: Modeling Oxygen Saturation Levels with Joint Supervision on Long-Tailed OCTA

    Authors: Zeyu Zhang, Xuyin Qi, Mingxi Chen, Guangxi Li, Ryan Pham, Ayub Qassim, Ella Berry, Zhibin Liao, Owen Siggs, Robert Mclaughlin, Jamie Craig, Minh-Son To

    Abstract: The oxygen saturation level in the blood (SaO2) is crucial for health, particularly in relation to sleep-related breathing disorders. However, continuous monitoring of SaO2 is time-consuming and highly variable depending on patients' conditions. Recently, optical coherence tomography angiography (OCTA) has shown promising development in rapidly and effectively screening eye-related lesions, offeri… ▽ More

    Submitted 18 April, 2024; v1 submitted 17 April, 2024; originally announced April 2024.

  49. arXiv:2404.11193  [pdf, other

    quant-ph

    Photonic indistinguishability characterization and optimization for cavity-based single-photon source

    Authors: Miao Cai, Mingyuan Chen, Jiangshan Tang, Keyu Xia

    Abstract: Indistinguishability of single photons from independent sources is critically important for scalable quantum technologies. We provide a comprehensive comparison of single-photon indistinguishability of different kinds of cavity quantum electrodynamics (CQED) systems by numerically simulating Hong-Ou-Mandel (HOM) two-photon interference. We find that the CQED system using nature atoms exhibit super… ▽ More

    Submitted 17 April, 2024; originally announced April 2024.

  50. arXiv:2404.11045  [pdf, other

    cs.CL

    Offset Unlearning for Large Language Models

    Authors: James Y. Huang, Wenxuan Zhou, Fei Wang, Fred Morstatter, Sheng Zhang, Hoifung Poon, Muhao Chen

    Abstract: Despite the strong capabilities of Large Language Models (LLMs) to acquire knowledge from their training corpora, the memorization of sensitive information in the corpora such as copyrighted, harmful, and private content has led to ethical and legal concerns. In response to these challenges, unlearning has emerged as a potential remedy for LLMs affected by problematic training data. However, previ… ▽ More

    Submitted 16 April, 2024; originally announced April 2024.