Skip to main content

Showing 201–250 of 3,416 results for author: Wu, Z

.
  1. arXiv:2404.04917  [pdf, ps, other

    hep-ex

    Search for $η_c(2S)\to 2(π^+π^-)$ and improved measurement of $χ_{cJ}\to 2(π^+π^-)$

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (638 additional authors not shown)

    Abstract: We search for the hadronic decay $η_c(2S)\to 2(π^+π^-)$ in the $ψ(3686)\toγη_c(2S)$ radiative decay using $(27.12\pm 0.14)\times 10^8$ $ψ(3686)$ events collected by the BESIII detector at the BEPCII collider. No significant signal is found, and the upper limit of $\mathcal{B}[ψ(3686)\toγη_c(2S)]\mathcal{B}[η_c(2S)\to 2(π^+π^-)]$ is determined to be $0.78\times 10^{-6}$ at the 90\% confidence level… ▽ More

    Submitted 7 April, 2024; originally announced April 2024.

  2. arXiv:2404.04836  [pdf, ps, other

    math.AP

    Global strong solution to the inviscid liquid-gas two-phase flow model in $L^p$ framework

    Authors: Zhigang Wu, Mengqian Liu, Juanzi Cai

    Abstract: This paper is dedicated to the study of the inviscid liquid-gas two-phase flow model in $\mathbb{R}^d\ (d\geq1)$. We establish the global existence of strong solutions to this system with small initial data in hybrid Besov spaces based on general $L^p$-norms. Additionally, we obtain the decay estimates of solutions rely on the constructed Lyapunov functional.

    Submitted 7 April, 2024; originally announced April 2024.

    MSC Class: 35A09; 35B40; 35Q35

  3. arXiv:2404.04772  [pdf, other

    cs.RO

    Efficient Reinforcement Learning of Task Planners for Robotic Palletization through Iterative Action Masking Learning

    Authors: Zheng Wu, Yichuan Li, Wei Zhan, Changliu Liu, Yun-Hui Liu, Masayoshi Tomizuka

    Abstract: The development of robotic systems for palletization in logistics scenarios is of paramount importance, addressing critical efficiency and precision demands in supply chain management. This paper investigates the application of Reinforcement Learning (RL) in enhancing task planning for such robotic systems. Confronted with the substantial challenge of a vast action space, which is a significant im… ▽ More

    Submitted 6 April, 2024; originally announced April 2024.

    Comments: 8 pages, 8 figures

  4. "Don't Step on My Toes": Resolving Editing Conflicts in Real-Time Collaboration in Computational Notebooks

    Authors: April Yi Wang, Zihan Wu, Christopher Brooks, Steve Oney

    Abstract: Real-time collaborative editing in computational notebooks can improve the efficiency of teamwork for data scientists. However, working together through synchronous editing of notebooks introduces new challenges. Data scientists may inadvertently interfere with each others' work by altering the shared codebase and runtime state if they do not set up a social protocol for working together and monit… ▽ More

    Submitted 6 April, 2024; originally announced April 2024.

  5. arXiv:2404.04640  [pdf, other

    hep-ex

    Search for di-photon decays of an axion-like particle in radiative decays of J/psi

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, M. R. An, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko , et al. (604 additional authors not shown)

    Abstract: We search for the di-photon decay of a light pseudoscalar axion-like particle, $a$, in radiative decays of the $J/ψ$, using 10 billion $J/ψ$ events collected with the BESIII detector. We find no evidence of a narrow resonance and set upper limits at the $95\%$ confidence level on the product branching fraction $\mathcal{B}(J/ψ\to γa) \times \mathcal{B}(a \to γγ)$ and the axion-like particle photon… ▽ More

    Submitted 6 April, 2024; originally announced April 2024.

    Comments: 9 pages, 5 figures, Submitted to Phys. Rev. D (Letter)

    Report number: BESIII Analysis Memo - 671

  6. arXiv:2404.04562  [pdf, other

    cs.CV

    Diffusion Time-step Curriculum for One Image to 3D Generation

    Authors: Xuanyu Yi, Zike Wu, Qingshan Xu, Pan Zhou, Joo-Hwee Lim, Hanwang Zhang

    Abstract: Score distillation sampling~(SDS) has been widely adopted to overcome the absence of unseen views in reconstructing 3D objects from a \textbf{single} image. It leverages pre-trained 2D diffusion models as teacher to guide the reconstruction of student 3D models. Despite their remarkable success, SDS-based methods often encounter geometric artifacts and texture saturation. We find out the crux is t… ▽ More

    Submitted 2 May, 2024; v1 submitted 6 April, 2024; originally announced April 2024.

    Comments: Accepted to CVPR 2024

  7. arXiv:2404.03736  [pdf, other

    cs.CV

    SC4D: Sparse-Controlled Video-to-4D Generation and Motion Transfer

    Authors: Zijie Wu, Chaohui Yu, Yanqin Jiang, Chenjie Cao, Fan Wang, Xiang Bai

    Abstract: Recent advances in 2D/3D generative models enable the generation of dynamic 3D objects from a single-view video. Existing approaches utilize score distillation sampling to form the dynamic scene as dynamic NeRF or dense 3D Gaussians. However, these methods struggle to strike a balance among reference view alignment, spatio-temporal consistency, and motion fidelity under single-view conditions due… ▽ More

    Submitted 4 April, 2024; originally announced April 2024.

    Comments: Project Page: https://sc4d.github.io/

  8. arXiv:2404.03654  [pdf, other

    cs.CV

    RaFE: Generative Radiance Fields Restoration

    Authors: Zhongkai Wu, Ziyu Wan, **g Zhang, **g Liao, Dong Xu

    Abstract: NeRF (Neural Radiance Fields) has demonstrated tremendous potential in novel view synthesis and 3D reconstruction, but its performance is sensitive to input image quality, which struggles to achieve high-fidelity rendering when provided with low-quality sparse input viewpoints. Previous methods for NeRF restoration are tailored for specific degradation type, ignoring the generality of restoration.… ▽ More

    Submitted 7 April, 2024; v1 submitted 4 April, 2024; originally announced April 2024.

    Comments: Project Page: https://zkaiwu.github.io/RaFE

  9. arXiv:2404.03592  [pdf, other

    cs.CL cs.AI cs.LG

    ReFT: Representation Finetuning for Language Models

    Authors: Zhengxuan Wu, Aryaman Arora, Zheng Wang, Atticus Geiger, Dan Jurafsky, Christopher D. Manning, Christopher Potts

    Abstract: Parameter-efficient finetuning (PEFT) methods seek to adapt large neural models via updates to a small number of weights. However, much prior interpretability work has shown that representations encode rich semantic information, suggesting that editing representations might be a more powerful alternative. We pursue this hypothesis by develo** a family of Representation Finetuning (ReFT) methods.… ▽ More

    Submitted 22 May, 2024; v1 submitted 4 April, 2024; originally announced April 2024.

    Comments: preprint

  10. arXiv:2404.03217  [pdf, other

    hep-ex

    Evidence of the $h_c\to K_S^0 K^+π^-+c.c.$ decay

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (638 additional authors not shown)

    Abstract: Based on $(2.712\pm0.014)\times10^9$ $ψ(3686)$ events collected by the BESIII collaboration, evidence of the hadronic decay $h_c\to K_S^0K^+π^-+c.c.$ is found with a significance of $4.3σ$ in the $ψ(3686)\toπ^0 h_c$ process. The branching fraction of $h_c\to K_S^0 K^+π^- +c.c.$ is measured to be $(7.3\pm0.8\pm1.8)\times10^{-4}$, where the first and second uncertainties are statistical and systemat… ▽ More

    Submitted 4 April, 2024; originally announced April 2024.

  11. arXiv:2404.02977  [pdf, other

    hep-th gr-qc

    Superradiant Instability of Charged Extremal Black Holes in Einstein-Born-Infeld Gravity

    Authors: Ze-Hua Wu, H. Lu

    Abstract: We study charged scalar perturbations of charged extremal black holes in Einstein-Born-Infeld theory. Our numerical results indicate that these black holes all suffer from superradiant instability by the unstable quasi-bound states, regardless how small the coupling constant is. We therefore provide a new example that the superradiant stability of the Reissner-Nordström black hole is a fine-tuned… ▽ More

    Submitted 3 April, 2024; originally announced April 2024.

    Comments: Latex, 23 pages, 4 figures with 7 panels

  12. arXiv:2404.02490  [pdf, other

    cs.CL

    Enhancing Cross-lingual Sentence Embedding for Low-resource Languages with Word Alignment

    Authors: Zhongtao Miao, Qiyu Wu, Kaiyan Zhao, Zilong Wu, Yoshimasa Tsuruoka

    Abstract: The field of cross-lingual sentence embeddings has recently experienced significant advancements, but research concerning low-resource languages has lagged due to the scarcity of parallel corpora. This paper shows that cross-lingual word representation in low-resource languages is notably under-aligned with that in high-resource languages in current models. To address this, we introduce a novel fr… ▽ More

    Submitted 3 April, 2024; originally announced April 2024.

    Comments: NAACL 2024 findings

  13. arXiv:2404.02446  [pdf, other

    cs.LG stat.ML

    Masked Completion via Structured Diffusion with White-Box Transformers

    Authors: Druv Pai, Ziyang Wu, Sam Buchanan, Yaodong Yu, Yi Ma

    Abstract: Modern learning frameworks often train deep neural networks with massive amounts of unlabeled data to learn representations by solving simple pretext tasks, then use the representations as foundations for downstream tasks. These networks are empirically designed; as such, they are usually not interpretable, their representations are not structured, and their designs are potentially redundant. Whit… ▽ More

    Submitted 3 April, 2024; originally announced April 2024.

    Comments: To be published at ICLR 2024; 44 pages. arXiv admin note: substantial text overlap with arXiv:2311.13110

  14. arXiv:2404.02033  [pdf, other

    hep-ex hep-ph

    Search for $C$-even states decaying to $D_{s}^{\pm}D_{s}^{*\mp}$ with masses between $4.08$ and $4.32$ $\rm GeV/{\it c}^{2}$

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (638 additional authors not shown)

    Abstract: Six $C$-even states, denoted as $X$, with quantum numbers $J^{PC}=0^{-+}$, $1^{\pm+}$, or $2^{\pm+}$, are searched for via the $e^+e^-\toγD_{s}^{\pm}D_{s}^{*\mp}$ process using $(1667.39\pm8.84)~\mathrm{pb}^{-1}$ of $e^+e^-$ collision data collected with the BESIII detector operating at the BEPCII storage ring at center-of-mass energy of $\sqrt{s}=(4681.92\pm0.30)~\mathrm{MeV}$. No statistically s… ▽ More

    Submitted 2 April, 2024; originally announced April 2024.

  15. arXiv:2404.01862  [pdf, other

    cs.CV cs.HC cs.MM

    Co-Speech Gesture Video Generation via Motion-Decoupled Diffusion Model

    Authors: Xu He, Qiaochu Huang, Zhensong Zhang, Zhiwei Lin, Zhiyong Wu, Sicheng Yang, Minglei Li, Zhiyi Chen, Songcen Xu, Xiaofei Wu

    Abstract: Co-speech gestures, if presented in the lively form of videos, can achieve superior visual effects in human-machine interaction. While previous works mostly generate structural human skeletons, resulting in the omission of appearance information, we focus on the direct generation of audio-driven co-speech gesture videos in this work. There are two main challenges: 1) A suitable motion feature is n… ▽ More

    Submitted 2 April, 2024; originally announced April 2024.

    Comments: 22 pages, 8 figures, CVPR 2024

  16. arXiv:2404.01768  [pdf, other

    cs.CL cs.AI

    Auditing Large Language Models for Enhanced Text-Based Stereotype Detection and Probing-Based Bias Evaluation

    Authors: Zekun Wu, Sahan Bulathwela, Maria Perez-Ortiz, Adriano Soares Koshiyama

    Abstract: Recent advancements in Large Language Models (LLMs) have significantly increased their presence in human-facing Artificial Intelligence (AI) applications. However, LLMs could reproduce and even exacerbate stereotypical outputs from training data. This work introduces the Multi-Grain Stereotype (MGS) dataset, encompassing 51,867 instances across gender, race, profession, religion, and stereotypical… ▽ More

    Submitted 2 April, 2024; originally announced April 2024.

    Comments: Under reviewed as a conference paper at COLM 2024

  17. arXiv:2404.01656  [pdf, other

    cs.CV

    Supporting Mitosis Detection AI Training with Inter-Observer Eye-Gaze Consistencies

    Authors: Hongyan Gu, Zihan Yan, Ayesha Alvi, Brandon Day, Chunxu Yang, Zida Wu, Shino Magaki, Mohammad Haeri, Xiang 'Anthony' Chen

    Abstract: The expansion of artificial intelligence (AI) in pathology tasks has intensified the demand for doctors' annotations in AI development. However, collecting high-quality annotations from doctors is costly and time-consuming, creating a bottleneck in AI progress. This study investigates eye-tracking as a cost-effective technology to collect doctors' behavioral data for AI training with a focus on th… ▽ More

    Submitted 2 April, 2024; originally announced April 2024.

    Comments: Accepted by IEEE International Conference on Healthcare Informatics 2024

  18. arXiv:2404.01359  [pdf

    quant-ph cs.AI cs.NE

    Parallel Proportional Fusion of Spiking Quantum Neural Network for Optimizing Image Classification

    Authors: Zuyu Xu, Kang Shen, Pengnian Cai, Tao Yang, Yuanming Hu, Shixian Chen, Yunlai Zhu, Zuheng Wu, Yuehua Dai, Jun Wang, Fei Yang

    Abstract: The recent emergence of the hybrid quantum-classical neural network (HQCNN) architecture has garnered considerable attention due to the potential advantages associated with integrating quantum principles to enhance various facets of machine learning algorithms and computations. However, the current investigated serial structure of HQCNN, wherein information sequentially passes from one network to… ▽ More

    Submitted 1 April, 2024; originally announced April 2024.

  19. arXiv:2404.01268  [pdf, other

    cs.CL cs.AI cs.DL cs.LG cs.SI

    Map** the Increasing Use of LLMs in Scientific Papers

    Authors: Weixin Liang, Yaohui Zhang, Zhengxuan Wu, Haley Lepp, Wenlong Ji, Xuandong Zhao, Hancheng Cao, Sheng Liu, Siyu He, Zhi Huang, Diyi Yang, Christopher Potts, Christopher D Manning, James Y. Zou

    Abstract: Scientific publishing lays the foundation of science by disseminating research findings, fostering collaboration, encouraging reproducibility, and ensuring that scientific knowledge is accessible, verifiable, and built upon over time. Recently, there has been immense speculation about how many people are using large language models (LLMs) like ChatGPT in their academic writing, and to what extent… ▽ More

    Submitted 1 April, 2024; originally announced April 2024.

  20. arXiv:2404.00848  [pdf, other

    cs.LG cs.CY stat.ME

    Predictive Performance Comparison of Decision Policies Under Confounding

    Authors: Luke Guerdan, Amanda Coston, Kenneth Holstein, Zhiwei Steven Wu

    Abstract: Predictive models are often introduced to decision-making tasks under the rationale that they improve performance over an existing decision-making policy. However, it is challenging to compare predictive performance against an existing decision-making policy that is generally under-specified and dependent on unobservable factors. These sources of uncertainty are often addressed in practice by maki… ▽ More

    Submitted 11 June, 2024; v1 submitted 31 March, 2024; originally announced April 2024.

    Comments: ICML 2024

  21. arXiv:2404.00680  [pdf, other

    cs.CV

    Learning to Rank Patches for Unbiased Image Redundancy Reduction

    Authors: Yang Luo, Zhineng Chen, Peng Zhou, Zuxuan Wu, ** Gao, Yu-Gang Jiang

    Abstract: Images suffer from heavy spatial redundancy because pixels in neighboring regions are spatially correlated. Existing approaches strive to overcome this limitation by reducing less meaningful image regions. However, current leading methods rely on supervisory signals. They may compel models to preserve content that aligns with labeled categories and discard content belonging to unlabeled categories… ▽ More

    Submitted 25 April, 2024; v1 submitted 31 March, 2024; originally announced April 2024.

    Comments: Accepted by CVPR 2024

  22. arXiv:2404.00320  [pdf, other

    cs.AI

    Advancing Multimodal Data Fusion in Pain Recognition: A Strategy Leveraging Statistical Correlation and Human-Centered Perspectives

    Authors: Xingrui Gu, Zhixuan Wang, Irisa **, Zekun Wu

    Abstract: This research tackles the challenge of integrating heterogeneous data for specific behavior recognition within the domain of Pain Recognition, presenting a novel methodology that harmonizes statistical correlations with a human-centered approach. By leveraging a diverse range of deep learning architectures, we highlight the adaptability and efficacy of our approach in improving model performance a… ▽ More

    Submitted 30 March, 2024; originally announced April 2024.

    Comments: Under reviewed by ACII 2024

  23. arXiv:2404.00014  [pdf

    physics.chem-ph cs.AI q-bio.BM

    Deep Geometry Handling and Fragment-wise Molecular 3D Graph Generation

    Authors: Odin Zhang, Yufei Huang, Shichen Cheng, Mengyao Yu, Xujun Zhang, Haitao Lin, Yundian Zeng, Mingyang Wang, Zhenxing Wu, Huifeng Zhao, Zaixi Zhang, Chenqing Hua, Yu Kang, Sunliang Cui, Peichen Pan, Chang-Yu Hsieh, Tingjun Hou

    Abstract: Most earlier 3D structure-based molecular generation approaches follow an atom-wise paradigm, incrementally adding atoms to a partially built molecular fragment within protein pockets. These methods, while effective in designing tightly bound ligands, often overlook other essential properties such as synthesizability. The fragment-wise generation paradigm offers a promising solution. However, a co… ▽ More

    Submitted 15 March, 2024; originally announced April 2024.

  24. arXiv:2403.20289  [pdf, other

    cs.CL cs.SD eess.AS

    Emotion-Anchored Contrastive Learning Framework for Emotion Recognition in Conversation

    Authors: Fangxu Yu, Junjie Guo, Zhen Wu, Xinyu Dai

    Abstract: Emotion Recognition in Conversation (ERC) involves detecting the underlying emotion behind each utterance within a conversation. Effectively generating representations for utterances remains a significant challenge in this task. Recent works propose various models to address this issue, but they still struggle with differentiating similar emotions such as excitement and happiness. To alleviate thi… ▽ More

    Submitted 29 March, 2024; originally announced March 2024.

    Comments: Accepted by Findings of NAACL 2024

  25. arXiv:2403.19829  [pdf, ps, other

    quant-ph math.NA

    An Efficient Quantum Algorithm for Linear System Problem in Tensor Format

    Authors: Zeguan Wu, Sidhant Misra, Tamás Terlaky, Xiu Yang, Marc Vuffray

    Abstract: Solving linear systems is at the foundation of many algorithms. Recently, quantum linear system algorithms (QLSAs) have attracted great attention since they converge to a solution exponentially faster than classical algorithms in terms of the problem dimension. However, low-complexity circuit implementations of the oracles assumed in these QLSAs constitute the major bottleneck for practical quantu… ▽ More

    Submitted 28 March, 2024; originally announced March 2024.

    MSC Class: 65F05; 68Q12; 81P68

  26. arXiv:2403.19336  [pdf, other

    cs.CV cs.AI

    IVLMap: Instance-Aware Visual Language Grounding for Consumer Robot Navigation

    Authors: Jiacui Huang, Hongtao Zhang, Mingbo Zhao, Zhou Wu

    Abstract: Vision-and-Language Navigation (VLN) is a challenging task that requires a robot to navigate in photo-realistic environments with human natural language promptings. Recent studies aim to handle this task by constructing the semantic spatial map representation of the environment, and then leveraging the strong ability of reasoning in large language models for generalizing code for guiding the robot… ▽ More

    Submitted 28 March, 2024; originally announced March 2024.

  27. arXiv:2403.19256  [pdf, other

    hep-ex

    Measurement of absolute branching fractions of $D_s^+$ hadronic decays

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (632 additional authors not shown)

    Abstract: Using $e^+ e^-$ collision data collected at the BESIII detector at center-of-mass energies between 4.128 and 4.226 GeV, corresponding to an integrated luminosity of $7.33~{\rm fb}^{-1}$, we determine the absolute branching fractions of fifteen hadronic $D_s^{+}$ decays with a double-tag technique. In particular, we make precise measurements of the branching fractions… ▽ More

    Submitted 30 May, 2024; v1 submitted 28 March, 2024; originally announced March 2024.

  28. arXiv:2403.19223  [pdf, ps, other

    math.NA

    Computing large deviation rate functions of entropy production for diffusion processes by an interacting particle method

    Authors: Zhizhang Wu, Renaud Raquépas, Jack Xin, Zhiwen Zhang

    Abstract: We study an interacting particle method (IPM) for computing the large deviation rate function of entropy production for diffusion processes, with emphasis on the vanishing-noise limit and high dimensions. The crucial ingredient to obtain the rate function is the computation of the principal eigenvalue $λ$ of elliptic, non-self-adjoint operators. We show that this principal eigenvalue can be approx… ▽ More

    Submitted 6 June, 2024; v1 submitted 28 March, 2024; originally announced March 2024.

    MSC Class: 37M25; 47D08; 60F10; 82C31

  29. arXiv:2403.19121  [pdf, other

    cs.CL

    Code Comparison Tuning for Code Large Language Models

    Authors: Yufan Jiang, Qiaozhi He, Xiaomin Zhuang, Zhihua Wu

    Abstract: We present Code Comparison Tuning (CCT), a simple and effective tuning method for code large language models (Code LLMs) to better handle subtle code errors. Specifically, we integrate the concept of comparison into instruction tuning, both at the token and sequence levels, enabling the model to discern even the slightest deviations in code. To compare the original code with an erroneous version c… ▽ More

    Submitted 5 June, 2024; v1 submitted 27 March, 2024; originally announced March 2024.

    Comments: Preprint

  30. arXiv:2403.19091  [pdf, other

    hep-ex

    Observation of the semileptonic decays $D^0\rightarrow K_S^0π^-π^0 e^+ ν_e$ and $D^+\rightarrow K_S^0π^+π^- e^+ ν_e$

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, X. C. Ai, R. Aliberti, A. Amoroso, M. R. An, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere, A. Brueggemann , et al. (600 additional authors not shown)

    Abstract: By analyzing $e^+e^-$ annihilation data corresponding to an integrated luminosity of 2.93 $\rm fb^{-1}$ collected at a center-of-mass energy of 3.773 GeV with the \text{BESIII} detector, the first observation of the semileptonic decays $D^0\rightarrow K_S^0π^-π^0 e^+ ν_e$ and $D^+\rightarrow K_S^0π^+π^- e^+ ν_e$ is reported. With a dominant hadronic contribution from $K_1(1270)$, the branching fra… ▽ More

    Submitted 27 March, 2024; originally announced March 2024.

    Comments: 19pages

  31. arXiv:2403.18795  [pdf, other

    cs.CV cs.AI

    Gamba: Marry Gaussian Splatting with Mamba for single view 3D reconstruction

    Authors: Qiuhong Shen, Zike Wu, Xuanyu Yi, Pan Zhou, Hanwang Zhang, Shuicheng Yan, Xinchao Wang

    Abstract: We tackle the challenge of efficiently reconstructing a 3D asset from a single image at millisecond speed. Existing methods for single-image 3D reconstruction are primarily based on Score Distillation Sampling (SDS) with Neural 3D representations. Despite promising results, these approaches encounter practical limitations due to lengthy optimizations and significant memory consumption. In this wor… ▽ More

    Submitted 24 May, 2024; v1 submitted 27 March, 2024; originally announced March 2024.

    Comments: project page: https://florinshen.github.io/gamba-project

  32. arXiv:2403.18730  [pdf, other

    cs.CV

    Towards Image Ambient Lighting Normalization

    Authors: Florin-Alexandru Vasluianu, Tim Seizinger, Zongwei Wu, Rakesh Ranjan, Radu Timofte

    Abstract: Lighting normalization is a crucial but underexplored restoration task with broad applications. However, existing works often simplify this task within the context of shadow removal, limiting the light sources to one and oversimplifying the scene, thus excluding complex self-shadows and restricting surface classes to smooth ones. Although promising, such simplifications hinder generalizability to… ▽ More

    Submitted 27 March, 2024; originally announced March 2024.

  33. arXiv:2403.18365  [pdf, other

    cs.CL

    BLADE: Enhancing Black-box Large Language Models with Small Domain-Specific Models

    Authors: Haitao Li, Qingyao Ai, Jia Chen, Qian Dong, Zhi**g Wu, Yiqun Liu, Chong Chen, Qi Tian

    Abstract: Large Language Models (LLMs) like ChatGPT and GPT-4 are versatile and capable of addressing a diverse range of tasks. However, general LLMs, which are developed on open-domain data, may lack the domain-specific knowledge essential for tasks in vertical domains, such as legal, medical, etc. To address this issue, previous approaches either conduct continuous pre-training with domain-specific data o… ▽ More

    Submitted 27 March, 2024; originally announced March 2024.

    Comments: 11pages

  34. arXiv:2403.18241  [pdf, other

    cs.CV cs.AI cs.GR cs.LG

    NeuSDFusion: A Spatial-Aware Generative Model for 3D Shape Completion, Reconstruction, and Generation

    Authors: Ruikai Cui, Weizhe Liu, Weixuan Sun, Senbo Wang, Taizhang Shang, Yang Li, Xibin Song, Han Yan, Zhennan Wu, Shenzhou Chen, Hongdong Li, Pan Ji

    Abstract: 3D shape generation aims to produce innovative 3D content adhering to specific conditions and constraints. Existing methods often decompose 3D shapes into a sequence of localized components, treating each element in isolation without considering spatial consistency. As a result, these approaches exhibit limited versatility in 3D data representation and shape generation, hindering their ability to… ▽ More

    Submitted 27 March, 2024; originally announced March 2024.

  35. arXiv:2403.17935  [pdf, other

    cs.CV

    OmniVid: A Generative Framework for Universal Video Understanding

    Authors: Junke Wang, Dongdong Chen, Chong Luo, Bo He, Lu Yuan, Zuxuan Wu, Yu-Gang Jiang

    Abstract: The core of video understanding tasks, such as recognition, captioning, and tracking, is to automatically detect objects or actions in a video and analyze their temporal evolution. Despite sharing a common goal, different tasks often rely on distinct model architectures and annotation formats. In contrast, natural language processing benefits from a unified output space, i.e., text sequences, whic… ▽ More

    Submitted 26 March, 2024; originally announced March 2024.

    Comments: Accepted by CVPR 2024

  36. arXiv:2403.17916  [pdf, other

    cs.RO cs.AI cs.CV cs.LG cs.MA

    CMP: Cooperative Motion Prediction with Multi-Agent Communication

    Authors: Zhuoyuan Wu, Yu** Wang, Hengbo Ma, Zhaowei Li, Hang Qiu, Jiachen Li

    Abstract: The confluence of the advancement of Autonomous Vehicles (AVs) and the maturity of Vehicle-to-Everything (V2X) communication has enabled the capability of cooperative connected and automated vehicles (CAVs). Building on top of cooperative perception, this paper explores the feasibility and effectiveness of cooperative motion prediction. Our method, CMP, takes LiDAR signals as input to enhance trac… ▽ More

    Submitted 26 March, 2024; originally announced March 2024.

  37. arXiv:2403.17008  [pdf, other

    cs.CV

    FlashFace: Human Image Personalization with High-fidelity Identity Preservation

    Authors: Shilong Zhang, Lianghua Huang, Xi Chen, Yifei Zhang, Zhi-Fan Wu, Yutong Feng, Wei Wang, Yujun Shen, Yu Liu, ** Luo

    Abstract: This work presents FlashFace, a practical tool with which users can easily personalize their own photos on the fly by providing one or a few reference face images and a text prompt. Our approach is distinguishable from existing human photo customization methods by higher-fidelity identity preservation and better instruction following, benefiting from two subtle designs. First, we encode the face i… ▽ More

    Submitted 25 March, 2024; originally announced March 2024.

    Comments: Project Page:https://jshilong.github.io/flashface-page

  38. arXiv:2403.16811  [pdf, ps, other

    hep-ex

    Cross section measurement of $e^+e^-\to ηψ(2S)$ and search for $e^+e^-\toη\tilde{X}(3872)$

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (639 additional authors not shown)

    Abstract: The energy-dependent cross section for $e^+e^-\to ηψ(2S)$ is measured at eighteen center of mass energies from 4.288 GeV to 4.951 GeV using the BESIII detector. Using the same data samples, we also perform the first search for the reaction $e^+e^-\toη\tilde{X}(3872)$, but no evidence is found for the $\tilde{X}(3872)$ in the $π^+π^- J/ψ$ mass distribution. At each of the eighteen center of mass en… ▽ More

    Submitted 25 March, 2024; originally announced March 2024.

  39. DBPF: A Framework for Efficient and Robust Dynamic Bin-Picking

    Authors: Yichuan Li, Junkai Zhao, Yixiao Li, Zheng Wu, Rui Cao, Masayoshi Tomizuka, Yunhui Liu

    Abstract: Efficiency and reliability are critical in robotic bin-picking as they directly impact the productivity of automated industrial processes. However, traditional approaches, demanding static objects and fixed collisions, lead to deployment limitations, operational inefficiencies, and process unreliability. This paper introduces a Dynamic Bin-Picking Framework (DBPF) that challenges traditional stati… ▽ More

    Submitted 25 March, 2024; originally announced March 2024.

    Comments: 8 pages, 5 figures. This paper has been accepted by IEEE RA-L on 2024-03-24. See the supplementary video at youtube: https://youtu.be/n5af2VsKhkg

  40. arXiv:2403.16339  [pdf, other

    quant-ph

    What is Entanglement?

    Authors: Chon-Fai Kam, Zhong-Tang Wu

    Abstract: Entanglement, a puzzle since Einstein's time, has become increasingly crucial with the rise of quantum computation. But what exactly is it? Historically , entanglement can be precisely defined, but only negatively. In this article, we explore four interconnected definitions of entangled states.

    Submitted 24 March, 2024; originally announced March 2024.

  41. arXiv:2403.16210  [pdf, other

    cs.CV cs.AI cs.GR

    Frankenstein: Generating Semantic-Compositional 3D Scenes in One Tri-Plane

    Authors: Han Yan, Yang Li, Zhennan Wu, Shenzhou Chen, Weixuan Sun, Taizhang Shang, Weizhe Liu, Tian Chen, Xiaqiang Dai, Chao Ma, Hongdong Li, Pan Ji

    Abstract: We present Frankenstein, a diffusion-based framework that can generate semantic-compositional 3D scenes in a single pass. Unlike existing methods that output a single, unified 3D shape, Frankenstein simultaneously generates multiple separated shapes, each corresponding to a semantically meaningful part. The 3D scene information is encoded in one single tri-plane tensor, from which multiple Singed… ▽ More

    Submitted 24 March, 2024; originally announced March 2024.

    Comments: Video: https://youtu.be/lRn-HqyCrLI

  42. arXiv:2403.16037  [pdf, other

    cs.IR

    Knowledge-aware Dual-side Attribute-enhanced Recommendation

    Authors: Taotian Pang, Xingyu Lou, Fei Zhao, Zhen Wu, Kuiyao Dong, Qiuying Peng, Yue Qi, Xinyu Dai

    Abstract: \textit{Knowledge-aware} recommendation methods (KGR) based on \textit{graph neural networks} (GNNs) and \textit{contrastive learning} (CL) have achieved promising performance. However, they fall short in modeling fine-grained user preferences and further fail to leverage the \textit{preference-attribute connection} to make predictions, leading to sub-optimal performance. To address the issue, we… ▽ More

    Submitted 24 March, 2024; originally announced March 2024.

  43. arXiv:2403.14998  [pdf, other

    hep-ex

    Precise measurement of the $e^+e^-\to D_s^+D_s^-$ cross sections at center-of-mass energies from threshold to 4.95 GeV

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (638 additional authors not shown)

    Abstract: Using the $e^+e^-$ collision data collected with the BESIII detector operating at the BEPCII collider, at center-of-mass energies from the threshold to $4.95$~GeV, we present precise measurements of the cross sections for the process $e^+e^-\to D_s^+D_s^-$ using a single tag method. The resulting cross section lineshape exhibits several new structures, thereby offering an input for coupled channel… ▽ More

    Submitted 22 March, 2024; originally announced March 2024.

    Comments: 9 pages, 4 figures, published to PRL

  44. arXiv:2403.14734  [pdf, other

    cs.SE cs.AI cs.CL cs.PL

    A Survey of Neural Code Intelligence: Paradigms, Advances and Beyond

    Authors: Qiushi Sun, Zhirui Chen, Fangzhi Xu, Kanzhi Cheng, Chang Ma, Zhangyue Yin, Jianing Wang, Chengcheng Han, Renyu Zhu, Shuai Yuan, Qipeng Guo, Xipeng Qiu, Pengcheng Yin, Xiaoli Li, Fei Yuan, Lingpeng Kong, Xiang Li, Zhiyong Wu

    Abstract: Neural Code Intelligence -- leveraging deep learning to understand, generate, and optimize code -- holds immense potential for transformative impacts on the whole society. Bridging the gap between Natural Language and Programming Language, this domain has drawn significant attention from researchers in both research communities over the past few years. This survey presents a systematic and chronol… ▽ More

    Submitted 23 June, 2024; v1 submitted 21 March, 2024; originally announced March 2024.

    Comments: 64 pages, 6 figures, 10 tables, 692 references

  45. arXiv:2403.14133  [pdf, other

    cs.CV

    3D Object Detection from Point Cloud via Voting Step Diffusion

    Authors: Haoran Hou, Mingtao Feng, Zijie Wu, Weisheng Dong, Qing Zhu, Yaonan Wang, Ajmal Mian

    Abstract: 3D object detection is a fundamental task in scene understanding. Numerous research efforts have been dedicated to better incorporate Hough voting into the 3D object detection pipeline. However, due to the noisy, cluttered, and partial nature of real 3D scans, existing voting-based methods often receive votes from the partial surfaces of individual objects together with severe noises, leading to s… ▽ More

    Submitted 21 March, 2024; originally announced March 2024.

  46. arXiv:2403.14121  [pdf, other

    cs.CV

    External Knowledge Enhanced 3D Scene Generation from Sketch

    Authors: Zijie Wu, Mingtao Feng, Yaonan Wang, He Xie, Weisheng Dong, Bo Miao, Ajmal Mian

    Abstract: Generating realistic 3D scenes is challenging due to the complexity of room layouts and object geometries.We propose a sketch based knowledge enhanced diffusion architecture (SEK) for generating customized, diverse, and plausible 3D scenes. SEK conditions the denoising process with a hand-drawn sketch of the target scene and cues from an object relationship knowledge base. We first construct an ex… ▽ More

    Submitted 21 March, 2024; originally announced March 2024.

  47. arXiv:2403.14072  [pdf, other

    cs.CL

    A Taxonomy of Ambiguity Types for NLP

    Authors: Margaret Y. Li, Alisa Liu, Zhaofeng Wu, Noah A. Smith

    Abstract: Ambiguity is an critical component of language that allows for more effective communication between speakers, but is often ignored in NLP. Recent work suggests that NLP systems may struggle to grasp certain elements of human language understanding because they may not handle ambiguities at the level that humans naturally do in communication. Additionally, different types of ambiguity may serve dif… ▽ More

    Submitted 20 March, 2024; originally announced March 2024.

    Comments: To appear at the UnImplicit workshop at EACL 2024

  48. arXiv:2403.13437  [pdf, other

    hep-ex

    Search for $ΔS=2$ nonleptonic hyperon decays $Ω^-\toΣ^{0}π^{-}$ and $Ω^-\to nK^{-}$

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (638 additional authors not shown)

    Abstract: Using $(27.12 \pm 0.14) \times 10^{8}$ $ψ(3686)$ events collected by the BESIII detector at the center-of-mass energy of $\sqrt{s} = 3.686$ GeV, we search for the first time for two nonleptonic hyperon decays that change strangeness by two units, $Ω^-\toΣ^{0}π^-$ and $Ω^-\to nK^{-}$. No significant signal is observed. The upper limits on their decay branching fractions are determined to be… ▽ More

    Submitted 14 April, 2024; v1 submitted 20 March, 2024; originally announced March 2024.

  49. arXiv:2403.13238  [pdf, other

    cs.CV

    Beyond Skeletons: Integrative Latent Map** for Coherent 4D Sequence Generation

    Authors: Qitong Yang, Mingtao Feng, Zijie Wu, Shijie Sun, Weisheng Dong, Yaonan Wang, Ajmal Mian

    Abstract: Directly learning to model 4D content, including shape, color and motion, is challenging. Existing methods depend on skeleton-based motion control and offer limited continuity in detail. To address this, we propose a novel framework that generates coherent 4D sequences with animation of 3D shapes under given conditions with dynamic evolution of shape and color over time through integrative latent… ▽ More

    Submitted 19 March, 2024; originally announced March 2024.

  50. arXiv:2403.13043  [pdf, other

    cs.CV

    When Do We Not Need Larger Vision Models?

    Authors: Baifeng Shi, Ziyang Wu, Maolin Mao, Xin Wang, Trevor Darrell

    Abstract: Scaling up the size of vision models has been the de facto standard to obtain more powerful visual representations. In this work, we discuss the point beyond which larger vision models are not necessary. First, we demonstrate the power of Scaling on Scales (S$^2$), whereby a pre-trained and frozen smaller vision model (e.g., ViT-B or ViT-L), run over multiple image scales, can outperform larger mo… ▽ More

    Submitted 19 March, 2024; originally announced March 2024.

    Comments: Code: https://github.com/bfshi/scaling_on_scales