Skip to main content

Showing 51–100 of 1,317 results for author: Ren, Z

.
  1. arXiv:2404.09263  [pdf, other

    cs.CV cs.AI

    Task-Driven Exploration: Decoupling and Inter-Task Feedback for Joint Moment Retrieval and Highlight Detection

    Authors: ** Yang, ** Wei, Huan Li, Ziyang Ren

    Abstract: Video moment retrieval and highlight detection are two highly valuable tasks in video understanding, but until recently they have been jointly studied. Although existing studies have made impressive advancement recently, they predominantly follow the data-driven bottom-up paradigm. Such paradigm overlooks task-specific and inter-task effects, resulting in poor model performance. In this paper, we… ▽ More

    Submitted 14 April, 2024; originally announced April 2024.

  2. arXiv:2404.09116  [pdf, other

    nucl-th nucl-ex

    Spin entanglement of multinucleons: experimental prospects

    Authors: Dong Bai, Zhongzhou Ren

    Abstract: Multiprotons and multineutrons are among the most exotic and mysterious things ever produced on earth. They provide an exceptional opportunity to understand nuclear forces and nuclear dynamics at extreme conditions, as well as neutron stars in the heaven. Quantum entanglement, referred to as ``spooky action at a distance'' by Einstein, is a ubiquitous yet deep property of quantum systems. It not o… ▽ More

    Submitted 13 April, 2024; originally announced April 2024.

    Comments: 6 pages, 4 figures

  3. arXiv:2404.07991  [pdf, other

    cs.CV

    GoMAvatar: Efficient Animatable Human Modeling from Monocular Video Using Gaussians-on-Mesh

    Authors: **g Wen, Xiaoming Zhao, Zhongzheng Ren, Alexander G. Schwing, Shenlong Wang

    Abstract: We introduce GoMAvatar, a novel approach for real-time, memory-efficient, high-quality animatable human modeling. GoMAvatar takes as input a single monocular video to create a digital avatar capable of re-articulation in new poses and real-time rendering from novel viewpoints, while seamlessly integrating with rasterization-based graphics pipelines. Central to our method is the Gaussians-on-Mesh r… ▽ More

    Submitted 11 April, 2024; originally announced April 2024.

    Comments: CVPR 2024; project page: https://wenj.github.io/GoMAvatar/

  4. arXiv:2404.07131  [pdf, other

    hep-ex

    Search for prompt production of pentaquarks in charm hadron final states

    Authors: LHCb collaboration, R. Aaij, A. S. W. Abdelmotteleb, C. Abellan Beteta, F. Abudinén, T. Ackernley, B. Adeva, M. Adinolfi, P. Adlarson, H. Afsharnia, C. Agapopoulou, C. A. Aidala, Z. Ajaltouni, S. Akar, K. Akiba, P. Albicocco, J. Albrecht, F. Alessio, M. Alexander, A. Alfonso Albero, Z. Aliouche, P. Alvarez Cartelle, R. Amalric, S. Amato, J. L. Amey , et al. (1090 additional authors not shown)

    Abstract: A search for hidden-charm pentaquark states decaying to a range of $Σ_{c}\bar{D}$ and $Λ_{c}\bar{D}$ final states, as well as doubly-charmed pentaquark states to $Σ_{c}D$ and $Λ_{c}^{+}D$, is made using samples of proton-proton collision data corresponding to an integrated luminosity of $5.7fb^{-1}$ recorded by the LHCb detector at $\sqrt{s} = 13Te\kern -0.1em V$. Since no significant signals are… ▽ More

    Submitted 2 June, 2024; v1 submitted 10 April, 2024; originally announced April 2024.

    Comments: All figures and tables, along with any supplementary material and additional information, are available at https://cern.ch/lhcbproject/Publications/p/LHCb-PAPER-2023-018.html (LHCb public pages)

    Report number: LHCb-PAPER-2023-018, CERN-EP-2024-071

  5. arXiv:2404.05051  [pdf, other

    cs.LG cs.RO

    Skill Transfer and Discovery for Sim-to-Real Learning: A Representation-Based Viewpoint

    Authors: Haitong Ma, Zhaolin Ren, Bo Dai, Na Li

    Abstract: We study sim-to-real skill transfer and discovery in the context of robotics control using representation learning. We draw inspiration from spectral decomposition of Markov decision processes. The spectral decomposition brings about representation that can linearly represent the state-action value function induced by any policies, thus can be regarded as skills. The skill representations are tran… ▽ More

    Submitted 7 April, 2024; originally announced April 2024.

    Comments: 9 pages, 6 figures. Project page: https://congharvard.github.io/steady-sim-to-real/

  6. arXiv:2404.04490  [pdf, other

    cs.LG cs.CR

    Hyperparameter Optimization for SecureBoost via Constrained Multi-Objective Federated Learning

    Authors: Yan Kang, Ziyao Ren, Lixin Fan, Linghua Yang, Yongxin Tong, Qiang Yang

    Abstract: SecureBoost is a tree-boosting algorithm that leverages homomorphic encryption (HE) to protect data privacy in vertical federated learning. SecureBoost and its variants have been widely adopted in fields such as finance and healthcare. However, the hyperparameters of SecureBoost are typically configured heuristically for optimizing model performance (i.e., utility) solely, assuming that privacy is… ▽ More

    Submitted 5 April, 2024; originally announced April 2024.

  7. arXiv:2404.03375  [pdf, other

    hep-ex

    Search for the $B_s^0 \rightarrow μ^+μ^-γ$ decay

    Authors: LHCb collaboration, R. Aaij, A. S. W. Abdelmotteleb, C. Abellan Beteta, F. Abudinén, T. Ackernley, A. A. Adefisoye, B. Adeva, M. Adinolfi, P. Adlarson, C. Agapopoulou, C. A. Aidala, Z. Ajaltouni, S. Akar, K. Akiba, P. Albicocco, J. Albrecht, F. Alessio, M. Alexander, Z. Aliouche, P. Alvarez Cartelle, R. Amalric, S. Amato, J. L. Amey, Y. Amhis , et al. (1068 additional authors not shown)

    Abstract: A search for the fully reconstructed $B_s^0 \rightarrow μ^+μ^-γ$ decay is performed at the LHCb experiment using proton-proton collisions at $\sqrt{s}=13$\,TeV corresponding to an integrated luminosity of $5.4\,\mathrm{fb^{-1}}$. No significant signal is found and upper limits on the branching fraction in intervals of the dimuon mass are set \begin{align} {\cal B}(B_s^0 \rightarrow μ^+μ^-γ) <… ▽ More

    Submitted 4 April, 2024; originally announced April 2024.

    Comments: All figures and tables, along with any supplementary material and additional information, are available at https://cern.ch/lhcbproject/Publications/p/LHCb-PAPER-2023-045.html

    Report number: LHCb-PAPER-2023-045, CERN-EP-2024-065

  8. arXiv:2404.03085  [pdf, other

    cs.HC cs.AI cs.LG

    Talaria: Interactively Optimizing Machine Learning Models for Efficient Inference

    Authors: Fred Hohman, Chaoqun Wang, **mook Lee, Jochen Görtler, Dominik Moritz, Jeffrey P Bigham, Zhile Ren, Cecile Foret, Qi Shan, Xiaoyi Zhang

    Abstract: On-device machine learning (ML) moves computation from the cloud to personal devices, protecting user privacy and enabling intelligent user experiences. However, fitting models on devices with limited resources presents a major technical challenge: practitioners need to optimize models and balance hardware metrics such as model size, latency, and power. To help practitioners create efficient ML mo… ▽ More

    Submitted 3 April, 2024; originally announced April 2024.

    Comments: Proceedings of the 2024 ACM CHI Conference on Human Factors in Computing Systems

  9. arXiv:2404.00684  [pdf, other

    cs.IR cs.AI

    Generative Retrieval as Multi-Vector Dense Retrieval

    Authors: Shiguang Wu, Wenda Wei, Mengqi Zhang, Zhumin Chen, Jun Ma, Zhaochun Ren, Maarten de Rijke, Pengjie Ren

    Abstract: Generative retrieval generates identifiers of relevant documents in an end-to-end manner using a sequence-to-sequence architecture for a given query. The relation between generative retrieval and other retrieval methods, especially those based on matching within dense retrieval models, is not yet fully comprehended. Prior work has demonstrated that generative retrieval with atomic identifiers is e… ▽ More

    Submitted 31 March, 2024; originally announced April 2024.

    Comments: 12 pages, 5 figures, 8 tables, accepted at SIGIR 2024

  10. arXiv:2404.00673  [pdf, other

    cs.CR cs.AI cs.CY cs.LG

    A Survey of Privacy-Preserving Model Explanations: Privacy Risks, Attacks, and Countermeasures

    Authors: Thanh Tam Nguyen, Thanh Trung Huynh, Zhao Ren, Thanh Toan Nguyen, Phi Le Nguyen, Hongzhi Yin, Quoc Viet Hung Nguyen

    Abstract: As the adoption of explainable AI (XAI) continues to expand, the urgency to address its privacy implications intensifies. Despite a growing corpus of research in AI privacy and explainability, there is little attention on privacy-preserving model explanations. This article presents the first thorough survey about privacy attacks on model explanations and their countermeasures. Our contribution to… ▽ More

    Submitted 26 June, 2024; v1 submitted 31 March, 2024; originally announced April 2024.

    Comments: Revision

  11. arXiv:2403.19056  [pdf, other

    cs.CL

    CAUSE: Counterfactual Assessment of User Satisfaction Estimation in Task-Oriented Dialogue Systems

    Authors: Amin Abolghasemi, Zhaochun Ren, Arian Askari, Mohammad Aliannejadi, Maarten de Rijke, Suzan Verberne

    Abstract: An important unexplored aspect in previous work on user satisfaction estimation for Task-Oriented Dialogue (TOD) systems is their evaluation in terms of robustness for the identification of user dissatisfaction: current benchmarks for user satisfaction estimation in TOD systems are highly skewed towards dialogues for which the user is satisfied. The effect of having a more balanced set of satisfac… ▽ More

    Submitted 27 March, 2024; originally announced March 2024.

  12. arXiv:2403.18480  [pdf, other

    cs.IR

    Enhanced Generative Recommendation via Content and Collaboration Integration

    Authors: Yidan Wang, Zhaochun Ren, Weiwei Sun, Jiyuan Yang, Zhixiang Liang, Xin Chen, Ruobing Xie, Su Yan, Xu Zhang, Pengjie Ren, Zhumin Chen, Xin Xin

    Abstract: Generative recommendation has emerged as a promising paradigm aimed at augmenting recommender systems with recent advancements in generative artificial intelligence. This task has been formulated as a sequence-to-sequence generation process, wherein the input sequence encompasses data pertaining to the user's previously interacted items, and the output sequence denotes the generative identifier fo… ▽ More

    Submitted 27 March, 2024; originally announced March 2024.

  13. arXiv:2403.16371  [pdf, other

    cs.IR

    Uncovering Selective State Space Model's Capabilities in Lifelong Sequential Recommendation

    Authors: Jiyuan Yang, Yuanzi Li, **gyu Zhao, Hanbing Wang, Muyang Ma, Jun Ma, Zhaochun Ren, Mengqi Zhang, Xin Xin, Zhumin Chen, Pengjie Ren

    Abstract: Sequential Recommenders have been widely applied in various online services, aiming to model users' dynamic interests from their sequential interactions. With users increasingly engaging with online platforms, vast amounts of lifelong user behavioral sequences have been generated. However, existing sequential recommender models often struggle to handle such lifelong sequences. The primary challeng… ▽ More

    Submitted 24 March, 2024; originally announced March 2024.

  14. arXiv:2403.15941  [pdf, other

    cs.RO cs.AI cs.CV cs.LG

    Explore until Confident: Efficient Exploration for Embodied Question Answering

    Authors: Allen Z. Ren, Jaden Clark, Anushri Dixit, Masha Itkina, Anirudha Majumdar, Dorsa Sadigh

    Abstract: We consider the problem of Embodied Question Answering (EQA), which refers to settings where an embodied agent such as a robot needs to actively explore an environment to gather information until it is confident about the answer to a question. In this work, we leverage the strong semantic reasoning capabilities of large vision-language models (VLMs) to efficiently explore and answer such questions… ▽ More

    Submitted 26 May, 2024; v1 submitted 23 March, 2024; originally announced March 2024.

    Comments: Robotics: Science and Systems (RSS) 2024

  15. arXiv:2403.14221  [pdf, other

    cs.CL

    Improving the Robustness of Large Language Models via Consistency Alignment

    Authors: Yukun Zhao, Lingyong Yan, Weiwei Sun, Guoliang Xing, Shuaiqiang Wang, Chong Meng, Zhicong Cheng, Zhaochun Ren, Dawei Yin

    Abstract: Large language models (LLMs) have shown tremendous success in following user instructions and generating helpful responses. Nevertheless, their robustness is still far from optimal, as they may generate significantly inconsistent responses due to minor changes in the verbalized instructions. Recent literature has explored this inconsistency issue, highlighting the importance of continued improveme… ▽ More

    Submitted 22 March, 2024; v1 submitted 21 March, 2024; originally announced March 2024.

    Comments: Accepted by LREC-COLING 2024

  16. arXiv:2403.09483  [pdf, other

    hep-ex

    Tracking of charged particles with nanosecond lifetimes at LHCb

    Authors: LHCb collaboration, R. Aaij, A. S. W. Abdelmotteleb, C. Abellan Beteta, F. Abudinén, T. Ackernley, J. A. Adams, A. A. Adefisoye, B. Adeva, M. Adinolfi, P. Adlarson, C. Agapopoulou, C. A. Aidala, Z. Ajaltouni, S. Akar, K. Akiba, P. Albicocco, J. Albrecht, F. Alessio, M. Alexander, Z. Aliouche, P. Alvarez Cartelle, R. Amalric, S. Amato, J. L. Amey , et al. (1060 additional authors not shown)

    Abstract: A method is presented to reconstruct charged particles with lifetimes between 10 ps and 10 ns, which considers a combination of their decay products and the partial tracks created by the initial charged particle. Using the $Ξ^-$ baryon as a benchmark, the method is demonstrated with simulated events and proton-proton collision data at $\sqrt{s}=13$ TeV, corresponding to an integrated luminosity of… ▽ More

    Submitted 14 March, 2024; originally announced March 2024.

    Comments: All figures and tables, along with any supplementary material and additional information, are available at https://cern.ch/lhcbproject/Publications/p/LHCb-DP-2023-004.html (LHCb public pages)

    Report number: CERN-EP-2024-077, LHCb-DP-2023-004

  17. arXiv:2403.08185  [pdf, other

    cs.RO eess.SY

    Perceive With Confidence: Statistical Safety Assurances for Navigation with Learning-Based Perception

    Authors: Anushri Dixit, Zhiting Mei, Meghan Booker, Mariko Storey-Matsutani, Allen Z. Ren, Anirudha Majumdar

    Abstract: Rapid advances in perception have enabled large pre-trained models to be used out of the box for processing high-dimensional, noisy, and partial observations of the world into rich geometric representations (e.g., occupancy predictions). However, safe integration of these models onto robots remains challenging due to a lack of reliable performance in unfamiliar environments. In this work, we prese… ▽ More

    Submitted 12 March, 2024; originally announced March 2024.

    Comments: Videos and code can be found at https://perceive-with-confidence.github.io

  18. arXiv:2403.06189  [pdf, other

    cs.CV

    Harmonious Group Choreography with Trajectory-Controllable Diffusion

    Authors: Yuqin Dai, Wanlu Zhu, Ronghui Li, Ze** Ren, Xiangzheng Zhou, Xiu Li, Jun Li, Jian Yang

    Abstract: Creating group choreography from music has gained attention in cultural entertainment and virtual reality, aiming to coordinate visually cohesive and diverse group movements. Despite increasing interest, recent works face challenges in achieving aesthetically appealing choreography, primarily for two key issues: multi-dancer collision and single-dancer foot slide. To address these issues, we propo… ▽ More

    Submitted 6 June, 2024; v1 submitted 10 March, 2024; originally announced March 2024.

  19. arXiv:2403.05156  [pdf, other

    cs.CR

    On Protecting the Data Privacy of Large Language Models (LLMs): A Survey

    Authors: Biwei Yan, Kun Li, Minghui Xu, Yueyan Dong, Yue Zhang, Zhaochun Ren, Xiuzhen Cheng

    Abstract: Large language models (LLMs) are complex artificial intelligence systems capable of understanding, generating and translating human language. They learn language patterns by analyzing large amounts of text data, allowing them to perform writing, conversation, summarizing and other language tasks. When LLMs process and generate large amounts of data, there is a risk of leaking sensitive information… ▽ More

    Submitted 14 March, 2024; v1 submitted 8 March, 2024; originally announced March 2024.

    Comments: 18 pages, 4 figures

  20. arXiv:2403.04917  [pdf, ps, other

    cs.RO cs.AI cs.DS

    A Mixed-Integer Conic Program for the Moving-Target Traveling Salesman Problem based on a Graph of Convex Sets

    Authors: Allen George Philip, Zhongqiang Ren, Sivakumar Rathinam, Howie Choset

    Abstract: This paper introduces a new formulation that finds the optimum for the Moving-Target Traveling Salesman Problem (MT-TSP), which seeks to find a shortest path for an agent, that starts at a depot, visits a set of moving targets exactly once within their assigned time-windows, and returns to the depot. The formulation relies on the key idea that when the targets move along lines, their trajectories… ▽ More

    Submitted 10 March, 2024; v1 submitted 7 March, 2024; originally announced March 2024.

    Comments: 7 pages, 4 figures

  21. arXiv:2403.04764  [pdf, other

    cs.LG math.OC stat.ML

    TS-RSR: A provably efficient approach for batch bayesian optimization

    Authors: Zhaolin Ren, Na Li

    Abstract: This paper presents a new approach for batch Bayesian Optimization (BO) called Thompson Sampling-Regret to Sigma Ratio directed sampling (TS-RSR), where we sample a new batch of actions by minimizing a Thompson Sampling approximation of a regret to uncertainty ratio. Our sampling objective is able to coordinate the actions chosen in each batch in a way that minimizes redundancy between points whil… ▽ More

    Submitted 2 May, 2024; v1 submitted 7 March, 2024; originally announced March 2024.

    Comments: Revised presentation and organization of theoretical results

  22. arXiv:2403.03868  [pdf, other

    stat.ME math.ST stat.ML

    Confidence on the Focal: Conformal Prediction with Selection-Conditional Coverage

    Authors: Ying **, Zhimei Ren

    Abstract: Conformal prediction builds marginally valid prediction intervals that cover the unknown outcome of a randomly drawn new test point with a prescribed probability. However, a common scenario in practice is that, after seeing the data, practitioners decide which test unit(s) to focus on in a data-driven manner and seek for uncertainty quantification of the focal unit(s). In such cases, marginally va… ▽ More

    Submitted 24 March, 2024; v1 submitted 6 March, 2024; originally announced March 2024.

  23. Amplitude analysis of the $Λ_b^0\to pK^-γ$ decay

    Authors: LHCb collaboration, R. Aaij, A. S. W. Abdelmotteleb, C. Abellan Beteta, F. Abudinén, T. Ackernley, B. Adeva, M. Adinolfi, P. Adlarson, C. Agapopoulou, C. A. Aidala, Z. Ajaltouni, S. Akar, K. Akiba, P. Albicocco, J. Albrecht, F. Alessio, M. Alexander, A. Alfonso Albero, Z. Aliouche, P. Alvarez Cartelle, R. Amalric, S. Amato, J. L. Amey, Y. Amhis , et al. (1084 additional authors not shown)

    Abstract: The resonant structure of the radiative decay $Λ_b^0\to pK^-γ$ in the region of proton-kaon invariant-mass up to 2.5 GeV$/c^2$ is studied using proton-proton collision data recorded at centre-of-mass energies of 7, 8, and 13 TeV collected with the LHCb detector, corresponding to a total integrated luminosity of 9 fb$^{-1}$. Results are given in terms of fit and interference fractions between the d… ▽ More

    Submitted 21 June, 2024; v1 submitted 6 March, 2024; originally announced March 2024.

    Comments: All figures and tables, along with machine-readable versions and any supplementary material and additional information, are available at https://cern.ch/lhcbproject/Publications/p/LHCb-PAPER-2023-036.html (LHCb public pages)

    Report number: LHCb-PAPER-2023-036, CERN-EP-2023-253

    Journal ref: JHEP 06 (2024) 098

  24. arXiv:2403.03586  [pdf, other

    hep-ex

    First observation of the $Λ^0_b \to D^+ D^- Λ$ decay

    Authors: LHCb collaboration, R. Aaij, A. S. W. Abdelmotteleb, C. Abellan Beteta, F. Abudinén, T. Ackernley, J. A. Adams, A. A. Adefisoye, B. Adeva, M. Adinolfi, P. Adlarson, C. Agapopoulou, C. A. Aidala, Z. Ajaltouni, S. Akar, K. Akiba, P. Albicocco, J. Albrecht, F. Alessio, M. Alexander, Z. Aliouche, P. Alvarez Cartelle, R. Amalric, S. Amato, J. L. Amey , et al. (1068 additional authors not shown)

    Abstract: The $Λ^0_b \to D^+ D^- Λ$ decay is observed for the first time using proton-proton collision data collected by the LHCb experiment at a center-of-mass energy of $13 \mathrm{TeV}$, corresponding to an integrated luminosity of $5.3 \mathrm{fb}^{-1}$. Using the $B^0 \to D^+ D^- K_{\mathrm{S}}^0$ decay as a reference channel, the product of the relative production cross-section and decay branching fra… ▽ More

    Submitted 6 March, 2024; originally announced March 2024.

    Comments: All figures and tables, along with machine-readable versions and any supplementary material and additional information, are available at https://cern.ch/lhcbproject/Publications/p/LHCb-PAPER-2023-042.html (LHCb public pages)

    Report number: LHCb-PAPER-2023-042, CERN-EP-2024-041

  25. arXiv:2403.03424  [pdf, other

    cs.IR

    Generative News Recommendation

    Authors: Shen Gao, Jiabao Fang, Quan Tu, Zhitao Yao, Zhumin Chen, Pengjie Ren, Zhaochun Ren

    Abstract: Most existing news recommendation methods tackle this task by conducting semantic matching between candidate news and user representation produced by historical clicked news. However, they overlook the high-level connections among different news articles and also ignore the profound relationship between these news articles and users. And the definition of these methods dictates that they can only… ▽ More

    Submitted 5 March, 2024; originally announced March 2024.

    Comments: Accepted by WWW 2024

  26. arXiv:2403.03031  [pdf, other

    cs.CL

    Learning to Use Tools via Cooperative and Interactive Agents

    Authors: Zhengliang Shi, Shen Gao, Xiuyi Chen, Yue Feng, Lingyong Yan, Haibo Shi, Dawei Yin, Pengjie Ren, Suzan Verberne, Zhaochun Ren

    Abstract: Tool learning empowers large language models (LLMs) as agents to use external tools and extend their utility. Existing methods employ one single LLM-based agent to iteratively select and execute tools, thereafter incorporating execution results into the next action prediction. Despite their progress, these methods suffer from performance degradation when addressing practical tasks due to: (1) the… ▽ More

    Submitted 22 June, 2024; v1 submitted 5 March, 2024; originally announced March 2024.

    Comments: working in process

  27. arXiv:2403.01821  [pdf

    quant-ph physics.app-ph

    Complete Interband Transitions for Non-Hermitian Spin-Orbit-Coupled Cold-Atom Systems

    Authors: Dong Liu, Zejian Ren, Wai Chun Wong, Entong Zhao, Chengdong He, Ka Kwan Pak, Gyu-Boong Jo, Jensen Li

    Abstract: Recently, synthetic spin-orbit coupling has been introduced into cold-atom systems for more flexible control of the Hamiltonian, which was further made time-varying through two-photon detuning to achieve dynamic control of the cold-atom state. While an intraband transition can be adiabatically obtained, a complete interband transition, rather than a superposition of different bands, obtained throu… ▽ More

    Submitted 4 March, 2024; originally announced March 2024.

    Comments: 21 pages, 4 figures

  28. arXiv:2402.17959  [pdf, other

    cs.CL cs.HC

    An Iterative Associative Memory Model for Empathetic Response Generation

    Authors: Zhou Yang, Zhaochun Ren, Yufeng Wang, Chao Chen, Haizhou Sun, Xiaofei Zhu, Xiangwen Liao

    Abstract: Empathetic response generation aims to comprehend the cognitive and emotional states in dialogue utterances and generate proper responses. Psychological theories posit that comprehending emotional and cognitive states necessitates iteratively capturing and understanding associated words across dialogue utterances. However, existing approaches regard dialogue utterances as either a long sequence or… ▽ More

    Submitted 2 June, 2024; v1 submitted 27 February, 2024; originally announced February 2024.

    Comments: 12 pages, 4 figures

  29. arXiv:2402.17531  [pdf, other

    cs.SE cs.AI cs.CL

    Nissist: An Incident Mitigation Copilot based on Troubleshooting Guides

    Authors: Kaikai An, Fangkai Yang, Junting Lu, Liqun Li, Zhixing Ren, Hao Huang, Lu Wang, Pu Zhao, Yu Kang, Hua Ding, Qingwei Lin, Saravan Rajmohan, Dongmei Zhang, Qi Zhang

    Abstract: Effective incident management is pivotal for the smooth operation of enterprises-level cloud services. In order to expedite incident mitigation, service teams compile troubleshooting knowledge into Troubleshooting Guides (TSGs) accessible to on-call engineers (OCEs). While automated pipelines are enabled to resolve the most frequent and easy incidents, there still exist complex incidents that requ… ▽ More

    Submitted 10 May, 2024; v1 submitted 27 February, 2024; originally announced February 2024.

    Comments: Work in progress

  30. arXiv:2402.17437  [pdf, other

    cs.CL cs.AI

    Exploiting Emotion-Semantic Correlations for Empathetic Response Generation

    Authors: Zhou Yang, Zhaochun Ren, Yufeng Wang, Xiaofei Zhu, Zhihao Chen, Tiecheng Cai, Yunbing Wu, Yisong Su, Sibo Ju, Xiangwen Liao

    Abstract: Empathetic response generation aims to generate empathetic responses by understanding the speaker's emotional feelings from the language of dialogue. Recent methods capture emotional words in the language of communicators and construct them as static vectors to perceive nuanced emotions. However, linguistic research has shown that emotional words in language are dynamic and have correlations with… ▽ More

    Submitted 27 February, 2024; originally announced February 2024.

    Comments: 12 pages, 3 figures, Findings of EMNLP 2023

  31. arXiv:2402.17315  [pdf

    cond-mat.supr-con

    Superconducting-transition-temperature dependence of superfluid density and conductivity in pressurized cuprate superconductors

    Authors: **yu Zhao, Shu Cai, Yiwen Chen, Genda Gu, Hongtao Yan, **g Guo, **yu Han, Pengyu Wang, Yazhou Zhou, Yanchun Li, Xiaodong Li, Zhian Ren, Qi Wu, Xingjiang Zhou, Yang Ding, Tao Xiang, Ho-kwang Mao, Liling Sun

    Abstract: What factors fundamentally determine the value of superconducting transition temperature (Tc) in high temperature superconductors has been the subject of intense debate. Following the establishment of an empirical law known as Homes'law, there is a growing consensus in the community that the Tc value of the cuprate superconductors is closely linked to its superfluid density and conductivity. Howev… ▽ More

    Submitted 28 February, 2024; v1 submitted 27 February, 2024; originally announced February 2024.

    Comments: 11 pages, 3 figures

  32. arXiv:2402.17263  [pdf, other

    cs.CL

    MELoRA: Mini-Ensemble Low-Rank Adapters for Parameter-Efficient Fine-Tuning

    Authors: Pengjie Ren, Chengshun Shi, Shiguang Wu, Mengqi Zhang, Zhaochun Ren, Maarten de Rijke, Zhumin Chen, Jiahuan Pei

    Abstract: Parameter-efficient fine-tuning (PEFT) is a popular method for tailoring pre-trained large language models (LLMs), especially as the models' scale and the diversity of tasks increase. Low-rank adaptation (LoRA) is based on the idea that the adaptation process is intrinsically low-dimensional, i.e., significant model changes can be represented with relatively few parameters. However, decreasing the… ▽ More

    Submitted 24 June, 2024; v1 submitted 27 February, 2024; originally announced February 2024.

    Comments: ACL2024

    MSC Class: 68T50 ACM Class: I.2.7

  33. arXiv:2402.16061  [pdf, other

    cs.CL

    How Large Language Models Encode Context Knowledge? A Layer-Wise Probing Study

    Authors: Tianjie Ju, Weiwei Sun, Wei Du, Xinwei Yuan, Zhaochun Ren, Gongshen Liu

    Abstract: Previous work has showcased the intriguing capability of large language models (LLMs) in retrieving facts and processing context knowledge. However, only limited research exists on the layer-wise capability of LLMs to encode knowledge, which challenges our understanding of their internal mechanisms. In this paper, we devote the first attempt to investigate the layer-wise capability of LLMs through… ▽ More

    Submitted 4 March, 2024; v1 submitted 25 February, 2024; originally announced February 2024.

    Comments: Accepted at LREC-COLING 2024 (Long Paper)

  34. Modification of $χ_{c1}$(3872) and $ψ$(2$S$) production in $p$Pb collisions at $\sqrt{s_{NN}} = 8.16$ TeV

    Authors: LHCb collaboration, R. Aaij, A. S. W. Abdelmotteleb, C. Abellan Beteta, F. Abudinén, T. Ackernley, B. Adeva, M. Adinolfi, P. Adlarson, C. Agapopoulou, C. A. Aidala, Z. Ajaltouni, S. Akar, K. Akiba, P. Albicocco, J. Albrecht, F. Alessio, M. Alexander, A. Alfonso Albero, Z. Aliouche, P. Alvarez Cartelle, R. Amalric, S. Amato, J. L. Amey, Y. Amhis , et al. (1082 additional authors not shown)

    Abstract: The LHCb collaboration measures production of the exotic hadron $χ_{c1}$(3872) in proton-nucleus collisions for the first time. Comparison with the charmonium state $ψ$(2$S$) suggests that the exotic $χ_{c1}$(3872) experiences different dynamics in the nuclear medium than conventional hadrons, and comparison with data from proton-proton collisions indicates that the presence of the nucleus may mod… ▽ More

    Submitted 19 June, 2024; v1 submitted 22 February, 2024; originally announced February 2024.

    Comments: All figures and tables, along with any supplementary material and additional information, are available at https://cern.ch/lhcbproject/Publications/p/LHCb-PAPER-2023-026.html (LHCb public pages)

    Report number: LHCb-PAPER-2023-026, CERN-EP-2024-033

    Journal ref: Phys. Rev. Lett. 132 (2024) 242301

  35. arXiv:2402.14436  [pdf

    cond-mat.supr-con

    Structural and resistivity properties of Fe$_{1-x}$Co${_x}$Se single crystals grown by the molten salt method

    Authors: Qiaoyu Wang, Mingwei Ma, Binbin Ruan, Menghu Zhou, Yadong Gu, Qingsong Yang, Lewei Chen, Yunqing Shi, Junkun Yi, Genfu Chen, Zhian Ren

    Abstract: A series of tetragonal Fe$_{1-x}$Co${_x}$Se single crystals with a complete Co do** range (0$\leq$x$\leq$0.52) up to its solid solubility limit in FeSe have been grown by an eutectic AlCl${_3}$/KCl molten salt method. The typical lateral size of as-grown Fe$_{1-x}$Co${_x}$Se single crystals is 1$-$5 mm. The chemical composition and homogeneity of the crystals was examined by both inductively cou… ▽ More

    Submitted 22 February, 2024; originally announced February 2024.

  36. VN Network: Embedding Newly Emerging Entities with Virtual Neighbors

    Authors: Yongquan He, Zihan Wang, Peng Zhang, Zhaopeng Tu, Zhaochun Ren

    Abstract: Embedding entities and relations into continuous vector spaces has attracted a surge of interest in recent years. Most embedding methods assume that all test entities are available during training, which makes it time-consuming to retrain embeddings for newly emerging entities. To address this issue, recent works apply the graph neural network on the existing neighbors of the unseen entities. In t… ▽ More

    Submitted 20 February, 2024; originally announced February 2024.

    Comments: 10 pages, 5 figures

    ACM Class: I.2.4; I.2.6

    Journal ref: CIKM (2020) 505-514

  37. arXiv:2402.13408  [pdf, other

    cs.CL

    Healthcare Copilot: Eliciting the Power of General LLMs for Medical Consultation

    Authors: Zhiyao Ren, Yibing Zhan, Baosheng Yu, Liang Ding, Dacheng Tao

    Abstract: The copilot framework, which aims to enhance and tailor large language models (LLMs) for specific complex tasks without requiring fine-tuning, is gaining increasing attention from the community. In this paper, we introduce the construction of a Healthcare Copilot designed for medical consultation. The proposed Healthcare Copilot comprises three main components: 1) the Dialogue component, responsib… ▽ More

    Submitted 20 February, 2024; originally announced February 2024.

  38. arXiv:2402.13042  [pdf, other

    stat.ME

    Not all distributional shifts are equal: Fine-grained robust conformal inference

    Authors: Jiahao Ai, Zhimei Ren

    Abstract: We introduce a fine-grained framework for uncertainty quantification of predictive models under distributional shifts. This framework distinguishes the shift in covariate distributions from that in the conditional relationship between the outcome ($Y$) and the covariates ($X$). We propose to reweight the training samples to adjust for an identifiable covariate shift while protecting against worst-… ▽ More

    Submitted 25 February, 2024; v1 submitted 20 February, 2024; originally announced February 2024.

    Comments: 25 pages, 5 figures

  39. arXiv:2402.11801  [pdf, other

    cs.HC

    Enhancing Empathetic Response Generation by Augmenting LLMs with Small-scale Empathetic Models

    Authors: Zhou Yang, Zhaochun Ren, Wang Yufeng, Shizhong Peng, Haizhou Sun, Xiaofei Zhu, Xiangwen Liao

    Abstract: Empathetic response generation is increasingly significant in AI, necessitating nuanced emotional and cognitive understanding coupled with articulate response expression. Current large language models (LLMs) excel in response expression; however, they lack the ability to deeply understand emotional and cognitive nuances, particularly in pinpointing fine-grained emotions and their triggers. Convers… ▽ More

    Submitted 18 February, 2024; originally announced February 2024.

    Comments: 12 pages, 4 figures

  40. arXiv:2402.11176  [pdf, other

    cs.CL cs.AI

    KnowTuning: Knowledge-aware Fine-tuning for Large Language Models

    Authors: Yougang Lyu, Lingyong Yan, Shuaiqiang Wang, Haibo Shi, Dawei Yin, Pengjie Ren, Zhumin Chen, Maarten de Rijke, Zhaochun Ren

    Abstract: Despite their success at many natural language processing (NLP) tasks, large language models still struggle to effectively leverage knowledge for knowledge-intensive tasks, manifesting limitations such as generating incomplete, non-factual, or illogical answers. These limitations stem from inadequate knowledge awareness of LLMs during vanilla fine-tuning. To address these problems, we propose a kn… ▽ More

    Submitted 17 April, 2024; v1 submitted 16 February, 2024; originally announced February 2024.

  41. arXiv:2402.11111  [pdf, other

    cs.CL

    Language Models as Science Tutors

    Authors: Alexis Chevalier, Jiayi Geng, Alexander Wettig, Howard Chen, Sebastian Mizera, Toni Annala, Max Jameson Aragon, Arturo Rodríguez Fanlo, Simon Frieder, Simon Machado, Akshara Prabhakar, Ellie Thieu, Jiachen T. Wang, Zirui Wang, Xindi Wu, Mengzhou Xia, Wenhan Jia, Jiatong Yu, Jun-Jie Zhu, Zhiyong Jason Ren, Sanjeev Arora, Danqi Chen

    Abstract: NLP has recently made exciting progress toward training language models (LMs) with strong scientific problem-solving skills. However, model development has not focused on real-life use-cases of LMs for science, including applications in education that require processing long scientific documents. To address this, we introduce TutorEval and TutorChat. TutorEval is a diverse question-answering bench… ▽ More

    Submitted 16 February, 2024; originally announced February 2024.

    Comments: 8 pages without bibliography and appendix, 26 pages total

  42. arXiv:2402.09959  [pdf, other

    cs.IR

    LLM-based Federated Recommendation

    Authors: Jujia Zhao, Wenjie Wang, Chen Xu, Zhaochun Ren, See-Kiong Ng, Tat-Seng Chua

    Abstract: Large Language Models (LLMs), with their advanced contextual understanding abilities, have demonstrated considerable potential in enhancing recommendation systems via fine-tuning methods. However, fine-tuning requires users' behavior data, which poses considerable privacy risks due to the incorporation of sensitive user information. The unintended disclosure of such data could infringe upon data p… ▽ More

    Submitted 16 February, 2024; v1 submitted 15 February, 2024; originally announced February 2024.

  43. arXiv:2402.07962  [pdf, other

    nucl-th

    Alpha-like correlations in $^{20}$Ne, comparison of quartetting wave function and THSR approaches

    Authors: G. Röpke, C. Xu, B. Zhou, Z. Z. Ren, Y. Funaki, H. Horiuchi, M. Lyu, A. Tohsaki, T. Yamada

    Abstract: $^{20}$Ne can be considered as a double-magic $^{16}$O core nucleus surrounded by four nucleons, the constituents of an $α$-like quartet. Similar to other nuclei ($^{212}$Po, $^{104}$Ti, etc.) with a quartet on top of a double-magic core nucleus, significant $α$-like correlations are expected. Correlations in the ground state of $^{20}… ▽ More

    Submitted 11 February, 2024; originally announced February 2024.

    Comments: 15 pages, 15 figures. arXiv admin note: text overlap with arXiv:1810.01274

  44. arXiv:2402.07249  [pdf, other

    cs.LG cs.CE q-bio.BM

    Impact of Domain Knowledge and Multi-Modality on Intelligent Molecular Property Prediction: A Systematic Survey

    Authors: Taojie Kuang, Pengfei Liu, Zhixiang Ren

    Abstract: The precise prediction of molecular properties is essential for advancements in drug development, particularly in virtual screening and compound optimization. The recent introduction of numerous deep learning-based methods has shown remarkable potential in enhancing molecular property prediction (MPP), especially improving accuracy and insights into molecular structures. Yet, two critical question… ▽ More

    Submitted 27 June, 2024; v1 submitted 11 February, 2024; originally announced February 2024.

  45. Measurement of the Branching Fraction of $B^{0} \rightarrow J/ψπ^{0}$ Decays

    Authors: LHCb collaboration, R. Aaij, A. S. W. Abdelmotteleb, C. Abellan Beteta, F. Abudinén, T. Ackernley, J. A. Adams, A. A. Adefisoye, B. Adeva, M. Adinolfi, P. Adlarson, C. Agapopoulou, C. A. Aidala, Z. Ajaltouni, S. Akar, K. Akiba, P. Albicocco, J. Albrecht, F. Alessio, M. Alexander, Z. Aliouche, P. Alvarez Cartelle, R. Amalric, S. Amato, J. L. Amey , et al. (1067 additional authors not shown)

    Abstract: The ratio of branching fractions between $B^{0} \rightarrow J/ψπ^{0}$ and $B^{+} \rightarrow J/ψK^{*+}$ decays is measured with proton-proton collision data collected by the LHCb experiment, corresponding to an integrated luminosity of 9 fb$^{-1}$. The measured value is… ▽ More

    Submitted 23 May, 2024; v1 submitted 8 February, 2024; originally announced February 2024.

    Comments: All figures and tables, along with any supplementary material and additional information, are available at https://cern.ch/lhcbproject/Publications/p/LHCb-PAPER-2023-041.html (LHCb public pages)

    Report number: LHCb-PAPER-2023-041, CERN-EP-2024-009

    Journal ref: J. High Energ. Phys. 2024, 65 (2024)

  46. Observation of the $B_c^+ \to J/ψπ^+ π^0$ decay

    Authors: LHCb collaboration, R. Aaij, A. S. W. Abdelmotteleb, C. Abellan Beteta, F. Abudinén, T. Ackernley, J. A. Adams, A. A. Adefisoye, B. Adeva, M. Adinolfi, P. Adlarson, C. Agapopoulou, C. A. Aidala, Z. Ajaltouni, S. Akar, K. Akiba, P. Albicocco, J. Albrecht, F. Alessio, M. Alexander, Z. Aliouche, P. Alvarez Cartelle, R. Amalric, S. Amato, J. L. Amey , et al. (1064 additional authors not shown)

    Abstract: The first observation of the $B_c^+ \to J/ψπ^+ π^0$ decay is reported with high significance using proton-proton collision data, corresponding to an integrated luminosity of 9fb$^{-1}$, collected with the LHCb detector at centre-of-mass energies of 7, 8, and 13 TeV. The ratio of its branching fraction relative to the $B_c^+ \to J/ψπ^+$ channel is measured to be… ▽ More

    Submitted 15 May, 2024; v1 submitted 8 February, 2024; originally announced February 2024.

    Comments: 30 pages, 6 figures. All figures and tables, along with any supplementary material and additional information, are available at https://cern.ch/lhcbproject/Publications/p/LHCb-PAPER-2023-046.html (LHCb public pages)

    Report number: LHCb-PAPER-2023-046, CERN-EP-2024-019

    Journal ref: JHEP04 (2024) 151

  47. arXiv:2402.04405  [pdf

    cs.CE

    Interpretable domain knowledge enhanced machine learning framework on axial capacity prediction of circular CFST columns

    Authors: Dian Wang, Zhigang Ren, Gen Kondo

    Abstract: This study introduces a novel machine learning framework, integrating domain knowledge, to accurately predict the bearing capacity of CFSTs, bridging the gap between traditional engineering and machine learning techniques. Utilizing a comprehensive database of 2621 experimental data points on CFSTs, we developed a Domain Knowledge Enhanced Neural Network (DKNN) model. This model incorporates advan… ▽ More

    Submitted 5 March, 2024; v1 submitted 6 February, 2024; originally announced February 2024.

    Comments: Journal Research Article

  48. arXiv:2402.04119  [pdf, other

    cs.LG cs.CE

    Scientific Language Modeling: A Quantitative Review of Large Language Models in Molecular Science

    Authors: Pengfei Liu, Jun Tao, Zhixiang Ren

    Abstract: Efficient molecular modeling and design are crucial for the discovery and exploration of novel molecules, and the incorporation of deep learning methods has revolutionized this field. In particular, large language models (LLMs) offer a fresh approach to tackle scientific problems from a natural language processing (NLP) perspective, introducing a research paradigm called scientific language modeli… ▽ More

    Submitted 6 February, 2024; originally announced February 2024.

  49. arXiv:2402.02825  [pdf, other

    gr-qc astro-ph.IM

    GWAI: Harnessing Artificial Intelligence for Enhancing Gravitational Wave Data Analysis

    Authors: Tianyu Zhao, Yue Zhou, Ruijun Shi, Zhoujian Cao, Zhixiang Ren

    Abstract: Gravitational wave (GW) astronomy has opened new frontiers in understanding the cosmos, while the integration of artificial intelligence (AI) in science promises to revolutionize data analysis methodologies. However, a significant gap exists, as there is currently no dedicated platform that enables scientists to develop, test, and evaluate AI algorithms efficiently. To address this gap, we introdu… ▽ More

    Submitted 5 February, 2024; originally announced February 2024.

    Comments: 10 pages, 5 figures

  50. arXiv:2402.01336  [pdf, other

    hep-ex

    Measurements of the branching fraction ratio $\cal{B}(φ\to μ^+μ^-)/\cal{B}(φ\to e^+e^-)$ with charm meson decays

    Authors: LHCb collaboration, R. Aaij, A. S. W. Abdelmotteleb, C. Abellan Beteta, F. Abudinén, T. Ackernley, B. Adeva, M. Adinolfi, P. Adlarson, C. Agapopoulou, C. A. Aidala, Z. Ajaltouni, S. Akar, K. Akiba, P. Albicocco, J. Albrecht, F. Alessio, M. Alexander, A. Alfonso Albero, Z. Aliouche, P. Alvarez Cartelle, R. Amalric, S. Amato, J. L. Amey, Y. Amhis , et al. (1080 additional authors not shown)

    Abstract: Measurements of the branching fraction ratio ${\cal{B}(φ\to μ^+ μ^-)/\cal{B}(φ\to e^+e^-)}$ with ${D_{s}^{+} \to π^{+} φ}$ and ${D^{+} \to π^{+} φ}$ decays, denoted $R^{s}_{φπ}$ and $R^{d}_{φπ}$, are presented. The analysis is performed using a dataset corresponding to an integrated luminosity of 5.4$\,\rm{fb}^{-1}$ of $pp$ collision data collected with the LHCb experiment. The branching fractions… ▽ More

    Submitted 1 May, 2024; v1 submitted 2 February, 2024; originally announced February 2024.

    Comments: All figures and tables, along with any supplementary material and additional information, are available at https://cern.ch/lhcbproject/Publications/p/LHCb-PAPER-2023-038.html (LHCb public pages)

    Report number: LHCb-PAPER-2023-038, CERN-EP-2024-001