Skip to main content

Showing 301–350 of 4,248 results for author: Liu, L

.
  1. arXiv:2402.13598  [pdf, other

    cs.CL cs.AI cs.LG

    User-LLM: Efficient LLM Contextualization with User Embeddings

    Authors: Lin Ning, Luyang Liu, Jiaxing Wu, Neo Wu, Devora Berlowitz, Sushant Prakash, Bradley Green, Shawn O'Banion, Jun Xie

    Abstract: Large language models (LLMs) have revolutionized natural language processing. However, effectively incorporating complex and potentially noisy user interaction data remains a challenge. To address this, we propose User-LLM, a novel framework that leverages user embeddings to contextualize LLMs. These embeddings, distilled from diverse user interactions using self-supervised pretraining, capture la… ▽ More

    Submitted 21 February, 2024; originally announced February 2024.

  2. arXiv:2402.13577  [pdf, other

    cs.CL

    BBA: Bi-Modal Behavioral Alignment for Reasoning with Large Vision-Language Models

    Authors: Xueliang Zhao, Xinting Huang, Tingchen Fu, Qintong Li, Shansan Gong, Lemao Liu, Wei Bi, Lingpeng Kong

    Abstract: Multimodal reasoning stands as a pivotal capability for large vision-language models (LVLMs). The integration with Domain-Specific Languages (DSL), offering precise visual representations, equips these models with the opportunity to execute more accurate reasoning in complex and professional domains. However, the vanilla Chain-of-Thought (CoT) prompting method faces challenges in effectively lever… ▽ More

    Submitted 21 February, 2024; originally announced February 2024.

    Comments: Preprint

  3. arXiv:2402.13443  [pdf, other

    cs.RO eess.SY

    Autonomous Mapless Navigation on Uneven Terrains

    Authors: Hassan Jardali, Mahmoud Ali, Lantao Liu

    Abstract: We propose a new method for autonomous navigation in uneven terrains by utilizing a sparse Gaussian Process (SGP) based local perception model. The SGP local perception model is trained on local ranging observation (pointcloud) to learn the terrain elevation profile and extract the feasible navigation subgoals around the robot. Subsequently, a cost function, which prioritizes the safety of the rob… ▽ More

    Submitted 20 February, 2024; originally announced February 2024.

    Comments: This paper has 7 pages, 7 figures, 2 tables. It has been accepted for publication at the 2024 IEEE International Conference on Robotics and Automation (ICRA2024)

  4. arXiv:2402.13270  [pdf, other

    physics.ao-ph cs.AI cs.LG physics.data-an

    Global Tropical Cyclone Intensity Forecasting with Multi-modal Multi-scale Causal Autoregressive Model

    Authors: Xinyu Wang, Kang Chen, Lei Liu, Tao Han, Bin Li, Lei Bai

    Abstract: Accurate forecasting of Tropical cyclone (TC) intensity is crucial for formulating disaster risk reduction strategies. Current methods predominantly rely on limited spatiotemporal information from ERA5 data and neglect the causal relationships between these physical variables, failing to fully capture the spatial and temporal patterns required for intensity forecasting. To address this issue, we p… ▽ More

    Submitted 16 February, 2024; originally announced February 2024.

  5. arXiv:2402.12762  [pdf, ps, other

    stat.ML cs.LG

    Learning under Singularity: An Information Criterion improving WBIC and sBIC

    Authors: Lirui Liu, Joe Suzuki

    Abstract: We introduce a novel Information Criterion (IC), termed Learning under Singularity (LS), designed to enhance the functionality of the Widely Applicable Bayes Information Criterion (WBIC) and the Singular Bayesian Information Criterion (sBIC). LS is effective without regularity constraints and demonstrates stability. Watanabe defined a statistical model or a learning machine as regular if the mappi… ▽ More

    Submitted 22 February, 2024; v1 submitted 20 February, 2024; originally announced February 2024.

  6. HIP Network: Historical Information Passing Network for Extrapolation Reasoning on Temporal Knowledge Graph

    Authors: Yongquan He, Peng Zhang, Luchen Liu, Qi Liang, Wenyuan Zhang, Chuang Zhang

    Abstract: In recent years, temporal knowledge graph (TKG) reasoning has received significant attention. Most existing methods assume that all timestamps and corresponding graphs are available during training, which makes it difficult to predict future events. To address this issue, recent works learn to infer future events based on historical information. However, these methods do not comprehensively consid… ▽ More

    Submitted 19 February, 2024; originally announced February 2024.

    Comments: 7 pages, 3 figures

    ACM Class: I.2.4; I.2.6; I.2.7

    Journal ref: IJCAI (2021) 1915-1921

  7. arXiv:2402.11623  [pdf

    quant-ph physics.optics

    Filter-free high-performance single photon emission from a quantum dot in a Fabry-Perot microcavity

    Authors: Zhixuan Rao, Jiawei Yang, Changkun Song, Mujie Rao, Ziyang Zheng, Luyu Liu, Xuebin Peng, Ying Yu, Siyuan Yu

    Abstract: Combining resonant excitation with Purcell-enhanced single quantum dots (QDs) stands out as a prominent strategy for realizing high performance solid-state single photon sources. However, optimizing photon efficiency requires addressing challenges associated with effectively separating the excitation laser from QDs' emission. Traditionally, this involves polarization filtering, which limits the ac… ▽ More

    Submitted 17 March, 2024; v1 submitted 18 February, 2024; originally announced February 2024.

    Comments: 9 pages, 4 figures

  8. arXiv:2402.11497  [pdf, other

    cs.CV

    Thyroid ultrasound diagnosis improvement via multi-view self-supervised learning and two-stage pre-training

    Authors: Jian Wang, Xin Yang, Xiaohong Jia, Wufeng Xue, Rusi Chen, Yanlin Chen, Xiliang Zhu, Lian Liu, Yan Cao, Jianqiao Zhou, Dong Ni, Ning Gu

    Abstract: Thyroid nodule classification and segmentation in ultrasound images are crucial for computer-aided diagnosis; however, they face limitations owing to insufficient labeled data. In this study, we proposed a multi-view contrastive self-supervised method to improve thyroid nodule classification and segmentation performance with limited manual labels. Our method aligns the transverse and longitudinal… ▽ More

    Submitted 18 February, 2024; originally announced February 2024.

    Comments: The article has been accepted by the journal of Computers in Biology and Medicine

  9. arXiv:2402.11484  [pdf, ps, other

    quant-ph

    Optimal Quantum State Tomography via Weak Value

    Authors: Xuanmin Zhu, Dezheng Zhang, Run** Gao, Qun wei, Lixia Liu, Zijiang Luo

    Abstract: To improve the efficiency of the state tomography strategy via weak value, we have searched the optimal coupling strength between the system and measuring device. For an arbitrary d-dimensional quantum system, the optimal strengths being used in measuring the real and imaginary parts of the density matrix are obtained. The optimal efficiency of the state tomography has also been studied by using m… ▽ More

    Submitted 22 February, 2024; v1 submitted 18 February, 2024; originally announced February 2024.

    Comments: 11page, 3figures

  10. arXiv:2402.11207  [pdf, ps, other

    hep-ex

    Search for the production of deuterons and antideuterons in e^+e^- annihilation at center-of-mass energies between 4.13 and 4.70 GeV

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, R. Aliberti, A. Amoroso, M. R. An, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere, A. Brueggemann , et al. (593 additional authors not shown)

    Abstract: Using a data sample of $e^+e^-$ collision data corresponding to an integrated luminosity of 19 fb$^{-1}$ collected with the BESIII detector at the BEPCII collider, we search for the production of deuterons and antideuterons via $e^+e^-\to ppπ^-\bar{d}+c.c.$ for the first time at center-of-mass energies between 4.13 and 4.70 GeV. No significant signal is observed and the upper limit of the… ▽ More

    Submitted 17 February, 2024; originally announced February 2024.

  11. arXiv:2402.11192  [pdf, other

    cs.CL cs.AI

    I Learn Better If You Speak My Language: Understanding the Superior Performance of Fine-Tuning Large Language Models with LLM-Generated Responses

    Authors: Xuan Ren, Biao Wu, Lingqiao Liu

    Abstract: This paper explores an intriguing observation: fine-tuning a large language model (LLM) with responses generated by a LLM often yields better results than using responses generated by humans. We conduct an in-depth investigation to understand why this occurs. Contrary to the common belief that these instances is simply due to the more detailed nature of LLM-generated content, our study identifies… ▽ More

    Submitted 31 May, 2024; v1 submitted 17 February, 2024; originally announced February 2024.

  12. arXiv:2402.09456  [pdf, other

    cs.LG cs.AI cs.GT stat.ML

    Optimistic Thompson Sampling for No-Regret Learning in Unknown Games

    Authors: Yingru Li, Liangqi Liu, Wenqiang Pu, Hao Liang, Zhi-Quan Luo

    Abstract: This work tackles the complexities of multi-player scenarios in \emph{unknown games}, where the primary challenge lies in navigating the uncertainty of the environment through bandit feedback alongside strategic decision-making. We introduce Thompson Sampling (TS)-based algorithms that exploit the information of opponents' actions and reward structures, leading to a substantial reduction in experi… ▽ More

    Submitted 24 February, 2024; v1 submitted 7 February, 2024; originally announced February 2024.

  13. arXiv:2402.07610  [pdf, other

    cs.CL cs.AI

    Step-On-Feet Tuning: Scaling Self-Alignment of LLMs via Bootstrap**

    Authors: Haoyu Wang, Guozheng Ma, Ziqiao Meng, Zeyu Qin, Li Shen, Zhong Zhang, Bingzhe Wu, Liu Liu, Yatao Bian, Tingyang Xu, Xueqian Wang, Peilin Zhao

    Abstract: Self-alignment is an effective way to reduce the cost of human annotation while ensuring promising model capability. However, most current methods complete the data collection and training steps in a single round, which may overlook the continuously improving ability of self-aligned models. This gives rise to a key query: What if we do multi-time bootstrap** self-alignment? Does this strategy en… ▽ More

    Submitted 27 June, 2024; v1 submitted 12 February, 2024; originally announced February 2024.

  14. arXiv:2402.07060  [pdf, other

    math.NA math.AP

    Spectral convergence of a semi-discretized numerical system for the spatially homogeneous Boltzmann equation with uncertainties

    Authors: Liu Liu, Kunlun Qi

    Abstract: In this paper, we study the Boltzmann equation with uncertainties and prove that the spectral convergence of the semi-discretized numerical system holds in a combined velocity and random space, where the Fourier-spectral method is applied for approximation in the velocity space whereas the generalized polynomial chaos (gPC)-based stochastic Galerkin (SG) method is employed to discretize the random… ▽ More

    Submitted 6 May, 2024; v1 submitted 10 February, 2024; originally announced February 2024.

    Comments: Revised version. To appear in SIAM/ASA Journal on Uncertainty Quantification

    MSC Class: Primary 35Q20; 65M12; Secondary 65M70; 45G10

  15. arXiv:2402.05869  [pdf, other

    cs.CV

    Adaptive Surface Normal Constraint for Geometric Estimation from Monocular Images

    Authors: Xiaoxiao Long, Yuhang Zheng, Yupeng Zheng, Beiwen Tian, Cheng Lin, Lingjie Liu, Hao Zhao, Guyue Zhou, Wen** Wang

    Abstract: We introduce a novel approach to learn geometries such as depth and surface normal from images while incorporating geometric context. The difficulty of reliably capturing geometric context in existing methods impedes their ability to accurately enforce the consistency between the different geometric properties, thereby leading to a bottleneck of geometric estimation quality. We therefore propose t… ▽ More

    Submitted 31 March, 2024; v1 submitted 8 February, 2024; originally announced February 2024.

    Comments: Accepted by TPAMI. arXiv admin note: substantial text overlap with arXiv:2103.15483

  16. arXiv:2402.05827  [pdf, other

    cs.CL

    Is it Possible to Edit Large Language Models Robustly?

    Authors: Xinbei Ma, Tianjie Ju, Jiyang Qiu, Zhuosheng Zhang, Hai Zhao, Lifeng Liu, Yulong Wang

    Abstract: Large language models (LLMs) have played a pivotal role in building communicative AI to imitate human behaviors but face the challenge of efficient customization. To tackle this challenge, recent studies have delved into the realm of model editing, which manipulates specific memories of language models and changes the related language generation. However, the robustness of model editing remains an… ▽ More

    Submitted 8 February, 2024; originally announced February 2024.

    Comments: Working in progress

  17. arXiv:2402.05650  [pdf, other

    cs.SE cs.AI

    Rocks Coding, Not Development--A Human-Centric, Experimental Evaluation of LLM-Supported SE Tasks

    Authors: Wei Wang, Huilong Ning, Gaowei Zhang, Libo Liu, Yi Wang

    Abstract: Recently, large language models (LLM) based generative AI has been gaining momentum for their impressive high-quality performances in multiple domains, particularly after the release of the ChatGPT. Many believe that they have the potential to perform general-purpose problem-solving in software development and replace human software developers. Nevertheless, there are in a lack of serious investig… ▽ More

    Submitted 21 February, 2024; v1 submitted 8 February, 2024; originally announced February 2024.

    Comments: The paper has been accepted by FSE

    MSC Class: 65-XX ACM Class: D.2; I.2

  18. arXiv:2402.05636  [pdf

    cs.SE cs.AI

    The Impact of AI Tool on Engineering at ANZ Bank An Empirical Study on GitHub Copilot within Corporate Environment

    Authors: Sayan Chatterjee, Ching Louis Liu, Gareth Rowland, Tim Hogarth

    Abstract: The increasing popularity of AI, particularly Large Language Models (LLMs), has significantly impacted various domains, including Software Engineering. This study explores the integration of AI tools in software engineering practices within a large organization. We focus on ANZ Bank, which employs over 5000 engineers covering all aspects of the software development life cycle. This paper details a… ▽ More

    Submitted 17 April, 2024; v1 submitted 8 February, 2024; originally announced February 2024.

    Comments: 16 pages, 4 figures. in proceeding for 10th International Conference on Software Engineering (SEC 2024)

  19. Measurement of the Branching Fraction of $B^{0} \rightarrow J/ψπ^{0}$ Decays

    Authors: LHCb collaboration, R. Aaij, A. S. W. Abdelmotteleb, C. Abellan Beteta, F. Abudinén, T. Ackernley, J. A. Adams, A. A. Adefisoye, B. Adeva, M. Adinolfi, P. Adlarson, C. Agapopoulou, C. A. Aidala, Z. Ajaltouni, S. Akar, K. Akiba, P. Albicocco, J. Albrecht, F. Alessio, M. Alexander, Z. Aliouche, P. Alvarez Cartelle, R. Amalric, S. Amato, J. L. Amey , et al. (1067 additional authors not shown)

    Abstract: The ratio of branching fractions between $B^{0} \rightarrow J/ψπ^{0}$ and $B^{+} \rightarrow J/ψK^{*+}$ decays is measured with proton-proton collision data collected by the LHCb experiment, corresponding to an integrated luminosity of 9 fb$^{-1}$. The measured value is… ▽ More

    Submitted 23 May, 2024; v1 submitted 8 February, 2024; originally announced February 2024.

    Comments: All figures and tables, along with any supplementary material and additional information, are available at https://cern.ch/lhcbproject/Publications/p/LHCb-PAPER-2023-041.html (LHCb public pages)

    Report number: LHCb-PAPER-2023-041, CERN-EP-2024-009

    Journal ref: J. High Energ. Phys. 2024, 65 (2024)

  20. Observation of the $B_c^+ \to J/ψπ^+ π^0$ decay

    Authors: LHCb collaboration, R. Aaij, A. S. W. Abdelmotteleb, C. Abellan Beteta, F. Abudinén, T. Ackernley, J. A. Adams, A. A. Adefisoye, B. Adeva, M. Adinolfi, P. Adlarson, C. Agapopoulou, C. A. Aidala, Z. Ajaltouni, S. Akar, K. Akiba, P. Albicocco, J. Albrecht, F. Alessio, M. Alexander, Z. Aliouche, P. Alvarez Cartelle, R. Amalric, S. Amato, J. L. Amey , et al. (1064 additional authors not shown)

    Abstract: The first observation of the $B_c^+ \to J/ψπ^+ π^0$ decay is reported with high significance using proton-proton collision data, corresponding to an integrated luminosity of 9fb$^{-1}$, collected with the LHCb detector at centre-of-mass energies of 7, 8, and 13 TeV. The ratio of its branching fraction relative to the $B_c^+ \to J/ψπ^+$ channel is measured to be… ▽ More

    Submitted 15 May, 2024; v1 submitted 8 February, 2024; originally announced February 2024.

    Comments: 30 pages, 6 figures. All figures and tables, along with any supplementary material and additional information, are available at https://cern.ch/lhcbproject/Publications/p/LHCb-PAPER-2023-046.html (LHCb public pages)

    Report number: LHCb-PAPER-2023-046, CERN-EP-2024-019

    Journal ref: JHEP04 (2024) 151

  21. arXiv:2402.05383  [pdf, other

    nucl-ex hep-ex

    First measurement of the yield of $^8$He isotopes produced in liquid scintillator by cosmic-ray muons at Daya Bay

    Authors: Daya Bay Collaboration, F. P. An, W. D. Bai, A. B. Balantekin, M. Bishai, S. Blyth, G. F. Cao, J. Cao, J. F. Chang, Y. Chang, H. S. Chen, H. Y. Chen, S. M. Chen, Y. Chen, Y. X. Chen, Z. Y. Chen, J. Cheng, Y. C. Cheng, Z. K. Cheng, J. J. Cherwinka, M. C. Chu, J. P. Cummings, O. Dalager, F. S. Deng, X. Y. Ding , et al. (177 additional authors not shown)

    Abstract: Daya Bay presents the first measurement of cosmogenic $^8$He isotope production in liquid scintillator, using an innovative method for identifying cascade decays of $^8$He and its child isotope, $^8$Li. We also measure the production yield of $^9$Li isotopes using well-established methodology. The results, in units of 10$^{-8}μ^{-1}$g$^{-1}$cm$^{2}$, are 0.307$\pm$0.042, 0.341$\pm$0.040, and 0.546… ▽ More

    Submitted 7 February, 2024; originally announced February 2024.

  22. arXiv:2402.04644  [pdf, other

    cs.LG cs.AI

    LEVI: Generalizable Fine-tuning via Layer-wise Ensemble of Different Views

    Authors: Yuji Roh, Qingyun Liu, Huan Gui, Zhe Yuan, Yu** Tang, Steven Euijong Whang, Liang Liu, Shuchao Bi, Lichan Hong, Ed H. Chi, Zhe Zhao

    Abstract: Fine-tuning is becoming widely used for leveraging the power of pre-trained foundation models in new downstream tasks. While there are many successes of fine-tuning on various tasks, recent studies have observed challenges in the generalization of fine-tuned models to unseen distributions (i.e., out-of-distribution; OOD). To improve OOD generalization, some previous studies identify the limitation… ▽ More

    Submitted 18 June, 2024; v1 submitted 7 February, 2024; originally announced February 2024.

    Comments: In Proceedings of the 41st International Conference on Machine Learning (ICML), 2024

  23. arXiv:2402.04325  [pdf, other

    cs.LG cs.AI cs.CR

    Enhance DNN Adversarial Robustness and Efficiency via Injecting Noise to Non-Essential Neurons

    Authors: Zhenyu Liu, Garrett Gagnon, Swagath Venkataramani, Liu Liu

    Abstract: Deep Neural Networks (DNNs) have revolutionized a wide range of industries, from healthcare and finance to automotive, by offering unparalleled capabilities in data analysis and decision-making. Despite their transforming impact, DNNs face two critical challenges: the vulnerability to adversarial attacks and the increasing computational costs associated with more complex and larger models. In this… ▽ More

    Submitted 6 February, 2024; originally announced February 2024.

  24. arXiv:2402.03829  [pdf, ps, other

    hep-ex

    Precise Measurement of Born Cross Sections for $e^+e^-\to D\bar{D}$ and Observation of One Structure between $\sqrt{s} = 3.80-4.95$ GeV

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, X. C. Ai, R. Aliberti, A. Amoroso, M. R. An, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere, A. Brueggemann , et al. (604 additional authors not shown)

    Abstract: Using data samples collected with the BESIII detector at the BEPCII collider at center-of-mass energies ranging from 3.80 to 4.95 GeV, corresponding to an integrated luminosity of 20 fb$^{-1}$, a measurement of Born cross sections for the $e^+e^-\to D^{0}\bar{D}^{0}$ and $D^{+}D^{-}$ processes is presented with unprecedented precision. By performing a simultaneous fit to the dressed cross sections… ▽ More

    Submitted 6 February, 2024; originally announced February 2024.

    Comments: 9 pages, 4 figures, 1 tables, 1 Supplemental_Material

  25. arXiv:2402.03774  [pdf, other

    cs.LG cs.AI cs.CL

    Learning a Decision Tree Algorithm with Transformers

    Authors: Yufan Zhuang, Liyuan Liu, Chandan Singh, **gbo Shang, Jianfeng Gao

    Abstract: Decision trees are renowned for their interpretability capability to achieve high predictive performance, especially on tabular data. Traditionally, they are constructed through recursive algorithms, where they partition the data at every node in a tree. However, identifying the best partition is challenging, as decision trees optimized for local segments may not bring global generalization. To ad… ▽ More

    Submitted 6 February, 2024; originally announced February 2024.

  26. arXiv:2402.03021  [pdf, other

    cs.LG math.NA

    Data-induced multiscale losses and efficient multirate gradient descent schemes

    Authors: Juncai He, Liangchen Liu, Yen-Hsi Richard Tsai

    Abstract: This paper investigates the impact of multiscale data on machine learning algorithms, particularly in the context of deep learning. A dataset is multiscale if its distribution shows large variations in scale across different directions. This paper reveals multiscale structures in the loss landscape, including its gradients and Hessians inherited from the data. Correspondingly, it introduces a nove… ▽ More

    Submitted 6 February, 2024; v1 submitted 5 February, 2024; originally announced February 2024.

    Comments: 28 pages, 4 figures, submitted under review

    MSC Class: 65F10; 65F45; 68T07 ACM Class: G.1.6; I.2.6

  27. arXiv:2402.02935  [pdf, other

    nucl-th astro-ph.SR nucl-ex

    Nuclear mass table in deformed relativistic Hartree-Bogoliubov theory in continuum, II: Even-$Z$ nuclei

    Authors: DRHBc Mass Table Collaboration, Peng Guo, Xiaojie Cao, Kangmin Chen, Zhihui Chen, Myung-Ki Cheoun, Yong-Beom Choi, Pak Chung Lam, Wenmin Deng, Jianmin Dong, Pengxiang Du, Xiaokai Du, Kangda Duan, Xiaohua Fan, Wei Gao, Lisheng Geng, Eunja Ha, Xiao-Tao He, **niu Hu, **gke Huang, Kun Huang, Yanan Huang, Zidan Huang, Kim Da Hyung, Hoi Yat Chan , et al. (58 additional authors not shown)

    Abstract: The mass table in the deformed relativistic Hartree-Bogoliubov theory in continuum (DRHBc) with the PC-PK1 density functional has been established for even-$Z$ nuclei with $8\le Z\le120$, extended from the previous work for even-even nuclei [Zhang $\it{et.~al.}$ (DRHBc Mass Table Collaboration), At. Data Nucl. Data Tables 144, 101488 (2022)]. The calculated binding energies, two-nucleon and one-ne… ▽ More

    Submitted 10 June, 2024; v1 submitted 5 February, 2024; originally announced February 2024.

    Comments: 394 pages, 17 figures, 2 tables, published in Atomic Data and Nuclear Data Tables, data file in the TXT form is available for download under "Ancillary files"

    Journal ref: Peng Guo, et. al. (DRHBc Mass Table Collaboration), Atomic Data and Nuclear Data Tables 158 (2024) 101661

  28. arXiv:2402.02065  [pdf, other

    cs.LG

    Training Implicit Networks for Image Deblurring using Jacobian-Free Backpropagation

    Authors: Linghai Liu, Shuaicheng Tong, Lisa Zhao

    Abstract: Recent efforts in applying implicit networks to solve inverse problems in imaging have achieved competitive or even superior results when compared to feedforward networks. These implicit networks only require constant memory during backpropagation, regardless of the number of layers. However, they are not necessarily easy to train. Gradient calculations are computationally expensive because they r… ▽ More

    Submitted 3 February, 2024; originally announced February 2024.

  29. arXiv:2402.01993  [pdf, other

    hep-ex

    Measurement of the Electromagnetic Transition Form-factors in the decays $η'\rightarrowπ^+π^-l^+l^-$

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (618 additional authors not shown)

    Abstract: With a sample of $(10087\pm44)\times10^{6}$ $J/ψ$ events accumulated with the BESIII detector, we analyze the decays $η'\rightarrowπ^+π^-l^+l^-(l=e,$ $μ)$ via the process $J/ψ\rightarrowγη'$. The branching fractions are measured to be $\mathcal{B}(η'\rightarrowπ^+π^-e^+e^-)=(2.45\pm0.02(\rm{stat.})\pm0.08(\rm{syst.})) \times10^{-3}$ and… ▽ More

    Submitted 2 February, 2024; originally announced February 2024.

  30. arXiv:2402.01336  [pdf, other

    hep-ex

    Measurements of the branching fraction ratio $\cal{B}(φ\to μ^+μ^-)/\cal{B}(φ\to e^+e^-)$ with charm meson decays

    Authors: LHCb collaboration, R. Aaij, A. S. W. Abdelmotteleb, C. Abellan Beteta, F. Abudinén, T. Ackernley, B. Adeva, M. Adinolfi, P. Adlarson, C. Agapopoulou, C. A. Aidala, Z. Ajaltouni, S. Akar, K. Akiba, P. Albicocco, J. Albrecht, F. Alessio, M. Alexander, A. Alfonso Albero, Z. Aliouche, P. Alvarez Cartelle, R. Amalric, S. Amato, J. L. Amey, Y. Amhis , et al. (1080 additional authors not shown)

    Abstract: Measurements of the branching fraction ratio ${\cal{B}(φ\to μ^+ μ^-)/\cal{B}(φ\to e^+e^-)}$ with ${D_{s}^{+} \to π^{+} φ}$ and ${D^{+} \to π^{+} φ}$ decays, denoted $R^{s}_{φπ}$ and $R^{d}_{φπ}$, are presented. The analysis is performed using a dataset corresponding to an integrated luminosity of 5.4$\,\rm{fb}^{-1}$ of $pp$ collision data collected with the LHCb experiment. The branching fractions… ▽ More

    Submitted 1 May, 2024; v1 submitted 2 February, 2024; originally announced February 2024.

    Comments: All figures and tables, along with any supplementary material and additional information, are available at https://cern.ch/lhcbproject/Publications/p/LHCb-PAPER-2023-038.html (LHCb public pages)

    Report number: LHCb-PAPER-2023-038, CERN-EP-2024-001

  31. arXiv:2402.01157  [pdf, other

    cs.CV

    Source-Free Unsupervised Domain Adaptation with Hypothesis Consolidation of Prediction Rationale

    Authors: Yangyang Shu, Xiaofeng Cao, Qi Chen, Bowen Zhang, Ziqin Zhou, Anton van den Hengel, Lingqiao Liu

    Abstract: Source-Free Unsupervised Domain Adaptation (SFUDA) is a challenging task where a model needs to be adapted to a new domain without access to target domain labels or source domain data. The primary difficulty in this task is that the model's predictions may be inaccurate, and using these inaccurate predictions for model adaptation can lead to misleading results. To address this issue, this paper pr… ▽ More

    Submitted 2 February, 2024; originally announced February 2024.

  32. arXiv:2402.01118  [pdf, other

    cs.AI cs.CL

    PokeLLMon: A Human-Parity Agent for Pokemon Battles with Large Language Models

    Authors: Sihao Hu, Tiansheng Huang, Ling Liu

    Abstract: We introduce PokeLLMon, the first LLM-embodied agent that achieves human-parity performance in tactical battle games, as demonstrated in Pokemon battles. The design of PokeLLMon incorporates three key strategies: (i) In-context reinforcement learning that instantly consumes text-based feedback derived from battles to iteratively refine the policy; (ii) Knowledge-augmented generation that retrieves… ▽ More

    Submitted 2 April, 2024; v1 submitted 1 February, 2024; originally announced February 2024.

    Comments: 10 pages

  33. arXiv:2402.01109  [pdf, other

    cs.LG cs.CL

    Vaccine: Perturbation-aware Alignment for Large Language Model

    Authors: Tiansheng Huang, Sihao Hu, Ling Liu

    Abstract: The new paradigm of finetuning-as-a-service introduces a new attack surface for Large Language Models (LLMs): a few harmful data uploaded by users can easily trick the finetuning to produce an alignment-broken model. We conduct an empirical analysis and uncover a \textit{harmful embedding drift} phenomenon, showing a probable cause of the alignment-broken effect. Inspired by our findings, we propo… ▽ More

    Submitted 29 February, 2024; v1 submitted 1 February, 2024; originally announced February 2024.

  34. arXiv:2402.01096  [pdf, other

    cs.LG cs.AI cs.CR cs.DC

    Trustworthy Distributed AI Systems: Robustness, Privacy, and Governance

    Authors: Wenqi Wei, Ling Liu

    Abstract: Emerging Distributed AI systems are revolutionizing big data computing and data processing capabilities with growing economic and societal impact. However, recent studies have identified new attack surfaces and risks caused by security, privacy, and fairness issues in AI systems. In this paper, we review representative techniques, algorithms, and theoretical foundations for trustworthy distributed… ▽ More

    Submitted 1 February, 2024; originally announced February 2024.

    Comments: Manuscript accepted to ACM Computing Surveys

  35. Lightweight Pixel Difference Networks for Efficient Visual Representation Learning

    Authors: Zhuo Su, Jiehua Zhang, Longguang Wang, Hua Zhang, Zhen Liu, Matti Pietikäinen, Li Liu

    Abstract: Recently, there have been tremendous efforts in develo** lightweight Deep Neural Networks (DNNs) with satisfactory accuracy, which can enable the ubiquitous deployment of DNNs in edge devices. The core challenge of develo** compact and efficient DNNs lies in how to balance the competing goals of achieving high accuracy and high efficiency. In this paper we propose two novel types of convolutio… ▽ More

    Submitted 1 February, 2024; originally announced February 2024.

    Comments: We design a novel lightweight convolutional operator for computer vision tasks. Both full-precision networks and BNNs are developed. Accepted by TPAMI

  36. arXiv:2402.00416  [pdf, ps, other

    math.CO math.SP

    A characterization of extremal non-transmission-regular graphs by the distance (signless Laplacian) spectral radius

    Authors: **gfen Lan, Lele Liu

    Abstract: Let $G$ be a simple connected graph of order $n$ and $\partial(G)$ is the spectral radius of the distance matrix $D(G)$ of $G$. The transmission $D_i$ of vertex $i$ is the $i$-th row sum of $D(G)$. Denote by $D_{\max}(G)$ the maximum of transmissions over all vertices of $G$, and $\partial^Q(G)$ is the spectral radius of the distance signless Laplacian matrix… ▽ More

    Submitted 1 February, 2024; originally announced February 2024.

    MSC Class: 05C50

  37. arXiv:2402.00388  [pdf, other

    cs.LG cs.AI stat.ML

    Cumulative Distribution Function based General Temporal Point Processes

    Authors: Maolin Wang, Yu Pan, Zenglin Xu, Ruocheng Guo, Xiangyu Zhao, Wanyu Wang, Yiqi Wang, Zitao Liu, Langming Liu

    Abstract: Temporal Point Processes (TPPs) hold a pivotal role in modeling event sequences across diverse domains, including social networking and e-commerce, and have significantly contributed to the advancement of recommendation systems and information retrieval strategies. Through the analysis of events such as user interactions and transactions, TPPs offer valuable insights into behavioral patterns, faci… ▽ More

    Submitted 1 February, 2024; originally announced February 2024.

  38. Enhanced contact flexibility from nanoparticles in capillary suspensions

    Authors: Lingyue Liu, Jens Allard, Erin Koos

    Abstract: Hypothesis: Sample-spanning particle networks are used to induce structure and a yield stress, necessary for 3D printing of porous ceramics and paints. In capillary suspensions, a small quantity of immiscible secondary fluid is incorporated into a suspension. By further adding nanoparticles with a range of hydrophobicities, the structure of the bridges and microparticle-microparticle contacts shou… ▽ More

    Submitted 14 March, 2024; v1 submitted 31 January, 2024; originally announced January 2024.

    Comments: SI included

    Journal ref: Journal of Colloid and Interface Science, Volume 665, July 2024, Pages 643-654

  39. Study of $CP$ violation in $B^0_{(s)} \to D K^{*}(892)^0$ decays with $D \to K π( ππ)$, $ ππ( ππ)$, and $KK$ final states

    Authors: LHCb collaboration, R. Aaij, A. S. W. Abdelmotteleb, C. Abellan Beteta, F. Abudinén, T. Ackernley, B. Adeva, M. Adinolfi, P. Adlarson, C. Agapopoulou, C. A. Aidala, Z. Ajaltouni, S. Akar, K. Akiba, P. Albicocco, J. Albrecht, F. Alessio, M. Alexander, A. Alfonso Albero, Z. Aliouche, P. Alvarez Cartelle, R. Amalric, S. Amato, J. L. Amey, Y. Amhis , et al. (1072 additional authors not shown)

    Abstract: A measurement of $CP$-violating observables associated with the interference of $B^0\to D^0 K^{*}(892)^0$ and $B^0\to \bar{D}^0 K^*(892)^0$ decay amplitudes is performed in the $D^0 \to K^{\mp}π^{\pm}(π^+π^-),$ $D^0 \to π^+π^-(π^+π^-)$, and $D^0\to K^+K^-$ final states using data collected by the LHCb experiment corresponding to an integrated luminosity of $9$ $\text{fb}^{-1}$. $CP$-violating obse… ▽ More

    Submitted 13 May, 2024; v1 submitted 31 January, 2024; originally announced January 2024.

    Comments: All figures and tables, along with any supplementary material and additional information, are available at https://cern.ch/lhcbproject/Publications/p/LHCb-PAPER-2023-040.html (LHCb public pages)

    Report number: LHCb-PAPER-2023-040, CERN-EP-2024-007

    Journal ref: JHEP 05(2024) 025

  40. arXiv:2401.17873  [pdf, other

    hep-ex

    Measurements of Normalized Differential Cross Sections of Inclusive $η$ Production in $e^{+}e^{-}$ Annihilation at Energy from 2.0000 to 3.6710 GeV

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, D. Anderle, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko , et al. (641 additional authors not shown)

    Abstract: Using data samples collected with the BESIII detector operating at the BEPCII storage ring, the cross section of the inclusive process $e^{+}e^{-} \to η+ X$, normalized by the total cross section of $e^{+}e^{-} \to \text{hadrons}$, is measured at eight center-of-mass energy points from 2.0000 GeV to 3.6710 GeV. These are the first measurements with momentum dependence in this energy region. Our me… ▽ More

    Submitted 31 January, 2024; originally announced January 2024.

    Comments: 9 pages, 2 figures

  41. arXiv:2401.17608  [pdf

    cond-mat.mtrl-sci

    Electrical 180o switching of Néel vector in spin-splitting antiferromagnet

    Authors: Lei Han, Xizhi Fu, Rui Peng, Xingkai Cheng, Jiankun Dai, Liangyang Liu, Yidian Li, Yichi Zhang, Wenxuan Zhu, Hua Bai, Yongjian Zhou, Shixuan Liang, Chong Chen, Qian Wang, Xianzhe Chen, Luyi Yang, Yang Zhang, Cheng Song, Junwei Liu, Feng Pan

    Abstract: Antiferromagnetic spintronics have attracted wide attention due to its great potential in constructing ultra-dense and ultra-fast antiferromagnetic memory that suits modern high-performance information technology. The electrical 180o switching of Néel vector is a long-term goal for develo** electrical-controllable antiferromagnetic memory with opposite Néel vectors as binary "0" and "1". However… ▽ More

    Submitted 31 January, 2024; originally announced January 2024.

    Comments: 19 pages, 4 figures

    Journal ref: Sci. Adv. 10, eadn0479 (2024)

  42. arXiv:2401.17604  [pdf, other

    cs.CV cs.SD eess.AS

    Computation and Parameter Efficient Multi-Modal Fusion Transformer for Cued Speech Recognition

    Authors: Lei Liu, Li Liu, Haizhou Li

    Abstract: Cued Speech (CS) is a pure visual coding method used by hearing-impaired people that combines lip reading with several specific hand shapes to make the spoken language visible. Automatic CS recognition (ACSR) seeks to transcribe visual cues of speech into text, which can help hearing-impaired people to communicate effectively. The visual information of CS contains lip reading and hand cueing, thus… ▽ More

    Submitted 8 February, 2024; v1 submitted 31 January, 2024; originally announced January 2024.

    Comments: Accepted by TASLP

  43. arXiv:2401.17086  [pdf, other

    cs.CV

    Active Generation Network of Human Skeleton for Action Recognition

    Authors: Long Liu, Xin Wang, Fangming Li, Jiayu Chen

    Abstract: Data generation is a data augmentation technique for enhancing the generalization ability for skeleton-based human action recognition. Most existing data generation methods face challenges to ensure the temporal consistency of the dynamic information for action. In addition, the data generated by these methods lack diversity when only a few training samples are available. To solve those problems,… ▽ More

    Submitted 30 January, 2024; originally announced January 2024.

  44. arXiv:2401.17038  [pdf, other

    cs.CV

    Towards Assessing the Synthetic-to-Measured Adversarial Vulnerability of SAR ATR

    Authors: Bowen Peng, Bo Peng, **gyuan Xia, Tianpeng Liu, Yongxiang Liu, Li Liu

    Abstract: Recently, there has been increasing concern about the vulnerability of deep neural network (DNN)-based synthetic aperture radar (SAR) automatic target recognition (ATR) to adversarial attacks, where a DNN could be easily deceived by clean input with imperceptible but aggressive perturbations. This paper studies the synthetic-to-measured (S2M) transfer setting, where an attacker generates adversari… ▽ More

    Submitted 30 January, 2024; originally announced January 2024.

  45. arXiv:2401.16465  [pdf, other

    cs.CV cs.GR

    DressCode: Autoregressively Sewing and Generating Garments from Text Guidance

    Authors: Kai He, Kaixin Yao, Qixuan Zhang, **gyi Yu, Lingjie Liu, Lan Xu

    Abstract: Apparel's significant role in human appearance underscores the importance of garment digitalization for digital human creation. Recent advances in 3D content creation are pivotal for digital human creation. Nonetheless, garment generation from text guidance is still nascent. We introduce a text-driven 3D garment generation framework, DressCode, which aims to democratize design for novices and offe… ▽ More

    Submitted 14 June, 2024; v1 submitted 29 January, 2024; originally announced January 2024.

    Comments: Project page: https://IHe-KaiI.github.io/DressCode/

  46. Superexchange interactions and magnetic anisotropy in MnPSe$_3$ monolayer

    Authors: Guangyu Wang, Ke Yang, Yaozhenghang Ma, Lu Liu, Di Lu, Yuxuan Zhou, Hua Wu

    Abstract: Two-dimensional van der Waals magnetic materials are of great current interest for their promising applications in spintronics. In this work, using density functional theory calculations in combination with the maximally localized Wannier functions method and the magnetic anisotropy analyses, we study the electronic and magnetic properties of MnPSe$_3$ monolayer. Our results show that it is a char… ▽ More

    Submitted 29 January, 2024; originally announced January 2024.

    Comments: 8 pages, 9 figures

    Journal ref: Chinese Phys. Lett. 40 077301 (2023)

  47. arXiv:2401.15803  [pdf, other

    cs.RO cs.AI cs.CV eess.SY

    GarchingSim: An Autonomous Driving Simulator with Photorealistic Scenes and Minimalist Workflow

    Authors: Liguo Zhou, Yinglei Song, Yichao Gao, Zhou Yu, Michael Sodamin, Hongshen Liu, Liang Ma, Lian Liu, Hao Liu, Yang Liu, Haichuan Li, Guang Chen, Alois Knoll

    Abstract: Conducting real road testing for autonomous driving algorithms can be expensive and sometimes impractical, particularly for small startups and research institutes. Thus, simulation becomes an important method for evaluating these algorithms. However, the availability of free and open-source simulators is limited, and the installation and configuration process can be daunting for beginners and inte… ▽ More

    Submitted 30 January, 2024; v1 submitted 28 January, 2024; originally announced January 2024.

  48. arXiv:2401.15626  [pdf, other

    cs.CL cs.AI

    TA&AT: Enhancing Task-Oriented Dialog with Turn-Level Auxiliary Tasks and Action-Tree Based Scheduled Sampling

    Authors: Longxiang Liu, Xiuxing Li, Yang Feng

    Abstract: Task-oriented dialog systems have witnessed substantial progress due to conversational pre-training techniques. Yet, two significant challenges persist. First, most systems primarily utilize the latest turn's state label for the generator. This practice overlooks the comprehensive value of state labels in boosting the model's understanding for future generations. Second, an overreliance on generat… ▽ More

    Submitted 28 January, 2024; originally announced January 2024.

    Comments: Accepted by AAAI 2024

  49. arXiv:2401.15002  [pdf, other

    cs.CV

    BackdoorBench: A Comprehensive Benchmark and Analysis of Backdoor Learning

    Authors: Baoyuan Wu, Hongrui Chen, Mingda Zhang, Zihao Zhu, Shaokui Wei, Danni Yuan, Mingli Zhu, Ruotong Wang, Li Liu, Chao Shen

    Abstract: As an emerging and vital topic for studying deep neural networks' vulnerability (DNNs), backdoor learning has attracted increasing interest in recent years, and many seminal backdoor attack and defense algorithms are being developed successively or concurrently, in the status of a rapid arms race. However, mainly due to the diverse settings, and the difficulties of implementation and reproducibili… ▽ More

    Submitted 26 January, 2024; originally announced January 2024.

  50. arXiv:2401.14720  [pdf, ps, other

    hep-ex

    Observation of structures in the processes $e^+e^-\rightarrowωχ_{c1}$ and $ωχ_{c2}$

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, X. C. Ai, R. Aliberti, A. Amoroso, M. R. An, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (608 additional authors not shown)

    Abstract: We present measurements of the Born cross sections for the processes $e^+e^-\rightarrowωχ_{c1}$ and $ωχ_{c2}$ at center-of-mass energies $\sqrt{s}$ from 4.308 to 4.951 GeV. The measurements are performed with data samples corresponding to an integrated luminosity of 11.0 $\rm{fb}^{-1}$ collected with the BESIII detector operating at the BEPCII storage ring. Assuming the $e^+e^-\rightarrowωχ_{c2}$… ▽ More

    Submitted 24 March, 2024; v1 submitted 26 January, 2024; originally announced January 2024.

    Comments: 11 pages, 8 figures, with Supplemental Material