Skip to main content

Showing 251–300 of 4,050 results for author: Liu, T

.
  1. arXiv:2403.07459  [pdf, other

    cond-mat.mes-hall

    Localization-Delocalization Transitions in Non-Hermitian Aharonov-Bohm Cages

    Authors: Xiang Li, ** Liu, Tao Liu

    Abstract: A unique feature of non-Hermitian systems is the extreme sensitivity of the eigenspectrum to boundary conditions with the emergence of the non-Hermitian skin effect (NHSE). A NHSE originates from the point-gap topology of complex eigenspectrum, where an extensive number of eigenstates are anomalously localized at the boundary driven by nonreciprocal dissipation. Two different approaches to create… ▽ More

    Submitted 12 March, 2024; originally announced March 2024.

    Comments: 10 pages, 6 figures

  2. arXiv:2403.07022  [pdf, other

    cs.LG cs.AI

    A Unified Model for Spatio-Temporal Prediction Queries with Arbitrary Modifiable Areal Units

    Authors: Liyue Chen, Jiangyi Fang, Tengfei Liu, Shaosheng Cao, Leye Wang

    Abstract: Spatio-Temporal (ST) prediction is crucial for making informed decisions in urban location-based applications like ride-sharing. However, existing ST models often require region partition as a prerequisite, resulting in two main pitfalls. Firstly, location-based services necessitate ad-hoc regions for various purposes, requiring multiple ST models with varying scales and zones, which can be costly… ▽ More

    Submitted 9 March, 2024; originally announced March 2024.

    Comments: Accepted by ICDE 2024

  3. arXiv:2403.06940  [pdf, other

    eess.IV cs.LG q-bio.QM

    Conditional Score-Based Diffusion Model for Cortical Thickness Trajectory Prediction

    Authors: Qing Xiao, Siyeop Yoon, Hui Ren, Matthew Tivnan, Lichao Sun, Quanzheng Li, Tianming Liu, Yu Zhang, Xiang Li

    Abstract: Alzheimer's Disease (AD) is a neurodegenerative condition characterized by diverse progression rates among individuals, with changes in cortical thickness (CTh) closely linked to its progression. Accurately forecasting CTh trajectories can significantly enhance early diagnosis and intervention strategies, providing timely care. However, the longitudinal data essential for these studies often suffe… ▽ More

    Submitted 11 March, 2024; originally announced March 2024.

  4. arXiv:2403.06766  [pdf, other

    hep-ex

    Determination of the number of $ψ(3686)$ events taken at BESIII

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (639 additional authors not shown)

    Abstract: The number of $ψ(3686)$ events collected by the BESIII detector during the 2021 run period is determined to be $(2259.3\pm 11.1)\times 10^6$ by counting inclusive $ψ(3686)$ hadronic events. The uncertainty is systematic and the statistical uncertainty is negligible. Meanwhile, the numbers of $ψ(3686)$ events collected during the 2009 and 2012 run periods are updated to be… ▽ More

    Submitted 28 May, 2024; v1 submitted 11 March, 2024; originally announced March 2024.

  5. arXiv:2403.06764  [pdf, other

    cs.CV cs.AI cs.CL

    An Image is Worth 1/2 Tokens After Layer 2: Plug-and-Play Inference Acceleration for Large Vision-Language Models

    Authors: Liang Chen, Haozhe Zhao, Tianyu Liu, Shuai Bai, Junyang Lin, Chang Zhou, Baobao Chang

    Abstract: In this study, we identify the inefficient attention phenomena in Large Vision-Language Models (LVLMs), notably within prominent models like LLaVA-1.5, QwenVL-Chat and Video-LLaVA. We find out that the attention computation over visual tokens is of extreme inefficiency in the deep layers of popular LVLMs, suggesting a need for a sparser approach compared to textual data handling. To this end, we i… ▽ More

    Submitted 25 March, 2024; v1 submitted 11 March, 2024; originally announced March 2024.

    Comments: 21 papes, 8 figures, code is released at https://github.com/pkunlp-icler/FastV

  6. arXiv:2403.05530  [pdf, other

    cs.CL cs.AI

    Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context

    Authors: Gemini Team, Petko Georgiev, Ving Ian Lei, Ryan Burnell, Libin Bai, Anmol Gulati, Garrett Tanzer, Damien Vincent, Zhufeng Pan, Shibo Wang, Soroosh Mariooryad, Yifan Ding, Xinyang Geng, Fred Alcober, Roy Frostig, Mark Omernick, Lexi Walker, Cosmin Paduraru, Christina Sorokin, Andrea Tacchetti, Colin Gaffney, Samira Daruki, Olcan Sercinoglu, Zach Gleicher, Juliette Love , et al. (1092 additional authors not shown)

    Abstract: In this report, we introduce the Gemini 1.5 family of models, representing the next generation of highly compute-efficient multimodal models capable of recalling and reasoning over fine-grained information from millions of tokens of context, including multiple long documents and hours of video and audio. The family includes two new models: (1) an updated Gemini 1.5 Pro, which exceeds the February… ▽ More

    Submitted 14 June, 2024; v1 submitted 8 March, 2024; originally announced March 2024.

  7. arXiv:2403.03561  [pdf, ps, other

    cs.CV

    HMD-Poser: On-Device Real-time Human Motion Tracking from Scalable Sparse Observations

    Authors: Peng Dai, Yang Zhang, Tao Liu, Zhen Fan, Tianyuan Du, Zhuo Su, Xiaozheng Zheng, Zeming Li

    Abstract: It is especially challenging to achieve real-time human motion tracking on a standalone VR Head-Mounted Display (HMD) such as Meta Quest and PICO. In this paper, we propose HMD-Poser, the first unified approach to recover full-body motions using scalable sparse observations from HMD and body-worn IMUs. In particular, it can support a variety of input scenarios, such as HMD, HMD+2IMUs, HMD+3IMUs, e… ▽ More

    Submitted 6 March, 2024; originally announced March 2024.

    Comments: CVPR2024 Accepted

  8. arXiv:2403.03527  [pdf

    eess.IV

    LDSF: Lightweight Dual-Stream Framework for SAR Target Recognition by Coupling Local Electromagnetic Scattering Features and Global Visual Features

    Authors: Xuying Xiong, Xinyu Zhang, Weidong Jiang, Tianpeng Liu

    Abstract: Mainstream DNN-based SAR-ATR methods still face issues such as easy overfitting of a few training data, high computational overhead, and poor interpretability of the black-box model. Integrating physical knowledge into DNNs to improve performance and achieve a higher level of physical interpretability becomes the key to solving the above problems. This paper begins by focusing on the electromagnet… ▽ More

    Submitted 6 March, 2024; originally announced March 2024.

  9. arXiv:2403.03500  [pdf, other

    hep-ex

    Observation of the decay $h_{c}\to3(π^{+}π^{-})π^{0}$

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (639 additional authors not shown)

    Abstract: Based on $(2712.4\pm14.1)\times10^{6}$ $ψ(3686)$ events collected with the BESIII detector, we study the decays $h_{c}\to3(π^{+}π^{-})π^{0}$, $h_{c}\to2(π^{+}π^{-})ω$, $h_{c}\to2(π^{+}π^{-})π^{0}η$, $h_{c}\to2(π^{+}π^{-})η$, and $h_{c}\to p\bar{p}$ via $ψ(3686)\toπ^{0}h_{c}$. The decay channel $h_{c}\to3(π^{+}π^{-})π^{0}$ is observed for the first time, and its branching fraction is determined to… ▽ More

    Submitted 6 March, 2024; originally announced March 2024.

    Comments: 11 pages, 3 figures

  10. arXiv:2403.03460  [pdf, ps, other

    cs.RO

    Foot Shape-Dependent Resistive Force Model for Bipedal Walkers on Granular Terrains

    Authors: Xunjie Chen, Aditya Anikode, **gang Yi, Tao Liu

    Abstract: Legged robots have demonstrated high efficiency and effectiveness in unstructured and dynamic environments. However, it is still challenging for legged robots to achieve rapid and efficient locomotion on deformable, yielding substrates, such as granular terrains. We present an enhanced resistive force model for bipedal walkers on soft granular terrains by introducing effective intrusion depth corr… ▽ More

    Submitted 5 March, 2024; originally announced March 2024.

    Comments: ICRA 2024

  11. arXiv:2403.03051  [pdf, other

    physics.flu-dyn

    Prediction of turbulent channel flow using Fourier neural operator-based machine-learning strategy

    Authors: Yunpeng Wang, Zhijie Li, Zelong Yuan, Wenhui Peng, Tianyuan Liu, Jianchun Wang

    Abstract: Fast and accurate predictions of turbulent flows are of great importance in the science and engineering field. In this paper, we investigate the implicit U-Net enhanced Fourier neural operator (IUFNO) in the stable prediction of long-time dynamics of three-dimensional (3D) turbulent channel flows. The trained IUFNO models are tested in the large-eddy simulations (LES) at coarse grids for three fri… ▽ More

    Submitted 5 March, 2024; originally announced March 2024.

  12. arXiv:2403.02883  [pdf, other

    physics.plasm-ph

    Canonical Hamiltonian Guiding Center Dynamics and Its Intrinsic Magnetic Moment

    Authors: Ruili Zhang, Jian Liu, Tong Liu, Wenxiang Li, Xiaogang Wang, Yifa Tang

    Abstract: The concept of guiding center is potent in astrophysics, space plasmas, fusion researches, and arc plasmas to solve the multi-scale dynamics of magnetized plasmas. In this letter, we rigorously prove that the guiding center dynamics can generally be described as a constrained canonical Hamiltonian system with two constraints in six dimensional phase space, and that the solution flow of the guiding… ▽ More

    Submitted 5 March, 2024; originally announced March 2024.

    Comments: 14pages, 4 figures

  13. arXiv:2403.01969  [pdf, other

    cs.CL

    AS-ES Learning: Towards Efficient CoT Learning in Small Models

    Authors: Nuwa Xi, Yuhan Chen, Sendong Zhao, Haochun Wang, Bing Qin, Ting Liu

    Abstract: Chain-of-Thought (CoT) serves as a critical emerging ability in LLMs, especially when it comes to logical reasoning. Attempts have been made to induce such ability in small models as well by distilling from the data with CoT generated by Large Language Models (LLMs). However, existing methods often simply generate and incorporate more data from LLMs and fail to note the importance of efficiently u… ▽ More

    Submitted 4 March, 2024; originally announced March 2024.

  14. arXiv:2403.01942  [pdf, other

    cs.LG

    Mitigating Label Noise on Graph via Topological Sample Selection

    Authors: Yuhao Wu, Jiangchao Yao, Xiaobo Xia, Jun Yu, Ruxin Wang, Bo Han, Tongliang Liu

    Abstract: Despite the success of the carefully-annotated benchmarks, the effectiveness of existing graph neural networks (GNNs) can be considerably impaired in practice when the real-world graph data is noisily labeled. Previous explorations in sample selection have been demonstrated as an effective way for robust learning with noisy labels, however, the conventional studies focus on i.i.d data, and when mo… ▽ More

    Submitted 4 June, 2024; v1 submitted 4 March, 2024; originally announced March 2024.

    Comments: ICML 2024

  15. arXiv:2403.01761  [pdf, other

    hep-ex

    Observation of $ψ(3686)\to 3φ$

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (645 additional authors not shown)

    Abstract: Using $(2.712\pm0.014)\times 10^9$ $ψ(3686)$ events collected by the BESIII detector operating at the BEPCII collider, we report the first observation of $ψ(3686)\to 3φ$ decay with a significance larger than 10$σ$. The branching fraction of this decay is determined to be $(1.46\pm0.05\pm0.17)\times10^{-5}$, where the first uncertainty is statistical and the second is systematic. No significant str… ▽ More

    Submitted 4 March, 2024; originally announced March 2024.

  16. arXiv:2403.01698  [pdf, other

    cs.CL cs.AI

    Hypertext Entity Extraction in Webpage

    Authors: Yifei Yang, Tianqiao Liu, Bo Shao, Hai Zhao, Linjun Shou, Ming Gong, Daxin Jiang

    Abstract: Webpage entity extraction is a fundamental natural language processing task in both research and applications. Nowadays, the majority of webpage entity extraction models are trained on structured datasets which strive to retain textual content and its structure information. However, existing datasets all overlook the rich hypertext features (e.g., font color, font size) which show their effectiven… ▽ More

    Submitted 3 March, 2024; originally announced March 2024.

  17. arXiv:2403.01254  [pdf, other

    cs.RO

    RKHS-BA: A Semantic Correspondence-Free Multi-View Registration Framework with Global Tracking

    Authors: Ray Zhang, **gwei Song, Xiang Gao, Junzhe Wu, Tianyi Liu, **yuan Zhang, Ryan Eustice, Maani Ghaffari

    Abstract: This work reports a novel Bundle Adjustment (BA) formulation using a Reproducing Kernel Hilbert Space (RKHS) representation called RKHS-BA. The proposed formulation is correspondence-free, enables the BA to use RGB-D/LiDAR and semantic labels in the optimization directly, and provides a generalization for the photometric loss function commonly used in direct methods. RKHS-BA can incorporate appear… ▽ More

    Submitted 2 March, 2024; originally announced March 2024.

    Comments: 16 pages, 12 figures, technical report under review

  18. arXiv:2403.00308  [pdf

    physics.acc-ph

    Longitudinal beam dynamics design fpr Super Tau-Charm Facility

    Authors: Linhao Zhang, Tao Liu, Sangya Li, **gyu Tang, Qing Luo

    Abstract: The project of Super Tau-Charm Facility (STCF) proposed in China, as a new-generation high-luminosity $e^+e^-$ collider in the low-energy region with the center-of-mass energy of 2-7 GeV, is well underway. The luminosity is targeted at $1.0\times10^{35} cm^{-2}s^{-1}$ at the optimized beam energy of 2 GeV. Longitudinal beam dynamics becomes of great importance for the STCF due to the constraints f… ▽ More

    Submitted 1 March, 2024; originally announced March 2024.

  19. arXiv:2403.00271  [pdf

    physics.med-ph

    Assessing Bilateral Neurovascular Bundles Function with Pulsed Wave Doppler Ultrasound: Implications for Reducing Erectile Dysfunction Following Prostate Radiotherapy

    Authors: **g Wang, Xiaofeng Yang, Boran Zhou, James Sohn, Richard Qiu, Pretesh Patel, Ashesh B. Jani, Tian Liu

    Abstract: This study aims to evaluate the functional status of bilateral neurovascular bundles (NVBs) using pulsed wave Doppler ultrasound in patients undergoing prostate radiotherapy (RT). Sixty-two patients (mean age: 66.1 +/- 7.2 years) underwent transrectal ultrasound scan using a conventional ultrasound scanner, a 7.5 MHz bi-plane probe and a mechanical stepper. The ultrasound protocol comprised 3 step… ▽ More

    Submitted 29 February, 2024; originally announced March 2024.

    Comments: 14 pages, 4 figures

    MSC Class: 68U10

  20. arXiv:2402.19452  [pdf, other

    hep-th

    The Giant Graviton Expansion from Bubbling Geometry

    Authors: Evan Deddo, James T. Liu, Leopoldo A. Pando Zayas, Robert J. Saskowski

    Abstract: The superconformal index of half-BPS states in ${\cal N}=4$ supersymmetric Yang-Mills with gauge group $U(N)$ admits an expansion in terms of giant gravitons, ${\cal I}_N(q)={\cal I}_\infty(q) \sum\limits_{m=0}^\infty q^{mN}\hat{\mathcal I}_m(q)$, where $m$ is the number of giant gravitons. We derive this expansion directly in supergravity from the class of half-BPS solutions due to Lin, Lunin, an… ▽ More

    Submitted 29 February, 2024; originally announced February 2024.

    Comments: 17 pages, 4 figures

    Report number: LCTP-24-04

  21. arXiv:2402.19173  [pdf, other

    cs.SE cs.AI

    StarCoder 2 and The Stack v2: The Next Generation

    Authors: Anton Lozhkov, Raymond Li, Loubna Ben Allal, Federico Cassano, Joel Lamy-Poirier, Nouamane Tazi, Ao Tang, Dmytro Pykhtar, Jiawei Liu, Yuxiang Wei, Tianyang Liu, Max Tian, Denis Kocetkov, Arthur Zucker, Younes Belkada, Zijian Wang, Qian Liu, Dmitry Abulkhanov, Indraneil Paul, Zhuang Li, Wen-Ding Li, Megan Risdal, Jia Li, Jian Zhu, Terry Yue Zhuo , et al. (41 additional authors not shown)

    Abstract: The BigCode project, an open-scientific collaboration focused on the responsible development of Large Language Models for Code (Code LLMs), introduces StarCoder2. In partnership with Software Heritage (SWH), we build The Stack v2 on top of the digital commons of their source code archive. Alongside the SWH repositories spanning 619 programming languages, we carefully select other high-quality data… ▽ More

    Submitted 29 February, 2024; originally announced February 2024.

  22. arXiv:2402.18625  [pdf, other

    hep-ph

    Light quark mediated Higgs boson production in association with a jet at the next-to-next-leading order and beyond

    Authors: Tao Liu, Alexander A. Penin, Abdur Rehman

    Abstract: We study the light quark effect on the Higgs boson production in association with a jet at the LHC in the intermediate transverse momentum region between the quark and the Higgs boson mass scales. Though the effect is suppressed by the small Yukawa coupling, it is enhanced by large logarithms of the quark mass ratio to the Higgs boson mass or transverse momentum. Following a remarkable success of… ▽ More

    Submitted 28 February, 2024; originally announced February 2024.

    Comments: 18 pages, 5 figures

    Report number: ALBERTA-THY-2-24

  23. arXiv:2402.18158  [pdf, other

    cs.CL cs.AI

    Evaluating Quantized Large Language Models

    Authors: Shiyao Li, Xuefei Ning, Luning Wang, Tengxuan Liu, Xiangsheng Shi, Shengen Yan, Guohao Dai, Huazhong Yang, Yu Wang

    Abstract: Post-training quantization (PTQ) has emerged as a promising technique to reduce the cost of large language models (LLMs). Specifically, PTQ can effectively mitigate memory consumption and reduce computational overhead in LLMs. To meet the requirements of both high efficiency and performance across diverse scenarios, a comprehensive evaluation of quantized LLMs is essential to guide the selection o… ▽ More

    Submitted 6 June, 2024; v1 submitted 28 February, 2024; originally announced February 2024.

  24. arXiv:2402.18104  [pdf, other

    cs.CR cs.AI

    Making Them Ask and Answer: Jailbreaking Large Language Models in Few Queries via Disguise and Reconstruction

    Authors: Tong Liu, Yingjie Zhang, Zhe Zhao, Yinpeng Dong, Guozhu Meng, Kai Chen

    Abstract: In recent years, large language models (LLMs) have demonstrated notable success across various tasks, but the trustworthiness of LLMs is still an open problem. One specific threat is the potential to generate toxic or harmful responses. Attackers can craft adversarial prompts that induce harmful responses from LLMs. In this work, we pioneer a theoretical foundation in LLMs security by identifying… ▽ More

    Submitted 10 June, 2024; v1 submitted 28 February, 2024; originally announced February 2024.

  25. arXiv:2402.17241  [pdf, other

    cs.CR cs.PL cs.SE

    HardTaint: Production-Run Dynamic Taint Analysis via Selective Hardware Tracing

    Authors: Yiyu Zhang, Tianyi Liu, Yueyang Wang, Yun Qi, Kai Ji, Jian Tang, Xiaoliang Wang, Xuandong Li, Zhiqiang Zuo

    Abstract: Dynamic taint analysis (DTA), as a fundamental analysis technique, is widely used in security, privacy, and diagnosis, etc. As DTA demands to collect and analyze massive taint data online, it suffers extremely high runtime overhead. Over the past decades, numerous attempts have been made to lower the overhead of DTA. Unfortunately, the reductions they achieved are marginal, causing DTA only applic… ▽ More

    Submitted 27 February, 2024; originally announced February 2024.

  26. arXiv:2402.16943  [pdf, other

    astro-ph.HE astro-ph.GA

    The LOFAR-eFEDS survey: The incidence of radio and X-ray AGN and the disk-jet connection

    Authors: Z. Igo, A. Merloni, D. Hoang, J. Buchner, T. Liu, M. Salvato, R. Arcodia, S. Bellstedt, M. Brüggen, J. H. Croston, F. de Gasperin, A. Georgakakis, M. J. Hardcastle, K. Nandra, Q. Ni, T. Pasini, T. Shimwell, J. Wolf

    Abstract: Radio jets are present in a diverse sample of AGN. However, the mechanisms of jet powering are not fully understood, and it is yet unclear to what extent they obey mass-invariant scaling relations, similar to those found for the triggering and fuelling of X-ray selected AGN. We study the incidence of eROSITA/eFEDS X-ray and LOFAR radio AGN as a function of several stellar mass normalised AGN power… ▽ More

    Submitted 26 February, 2024; originally announced February 2024.

    Comments: 30 pages, 32 figures, accepted for publication in A&A

    Journal ref: A&A 686, A43 (2024)

  27. arXiv:2402.16398  [pdf, other

    cs.RO

    Efficient Continuous-Time Ego-Motion Estimation for Asynchronous Event-based Data Associations

    Authors: Zhixiang Wang, Xudong Li, Tianle Liu, Yizhai Zhang, Panfeng Huang

    Abstract: Event cameras are bio-inspired vision sensors that asynchronously measure per-pixel brightness changes. The high temporal resolution and asynchronicity of event cameras offer great potential for estimating the robot motion state. Recent works have adopted the continuous-time ego-motion estimation methods to exploit the inherent nature of event cameras. However, most of the adopted methods have poo… ▽ More

    Submitted 26 February, 2024; originally announced February 2024.

    Comments: 8 pages, 7 figures

  28. arXiv:2402.16070  [pdf, other

    quant-ph

    High-order topological pum** on a superconducting quantum processor

    Authors: Cheng-Lin Deng, Yu Liu, Yu-Ran Zhang, Xue-Gang Li, Tao Liu, Chi-Tong Chen, Tong Liu, Cong-Wei Lu, Yong-Yi Wang, Tian-Ming Li, Cai-** Fang, Si-Yun Zhou, Jia-Cheng Song, Yue-Shan Xu, Yang He, Zheng-He Liu, Kai-Xuan Huang, Zhong-Cheng Xiang, Jie-Ci Wang, Dong-Ning Zheng, Guang-Ming Xue, Kai Xu, H. F. Yu, Heng Fan

    Abstract: High-order topological phases of matter refer to the systems of $n$-dimensional bulk with the topology of $m$-th order, exhibiting $(n-m)$-dimensional boundary modes and can be characterized by topological pum**. Here, we experimentally demonstrate two types of second-order topological pumps, forming four 0-dimensional corner localized states on a 4$\times$4 square lattice array of 16 supercondu… ▽ More

    Submitted 25 February, 2024; originally announced February 2024.

  29. From COBIT to ISO 42001: Evaluating Cybersecurity Frameworks for Opportunities, Risks, and Regulatory Compliance in Commercializing Large Language Models

    Authors: Timothy R. McIntosh, Teo Susnjak, Tong Liu, Paul Watters, Raza Nowrozy, Malka N. Halgamuge

    Abstract: This study investigated the integration readiness of four predominant cybersecurity Governance, Risk and Compliance (GRC) frameworks - NIST CSF 2.0, COBIT 2019, ISO 27001:2022, and the latest ISO 42001:2023 - for the opportunities, risks, and regulatory compliance when adopting Large Language Models (LLMs), using qualitative content analysis and expert validation. Our analysis, with both LLMs and… ▽ More

    Submitted 24 February, 2024; originally announced February 2024.

  30. arXiv:2402.15678  [pdf, other

    cs.DC

    Minions: Accelerating Large Language Model Inference with Adaptive and Collective Speculative Decoding

    Authors: Siqi Wang, Hailong Yang, Xuezhu Wang, Tongxuan Liu, Pengbo Wang, Xuning Liang, Kejie Ma, Tianyu Feng, Xin You, Yongjun Bao, Yi Liu, Zhongzhi Luan, Depei Qian

    Abstract: Large language models (LLM) have recently attracted surging interest due to their outstanding capabilities across various domains. However, enabling efficient LLM inference is challenging due to its autoregressive decoding that generates tokens only one at a time. Although research works apply pruning or quantization to speed up LLM inference, they typically require fine-tuning the LLM, incurring… ▽ More

    Submitted 23 February, 2024; originally announced February 2024.

  31. arXiv:2402.15527  [pdf, other

    cs.CL cs.AI cs.CV

    PCA-Bench: Evaluating Multimodal Large Language Models in Perception-Cognition-Action Chain

    Authors: Liang Chen, Yichi Zhang, Shuhuai Ren, Haozhe Zhao, Zefan Cai, Yuchi Wang, Peiyi Wang, Xiangdi Meng, Tianyu Liu, Baobao Chang

    Abstract: We present PCA-Bench, a multimodal decision-making benchmark for evaluating the integrated capabilities of Multimodal Large Language Models (MLLMs). Departing from previous benchmarks focusing on simplistic tasks and individual model capability, PCA-Bench introduces three complex scenarios: autonomous driving, domestic robotics, and open-world games. Given task instructions and diverse contexts, t… ▽ More

    Submitted 21 February, 2024; originally announced February 2024.

    Comments: Code and Data released at https://github.com/pkunlp-icler/PCA-EVAL. Leaderboard at: https://docs.qq.com/sheet/DVUd4WUpGRHRqUnNV. This article supersedes its workshop version arxiv: 2310.02071. arXiv admin note: text overlap with arXiv:2310.02071

  32. arXiv:2402.15070  [pdf, other

    cs.LG

    Enhancing One-Shot Federated Learning Through Data and Ensemble Co-Boosting

    Authors: Rong Dai, Yonggang Zhang, Ang Li, Tongliang Liu, Xun Yang, Bo Han

    Abstract: One-shot Federated Learning (OFL) has become a promising learning paradigm, enabling the training of a global server model via a single communication round. In OFL, the server model is aggregated by distilling knowledge from all client models (the ensemble), which are also responsible for synthesizing samples for distillation. In this regard, advanced works show that the performance of the server… ▽ More

    Submitted 22 February, 2024; originally announced February 2024.

    Comments: To be published in ICLR2024

  33. arXiv:2402.14430  [pdf, other

    cs.LG

    Robust Training of Federated Models with Extremely Label Deficiency

    Authors: Yonggang Zhang, Zhiqin Yang, Xinmei Tian, Nannan Wang, Tongliang Liu, Bo Han

    Abstract: Federated semi-supervised learning (FSSL) has emerged as a powerful paradigm for collaboratively training machine learning models using distributed data with label deficiency. Advanced FSSL methods predominantly focus on training a single model on each client. However, this approach could lead to a discrepancy between the objective functions of labeled and unlabeled data, resulting in gradient con… ▽ More

    Submitted 22 February, 2024; originally announced February 2024.

    Comments: ICLR 2024, 22 pages

  34. arXiv:2402.14135  [pdf, other

    astro-ph.GA astro-ph.HE

    Steep-spectrum AGN in eROSITA Final Equatorial-Depth Survey (eFEDS): Their host galaxies and multi-wavelength properties

    Authors: K. Iwasawa, T. Liu, Th. Boller, J. Buchner, J. Li, T. Kawaguchi, T. Nagao, Y. Terashima, Y. Toba, J. D. Silverman, R. Arcodia, Th. Dauser, M. Krumpe, K. Nandra, J. Wilms

    Abstract: We selected sources with a steep soft-X-ray-band spectrum with a photon index larger than 2.5 -- measured by eROSITA on board the Spectrum-Roentgen-Gamma (SRG) -- from the eFEDS AGN catalogue as candidates of highly accreting supermassive black holes, and investigated their multi-wavelength properties. Among 601 bright AGN with 0.2-5 keV counts of greater than 100, 83 sources (~14%) are classified… ▽ More

    Submitted 21 February, 2024; originally announced February 2024.

    Comments: 8 pages, accepted for publication in A&A

  35. arXiv:2402.13588  [pdf, other

    eess.SY

    PI-CoF: A Bilevel Optimization Framework for Solving Active Learning Problems using Physics-Information

    Authors: Liqiu Dong, Marta Zagorowska, Tong Liu, Alex Durkin, Mehmet Mercangöz

    Abstract: Physics informed neural networks (PINNs) have recently been proposed as surrogate models for solving process optimization problems. However, in an active learning setting collecting enough data for reliably training PINNs poses a challenge. This study proposes a broadly applicable method for incorporating physics information into existing machine learning (ML) models of any type. The proposed meth… ▽ More

    Submitted 21 February, 2024; originally announced February 2024.

    Comments: Submitted to The 8th IEEE Conference on Control Technology and Applications (CCTA) 2024, 6 pages

  36. arXiv:2402.13471  [pdf

    cond-mat.mtrl-sci physics.app-ph

    Thermal transport in a 2D amorphous material

    Authors: Yuxi Wang, Xingxing Zhang, Wujuan Yan, Nianjie Liang, Haiyu He, Xinwei Tao, Ang Li, Fuwei Yang, Buxuan Li, Te-Huan Liu, Jia Zhu, Wu Zhou, Wei Wang, Lin Zhou, Bai Song

    Abstract: Two-dimensional (2D) crystals proved revolutionary soon after graphene was discovered in 2004. However, 2D amorphous materials only became accessible in 2020 and remain largely unexplored. In particular, the thermophysical properties of amorphous materials are of great interest upon transition from 3D to 2D. Here, we probe thermal transport in 2D amorphous carbon. A cross-plane thermal conductivit… ▽ More

    Submitted 22 March, 2024; v1 submitted 20 February, 2024; originally announced February 2024.

  37. arXiv:2402.13241  [pdf, other

    cs.LG cs.AI

    Federated Causal Discovery from Heterogeneous Data

    Authors: Loka Li, Ignavier Ng, Gongxu Luo, Biwei Huang, Guangyi Chen, Tongliang Liu, Bin Gu, Kun Zhang

    Abstract: Conventional causal discovery methods rely on centralized data, which is inconsistent with the decentralized nature of data in many real-world situations. This discrepancy has motivated the development of federated causal discovery (FCD) approaches. However, existing FCD methods may be limited by their potentially restrictive assumptions of identifiable functional causal models or homogeneous data… ▽ More

    Submitted 26 February, 2024; v1 submitted 20 February, 2024; originally announced February 2024.

    Comments: ICLR 2024

  38. arXiv:2402.13217  [pdf, other

    cs.CV cs.AI

    VideoPrism: A Foundational Visual Encoder for Video Understanding

    Authors: Long Zhao, Nitesh B. Gundavarapu, Liangzhe Yuan, Hao Zhou, Shen Yan, Jennifer J. Sun, Luke Friedman, Rui Qian, Tobias Weyand, Yue Zhao, Rachel Hornung, Florian Schroff, Ming-Hsuan Yang, David A. Ross, Huisheng Wang, Hartwig Adam, Mikhail Sirotenko, Ting Liu, Boqing Gong

    Abstract: We introduce VideoPrism, a general-purpose video encoder that tackles diverse video understanding tasks with a single frozen model. We pretrain VideoPrism on a heterogeneous corpus containing 36M high-quality video-caption pairs and 582M video clips with noisy parallel text (e.g., ASR transcripts). The pretraining approach improves upon masked autoencoding by global-local distillation of semantic… ▽ More

    Submitted 15 June, 2024; v1 submitted 20 February, 2024; originally announced February 2024.

    Comments: Accepted to ICML 2024. v2: added retrieval results on MSRVTT (1K-A), more data analyses, and ablation studies

  39. arXiv:2402.11537  [pdf, other

    cs.CL cs.AI

    Deciphering the Impact of Pretraining Data on Large Language Models through Machine Unlearning

    Authors: Yang Zhao, Li Du, Xiao Ding, Kai Xiong, Zhouhao Sun, Jun Shi, Ting Liu, Bing Qin

    Abstract: Through pretraining on a corpus with various sources, Large Language Models (LLMs) have gained impressive performance. However, the impact of each component of the pretraining corpus remains opaque. As a result, the organization of the pretraining corpus is still empirical and may deviate from the optimal. To address this issue, we systematically analyze the impact of 48 datasets from 5 major cate… ▽ More

    Submitted 26 March, 2024; v1 submitted 18 February, 2024; originally announced February 2024.

  40. arXiv:2402.11398  [pdf, other

    cs.CL cs.AI

    Reasoning before Comparison: LLM-Enhanced Semantic Similarity Metrics for Domain Specialized Text Analysis

    Authors: Shaochen Xu, Zihao Wu, Huaqin Zhao, Peng Shu, Zhengliang Liu, Wenxiong Liao, Sheng Li, Andrea Sikora, Tianming Liu, Xiang Li

    Abstract: In this study, we leverage LLM to enhance the semantic analysis and develop similarity metrics for texts, addressing the limitations of traditional unsupervised NLP metrics like ROUGE and BLEU. We develop a framework where LLMs such as GPT-4 are employed for zero-shot text identification and label generation for radiology reports, where the labels are then used as measurements for text similarity.… ▽ More

    Submitted 20 February, 2024; v1 submitted 17 February, 2024; originally announced February 2024.

  41. arXiv:2402.11207  [pdf, ps, other

    hep-ex

    Search for the production of deuterons and antideuterons in e^+e^- annihilation at center-of-mass energies between 4.13 and 4.70 GeV

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, R. Aliberti, A. Amoroso, M. R. An, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere, A. Brueggemann , et al. (593 additional authors not shown)

    Abstract: Using a data sample of $e^+e^-$ collision data corresponding to an integrated luminosity of 19 fb$^{-1}$ collected with the BESIII detector at the BEPCII collider, we search for the production of deuterons and antideuterons via $e^+e^-\to ppπ^-\bar{d}+c.c.$ for the first time at center-of-mass energies between 4.13 and 4.70 GeV. No significant signal is observed and the upper limit of the… ▽ More

    Submitted 17 February, 2024; originally announced February 2024.

  42. arXiv:2402.10962  [pdf, other

    cs.CL cs.AI cs.LG

    Measuring and Controlling Instruction (In)Stability in Language Model Dialogs

    Authors: Kenneth Li, Tianle Liu, Naomi Bashkansky, David Bau, Fernanda Viégas, Hanspeter Pfister, Martin Wattenberg

    Abstract: System-prompting is a standard tool for customizing language-model chatbots, enabling them to follow a specific instruction. An implicit assumption in the use of system prompts is that they will be stable, so the chatbot will continue to generate text according to the stipulated instructions for the duration of a conversation. We propose a quantitative benchmark to test this assumption, evaluating… ▽ More

    Submitted 1 May, 2024; v1 submitted 13 February, 2024; originally announced February 2024.

    Comments: Code: https://github.com/likenneth/persona_drift

  43. arXiv:2402.10812  [pdf, other

    cs.CL

    Exploring Hybrid Question Answering via Program-based Prompting

    Authors: Qi Shi, Han Cui, Haofeng Wang, Qingfu Zhu, Wanxiang Che, Ting Liu

    Abstract: Question answering over heterogeneous data requires reasoning over diverse sources of data, which is challenging due to the large scale of information and organic coupling of heterogeneous data. Various approaches have been proposed to address these challenges. One approach involves training specialized retrievers to select relevant information, thereby reducing the input length. Another approach… ▽ More

    Submitted 16 February, 2024; originally announced February 2024.

  44. arXiv:2402.09880  [pdf, ps, other

    cs.AI cs.CL cs.CY cs.HC

    Inadequacies of Large Language Model Benchmarks in the Era of Generative Artificial Intelligence

    Authors: Timothy R. McIntosh, Teo Susnjak, Tong Liu, Paul Watters, Malka N. Halgamuge

    Abstract: The rapid rise in popularity of Large Language Models (LLMs) with emerging capabilities has spurred public curiosity to evaluate and compare different LLMs, leading many researchers to propose their LLM benchmarks. Noticing preliminary inadequacies in those benchmarks, we embarked on a study to critically assess 23 state-of-the-art LLM benchmarks, using our novel unified evaluation framework throu… ▽ More

    Submitted 15 February, 2024; originally announced February 2024.

  45. arXiv:2402.09685  [pdf, other

    cs.RO

    Pheno-Robot: An Auto-Digital Modelling System for In-Situ Phenoty** in the Field

    Authors: Yaoqiang Pan, Kewei Hu, Tianhao Liu, Chao Chen, Hanwen Kang

    Abstract: Accurate reconstruction of plant models for phenoty** analysis is critical for optimising sustainable agricultural practices in precision agriculture. Traditional laboratory-based phenoty**, while valuable, falls short of understanding how plants grow under uncontrolled conditions. Robotic technologies offer a promising avenue for large-scale, direct phenoty** in real-world environments. Thi… ▽ More

    Submitted 14 February, 2024; originally announced February 2024.

  46. arXiv:2402.08960  [pdf, other

    cs.CV cs.AI

    Open-Vocabulary Segmentation with Unpaired Mask-Text Supervision

    Authors: Zhaoqing Wang, Xiaobo Xia, Ziye Chen, Xiao He, Yandong Guo, Mingming Gong, Tongliang Liu

    Abstract: Current state-of-the-art open-vocabulary segmentation methods typically rely on image-mask-text triplet annotations for supervision. However, acquiring such detailed annotations is labour-intensive and poses scalability challenges in complex real-world scenarios. While existing weakly-supervised approaches leverage image-text pairs to reduce the expansive annotation cost, the lack of mask supervis… ▽ More

    Submitted 11 June, 2024; v1 submitted 14 February, 2024; originally announced February 2024.

    Comments: 27 pages, 18 figures, 10 tables

  47. arXiv:2402.08919  [pdf, other

    cs.CV cs.LG

    Interpretable Measures of Conceptual Similarity by Complexity-Constrained Descriptive Auto-Encoding

    Authors: Alessandro Achille, Greg Ver Steeg, Tian Yu Liu, Matthew Trager, Carson Klingenberg, Stefano Soatto

    Abstract: Quantifying the degree of similarity between images is a key copyright issue for image-based machine learning. In legal doctrine however, determining the degree of similarity between works requires subjective analysis, and fact-finders (judges and juries) can demonstrate considerable variability in these subjective judgement calls. Images that are structurally similar can be deemed dissimilar, whe… ▽ More

    Submitted 13 February, 2024; originally announced February 2024.

  48. arXiv:2402.08901  [pdf

    physics.ins-det

    Characterization of the ATLAS Liquid Argon Front-End ASIC ALFE2 for the HL-LHC upgrade

    Authors: D. Matakias, G. Carini, H. Chen, M. Dabrowski, G. Deptuch, L. Duflot, J. Kierstead, T. Liu, H. Ma, N. Morange, S. Rescia, S. Tang, H. Xu

    Abstract: ALFE2 is an ATLAS Liquid Argon Calorimeter (LAr) Front-End ASIC designed for the HL-LHC upgrade. ALFE2 comprises four channels of pre-amplifiers and CR-(RC)2 shapers with adjustable input impedance. ALFE2 features two separate gain outputs to provide 16-bit dynamic-range coverage and an optimum resolution. ALFE2 is characterized using a Front-End Test Board (FETB) based on a Zynq UltraScale+ MPSoC… ▽ More

    Submitted 13 February, 2024; originally announced February 2024.

    Comments: 7 pages, 4 figures

  49. arXiv:2402.08458  [pdf, other

    astro-ph.CO gr-qc

    The SRG/eROSITA All-Sky Survey: Cosmology Constraints from Cluster Abundances in the Western Galactic Hemisphere

    Authors: V. Ghirardini, E. Bulbul, E. Artis, N. Clerc, C. Garrel, S. Grandis, M. Kluge, A. Liu, Y. E. Bahar, F. Balzer, I. Chiu, J. Comparat, D. Gruen, F. Kleinebreil, S. Krippendorf, A. Merloni, K. Nandra, N. Okabe, F. Pacaud, P. Predehl, M. E. Ramos-Ceja, T. H. Reiprich, J. S. Sanders, T. Schrabback, R. Seppi , et al. (24 additional authors not shown)

    Abstract: The cluster mass function traces the growth of linear density perturbations and provides valuable insights into the growth of structures, the nature of dark matter, and the cosmological parameters governing the Universe. The primary science goal of eROSITA, on board the {\it Spectrum Roentgen Gamma (SRG)} mission, launched in 2019, is to constrain cosmology through the evolution of cluster mass fu… ▽ More

    Submitted 13 February, 2024; originally announced February 2024.

    Comments: 43 pages, 22 figures, submitted to A&A

  50. Particle Filter SLAM for Vehicle Localization

    Authors: Tianrui Liu, Changxin Xu, Yuxin Qiao, Chufeng Jiang, Jiqiang Yu

    Abstract: Simultaneous Localization and Map** (SLAM) presents a formidable challenge in robotics, involving the dynamic construction of a map while concurrently determining the precise location of the robotic agent within an unfamiliar environment. This intricate task is further compounded by the inherent "chicken-and-egg" dilemma, where accurate map** relies on a dependable estimation of the robot's lo… ▽ More

    Submitted 19 February, 2024; v1 submitted 12 February, 2024; originally announced February 2024.

    Comments: 6 pages, Journal of Industrial Engineering and Applied Science

    Journal ref: Journal of Industrial Engineering and Applied Science 2024