Skip to main content

Showing 101–150 of 16,652 results for author: Li, Y

.
  1. arXiv:2406.15925  [pdf, other

    cs.CV

    Federated Adversarial Learning for Robust Autonomous Landing Runway Detection

    Authors: Yi Li, Plamen Angelov, Zhengxin Yu, Alvaro Lopez Pellicer, Neeraj Suri

    Abstract: As the development of deep learning techniques in autonomous landing systems continues to grow, one of the major challenges is trust and security in the face of possible adversarial attacks. In this paper, we propose a federated adversarial learning-based framework to detect landing runways using paired data comprising of clean local data and its adversarial version. Firstly, the local model is pr… ▽ More

    Submitted 22 June, 2024; originally announced June 2024.

    Comments: ICANN2024

    Journal ref: ICANN2024

  2. arXiv:2406.15921  [pdf, other

    cs.CV

    PUDD: Towards Robust Multi-modal Prototype-based Deepfake Detection

    Authors: Alvaro Lopez Pellcier, Yi Li, Plamen Angelov

    Abstract: Deepfake techniques generate highly realistic data, making it challenging for humans to discern between actual and artificially generated images. Recent advancements in deep learning-based deepfake detection methods, particularly with diffusion models, have shown remarkable progress. However, there is a growing demand for real-world applications to detect unseen individuals, deepfake techniques, a… ▽ More

    Submitted 30 June, 2024; v1 submitted 22 June, 2024; originally announced June 2024.

    Comments: CVPR2024

    Journal ref: CVPR2024

  3. arXiv:2406.15859  [pdf, other

    cs.IR cs.AI

    LLM-Powered Explanations: Unraveling Recommendations Through Subgraph Reasoning

    Authors: Guangsi Shi, Xiaofeng Deng, Linhao Luo, Lijuan Xia, Lei Bao, Bei Ye, Fei Du, Shirui Pan, Yuxiao Li

    Abstract: Recommender systems are pivotal in enhancing user experiences across various web applications by analyzing the complicated relationships between users and items. Knowledge graphs(KGs) have been widely used to enhance the performance of recommender systems. However, KGs are known to be noisy and incomplete, which are hard to provide reliable explanations for recommendation results. An explainable r… ▽ More

    Submitted 29 June, 2024; v1 submitted 22 June, 2024; originally announced June 2024.

  4. arXiv:2406.15758  [pdf, other

    cs.LG cs.DC

    EDGE-LLM: Enabling Efficient Large Language Model Adaptation on Edge Devices via Layerwise Unified Compression and Adaptive Layer Tuning and Voting

    Authors: Zhongzhi Yu, Zheng Wang, Yuhan Li, Haoran You, Ruijie Gao, Xiaoya Zhou, Sreenidhi Reedy Bommu, Yang Katie Zhao, Yingyan Celine Lin

    Abstract: Efficient adaption of large language models (LLMs) on edge devices is essential for applications requiring continuous and privacy-preserving adaptation and inference. However, existing tuning techniques fall short because of the high computation and memory overheads. To this end, we introduce a computation- and memory-efficient LLM tuning framework, called Edge-LLM, to facilitate affordable and ef… ▽ More

    Submitted 22 June, 2024; originally announced June 2024.

  5. arXiv:2406.15757  [pdf, other

    quant-ph cond-mat.stat-mech hep-lat

    Perturbative stability and error correction thresholds of quantum codes

    Authors: Yaodong Li, Nicholas O'Dea, Vedika Khemani

    Abstract: Topologically-ordered phases are stable to local perturbations, and topological quantum error-correcting codes enjoy thresholds to local errors. We connect the two notions of stability by constructing classical statistical mechanics models for decoding general CSS codes and classical linear codes. Our construction encodes correction success probabilities under uncorrelated bit-flip and phase-flip… ▽ More

    Submitted 22 June, 2024; originally announced June 2024.

    Comments: 17 pages + appendices

  6. arXiv:2406.15738  [pdf

    physics.app-ph

    Observation of Heat Pum** Effect by Radiative Shuttling

    Authors: Yuxuan Li, Yongdi Dang, Sen Zhang, Xinran Li, Tianle Chen, Pankaj K. Choudhury, Yi **, Jianbin Xu, Philippe Ben-Abdallah, Bing-Feng Ju, Yungui Ma

    Abstract: Heat shuttling phenomenon is characterized by the presence of a non-zero heat flow between two bodies without net thermal bias on average. It was initially predicted in the context of nonlinear heat conduction within atomic lattices coupled to two time-oscillating thermostats. Recent theoretical works revealed an analog of this effect for heat exchanges mediated by thermal photons between two soli… ▽ More

    Submitted 22 June, 2024; originally announced June 2024.

  7. arXiv:2406.15707  [pdf, other

    cs.CV

    psPRF:Pansharpening Planar Neural Radiance Field for Generalized 3D Reconstruction Satellite Imagery

    Authors: Tongtong Zhang, Yuanxiang Li

    Abstract: Most current NeRF variants for satellites are designed for one specific scene and fall short of generalization to new geometry. Additionally, the RGB images require pan-sharpening as an independent preprocessing step. This paper introduces psPRF, a Planar Neural Radiance Field designed for paired low-resolution RGB (LR-RGB) and high-resolution panchromatic (HR-PAN) images from satellite sensors wi… ▽ More

    Submitted 21 June, 2024; originally announced June 2024.

  8. arXiv:2406.15501  [pdf

    cs.CR

    Secure Combination of Untrusted Time information Based on Optimized Dempster-Shafer Theory

    Authors: Yang Li, Yujie Luo, Yichen Zhang, Ao Sun, Wei Huang, Shuai Zhang, Tao Zhang, Chuang Zhou, Li Ma, Jie Yang, Mei Wu, Heng Wang, Yan Pan, Yun Shao, Xing Chen, Ziyang Chen, Song Yu, Hong Guo, Bingjie Xu

    Abstract: Secure precision time synchronization is important for applications of Cyber-Physical Systems. However, several attacks, especially the Time Delay Attack (TDA), deteriorates the performance of time synchronization system seriously. Multiple paths scheme is thought as an effective security countermeasure to decrease the influence of TDA. However, the effective secure combination algorithm is still… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

  9. arXiv:2406.15407  [pdf

    physics.ins-det

    Preliminary Design of a General Electronics Platform for Accelerator Facilities

    Authors: **fu Zhu, Hongli Ding, Haokui Li, Qiaoye Ran, Xiwen Dai, Wei Li, Jiawei Han, Yue Li, Zhiyuan Zhang, Weixin Qiu, Weiqing Zhang

    Abstract: Many accelerators require considerable electronic systems for tests, verification, and operation. In Shenzhen Superconducting Soft X-ray Free Electron Laser (S3FEL), to meet the early tests and verification of various systems, save development expenses, and improve the reusability of hardware, firmware, and software systems, we have considered the needs of each system and preliminarily designed a… ▽ More

    Submitted 11 May, 2024; originally announced June 2024.

    Comments: 3 pages, 4 figures, 2024 IEEE Real-Time Conference

  10. arXiv:2406.15339  [pdf, other

    cs.CV cs.AI cs.MM

    Image Conductor: Precision Control for Interactive Video Synthesis

    Authors: Yaowei Li, Xintao Wang, Zhaoyang Zhang, Zhouxia Wang, Ziyang Yuan, Liangbin Xie, Yuexian Zou, Ying Shan

    Abstract: Filmmaking and animation production often require sophisticated techniques for coordinating camera transitions and object movements, typically involving labor-intensive real-world capturing. Despite advancements in generative AI for video creation, achieving precise control over motion for interactive video asset generation remains challenging. To this end, we propose Image Conductor, a method for… ▽ More

    Submitted 21 June, 2024; originally announced June 2024.

    Comments: Project webpage available at https://liyaowei-stu.github.io/project/ImageConductor/

  11. arXiv:2406.15269  [pdf, other

    cs.CV

    You Only Acquire Sparse-channel (YOAS): A Unified Framework for Dense-channel EEG Generation

    Authors: Hongyu Chen, Weiming Zeng, Luhui Cai, Yueyang Li, Lei Wang, Jia Lu, Hongjie Yan, Wai Ting Siok, Nizhuan Wang

    Abstract: High-precision acquisition of dense-channel electroencephalogram (EEG) signals is often impeded by the costliness and lack of portability of equipment. In contrast, generating dense-channel EEG signals effectively from sparse channels shows promise and economic viability. However, sparse-channel EEG poses challenges such as reduced spatial resolution, information loss, signal mixing, and heightene… ▽ More

    Submitted 21 June, 2024; originally announced June 2024.

  12. arXiv:2406.15073  [pdf, other

    cs.AI cs.DB

    KnobTree: Intelligent Database Parameter Configuration via Explainable Reinforcement Learning

    Authors: Jiahan Chen, Shuhan Qi, Yifan Li, Zeyu Dong, Mingfeng Ding, Yulin Wu, Xuan Wang

    Abstract: Databases are fundamental to contemporary information systems, yet traditional rule-based configuration methods struggle to manage the complexity of real-world applications with hundreds of tunable parameters. Deep reinforcement learning (DRL), which combines perception and decision-making, presents a potential solution for intelligent database configuration tuning. However, due to black-box prope… ▽ More

    Submitted 21 June, 2024; originally announced June 2024.

  13. arXiv:2406.15067  [pdf

    cond-mat.str-el

    Time-Domain Signatures of Distinct Correlated Insulators in a Moiré Superlattice

    Authors: Eric A. Arsenault, Yiliu Li, Birui Yang, Takashi Taniguchi, Kenji Watanabe, James C. Hone, Cory R. Dean, Xiaodong Xu, X. -Y. Zhu

    Abstract: Among expanding discoveries of quantum phases in moiré superlattices, correlated insulators stand out as both the most stable and most commonly observed. Despite the central importance of these states in moiré physics, little is known about their underlying nature. Here, we use pump-probe spectroscopy to show distinct time-domain signatures of correlated insulators at fillings of one (v = -1) and… ▽ More

    Submitted 21 June, 2024; originally announced June 2024.

    Comments: 18 pages, 4 figures, +10 Supporting figures. arXiv admin note: text overlap with arXiv:2307.16563

  14. arXiv:2406.15030  [pdf, ps, other

    hep-ex

    Search for the $e^+e^- \to φχ_{c1}(3872)$ process at BESIII

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (639 additional authors not shown)

    Abstract: Based on 368.5 pb$^{-1}$ of $e^+e^-$ collision data collected at center-of-mass energies 4.914 and 4.946 GeV by the BESIII detector, the $e^+e^- \to φχ_{c1}(3872)$ process is searched for the first time. No significant signal is observed and the upper limits at the 90\% confidence level on the product of the Born cross section $σ(e^+e^- \to φχ_{c1}(3872))$ and the branching fraction… ▽ More

    Submitted 21 June, 2024; originally announced June 2024.

    Comments: 11 pages, 3 figures

  15. arXiv:2406.14892  [pdf, other

    astro-ph.IM physics.ins-det

    CCAT: Detector Noise Limited Performance of the RFSoC-based Readout Electronics for mm/sub-mm/far-IR KIDs

    Authors: Adrian K. Sinclair, James Burgoyne, Anthony I. Huber, Colin Murphy, Steve K. Choi, Cody J. Duell, Zachary B. Huber, Yaqiong Li, Scott C. Chapman, Michael D. Niemack, Thomas Nikola, Eve M. Vavagiakis, Samantha Walker, Jordan D. Wheeler, Jason Austermann, Lawrence Lin, Ruixuan Xie, Bugao Zou, Philip D. Mauskopf

    Abstract: The Fred Young Submillimeter Telescope (FYST), on Cerro Chajnantor in the Atacama desert of Chile, will conduct wide-field and small deep-field surveys of the sky with more than 100,000 detectors on the Prime-Cam instrument. Kinetic inductance detectors (KIDs) were chosen as the primary sensor technology for their high density focal plane packing. Additionally, they benefit from low cost, ease of… ▽ More

    Submitted 21 June, 2024; originally announced June 2024.

    Comments: draft submitted to SPIE

  16. arXiv:2406.14887  [pdf, other

    cs.CL

    InternLM-Law: An Open Source Chinese Legal Large Language Model

    Authors: Zhiwei Fei, Songyang Zhang, Xiaoyu Shen, Dawei Zhu, Xiao Wang, Maosong Cao, Fengzhe Zhou, Yining Li, Wenwei Zhang, Dahua Lin, Kai Chen, Jidong Ge

    Abstract: While large language models (LLMs) have showcased impressive capabilities, they struggle with addressing legal queries due to the intricate complexities and specialized expertise required in the legal field. In this paper, we introduce InternLM-Law, a specialized LLM tailored for addressing diverse legal queries related to Chinese laws, spanning from responding to standard legal questions (e.g., l… ▽ More

    Submitted 21 June, 2024; originally announced June 2024.

    Comments: Our dataset, code and models will be released at https://github.com/InternLM/InternLM-Law

  17. arXiv:2406.14884  [pdf, other

    cs.CL

    FlowBench: Revisiting and Benchmarking Workflow-Guided Planning for LLM-based Agents

    Authors: Ruixuan Xiao, Wentao Ma, Ke Wang, Yuchuan Wu, Junbo Zhao, Haobo Wang, Fei Huang, Yongbin Li

    Abstract: LLM-based agents have emerged as promising tools, which are crafted to fulfill complex tasks by iterative planning and action. However, these agents are susceptible to undesired planning hallucinations when lacking specific knowledge for expertise-intensive tasks. To address this, preliminary attempts are made to enhance planning reliability by incorporating external workflow-related knowledge. De… ▽ More

    Submitted 21 June, 2024; originally announced June 2024.

  18. arXiv:2406.14844  [pdf, other

    cs.LG cs.AI

    DN-CL: Deep Symbolic Regression against Noise via Contrastive Learning

    Authors: **gyi Liu, Yanjie Li, Lina Yu, Min Wu, Weijun Li, Wenqiang Li, Meilan Hao, Yusong Deng, Shu Wei

    Abstract: Noise ubiquitously exists in signals due to numerous factors including physical, electronic, and environmental effects. Traditional methods of symbolic regression, such as genetic programming or deep learning models, aim to find the most fitting expressions for these signals. However, these methods often overlook the noise present in real-world data, leading to reduced fitting accuracy. To tackle… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

  19. arXiv:2406.14828  [pdf, other

    cs.CL

    Word Matters: What Influences Domain Adaptation in Summarization?

    Authors: Yinghao Li, Siyu Miao, Heyan Huang, Yang Gao

    Abstract: Domain adaptation aims to enable Large Language Models (LLMs) to generalize domain datasets unseen effectively during the training phase. However, factors such as the size of the model parameters and the scale of training data are general influencers and do not reflect the nuances of domain adaptation performance. This paper investigates the fine-grained factors affecting domain adaptation perform… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

  20. arXiv:2406.14787  [pdf, other

    cs.PL

    Story of Your Lazy Function's Life: A Bidirectional Demand Semantics for Mechanized Cost Analysis of Lazy Programs

    Authors: Li-yao Xia, Laura Israel, Maite Kramarz, Nicholas Coltharp, Koen Claessen, Stephanie Weirich, Yao Li

    Abstract: Lazy evaluation is a powerful tool that enables better compositionality and potentially better performance in functional programming, but it is challenging to analyze its computation cost. Existing works either require manually annotating sharing, or rely on separation logic to reason about heaps of mutable cells. In this paper, we propose a bidirectional demand semantics that allows for extrinsic… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

    Comments: Accepted by ICFP 2024

  21. arXiv:2406.14721  [pdf, other

    cs.CL

    1+1>2: Can Large Language Models Serve as Cross-Lingual Knowledge Aggregators?

    Authors: Yue Huang, Chenrui Fan, Yuan Li, Siyuan Wu, Tianyi Zhou, Xiangliang Zhang, Lichao Sun

    Abstract: Large Language Models (LLMs) have garnered significant attention due to their remarkable ability to process information across various languages. Despite their capabilities, they exhibit inconsistencies in handling identical queries in different languages, presenting challenges for further advancement. This paper introduces a method to enhance the multilingual performance of LLMs by aggregating kn… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

  22. arXiv:2406.14664  [pdf, ps, other

    eess.SP

    Experimental Validation of Cooperative RSS-based Localization with Unknown Transmit Power, Path Loss Exponent, and Precise Anchor Location

    Authors: Yingquan Li, Bodhibrata Mukhopadhyay, Jiajie Xu, Mohamed-Slim Alouini

    Abstract: Received signal strength (RSS)--based cooperative localization has gained significant attention due to its straightforward system architectures and cost-effectiveness. In this paper, we propose Cooperative Localization Techniques (with Unknown Parameters), referred to as CTUP(s), which consider uncertainty in anchor nodes' locations and assume the transmit power and \textcolor{blue}{path loss expo… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

  23. arXiv:2406.14644  [pdf, other

    cs.CL

    Unveiling the Spectrum of Data Contamination in Language Models: A Survey from Detection to Remediation

    Authors: Chunyuan Deng, Yilun Zhao, Yuzhao Heng, Yitong Li, Jiannan Cao, Xiangru Tang, Arman Cohan

    Abstract: Data contamination has garnered increased attention in the era of large language models (LLMs) due to the reliance on extensive internet-derived training corpora. The issue of training corpus overlap with evaluation benchmarks--referred to as contamination--has been the focus of significant recent research. This body of work aims to identify contamination, understand its impacts, and explore mitig… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

    Comments: ACL 2024 Camera-Ready Version

  24. arXiv:2406.14550  [pdf, other

    cs.CL cs.AI

    GraphReader: Building Graph-based Agent to Enhance Long-Context Abilities of Large Language Models

    Authors: Shilong Li, Yancheng He, Hangyu Guo, Xingyuan Bu, Ge Bai, Jie Liu, Jiaheng Liu, Xingwei Qu, Yangguang Li, Wanli Ouyang, Wenbo Su, Bo Zheng

    Abstract: Long-context capabilities are essential for large language models (LLMs) to tackle complex and long-input tasks. Despite numerous efforts made to optimize LLMs for long contexts, challenges persist in robustly processing long inputs. In this paper, we introduce GraphReader, a graph-based agent system designed to handle long texts by structuring them into a graph and employing an agent to explore t… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

    Comments: The first four authors contributed equally, 27 pages

  25. arXiv:2406.14515  [pdf, other

    cs.CV cs.MM

    MMBench-Video: A Long-Form Multi-Shot Benchmark for Holistic Video Understanding

    Authors: Xinyu Fang, Kangrui Mao, Haodong Duan, Xiangyu Zhao, Yining Li, Dahua Lin, Kai Chen

    Abstract: The advent of large vision-language models (LVLMs) has spurred research into their applications in multi-modal contexts, particularly in video understanding. Traditional VideoQA benchmarks, despite providing quantitative metrics, often fail to encompass the full spectrum of video content and inadequately assess models' temporal comprehension. To address these limitations, we introduce MMBench-Vide… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

  26. arXiv:2406.14457  [pdf, other

    cs.AI

    Rewarding What Matters: Step-by-Step Reinforcement Learning for Task-Oriented Dialogue

    Authors: Huifang Du, Shuqin Li, Minghao Wu, Xue**g Feng, Yuan-Fang Li, Haofen Wang

    Abstract: Reinforcement learning (RL) is a powerful approach to enhance task-oriented dialogue (TOD) systems. However, existing RL methods tend to mainly focus on generation tasks, such as dialogue policy learning (DPL) or response generation (RG), while neglecting dialogue state tracking (DST) for understanding. This narrow focus limits the systems to achieve globally optimal performance by overlooking the… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

  27. arXiv:2406.14455  [pdf, other

    cs.CV

    MM-GTUNets: Unified Multi-Modal Graph Deep Learning for Brain Disorders Prediction

    Authors: Luhui Cai, Weiming Zeng, Hongyu Chen, Hua Zhang, Yueyang Li, Hongjie Yan, Lingbin Bian, Nizhuan Wang

    Abstract: Graph deep learning (GDL) has demonstrated impressive performance in predicting population-based brain disorders (BDs) through the integration of both imaging and non-imaging data. However, the effectiveness of GDL based methods heavily depends on the quality of modeling the multi-modal population graphs and tends to degrade as the graph scale increases. Furthermore, these methods often constrain… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

  28. arXiv:2406.14395  [pdf, other

    quant-ph

    Communication with Quantum Catalysts

    Authors: Yuqi Li, Jun**g Xing, Dengke Qu, Lei Xiao, Zhaobing Fan, Zhu-Jun Zheng, Haitao Ma, Peng Xue, Kishor Bharti, Dax Enshan Koh, Yunlong Xiao

    Abstract: Communication is essential for advancing science and technology. Quantum communication, in particular, benefits from the use of catalysts. During the communication process, these catalysts enhance performance while remaining unchanged. Although chemical catalysts that undergo deactivation typically perform worse than those that remain unaffected, quantum catalysts, referred to as embezzling cataly… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

    Comments: 16 pages, 9 figures. Comments welcome!

  29. arXiv:2406.14386  [pdf, other

    quant-ph

    Teleportation with Embezzling Catalysts

    Authors: Jun**g Xing, Yuqi Li, Dengke Qu, Lei Xiao, Zhaobing Fan, Haitao Ma, Peng Xue, Kishor Bharti, Dax Enshan Koh, Yunlong Xiao

    Abstract: Quantum teleportation is the process of transferring quantum information using classical communication and pre-shared entanglement. This process can benefit from the use of catalysts, which are ancillary entangled states that can enhance teleportation without being consumed. While chemical catalysts undergoing deactivation invariably exhibit inferior performance compared to those unaffected by dea… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

    Comments: 19 pages, 11 figures. Comments welcome!

  30. arXiv:2406.14358  [pdf

    q-bio.NC cs.AI cs.CL

    The neural correlates of logical-mathematical symbol systems processing resemble that of spatial cognition more than natural language processing

    Authors: Yuannan Li, Shan Xu, Jia Liu

    Abstract: The ability to manipulate logical-mathematical symbols (LMS), encompassing tasks such as calculation, reasoning, and programming, is a cognitive skill arguably unique to humans. Considering the relatively recent emergence of this ability in human evolutionary history, it has been suggested that LMS processing may build upon more fundamental cognitive systems, possibly through neuronal recycling. P… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

  31. arXiv:2406.14189  [pdf, other

    cs.CL

    In Tree Structure Should Sentence Be Generated

    Authors: Yaguang Li, Xin Chen

    Abstract: Generative models reliant on sequential autoregression have been at the forefront of language generation for an extensive period, particularly following the introduction of widely acclaimed transformers. Despite its excellent performance, there are always some issues that we face today. For example, problems such as hallucinations and getting trapped in a logic loop may occur. To enhance the perfo… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

  32. arXiv:2406.14130  [pdf, other

    cs.CV

    ExVideo: Extending Video Diffusion Models via Parameter-Efficient Post-Tuning

    Authors: Zhongjie Duan, Wenmeng Zhou, Cen Chen, Yaliang Li, Weining Qian

    Abstract: Recently, advancements in video synthesis have attracted significant attention. Video synthesis models such as AnimateDiff and Stable Video Diffusion have demonstrated the practical applicability of diffusion models in creating dynamic visual content. The emergence of SORA has further spotlighted the potential of video generation technologies. Nonetheless, the extension of video lengths has been c… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

    Comments: 8 pages, 5 figures

  33. arXiv:2406.14129  [pdf, other

    cs.CV cs.CL cs.MM

    Towards Event-oriented Long Video Understanding

    Authors: Yifan Du, Kun Zhou, Yuqi Huo, Yifan Li, Wayne Xin Zhao, Haoyu Lu, Zijia Zhao, Bingning Wang, Weipeng Chen, Ji-Rong Wen

    Abstract: With the rapid development of video Multimodal Large Language Models (MLLMs), numerous benchmarks have been proposed to assess their video understanding capability. However, due to the lack of rich events in the videos, these datasets may suffer from the short-cut bias that the answers can be deduced from a few frames, without the need to watch the entire video. To address this issue, we introduce… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

    Comments: Work on progress

  34. arXiv:2406.14106  [pdf, other

    cs.AI cs.CL

    EasyECR: A Library for Easy Implementation and Evaluation of Event Coreference Resolution Models

    Authors: Yuncong Li, Tianhua Xu, Sheng-hua Zhong, Haiqin Yang

    Abstract: Event Coreference Resolution (ECR) is the task of clustering event mentions that refer to the same real-world event. Despite significant advancements, ECR research faces two main challenges: limited generalizability across domains due to narrow dataset evaluations, and difficulties in comparing models within diverse ECR pipelines. To address these issues, we develop EasyECR, the first open-source… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

    Comments: 14 pages, 4 figures, 12 tables

  35. arXiv:2406.14100  [pdf, other

    q-bio.NC

    Self-Attention in Transformer Networks Explains Monkeys' Gaze Pattern in Pac-Man Game

    Authors: Zhongqiao Lin, Yunwei Li, Tianming Yang

    Abstract: We proactively direct our eyes and attention to collect information during problem solving and decision making. Understanding gaze patterns is crucial for gaining insights into the computation underlying the problem-solving process. However, there is a lack of interpretable models that can account for how the brain directs the eyes to collect information and utilize it, especially in the context o… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

  36. arXiv:2406.14054  [pdf, other

    cs.LG

    Urban-Focused Multi-Task Offline Reinforcement Learning with Contrastive Data Sharing

    Authors: Xinbo Zhao, Yingxue Zhang, Xin Zhang, Yu Yang, Yiqun Xie, Yanhua Li, Jun Luo

    Abstract: Enhancing diverse human decision-making processes in an urban environment is a critical issue across various applications, including ride-sharing vehicle dispatching, public transportation management, and autonomous driving. Offline reinforcement learning (RL) is a promising approach to learn and optimize human urban strategies (or policies) from pre-collected human-generated spatial-temporal urba… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

    Comments: KDD 2024

  37. arXiv:2406.13958  [pdf

    physics.app-ph

    Symmetry engineering in 2D bioelectronics facilitating augmented biosensing interfaces

    Authors: Yizhang Wu, Yihan Liu, Yuan Li, Ziquan Wei, Sicheng Xing, Yunlang Wang, Dashuai Zhu, Ziheng Guo, Anran Zhang, Gongkai Yuan, Zhibo Zhang, Ke Huang, Yong Wang, Guorong Wu, Ke Cheng, Wubin Bai

    Abstract: Symmetry lies at the heart of 2D bioelectronics, determining material properties at the fundamental level. Breaking the symmetry allows emergent functionalities and effects. However, symmetry modulation in 2D bioelectronics and the resultant applications have been largely overlooked. Here we devise an oxidized architectural MXene, referred as OXene, that couples orbit symmetric breaking with inver… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

  38. arXiv:2406.13956  [pdf

    physics.app-ph

    Orbit symmetry breaking in MXene implements enhanced soft bioelectronic implants

    Authors: Yizhang Wu, Yuan Li, Yihan Liu, Dashuai Zhu, Sicheng Xing, Noah Lambert, Hannah Weisbecker, Siyuan Liu, Brayden Davis, Lin Zhang, Meixiang Wang, Gongkai Yuan, Chris Zhoufan You, Anran Zhang, Cate Duncan, Wanrong Xie, Yihang Wang, Yong Wang, Sreya Kanamurlapudi, Garcia-Guzman Evert, Arjun Putcha, Michael D. Dickey, Ke Huang, Wubin Bai

    Abstract: Bioelectronic implants with soft mechanics, biocompatibility, and excellent electrical performance enable biomedical implants to record electrophysiological signals and execute interventions within internal organs, promising to revolutionize the diagnosing, monitoring, and treatment of various pathological conditions. However, challenges remain in improving excessive impedance at the bioelectronic… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

  39. arXiv:2406.13948  [pdf, other

    cs.AI cs.CL cs.LG

    CityGPT: Empowering Urban Spatial Cognition of Large Language Models

    Authors: Jie Feng, Yuwei Du, Tianhui Liu, Siqi Guo, Yuming Lin, Yong Li

    Abstract: Large language models(LLMs) with powerful language generation and reasoning capabilities have already achieved success in many domains, e.g., math and code generation. However, due to the lacking of physical world's corpus and knowledge during training, they usually fail to solve many real-life tasks in the urban space. In this paper, we propose CityGPT, a systematic framework for enhancing the ca… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

  40. arXiv:2406.13947  [pdf, other

    cs.AI cs.CL

    AspirinSum: an Aspect-based utility-preserved de-identification Summarization framework

    Authors: Ya-Lun Li

    Abstract: Due to the rapid advancement of Large Language Model (LLM), the whole community eagerly consumes any available text data in order to train the LLM. Currently, large portion of the available text data are collected from internet, which has been thought as a cheap source of the training data. However, when people try to extend the LLM's capability to the personal related domain, such as healthcare o… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

  41. arXiv:2406.13945  [pdf, other

    cs.AI cs.CL cs.LG

    CityBench: Evaluating the Capabilities of Large Language Model as World Model

    Authors: Jie Feng, Jun Zhang, Junbo Yan, Xin Zhang, Tianjian Ouyang, Tianhui Liu, Yuwei Du, Siqi Guo, Yong Li

    Abstract: Large language models (LLMs) with powerful generalization ability has been widely used in many domains. A systematic and reliable evaluation of LLMs is a crucial step in their development and applications, especially for specific professional fields. In the urban domain, there have been some early explorations about the usability of LLMs, but a systematic and scalable evaluation benchmark is still… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

  42. arXiv:2406.13941  [pdf, other

    cs.IR cs.AI

    UpDLRM: Accelerating Personalized Recommendation using Real-World PIM Architecture

    Authors: Sitian Chen, Haobin Tan, Amelie Chi Zhou, Yusen Li, Pavan Balaji

    Abstract: Deep Learning Recommendation Models (DLRMs) have gained popularity in recommendation systems due to their effectiveness in handling large-scale recommendation tasks. The embedding layers of DLRMs have become the performance bottleneck due to their intensive needs on memory capacity and memory bandwidth. In this paper, we propose UpDLRM, which utilizes real-world processingin-memory (PIM) hardware,… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

  43. arXiv:2406.13705  [pdf, other

    eess.IV cs.AI cs.CV

    EndoUIC: Promptable Diffusion Transformer for Unified Illumination Correction in Capsule Endoscopy

    Authors: Long Bai, Qiaozhi Tan, Tong Chen, Wan Jun Nah, Yanheng Li, Zhicheng He, Sishen Yuan, Zhen Chen, **lin Wu, Mobarakol Islam, Zhen Li, Hongbin Liu, Hongliang Ren

    Abstract: Wireless Capsule Endoscopy (WCE) is highly valued for its non-invasive and painless approach, though its effectiveness is compromised by uneven illumination from hardware constraints and complex internal dynamics, leading to overexposed or underexposed images. While researchers have discussed the challenges of low-light enhancement in WCE, the issue of correcting for different exposure levels rema… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

    Comments: To appear in MICCAI 2024. Code and dataset availability: https://github.com/longbai1006/EndoUIC

  44. arXiv:2406.13443  [pdf, other

    cs.CL

    Dual-Phase Accelerated Prompt Optimization

    Authors: Muchen Yang, Moxin Li, Yongle Li, Zijun Chen, Chongming Gao, Junqi Zhang, Yangyang Li, Fuli Feng

    Abstract: Gradient-free prompt optimization methods have made significant strides in enhancing the performance of closed-source Large Language Models (LLMs) across a wide range of tasks. However, existing approaches make light of the importance of high-quality prompt initialization and the identification of effective optimization directions, thus resulting in substantial optimization steps to obtain satisfa… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

  45. arXiv:2406.13426  [pdf, other

    astro-ph.HE

    Multi-messenger modeling of the Monogem pulsar halo

    Authors: Youyou Li, Oscar Macias, Shinichiro Ando, Jacco Vink

    Abstract: The High-Altitude Water Cherenkov Telescope (HAWC) has detected TeV halos associated with two nearby pulsars/pulsar wind nebulae (PWN) -- Geminga and B0656+14. These TeV halos extend up to tens of pc from the central accelerators, indicating that the diffusion of ultrarelativistic electrons and positrons in the interstellar medium has been suppressed by two orders of magnitude. Although Geminga an… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

  46. arXiv:2406.13404  [pdf, other

    cs.DC

    Low-Latency Layer-Aware Proactive and Passive Container Migration in Meta Computing

    Authors: Mengjie Liu, Yihua Li, Fangyi Mou, Zhiqing Tang, Jiong Lou, Jianxiong Guo, Weijia Jia

    Abstract: Meta computing is a new computing paradigm that aims to efficiently utilize all network computing resources to provide fault-tolerant, personalized services with strong security and privacy guarantees. It also seeks to virtualize the Internet as many meta computers. In meta computing, tasks can be assigned to containers at edge nodes for processing, based on container images with multiple layers.… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

    Comments: to be published in IEEE ICMC 2024

  47. arXiv:2406.13284  [pdf

    physics.med-ph q-bio.QM

    The association of domain-specific physical activity and sedentary activity with stroke: A prospective cohort study

    Authors: Xinyi He, Shidi Wang, Yi Li, Jiucun Wang, Guangrui Yang, Jun Chen, Zixin Hu

    Abstract: Background The incidence of stroke places a heavy burden on both society and individuals. Activity is closely related to cardiovascular health. This study aimed to investigate the relationship between the varying domains of PA, like occupation-related Physical Activity (OPA), transportation-related Physical Activity (TPA), leisure-time Physical Activity (LTPA), and Sedentary Activity (SA) with str… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

  48. arXiv:2406.13201  [pdf, other

    cs.LG cs.SI

    Toward Structure Fairness in Dynamic Graph Embedding: A Trend-aware Dual Debiasing Approach

    Authors: Yicong Li, Yu Yang, Jiannong Cao, Shuaiqi Liu, Haoran Tang, Guandong Xu

    Abstract: Recent studies successfully learned static graph embeddings that are structurally fair by preventing the effectiveness disparity of high- and low-degree vertex groups in downstream graph mining tasks. However, achieving structure fairness in dynamic graph embedding remains an open problem. Neglecting degree changes in dynamic graphs will significantly impair embedding effectiveness without notably… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

  49. arXiv:2406.13193  [pdf, other

    cs.LG cs.AI cs.CL physics.chem-ph

    PRESTO: Progressive Pretraining Enhances Synthetic Chemistry Outcomes

    Authors: He Cao, Yanjun Shao, Zhiyuan Liu, Zi**g Liu, Xiangru Tang, Yuan Yao, Yu Li

    Abstract: Multimodal Large Language Models (MLLMs) have seen growing adoption across various scientific disciplines. These advancements encourage the investigation of molecule-text modeling within synthetic chemistry, a field dedicated to designing and conducting chemical reactions to synthesize new compounds with desired properties and applications. Current approaches, however, often neglect the critical r… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

  50. arXiv:2406.13176  [pdf, other

    math.CO

    A spectral Erdős-Faudree-Rousseau theorem

    Authors: Yongtao Li, Lihua Feng, Yuejian Peng

    Abstract: A well-known theorem of Mantel states that every $n$-vertex graph with more than $\lfloor n^2/4\rfloor $ edges contains a triangle. An interesting problem in extremal graph theory studies the minimum number of edges contained in triangles among graphs with a prescribed number of vertices and edges. Erdős, Faudree and Rousseau (1992) showed that a graph on $n$ vertices with more than… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

    Comments: 30 pages. At the end of the paper, we proposed many spectral extremal graph problems for readers. Any comments are welcome

    MSC Class: 05C35; 05C50