Skip to main content

Showing 1–50 of 1,030 results for author: Guo, S

.
  1. arXiv:2407.00014  [pdf

    cs.RO eess.SY

    Simplifying Kinematic Parameter Estimation in sEMG Prosthetic Hands: A Two-Point Approach

    Authors: Gang Liu, Zhenxiang Wang, Ziyang He, Shanshan Guo, Rui Zhang, Dezhong Yao

    Abstract: Regression-based sEMG prosthetic hands are widely used for their ability to provide continuous kinematic parameters. However, establishing these models traditionally requires complex kinematic sensor systems to collect corresponding kinematic data in synchronization with EMG, which is cumbersome and user-unfriendly. This paper presents a simplified approach utilizing only two data points to depict… ▽ More

    Submitted 1 May, 2024; originally announced July 2024.

    Comments: 13 pages

  2. arXiv:2406.19939  [pdf, other

    physics.flu-dyn

    Data-driven methods for flow and transport in porous media: a review

    Authors: Guang Yang, Ran Xu, Yusong Tian, Songyuan Guo, **gyi Wu, Xu Chu

    Abstract: This review examined the current advancements in data-driven methods for analyzing flow and transport in porous media, which has various applications in energy, chemical engineering, environmental science, and beyond. Although there has been progress in recent years, the challenges of current experimental and high-fidelity numerical simulations, such as high computational costs and difficulties in… ▽ More

    Submitted 28 June, 2024; originally announced June 2024.

  3. arXiv:2406.19486  [pdf, other

    cs.CL cs.AI cs.ET cs.LG eess.SP

    LoPT: Low-Rank Prompt Tuning for Parameter Efficient Language Models

    Authors: Shouchang Guo, Sonam Damani, Keng-hao Chang

    Abstract: In prompt tuning, a prefix or suffix text is added to the prompt, and the embeddings (soft prompts) or token indices (hard prompts) of the prefix/suffix are optimized to gain more control over language models for specific tasks. This approach eliminates the need for hand-crafted prompt engineering or explicit model fine-tuning. Prompt tuning is significantly more parameter-efficient than model fin… ▽ More

    Submitted 27 June, 2024; originally announced June 2024.

  4. arXiv:2406.19246  [pdf, other

    eess.SP

    An Interpretable and Efficient Sleep Staging Algorithm: DetectsleepNet

    Authors: Shengwei Guo

    Abstract: Sleep quality directly impacts human health and quality of life, so accurate sleep staging is essential for assessing sleep quality. However, most traditional methods are inefficient and time-consuming due to segmenting different sleep cycles by manual labeling. In contrast, automated sleep staging technology not only directly assesses sleep quality but also helps sleep specialists analyze sleep s… ▽ More

    Submitted 27 June, 2024; originally announced June 2024.

    Comments: 25 pages, 11 figures

  5. arXiv:2406.17142  [pdf, other

    quant-ph

    Continuous drive heterodyne microwave sensing with spin qubits in hexagonal boron nitride

    Authors: Charlie J. Patrickson, Valentin Haemmerli, Shi Guo, Andrew J. Ramsay, Isaac J. Luxmoore

    Abstract: Quantum sensors that use solid state spin defects have emerged as effective probes of weak alternating magnetic signals. By recording the phase of a signal relative to an external clock, these devices can resolve signal frequencies to a precision orders of magnitude longer than the spin state lifetime. However, these quantum heterodyne protocols suffer from sub-optimal sensitivity, as they are cur… ▽ More

    Submitted 24 June, 2024; originally announced June 2024.

  6. arXiv:2406.16878  [pdf, ps, other

    eess.SP cs.AI cs.IT

    Benchmarking Semantic Communications for Image Transmission Over MIMO Interference Channels

    Authors: Yanhu Wang, Shuaishuai Guo, Anming Dong, Hui Zhao

    Abstract: Semantic communications offer promising prospects for enhancing data transmission efficiency. However, existing schemes have predominantly concentrated on point-to-point transmissions. In this paper, we aim to investigate the validity of this claim in interference scenarios compared to baseline approaches. Specifically, our focus is on general multiple-input multiple-output (MIMO) interference cha… ▽ More

    Submitted 10 April, 2024; originally announced June 2024.

  7. arXiv:2406.14675  [pdf, other

    cs.CV cs.AI cs.LG

    This Looks Better than That: Better Interpretable Models with ProtoPNeXt

    Authors: Frank Willard, Luke Moffett, Emmanuel Mokel, Jon Donnelly, Stark Guo, Julia Yang, Giyoung Kim, Alina Jade Barnett, Cynthia Rudin

    Abstract: Prototypical-part models are a popular interpretable alternative to black-box deep learning models for computer vision. However, they are difficult to train, with high sensitivity to hyperparameter tuning, inhibiting their application to new datasets and our understanding of which methods truly improve their performance. To facilitate the careful study of prototypical-part networks (ProtoPNets), w… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

  8. arXiv:2406.14577  [pdf, ps, other

    math.RA

    Non-abelian extensions of Lie triple systems and Wells exact sequences

    Authors: Qinxiu Sun, Shuangjian Guo

    Abstract: In this paper, we investigate non-abelian extensions and inducibility of pairs of automorphisms of Lie triple systems. First, we introduce non-abelian cohomology groups and classify the non-abelian extensions in terms of non-abelian cohomology groups. Next, we characterize the non-abelian extensions using Maurer-Cartan elements. Furthermore, we explore the inducibility of pairs of automorphism… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

    Comments: 23 pages. arXiv admin note: substantial text overlap with arXiv:2401.15333, arXiv:2404.02752

    MSC Class: 17A30; 17B62; 17B38

  9. arXiv:2406.14540  [pdf, other

    cs.RO cs.AI cs.CV

    IRASim: Learning Interactive Real-Robot Action Simulators

    Authors: Fangqi Zhu, Hongtao Wu, Song Guo, Yuxiao Liu, Chilam Cheang, Tao Kong

    Abstract: Scalable robot learning in the real world is limited by the cost and safety issues of real robots. In addition, rolling out robot trajectories in the real world can be time-consuming and labor-intensive. In this paper, we propose to learn an interactive real-robot action simulator as an alternative. We introduce a novel method, IRASim, which leverages the power of generative models to generate ext… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

    Comments: Opensource, project website: https://gen-irasim.github.io

  10. arXiv:2406.14399  [pdf, other

    cs.LG cs.CV physics.ao-ph stat.ML

    WEATHER-5K: A Large-scale Global Station Weather Dataset Towards Comprehensive Time-series Forecasting Benchmark

    Authors: Tao Han, Song Guo, Zhenghao Chen, Wanghan Xu, Lei Bai

    Abstract: Global Station Weather Forecasting (GSWF) is crucial for various sectors, including aviation, agriculture, energy, and disaster preparedness. Recent advancements in deep learning have significantly improved the accuracy of weather predictions by optimizing models based on public meteorological data. However, existing public datasets for GSWF optimization and benchmarking still suffer from signific… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

    Comments: 26 pages,13 figures

  11. arXiv:2406.14302  [pdf, ps, other

    stat.ML cs.AI cs.LG

    Identifiable Exchangeable Mechanisms for Causal Structure and Representation Learning

    Authors: Patrik Reizinger, Siyuan Guo, Ferenc Huszár, Bernhard Schölkopf, Wieland Brendel

    Abstract: Identifying latent representations or causal structures is important for good generalization and downstream task performance. However, both fields have been developed rather independently. We observe that several methods in both representation and causal structure learning rely on the same data-generating process (DGP), namely, exchangeable but not i.i.d. (independent and identically distributed)… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

  12. arXiv:2406.14067  [pdf

    physics.optics eess.SP

    A microwave photonic prototype for concurrent radar detection and spectrum sensing over an 8 to 40 GHz bandwidth

    Authors: Taixia Shi, Dingding Liang, Lu Wang, Lin Li, Shaogang Guo, Jiawei Gao, Xiaowei Li, Chulun Lin, Lei Shi, Baogang Ding, Shiyang Liu, Fangyi Yang, Chi Jiang, Yang Chen

    Abstract: In this work, a microwave photonic prototype for concurrent radar detection and spectrum sensing is proposed, designed, built, and investigated. A direct digital synthesizer and an analog electronic circuit are integrated to generate an intermediate frequency (IF) linearly frequency-modulated (LFM) signal with a tunable center frequency from 2.5 to 9.5 GHz and an instantaneous bandwidth of 1 GHz.… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

    Comments: 18 pages, 12 figures, 1 table

  13. arXiv:2406.14056  [pdf, other

    cs.CV

    VGA: Vision GUI Assistant -- Minimizing Hallucinations through Image-Centric Fine-Tuning

    Authors: Ziyang Meng, Yu Dai, Zezheng Gong, Shaoxiong Guo, Minglong Tang, Tongquan Wei

    Abstract: Recent advances in Large Vision-Language Models (LVLMs) have significantly improve performance in image comprehension tasks, such as formatted charts and rich-content images. Yet, Graphical User Interface (GUI) pose a greater challenge due to their structured format and detailed textual information. Existing LVLMs often overly depend on internal knowledge and neglect image content, resulting in ha… ▽ More

    Submitted 21 June, 2024; v1 submitted 20 June, 2024; originally announced June 2024.

    Comments: 18 pages

    MSC Class: 68-04 68-04 ACM Class: I.2.7; I.2.10

  14. arXiv:2406.13999  [pdf, other

    quant-ph

    Individually Addressed Entangling Gates in a Two-Dimensional Ion Crystal

    Authors: Y. -H. Hou, Y. -J. Yi, Y. -K. Wu, Y. -Y. Chen, L. Zhang, Y. Wang, Y. -L. Xu, C. Zhang, Q. -X. Mei, H. -X. Yang, J. -Y. Ma, S. -A. Guo, J. Ye, B. -X. Qi, Z. -C. Zhou, P. -Y. Hou, L. -M. Duan

    Abstract: Two-dimensional (2D) ion crystals have become a promising way to scale up qubit numbers for ion trap quantum information processing. However, to realize universal quantum computing in this system, individually addressed high-fidelity two-qubit entangling gates still remain challenging due to the inevitable micromotion of ions in a 2D crystal as well as the technical difficulty in 2D addressing. He… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

  15. arXiv:2406.13986  [pdf, other

    astro-ph.SR astro-ph.GA

    Novae: An Important Source of Lithium in the Galaxy

    Authors: Jun Gao, Chunhua Zhu, Guoliang Lü, **long Yu, Lin Li, Helei Liu, Sufen Guo

    Abstract: The source of the Galactic Lithium (Li) has long been a puzzle. With the discovery of Li in novae, extensive research has been conducted. However, there still exists a significant disparity between the observed abundance of lithium in novae and the existing theoretical predictions. Using the Modules for Experiments in Stellar Astrophysics (MESA), we simulate the evolution of nova with element diff… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

    Comments: 12 pages, 4 figures. Accepted for publication in Astrophysical Journal

  16. arXiv:2406.13970  [pdf

    physics.optics

    Pixel-scale NIR-VIS Spectral Routers Based on 2D Mie-type Metagratings

    Authors: Yifan Shao, Shuhan Guo, Rui Chen, Yongdi Dang, Yi Zhou, Yubo Wang, Junjie Zhan, Jiaqi Yu, Bing-Feng Ju, Yungui Ma

    Abstract: The out-of-band energy loss caused by in-built color filters significantly degrades the signal-to-noise ratio and the dynamic range of conventional image sensors, which has restricted the attempt to develop ultrahigh-density imaging devices by merely shrinking the pixel size. This issue will be more serious for security cameras which need to collect visible (VIS) light and near-infrared (NIR) phot… ▽ More

    Submitted 24 June, 2024; v1 submitted 19 June, 2024; originally announced June 2024.

    Journal ref: Laser and Photonics Reviews 17, 2300027(2023)

  17. arXiv:2406.13950  [pdf, ps, other

    cond-mat.mtrl-sci

    Valley polarization in twisted altermagnetism

    Authors: San-Dong Guo, Yichen Liu, Cheng-Cheng Liu

    Abstract: The combination of altermagnetism, twistronics and valleytronics is of great significance for potential applications in advanced electronic devices. Twisted magnetic van der Waals bilayers have been identified as an ideal platform for altermagnetism of any type, such as $d$-wave, $g$-wave, and $i$-wave, by choosing the constituent monolayer with specific symmetry [arXiv:2404.17146 (2024)]. Here, w… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

    Comments: 5 pages, 5 figures

  18. arXiv:2406.13948  [pdf, other

    cs.AI cs.CL cs.LG

    CityGPT: Empowering Urban Spatial Cognition of Large Language Models

    Authors: Jie Feng, Yuwei Du, Tianhui Liu, Siqi Guo, Yuming Lin, Yong Li

    Abstract: Large language models(LLMs) with powerful language generation and reasoning capabilities have already achieved success in many domains, e.g., math and code generation. However, due to the lacking of physical world's corpus and knowledge during training, they usually fail to solve many real-life tasks in the urban space. In this paper, we propose CityGPT, a systematic framework for enhancing the ca… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

  19. arXiv:2406.13945  [pdf, other

    cs.AI cs.CL cs.LG

    CityBench: Evaluating the Capabilities of Large Language Model as World Model

    Authors: Jie Feng, Jun Zhang, Junbo Yan, Xin Zhang, Tianjian Ouyang, Tianhui Liu, Yuwei Du, Siqi Guo, Yong Li

    Abstract: Large language models (LLMs) with powerful generalization ability has been widely used in many domains. A systematic and reliable evaluation of LLMs is a crucial step in their development and applications, especially for specific professional fields. In the urban domain, there have been some early explorations about the usability of LLMs, but a systematic and scalable evaluation benchmark is still… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

  20. arXiv:2406.13368  [pdf

    cond-mat.mtrl-sci

    Lewis Acidity and Basicity Diagnostics of Molten Salt for its Properties and Structure Online Monitoring

    Authors: Changzu Zhu, Jia Song, Xiaorui Xu, Chengyu Wang, Yang Tong, Lve Lin, Shaoqiang Guo, Wentao Zhou, Adrien Couet, Yafei Wang

    Abstract: Analogous to the aqueous solution where the pH of the solvent affects its multiple behaviors, the Lewis acidity-basicity of molten salts also greatly influences their thermophysical and thermochemical properties. In the study, we develop ion probes to quantitatively determine the acidity-basicity scale of molten NaCl-xAlCl3 (x = 1.5-2.1) salt using in-situ ultra-violet visible (UV-Vis) spectroscop… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

  21. arXiv:2406.12859  [pdf, ps, other

    math.RA

    Cohomologies of Reynolds Lie-Yamaguti algebras of any weight and applications

    Authors: Wen Teng, Shuangjian Guo

    Abstract: The purpose of the present paper is to investigate cohomologies of Reynolds Lie-Yamaguti algebras of any weight and provide some applications. First, we introduce the notion of Reynolds Lie-Yamaguti algebras and give some new examples. Moreover, cohomologies of Reynolds operators and Reynolds Lie-Yamaguti algebras with coefficients in a suitable representation are established. Finally, formal defo… ▽ More

    Submitted 6 March, 2024; originally announced June 2024.

    MSC Class: 17B38; 17B60; 17B56; 17D99

  22. arXiv:2406.12466  [pdf, other

    gr-qc

    Rastall gravity: accretion disk image in radiation fields context and visual transformations compared to Reissner-Nordstrom black holes

    Authors: Yu-Xiang Huang, Sen Guo, Yu Liang, Yu-Hao Cui, Qing-Quan Jiang, Kai Lin

    Abstract: Our study investigates the astronomical implications of Rastall gravity, particularly its behavior amidst a radiation field compared to Reissner-Nordstrom (RN) black holes. Our research delineates a crucial correlation between the dynamics of the accretion disk and the parameters Q and N_{\rm r}, which aptly reflect the influence of spacetime metrics on the disk's appearance. Elevated electric cha… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

  23. arXiv:2406.12074  [pdf, other

    cs.CL

    COMMUNITY-CROSS-INSTRUCT: Unsupervised Instruction Generation for Aligning Large Language Models to Online Communities

    Authors: Zihao He, Rebecca Dorn, Siyi Guo, Minh Duc Chu, Kristina Lerman

    Abstract: Social scientists use surveys to probe the opinions and beliefs of populations, but these methods are slow, costly, and prone to biases. Recent advances in large language models (LLMs) enable creating computational representations or "digital twins" of populations that generate human-like responses mimicking the population's language, styles, and attitudes. We introduce Community-Cross-Instruct, a… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

  24. arXiv:2406.10506  [pdf, ps, other

    cs.CY

    Validating an Instrument for Teachers' Acceptance of Artificial Intelligence in Education

    Authors: Shuchen Guo, Lehong Shi, Xiaoming Zhai

    Abstract: As artificial intelligence (AI) receives wider attention in education, examining teachers' acceptance of AI (TAAI) becomes essential. However, existing instruments measuring TAAI reported limited reliability and validity evidence and faced some design challenges, such as missing informed definitions of AI to participants. This study aimed to develop and validate a TAAI instrument, with providing s… ▽ More

    Submitted 15 June, 2024; originally announced June 2024.

  25. arXiv:2406.08909  [pdf, other

    cs.CV

    A Label-Free and Non-Monotonic Metric for Evaluating Denoising in Event Cameras

    Authors: Chenyang Shi, Shasha Guo, Boyi Wei, Hanxiao Liu, Yibo Zhang, Ningfang Song, **g **

    Abstract: Event cameras are renowned for their high efficiency due to outputting a sparse, asynchronous stream of events. However, they are plagued by noisy events, especially in low light conditions. Denoising is an essential task for event cameras, but evaluating denoising performance is challenging. Label-dependent denoising metrics involve artificially adding noise to clean sequences, complicating evalu… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

  26. arXiv:2406.08090  [pdf, other

    cs.CV

    From Sim-to-Real: Toward General Event-based Low-light Frame Interpolation with Per-scene Optimization

    Authors: Ziran Zhang, Yongrui Ma, Yueting Chen, Feng Zhang, **wei Gu, Tianfan Xue, Shi Guo

    Abstract: Video Frame Interpolation (VFI) is important for video enhancement, frame rate up-conversion, and slow-motion generation. The introduction of event cameras, which capture per-pixel brightness changes asynchronously, has significantly enhanced VFI capabilities, particularly for high-speed, nonlinear motions. However, these event-based methods encounter challenges in low-light conditions, notably tr… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

  27. arXiv:2406.06937  [pdf, other

    cs.CL cs.AI cs.SD eess.AS

    A Non-autoregressive Generation Framework for End-to-End Simultaneous Speech-to-Any Translation

    Authors: Zhengrui Ma, Qingkai Fang, Shaolei Zhang, Shoutao Guo, Yang Feng, Min Zhang

    Abstract: Simultaneous translation models play a crucial role in facilitating communication. However, existing research primarily focuses on text-to-text or speech-to-text models, necessitating additional cascade components to achieve speech-to-speech translation. These pipeline methods suffer from error propagation and accumulate delays in each cascade component, resulting in reduced synchronization betwee… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

    Comments: ACL 2024; Codes and demos are at https://github.com/ictnlp/NAST-S2x

  28. arXiv:2406.06910  [pdf, other

    cs.CL

    Agent-SiMT: Agent-assisted Simultaneous Machine Translation with Large Language Models

    Authors: Shoutao Guo, Shaolei Zhang, Zhengrui Ma, Min Zhang, Yang Feng

    Abstract: Simultaneous Machine Translation (SiMT) generates target translations while reading the source sentence. It relies on a policy to determine the optimal timing for reading sentences and generating translations. Existing SiMT methods generally adopt the traditional Transformer architecture, which concurrently determines the policy and generates translations. While they excel at determining policies,… ▽ More

    Submitted 12 June, 2024; v1 submitted 10 June, 2024; originally announced June 2024.

    Comments: 18 pages, 8 figures, 7 tables. v2 of arXiv:2402.13036

  29. arXiv:2406.05449  [pdf, ps, other

    math.SP

    Anderson localization for CMV matrices with Verblunsky coefficients defined by the hyperbolic toral automorphism

    Authors: Yanxue Lin, Shuzheng Guo, Daxiong Piao

    Abstract: In this paper, we prove the large deviation estimates and Anderson localization for CMV matrices on $\ell^2(\mathbb{Z}_+)$ with Verblunsky coefficients defined dynamically by the hyperbolic toral automorphism. Part of positivity results on the Lyapunov exponents of Chulaevsky-Spencer and Anderson localization results of Bourgain-Schlag on Schrödinger operators with strongly mixing potentials are e… ▽ More

    Submitted 8 June, 2024; originally announced June 2024.

    MSC Class: 37A30; 42C05; 70G60

  30. arXiv:2406.03878  [pdf, other

    cs.CL

    Decoder-only Streaming Transformer for Simultaneous Translation

    Authors: Shoutao Guo, Shaolei Zhang, Yang Feng

    Abstract: Simultaneous Machine Translation (SiMT) generates translation while reading source tokens, essentially producing the target prefix based on the source prefix. To achieve good performance, it leverages the relationship between source and target prefixes to exact a policy to guide the generation of translations. Although existing SiMT methods primarily focus on the Encoder-Decoder architecture, we e… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

    Comments: Accepted to ACL 2024. 14 pages, 10 Tables, 5 Figures

  31. arXiv:2406.03049  [pdf, other

    cs.CL cs.AI cs.SD eess.AS

    StreamSpeech: Simultaneous Speech-to-Speech Translation with Multi-task Learning

    Authors: Shaolei Zhang, Qingkai Fang, Shoutao Guo, Zhengrui Ma, Min Zhang, Yang Feng

    Abstract: Simultaneous speech-to-speech translation (Simul-S2ST, a.k.a streaming speech translation) outputs target speech while receiving streaming speech inputs, which is critical for real-time communication. Beyond accomplishing translation between speech, Simul-S2ST requires a policy to control the model to generate corresponding target speech at the opportune moment within speech inputs, thereby posing… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

    Comments: Accepted to ACL 2024 main conference, Project Page: https://ictnlp.github.io/StreamSpeech-site/

  32. arXiv:2406.02903  [pdf, other

    cs.CL

    Open Grounded Planning: Challenges and Benchmark Construction

    Authors: Shiguang Guo, Ziliang Deng, Hongyu Lin, Yaojie Lu, Xianpei Han, Le Sun

    Abstract: The emergence of large language models (LLMs) has increasingly drawn attention to the use of LLMs for human-like planning. Existing work on LLM-based planning either focuses on leveraging the inherent language generation capabilities of LLMs to produce free-style plans, or employs reinforcement learning approaches to learn decision-making for a limited set of actions within restricted environments… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

    Comments: Accept to ACL 2024 main conference

  33. arXiv:2406.02260  [pdf

    cond-mat.mes-hall cond-mat.mtrl-sci

    Near-Room-Temperature Field-Controllable Exchange Bias in 2D van der Waals Ferromagnet Fe3GaTe2

    Authors: Jifeng Shao, Xiaolong Yin, Chunhao Bao, Sirong Lu, Xiaoming Ma, Shu Guo, Le Wang, Xi Zhang, Zhiyue Li, Longxiang Li, Yue Zhao, Tingyong Chen

    Abstract: Exchange bias (EB) is a cornerstone of modern magnetic memory and sensing technologies. Its extension to the realm of two-dimensional (2D) van der Waals (vdW) magnets holds promise for revolutionary advancements in miniaturized and efficient atomic spintronic devices. However, the blocking temperature of EB in 2D vdW magnets is currently well below room temperature ~130 K. This study reports a rob… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

    Comments: 14 pages, 5 figures

  34. arXiv:2406.01866  [pdf, other

    cs.CL cs.CY cs.SI

    #EpiTwitter: Public Health Messaging During the COVID-19 Pandemic

    Authors: Ashwin Rao, Nazanin Sabri, Siyi Guo, Louiqa Raschid, Kristina Lerman

    Abstract: Effective communication during health crises is critical, with social media serving as a key platform for public health experts (PHEs) to engage with the public. However, it also amplifies pseudo-experts promoting contrarian views. Despite its importance, the role of emotional and moral language in PHEs' communication during COVID-19 remains under explored. This study examines how PHEs and pseudo-… ▽ More

    Submitted 10 June, 2024; v1 submitted 3 June, 2024; originally announced June 2024.

  35. arXiv:2406.01574  [pdf, other

    cs.CL

    MMLU-Pro: A More Robust and Challenging Multi-Task Language Understanding Benchmark

    Authors: Yubo Wang, Xueguang Ma, Ge Zhang, Yuansheng Ni, Abhranil Chandra, Shiguang Guo, Weiming Ren, Aaran Arulraj, Xuan He, Ziyan Jiang, Tianle Li, Max Ku, Kai Wang, Alex Zhuang, Rongqi Fan, Xiang Yue, Wenhu Chen

    Abstract: In the age of large-scale language models, benchmarks like the Massive Multitask Language Understanding (MMLU) have been pivotal in pushing the boundaries of what AI can achieve in language comprehension and reasoning across diverse domains. However, as models continue to improve, their performance on these benchmarks has begun to plateau, making it increasingly difficult to discern differences in… ▽ More

    Submitted 23 June, 2024; v1 submitted 3 June, 2024; originally announced June 2024.

  36. arXiv:2406.00894  [pdf, other

    cs.LG cs.AI cs.CL

    Pretrained Hybrids with MAD Skills

    Authors: Nicholas Roberts, Samuel Guo, Zhiqi Gao, Satya Sai Srinath Namburi GNVV, Sonia Cromp, Chengjun Wu, Chengyu Duan, Frederic Sala

    Abstract: While Transformers underpin modern large language models (LMs), there is a growing list of alternative architectures with new capabilities, promises, and tradeoffs. This makes choosing the right LM architecture challenging. Recently-proposed $\textit{hybrid architectures}$ seek a best-of-all-worlds approach that reaps the benefits of all architectures. Hybrid design is difficult for two reasons: i… ▽ More

    Submitted 2 June, 2024; originally announced June 2024.

  37. arXiv:2405.19327  [pdf, other

    cs.CL cs.AI cs.LG

    MAP-Neo: Highly Capable and Transparent Bilingual Large Language Model Series

    Authors: Ge Zhang, Scott Qu, Jiaheng Liu, Chenchen Zhang, Chenghua Lin, Chou Leuang Yu, Danny Pan, Esther Cheng, Jie Liu, Qunshu Lin, Raven Yuan, Tuney Zheng, Wei Pang, Xinrun Du, Yiming Liang, Yinghao Ma, Yizhi Li, Ziyang Ma, Bill Lin, Emmanouil Benetos, Huan Yang, Junting Zhou, Kai**g Ma, Minghao Liu, Morry Niu , et al. (20 additional authors not shown)

    Abstract: Large Language Models (LLMs) have made great strides in recent years to achieve unprecedented performance across different tasks. However, due to commercial interest, the most competitive models like GPT, Gemini, and Claude have been gated behind proprietary interfaces without disclosing the training details. Recently, many institutions have open-sourced several strong LLMs like LLaMA-3, comparabl… ▽ More

    Submitted 2 June, 2024; v1 submitted 29 May, 2024; originally announced May 2024.

    Comments: https://map-neo.github.io/

  38. arXiv:2405.18836  [pdf, other

    stat.ME cs.LG

    Do Finetti: On Causal Effects for Exchangeable Data

    Authors: Siyuan Guo, Chi Zhang, Karthika Mohan, Ferenc Huszár, Bernhard Schölkopf

    Abstract: We study causal effect estimation in a setting where the data are not i.i.d. (independent and identically distributed). We focus on exchangeable data satisfying an assumption of independent causal mechanisms. Traditional causal effect estimation frameworks, e.g., relying on structural causal models and do-calculus, are typically limited to i.i.d. data and do not extend to more general exchangeable… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

  39. arXiv:2405.18826  [pdf, ps, other

    cond-mat.mtrl-sci

    Isovalent alloying assisted anomalous valley Hall effect in hexagonal antiferromagnetic monolayer

    Authors: San-Dong Guo, Liguo Zhang, Xiao-Shu Guo, Gangqiang Zhu

    Abstract: Exploring combination of antiferromagnetic (AFM) spintronics and anomalous valley Hall effect (AVHE) is one of the most important questions for valleytronic applications. The key to address this issue is to achieve spin splitting around the valleys in AFM systems. Here, we propose a possible way for achieving AVHE in hexagonal AFM monolayer, which involves the isovalent alloying. This can break th… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

    Comments: 6 pages, 7 figures

  40. arXiv:2405.17546  [pdf, other

    hep-th quant-ph

    Complexity is not Enough for Randomness

    Authors: Shiyong Guo, Martin Sasieta, Brian Swingle

    Abstract: We study the dynamical generation of randomness in Brownian systems as a function of the degree of locality of the Hamiltonian. We first express the trace distance to a unitary design for these systems in terms of an effective equilibrium thermal partition function, and provide a set of conditions that guarantee a linear time to design. We relate the trace distance to design to spectral properties… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

    Comments: 40 pages + appendices

  41. arXiv:2405.16152  [pdf, other

    cs.CV cs.HC

    SuDA: Support-based Domain Adaptation for Sim2Real Motion Capture with Flexible Sensors

    Authors: Jiawei Fang, Haishan Song, Chengxu Zuo, Xiaoxia Gao, Xiaowei Chen, Shihui Guo, Yipeng Qin

    Abstract: Flexible sensors hold promise for human motion capture (MoCap), offering advantages such as wearability, privacy preservation, and minimal constraints on natural movement. However, existing flexible sensor-based MoCap methods rely on deep learning and necessitate large and diverse labeled datasets for training. These data typically need to be collected in MoCap studios with specialized equipment a… ▽ More

    Submitted 25 May, 2024; originally announced May 2024.

    Comments: 20 pages conference, accepted ICML paper

  42. arXiv:2405.16011  [pdf, ps, other

    eess.SP

    Semantic Importance-Aware Communications with Semantic Correction Using Large Language Models

    Authors: Shuaishuai Guo, Yanhu Wang, Jia Ye, Anbang Zhang, Kun Xu

    Abstract: Semantic communications, a promising approach for agent-human and agent-agent interactions, typically operate at a feature level, lacking true semantic understanding. This paper explores understanding-level semantic communications (ULSC), transforming visual data into human-intelligible semantic content. We employ an image caption neural network (ICNN) to derive semantic representations from visua… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

  43. arXiv:2405.15485  [pdf, other

    cs.AI cs.CL cs.LG

    Learning Beyond Pattern Matching? Assaying Mathematical Understanding in LLMs

    Authors: Siyuan Guo, Aniket Didolkar, Nan Rosemary Ke, Anirudh Goyal, Ferenc Huszár, Bernhard Schölkopf

    Abstract: We are beginning to see progress in language model assisted scientific discovery. Motivated by the use of LLMs as a general scientific assistant, this paper assesses the domain knowledge of LLMs through its understanding of different mathematical skills required to solve problems. In particular, we look at not just what the pre-trained model already knows, but how it learned to learn from informat… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

  44. arXiv:2405.14744  [pdf, other

    cs.CY

    Exploring Prosocial Irrationality for LLM Agents: A Social Cognition View

    Authors: Xuan Liu, Jie Zhang, Song Guo, Haoyang Shang, Chengxu Yang, Quanyan Zhu

    Abstract: Large language models (LLMs) have been shown to face hallucination issues due to the data they trained on often containing human bias; whether this is reflected in the decision-making process of LLM agents remains under-explored. As LLM Agents are increasingly employed in intricate social environments, a pressing and natural question emerges: Can LLM Agents leverage hallucinations to mirror human… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

  45. arXiv:2405.13999  [pdf, other

    cs.CV

    Computer-Vision-Enabled Worker Video Analysis for Motion Amount Quantification

    Authors: Hari Iyer, Neel Macwan, Shenghan Guo, Hee** Jeong

    Abstract: The performance of physical workers is significantly influenced by the quantity of their motions. However, monitoring and assessing these motions is challenging due to the complexities of motion sensing, tracking, and quantification. Recent advancements have utilized in-situ video analysis for real-time observation of worker behaviors, enabling data-driven quantification of motion amounts. Neverth… ▽ More

    Submitted 22 May, 2024; originally announced May 2024.

  46. arXiv:2405.13080  [pdf, other

    cs.CR cs.LG

    EmInspector: Combating Backdoor Attacks in Federated Self-Supervised Learning Through Embedding Inspection

    Authors: Yuwen Qian, Shuchi Wu, Kang Wei, Ming Ding, Di Xiao, Tao Xiang, Chuan Ma, Song Guo

    Abstract: Federated self-supervised learning (FSSL) has recently emerged as a promising paradigm that enables the exploitation of clients' vast amounts of unlabeled data while preserving data privacy. While FSSL offers advantages, its susceptibility to backdoor attacks, a concern identified in traditional federated supervised learning (FSL), has not been investigated. To fill the research gap, we undertake… ▽ More

    Submitted 21 May, 2024; originally announced May 2024.

    Comments: 18 pages, 12 figures

  47. arXiv:2405.12808  [pdf, other

    gr-qc astro-ph.HE

    Influence of quantum correction on the Schwarzschild black hole polarized image

    Authors: Sen Guo, Yu-Xiang Huang, Kuan Liu, En-Wei Liang, Kai Lin

    Abstract: Using a model of an accretion disk around a Schwarzschild black hole, the analytic estimates for image polarization were derived by Narayan $et~al.$. [Astrophys. J, 102, 912 (2021)]. Recently, the EHT team also obtained polarization images of the Sgr A$^{*}$ and measured both linear and circular polarization [Astrophys. J. Lett, 964, L25 (2024)]. We find that quantum correction effects can also in… ▽ More

    Submitted 21 May, 2024; originally announced May 2024.

    Comments: 20 pages, 8 figures

    Report number: Accepted European Physical Journal C (EPJC) 2024

  48. arXiv:2405.12459  [pdf, other

    cs.LG

    PLM4Traj: Cognizing Movement Patterns and Travel Purposes from Trajectories with Pre-trained Language Models

    Authors: Zeyu Zhou, Yan Lin, Haomin Wen, Shengnan Guo, Jilin Hu, Youfang Lin, Huaiyu Wan

    Abstract: Spatio-temporal trajectories play a vital role in various spatio-temporal data mining tasks. Develo** a versatile trajectory learning approach that can adapt to different tasks while ensuring high accuracy is crucial. This requires effectively extracting movement patterns and travel purposes embedded in trajectories. However, this task is challenging due to limitations in the size and quality of… ▽ More

    Submitted 20 May, 2024; originally announced May 2024.

  49. arXiv:2405.12205  [pdf, other

    cs.AI cs.LG

    Metacognitive Capabilities of LLMs: An Exploration in Mathematical Problem Solving

    Authors: Aniket Didolkar, Anirudh Goyal, Nan Rosemary Ke, Siyuan Guo, Michal Valko, Timothy Lillicrap, Danilo Rezende, Yoshua Bengio, Michael Mozer, Sanjeev Arora

    Abstract: Metacognitive knowledge refers to humans' intuitive knowledge of their own thinking and reasoning processes. Today's best LLMs clearly possess some reasoning processes. The paper gives evidence that they also have metacognitive knowledge, including ability to name skills and procedures to apply given a task. We explore this primarily in context of math reasoning, develo** a prompt-guided interac… ▽ More

    Submitted 20 May, 2024; originally announced May 2024.

    Comments: Preprint. Under review

  50. arXiv:2405.11571  [pdf, other

    astro-ph.SR astro-ph.GA

    The population synthesis of Wolf-Rayet stars involving binary merger channels

    Authors: Zhuowen Li, Chunhua Zhu, Guoliang Lü, Lin Li, Helei Liu, Sufen Guo, **long Yu, Xizhen Lu

    Abstract: Wolf-Rayet stars (WRs) are very important massive stars. However, their origin and the observed binary fraction within the entire WR population are still debated. We investigate some possible merger channels for the formation of WRs, including main sequence (MS)/ Hertzsprung Gap (HG) + MS, He + HG/ Giant Branch (GB). We find that many products produced via binary merger can evolve into WRs, the MS… ▽ More

    Submitted 19 May, 2024; originally announced May 2024.

    Comments: 14 pages, 8 figures , Accepted to APJ