Skip to main content

Showing 201–250 of 4,505 results for author: Chen, W

.
  1. Single-Atom Verification of the Optimal Trade-Off between Speed and Cost in Shortcuts to Adiabaticity

    Authors: J. -W. Zhang, J. -T. Bu, J. C. Li, Weiquan Meng, W. -Q. Ding, B. Wang, W. -F. Yuan, H. -J. Du, G. -Y. Ding, W. -J. Chen, L. Chen, F. Zhou, Zhenyu Xu, M. Feng

    Abstract: The approach of shortcuts to adiabaticity enables the effective execution of adiabatic dynamics in quantum information processing with enhanced speed. Owing to the inherent trade-off between dynamical speed and the cost associated with the transitionless driving field, executing arbitrarily fast operations becomes impractical. To understand the accurate interplay between speed and energetic cost i… ▽ More

    Submitted 6 June, 2024; v1 submitted 24 April, 2024; originally announced April 2024.

    Comments: 6+5 pages, 3+3 figures

    Journal ref: Phys. Rev. Lett. 132, 213602 (2024)

  2. arXiv:2404.15805  [pdf, other

    q-bio.BM cs.LG

    Beyond ESM2: Graph-Enhanced Protein Sequence Modeling with Efficient Clustering

    Authors: Shujian Jiao, Bingxuan Li, Lei Wang, Xiao** Zhang, Wei Chen, Jiajie Peng, Zhongyu Wei

    Abstract: Proteins are essential to life's processes, underpinning evolution and diversity. Advances in sequencing technology have revealed millions of proteins, underscoring the need for sophisticated pre-trained protein models for biological analysis and AI development. Facebook's ESM2, the most advanced protein language model to date, leverages a masked prediction task for unsupervised learning, crafting… ▽ More

    Submitted 24 April, 2024; originally announced April 2024.

  3. arXiv:2404.15594  [pdf, ps, other

    math.CO math.DG math.SP

    Curvature, diameter and signs of graphs

    Authors: Wei Chen, Shi** Liu

    Abstract: We prove a Li-Yau type eigenvalue-diameter estimate for signed graphs. That is, the nonzero eigenvalues of the Laplacian of a non-negatively curved signed graph are lower bounded by $1/D^2$ up to a constant, where $D$ stands for the diameter. This leads to several interesting applications, including a volume estimate for non-negatively curved signed graphs in terms of frustration index and diamete… ▽ More

    Submitted 23 April, 2024; originally announced April 2024.

    Comments: 28 pages, 2 figures. All comments are welcome

  4. arXiv:2404.15449  [pdf, other

    cs.CV cs.AI

    ID-Aligner: Enhancing Identity-Preserving Text-to-Image Generation with Reward Feedback Learning

    Authors: Weifeng Chen, Jiacheng Zhang, Jie Wu, Hefeng Wu, Xuefeng Xiao, Liang Lin

    Abstract: The rapid development of diffusion models has triggered diverse applications. Identity-preserving text-to-image generation (ID-T2I) particularly has received significant attention due to its wide range of application scenarios like AI portrait and advertising. While existing ID-T2I methods have demonstrated impressive results, several key challenges remain: (1) It is hard to maintain the identity… ▽ More

    Submitted 23 April, 2024; originally announced April 2024.

  5. arXiv:2404.15380  [pdf, other

    cs.LG cs.AI

    ControlTraj: Controllable Trajectory Generation with Topology-Constrained Diffusion Model

    Authors: Yuanshao Zhu, James Jianqiao Yu, Xiangyu Zhao, Qidong Liu, Yongchao Ye, Wei Chen, Zijian Zhang, Xuetao Wei, Yuxuan Liang

    Abstract: Generating trajectory data is among promising solutions to addressing privacy concerns, collection costs, and proprietary restrictions usually associated with human mobility analyses. However, existing trajectory generation methods are still in their infancy due to the inherent diversity and unpredictability of human activities, grappling with issues such as fidelity, flexibility, and generalizabi… ▽ More

    Submitted 23 April, 2024; originally announced April 2024.

  6. arXiv:2404.15207  [pdf, other

    cs.CE cond-mat.mtrl-sci cs.LG stat.AP

    Simulation-Free Determination of Microstructure Representative Volume Element Size via Fisher Scores

    Authors: Wei Liu, Satyajit Mojumder, Wing Kam Liu, Wei Chen, Daniel W. Apley

    Abstract: A representative volume element (RVE) is a reasonably small unit of microstructure that can be simulated to obtain the same effective properties as the entire microstructure sample. Finite element (FE) simulation of RVEs, as opposed to much larger samples, saves computational expense, especially in multiscale modeling. Therefore, it is desirable to have a framework that determines RVE size prior t… ▽ More

    Submitted 7 April, 2024; originally announced April 2024.

    Journal ref: APL Mach. Learn. 2(2): 026101 (2024)

  7. arXiv:2404.14219  [pdf, other

    cs.CL cs.AI

    Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone

    Authors: Marah Abdin, Sam Ade Jacobs, Ammar Ahmad Awan, Jyoti Aneja, Ahmed Awadallah, Hany Awadalla, Nguyen Bach, Amit Bahree, Arash Bakhtiari, Jianmin Bao, Harkirat Behl, Alon Benhaim, Misha Bilenko, Johan Bjorck, Sébastien Bubeck, Qin Cai, Martin Cai, Caio César Teodoro Mendes, Weizhu Chen, Vishrav Chaudhary, Dong Chen, Dongdong Chen, Yen-Chun Chen, Yi-Ling Chen, Parul Chopra , et al. (90 additional authors not shown)

    Abstract: We introduce phi-3-mini, a 3.8 billion parameter language model trained on 3.3 trillion tokens, whose overall performance, as measured by both academic benchmarks and internal testing, rivals that of models such as Mixtral 8x7B and GPT-3.5 (e.g., phi-3-mini achieves 69% on MMLU and 8.38 on MT-bench), despite being small enough to be deployed on a phone. The innovation lies entirely in our dataset… ▽ More

    Submitted 23 May, 2024; v1 submitted 22 April, 2024; originally announced April 2024.

    Comments: 19 pages

  8. arXiv:2404.14073  [pdf, other

    cs.LG cs.AI

    Towards Robust Trajectory Representations: Isolating Environmental Confounders with Causal Learning

    Authors: Kang Luo, Yuanshao Zhu, Wei Chen, Kun Wang, Zhengyang Zhou, Sijie Ruan, Yuxuan Liang

    Abstract: Trajectory modeling refers to characterizing human movement behavior, serving as a pivotal step in understanding mobility patterns. Nevertheless, existing studies typically ignore the confounding effects of geospatial context, leading to the acquisition of spurious correlations and limited generalization capabilities. To bridge this gap, we initially formulate a Structural Causal Model (SCM) to de… ▽ More

    Submitted 22 April, 2024; originally announced April 2024.

    Comments: The paper has been accepted by IJCAI 2024

  9. arXiv:2404.12782  [pdf, other

    cs.CV cs.AI

    Sentiment-oriented Transformer-based Variational Autoencoder Network for Live Video Commenting

    Authors: Fengyi Fu, Shancheng Fang, Weidong Chen, Zhendong Mao

    Abstract: Automatic live video commenting is with increasing attention due to its significance in narration generation, topic explanation, etc. However, the diverse sentiment consideration of the generated comments is missing from the current methods. Sentimental factors are critical in interactive commenting, and lack of research so far. Thus, in this paper, we propose a Sentiment-oriented Transformer-base… ▽ More

    Submitted 19 April, 2024; originally announced April 2024.

    Comments: 27 pages, 10 figures, ACM Transactions on Multimedia Computing, Communications and Applications, 2024

  10. arXiv:2404.12759  [pdf, other

    cs.LG

    decoupleQ: Towards 2-bit Post-Training Uniform Quantization via decoupling Parameters into Integer and Floating Points

    Authors: Yi Guo, Fanliu Kong, Xiaoyang Li, Hui Li, Wei Chen, Xiaogang Tian, **** Cai, Yang Zhang, Shouda Liu

    Abstract: Quantization emerges as one of the most promising compression technologies for deploying efficient large models for various real time application in recent years. Considering that the storage and IO of weights take up the vast majority of the overhead inside a large model, weight only quantization can lead to large gains. However, existing quantization schemes suffer from significant accuracy degr… ▽ More

    Submitted 19 April, 2024; originally announced April 2024.

    Comments: quantization for deep models

  11. arXiv:2404.12445  [pdf

    cs.LG cs.CE physics.chem-ph

    Adaptive Catalyst Discovery Using Multicriteria Bayesian Optimization with Representation Learning

    Authors: Jie Chen, Pengfei Ou, Yuxin Chang, Hengrui Zhang, Xiao-Yan Li, Edward H. Sargent, Wei Chen

    Abstract: High-performance catalysts are crucial for sustainable energy conversion and human health. However, the discovery of catalysts faces challenges due to the absence of efficient approaches to navigating vast and high-dimensional structure and composition spaces. In this study, we propose a high-throughput computational catalyst screening approach integrating density functional theory (DFT) and Bayes… ▽ More

    Submitted 18 April, 2024; originally announced April 2024.

  12. arXiv:2404.11943  [pdf, other

    cs.HC

    AgentCoord: Visually Exploring Coordination Strategy for LLM-based Multi-Agent Collaboration

    Authors: Bo Pan, Jiaying Lu, Ke Wang, Li Zheng, Zhen Wen, Yingchaojie Feng, Minfeng Zhu, Wei Chen

    Abstract: The potential of automatic task-solving through Large Language Model (LLM)-based multi-agent collaboration has recently garnered widespread attention from both the research community and industry. While utilizing natural language to coordinate multiple agents presents a promising avenue for democratizing agent technology for general users, designing coordination strategies remains challenging with… ▽ More

    Submitted 18 April, 2024; originally announced April 2024.

  13. arXiv:2404.11881  [pdf, other

    cs.IT eess.SP

    Joint Transmitter and Receiver Design for Movable Antenna Enhanced Multicast Communications

    Authors: Ying Gao, Qingqing Wu, Wen Chen

    Abstract: Movable antenna (MA) is an emerging technology that utilizes localized antenna movement to pursue better channel conditions for enhancing communication performance. In this paper, we study the MA-enhanced multicast transmission from a base station equipped with multiple MAs to multiple groups of single-MA users. Our goal is to maximize the minimum weighted signal-to-interference-plus-noise ratio (… ▽ More

    Submitted 9 May, 2024; v1 submitted 17 April, 2024; originally announced April 2024.

    Comments: 13 pages, 9 figures, submitted to IEEE journal for possible publication

  14. arXiv:2404.11858  [pdf, other

    eess.SP

    Graph Neural Networks for Wireless Networks: Graph Representation, Architecture and Evaluation

    Authors: Yang Lu, Yuhang Li, Ruichen Zhang, Wei Chen, Bo Ai, Dusit Niyato

    Abstract: Graph neural networks (GNNs) have been regarded as the basic model to facilitate deep learning (DL) to revolutionize resource allocation in wireless networks. GNN-based models are shown to be able to learn the structural information about graphs representing the wireless networks to adapt to the time-varying channel state information and dynamics of network topology. This article aims to provide a… ▽ More

    Submitted 17 April, 2024; originally announced April 2024.

  15. arXiv:2404.11797  [pdf, other

    cs.CV cs.AI cs.LG

    When are Foundation Models Effective? Understanding the Suitability for Pixel-Level Classification Using Multispectral Imagery

    Authors: Yiqun Xie, Zhihao Wang, Weiye Chen, Zhili Li, Xiaowei Jia, Yanhua Li, Ruichen Wang, Kangyang Chai, Ruohan Li, Sergii Skakun

    Abstract: Foundation models, i.e., very large deep learning models, have demonstrated impressive performances in various language and vision tasks that are otherwise difficult to reach using smaller-size models. The major success of GPT-type of language models is particularly exciting and raises expectations on the potential of foundation models in other domains including satellite remote sensing. In this c… ▽ More

    Submitted 17 April, 2024; originally announced April 2024.

  16. arXiv:2404.11459  [pdf, other

    cs.CL cs.CV

    Octopus v3: Technical Report for On-device Sub-billion Multimodal AI Agent

    Authors: Wei Chen, Zhiyuan Li

    Abstract: A multimodal AI agent is characterized by its ability to process and learn from various types of data, including natural language, visual, and audio inputs, to inform its actions. Despite advancements in large language models that incorporate visual data, such as GPT-4V, effectively translating image-based data into actionable outcomes for AI agents continues to be challenging. In this paper, we i… ▽ More

    Submitted 18 April, 2024; v1 submitted 17 April, 2024; originally announced April 2024.

  17. arXiv:2404.11228  [pdf, other

    physics.bio-ph q-bio.QM

    Density estimation for ordinal biological sequences and its applications

    Authors: Wei-Chia Chen, Juannan Zhou, David M. McCandlish

    Abstract: Biological sequences do not come at random. Instead, they appear with particular frequencies that reflect properties of the associated system or phenomenon. Knowing how biological sequences are distributed in sequence space is thus a natural first step toward understanding the underlying mechanisms. Here we propose a new method for inferring the probability distribution from which a sample of biol… ▽ More

    Submitted 17 April, 2024; originally announced April 2024.

  18. arXiv:2404.11001  [pdf

    cond-mat.supr-con

    Modulation of the Octahedral Structure and Potential Superconductivity of La3Ni2O7 at Ambient Pressure by Compressive Strain

    Authors: Zihao Huo, Peng Zhang, Aiqin Yang, Zhengtao Liu, Xiangru Tao, Zihan Zhang, Qiwen Jiang, Wenxuan Chen, Defang Duan, Tian Cui

    Abstract: Superconductivity at Tc = 80 K has recently been reported above 14 GPa in La3Ni2O7, which thus introduces a new family of high-temperature superconductors. Using a first-principles calculation with Coulomb repulsion, we unveil a surprising new route to obtain superconductivity in La3Ni2O7 at ambient pressure by introducing compressive strain along the [001] direction. The shape of the NiO6 octahed… ▽ More

    Submitted 16 April, 2024; originally announced April 2024.

  19. arXiv:2404.10331  [pdf, ps, other

    math.CO

    Increasing Binary Trees and the $(α,β)$-Eulerian Polynomials

    Authors: William Y. C. Chen, Amy M. Fu

    Abstract: In light of the grammar given by Ji for the $(α,β)$-Eulerian polynomials introduced by Carlitz and Scoville, we provide a labeling scheme for increasing binary trees. In this setting, we obtain a combinatorial interpretation of the $γ$-coefficients of the $α$-Eulerian polynomials in terms of forests of planted 0-1-2-plane trees, which specializes to a combinatorial interpretation of the $γ$-coeffi… ▽ More

    Submitted 16 April, 2024; originally announced April 2024.

    Comments: 15 pages, 7 figures

    MSC Class: 05A15; 05A19

  20. arXiv:2404.10296  [pdf, other

    cs.LG cs.AI cs.NE

    Engineering software 2.0 by interpolating neural networks: unifying training, solving, and calibration

    Authors: Chanwook Park, Sourav Saha, Jiachen Guo, Xiaoyu Xie, Satyajit Mojumder, Miguel A. Bessa, Dong Qian, Wei Chen, Gregory J. Wagner, Jian Cao, Wing Kam Liu

    Abstract: The evolution of artificial intelligence (AI) and neural network theories has revolutionized the way software is programmed, shifting from a hard-coded series of codes to a vast neural network. However, this transition in engineering software has faced challenges such as data scarcity, multi-modality of data, low model accuracy, and slow inference. Here, we propose a new network based on interpola… ▽ More

    Submitted 22 April, 2024; v1 submitted 16 April, 2024; originally announced April 2024.

    Comments: 9 pages, 3 figures

  21. arXiv:2404.09595  [pdf, ps, other

    eess.SP cs.AI

    Building Semantic Communication System via Molecules: An End-to-End Training Approach

    Authors: Yukun Cheng, Wei Chen, Bo Ai

    Abstract: The concept of semantic communication provides a novel approach for applications in scenarios with limited communication resources. In this paper, we propose an end-to-end (E2E) semantic molecular communication system, aiming to enhance the efficiency of molecular communication systems by reducing the transmitted information. Specifically, following the joint source channel coding paradigm, the ne… ▽ More

    Submitted 15 April, 2024; originally announced April 2024.

  22. arXiv:2404.09512  [pdf, other

    cs.CV

    Magic Clothing: Controllable Garment-Driven Image Synthesis

    Authors: Weifeng Chen, Tao Gu, Yuhao Xu, Chengcai Chen

    Abstract: We propose Magic Clothing, a latent diffusion model (LDM)-based network architecture for an unexplored garment-driven image synthesis task. Aiming at generating customized characters wearing the target garments with diverse text prompts, the image controllability is the most critical issue, i.e., to preserve the garment details and maintain faithfulness to the text prompts. To this end, we introdu… ▽ More

    Submitted 15 April, 2024; originally announced April 2024.

  23. arXiv:2404.09409  [pdf, other

    math.PR math-ph

    Disorder Chaos in Short-Range, Diluted, and Lévy Spin Glasses

    Authors: Wei-Kuo Chen, Heejune Kim, Arnab Sen

    Abstract: In a recent breakthrough [arXiv:2301.04112], Chatterjee proved site disorder chaos in the Edwards-Anderson (EA) short-range spin glass model utilizing the Hermite spectral method. In this paper, we demonstrate the further usefulness of this Hermite spectral approach by extending the validity of site disorder chaos in three related spin glass models. The first, called the mixed even $p$-spin shor… ▽ More

    Submitted 13 June, 2024; v1 submitted 14 April, 2024; originally announced April 2024.

    Comments: A few paragraphs in the introduction revised for clarity. 24 pages, 3 Figures

    MSC Class: 60K35; 82B44

  24. arXiv:2404.09292  [pdf, other

    cs.CV cs.AI

    Bridging Data Islands: Geographic Heterogeneity-Aware Federated Learning for Collaborative Remote Sensing Semantic Segmentation

    Authors: Jieyi Tan, Yansheng Li, Sergey A. Bartalev, Bo Dang, Wei Chen, Yongjun Zhang, Liangqi Yuan

    Abstract: Remote sensing semantic segmentation (RSS) is an essential task in Earth Observation missions. Due to data privacy concerns, high-quality remote sensing images with annotations cannot be well shared among institutions, making it difficult to fully utilize RSS data to train a generalized model. Federated Learning (FL), a privacy-preserving collaborative learning technology, is a potential solution.… ▽ More

    Submitted 14 April, 2024; originally announced April 2024.

    Comments: 13 pages,9 figures, 4 tables

  25. arXiv:2404.09199  [pdf

    cond-mat.supr-con

    Observation of the Josephson effect in superhydrides: DC SQUID based on (La,Ce)H$_{10}$ with operating temperature of 179 K

    Authors: Dmitrii V. Semenok, Ivan A. Troyan, Di Zhou, Wuhao Chen, Ho-kwang Mao, Viktor V. Struzhkin

    Abstract: Among known materials, hydride superconductors have the highest critical temperatures and are very promising as a basis for electronic sensors. Superconducting quantum interference device (SQUID), due to its unique sensitivity to magnetic fields, is the most important application of superconductors in microelectronics. In this work, we describe a direct current SQUID made of lanthanum-cerium super… ▽ More

    Submitted 14 April, 2024; originally announced April 2024.

    Comments: Raw data: https://github.com/mark6871/LaCeH10/tree/SQUID. Video: https://www.youtube.com/watch?v=x5Yh220RvJQ&t=71s

  26. arXiv:2404.09187  [pdf

    physics.acc-ph

    Stable Acceleration of a LHe-Free Nb3Sn demo SRF e-linac Based on Conduction Cooling

    Authors: Ziqin Yang, Yuan He, Tiancai Jiang, Feng Bai, Fengfeng Wang, Weilong Chen, Guangze Jiang, Yimeng Chu, Hangxu Li, Bo Zhao, Guozhen Sun, Zongheng Xue, Yugang Zhao, Zheng Gao, Yaguang Li, **ran Xiong, Hao Guo, Liepeng Sun, Guirong Huang, Zhijun Wang, Junhui Zhang, Teng Tan, Hongwei Zhao, Wenlong Zhan

    Abstract: The design, construction, and commissioning of a conduction-cooled Nb3Sn demonstration superconducting radio frequency (SRF) electron accelerator at the Institute of Modern Physics of the Chinese Academy of Sciences (IMP, CAS) will be presented. In the context of engineering application planning for Nb3Sn thin-film SRF cavities within the CiADS project, a 650MHz 5-cell elliptical cavity was coated… ▽ More

    Submitted 14 April, 2024; originally announced April 2024.

  27. arXiv:2404.08793  [pdf, other

    cs.CR cs.CL cs.HC

    JailbreakLens: Visual Analysis of Jailbreak Attacks Against Large Language Models

    Authors: Yingchaojie Feng, Zhizhang Chen, Zhining Kang, Sijia Wang, Minfeng Zhu, Wei Zhang, Wei Chen

    Abstract: The proliferation of large language models (LLMs) has underscored concerns regarding their security vulnerabilities, notably against jailbreak attacks, where adversaries design jailbreak prompts to circumvent safety mechanisms for potential misuse. Addressing these concerns necessitates a comprehensive analysis of jailbreak prompts to evaluate LLMs' defensive capabilities and identify potential we… ▽ More

    Submitted 12 April, 2024; originally announced April 2024.

    Comments: Submitted to VIS 2024

  28. arXiv:2404.08571  [pdf, other

    astro-ph.CO

    Flashlights: Microlensing vs Stellar Variability of Transients in the Star Clusters of the Dragon Arc

    Authors: Sung Kei Li, Patrick L. Kelly, Jose M. Diego, Jeremy Lim, WenLei Chen, Amruth Alfred, Liliya L. R. Williams, Thomas J. Broadhurst, Ashish. K. Meena, Adi Zitrin, Alex Chow

    Abstract: We study the nature of transient events detected in the "Dragon Arc", a star-forming galaxy at a redshift of $0.7251$ that is gravitationally lensed by the galaxy cluster Abell 370. In particular, we focus on a subset of ten transients that are identified as unresolved young star clusters in the deep broadband, F200LP, taken as part of the "Flashlights" Hubble Space Telescope program, showing flux… ▽ More

    Submitted 12 April, 2024; originally announced April 2024.

    Comments: 28 pages, 14 figures. To be submitted, comments welcomed

  29. arXiv:2404.08559  [pdf, other

    cs.CL

    MoPE: Mixture of Prefix Experts for Zero-Shot Dialogue State Tracking

    Authors: Tianwen Tang, Tong Zhu, Haodong Liu, Yin Bai, Jia Cheng, Wenliang Chen

    Abstract: Zero-shot dialogue state tracking (DST) transfers knowledge to unseen domains, reducing the cost of annotating new datasets. Previous zero-shot DST models mainly suffer from domain transferring and partial prediction problems. To address these challenges, we propose Mixture of Prefix Experts (MoPE) to establish connections between similar slots in different domains, which strengthens the model tra… ▽ More

    Submitted 12 April, 2024; originally announced April 2024.

    Comments: Accepted to LREC-COLING 2024

  30. arXiv:2404.08324  [pdf, other

    cs.DC

    Communication-Efficient Model Aggregation with Layer Divergence Feedback in Federated Learning

    Authors: Liwei Wang, Jun Li, Wen Chen, Qingqing Wu, Ming Ding

    Abstract: Federated Learning (FL) facilitates collaborative machine learning by training models on local datasets, and subsequently aggregating these local models at a central server. However, the frequent exchange of model parameters between clients and the central server can result in significant communication overhead during the FL training process. To solve this problem, this paper proposes a novel FL f… ▽ More

    Submitted 12 April, 2024; originally announced April 2024.

  31. arXiv:2404.08045  [pdf, other

    astro-ph.GA astro-ph.CO

    JWST Discovery of $40+$ Microlensed Stars in a Magnified Galaxy, the "Dragon" behind Abell 370

    Authors: Yoshinobu Fudamoto, Fengwu Sun, Jose M. Diego, Liang Dai, Masamune Oguri, Adi Zitrin, Erik Zackrisson, Mathilde Jauzac, David J. Lagattuta, Eiichi Egami, Edoardo Iani, Rogier A. Windhorst, Katsuya T. Abe, Franz Erik Bauer, Fuyan Bian, Rachana Bhatawdekar, Thomas J. Broadhurst, Zheng Cai, Chian-Chou Chen, Wenlei Chen, Seth H. Cohen, Christopher J. Conselice, Daniel Espada, Nicholas Foo, Brenda L. Frye , et al. (21 additional authors not shown)

    Abstract: Strong gravitational magnification by massive galaxy clusters enable us to detect faint background sources, resolve their detailed internal structures, and in the most extreme cases identify and study individual stars in distant galaxies. Highly magnified individual stars allow for a wide range of applications, including studies of stellar populations in distant galaxies and constraining small-sca… ▽ More

    Submitted 11 April, 2024; originally announced April 2024.

    Comments: 15 pages, 4 figures, 1 table submitted to Nature Astronomy

  32. arXiv:2404.07965  [pdf, other

    cs.CL cs.AI

    Rho-1: Not All Tokens Are What You Need

    Authors: Zhenghao Lin, Zhibin Gou, Yeyun Gong, Xiao Liu, Yelong Shen, Ruochen Xu, Chen Lin, Yujiu Yang, Jian Jiao, Nan Duan, Weizhu Chen

    Abstract: Previous language model pre-training methods have uniformly applied a next-token prediction loss to all training tokens. Challenging this norm, we posit that ''Not all tokens in a corpus are equally important for language model training''. Our initial analysis examines token-level training dynamics of language model, revealing distinct loss patterns for different tokens. Leveraging these insights,… ▽ More

    Submitted 23 May, 2024; v1 submitted 11 April, 2024; originally announced April 2024.

    Comments: First two authors equal contribution

  33. arXiv:2404.07919  [pdf, other

    cs.LG cs.AI

    Low-rank Adaptation for Spatio-Temporal Forecasting

    Authors: Weilin Ruan, Wei Chen, Xilin Dang, Jianxiang Zhou, Weichuang Li, Xu Liu, Yuxuan Liang

    Abstract: Spatio-temporal forecasting is crucial in real-world dynamic systems, predicting future changes using historical data from diverse locations. Existing methods often prioritize the development of intricate neural networks to capture the complex dependencies of the data, yet their accuracy fails to show sustained improvement. Besides, these methods also overlook node heterogeneity, hindering customi… ▽ More

    Submitted 11 April, 2024; originally announced April 2024.

  34. arXiv:2404.07448  [pdf, other

    cs.CV cs.CL eess.IV

    Transferable and Principled Efficiency for Open-Vocabulary Segmentation

    Authors: **gxuan Xu, Wuyang Chen, Yao Zhao, Yunchao Wei

    Abstract: Recent success of pre-trained foundation vision-language models makes Open-Vocabulary Segmentation (OVS) possible. Despite the promising performance, this approach introduces heavy computational overheads for two challenges: 1) large model sizes of the backbone; 2) expensive costs during the fine-tuning. These challenges hinder this OVS strategy from being widely applicable and affordable in real-… ▽ More

    Submitted 3 June, 2024; v1 submitted 10 April, 2024; originally announced April 2024.

  35. arXiv:2404.06880  [pdf, ps, other

    cs.IT eess.SP

    Joint Active And Passive IRS Aided Wireless Communication: Elements Allocation and Achievable Rate

    Authors: Chaoying Huang, Wen Chen, Qingqing Wu

    Abstract: Equip** reflecting elements at the active intelligent reflecting surface (AIRS) enhances signal amplification capability but meanwhile incurs non-negligible amplification noise, which thus challenges the determination of elements allocation for maximizing achievable rate in multi-cooperative AIRS and passive IRS (PIRS) jointly aided wireless communication system. To tackle this issue, we conside… ▽ More

    Submitted 10 April, 2024; v1 submitted 10 April, 2024; originally announced April 2024.

  36. arXiv:2404.06833  [pdf, other

    cs.CL

    Does Mapo Tofu Contain Coffee? Probing LLMs for Food-related Cultural Knowledge

    Authors: Li Zhou, Taelin Karidi, Nicolas Garneau, Yong Cao, Wanlong Liu, Wenyu Chen, Daniel Hershcovich

    Abstract: Recent studies have highlighted the presence of cultural biases in Large Language Models (LLMs), yet often lack a robust methodology to dissect these phenomena comprehensively. Our work aims to bridge this gap by delving into the Food domain, a universally relevant yet culturally diverse aspect of human life. We introduce FmLAMA, a multilingual dataset centered on food-related cultural facts and v… ▽ More

    Submitted 10 April, 2024; originally announced April 2024.

    Comments: 20 pages,8 figures

  37. arXiv:2404.06760  [pdf, other

    cs.CL cs.AI

    DiffusionDialog: A Diffusion Model for Diverse Dialog Generation with Latent Space

    Authors: Jianxiang Xiang, Zhenhua Liu, Haodong Liu, Yin Bai, Jia Cheng, Wenliang Chen

    Abstract: In real-life conversations, the content is diverse, and there exists the one-to-many problem that requires diverse generation. Previous studies attempted to introduce discrete or Gaussian-based continuous latent variables to address the one-to-many problem, but the diversity is limited. Recently, diffusion models have made breakthroughs in computer vision, and some attempts have been made in natur… ▽ More

    Submitted 10 April, 2024; originally announced April 2024.

    Comments: LREC-COLING 2024 camera ready

  38. arXiv:2404.06710  [pdf, other

    cs.CV cs.AI

    SpikeNVS: Enhancing Novel View Synthesis from Blurry Images via Spike Camera

    Authors: Gaole Dai, Zhenyu Wang, Qinwen Xu, Ming Lu, Wen Chen, Boxin Shi, Shanghang Zhang, Tiejun Huang

    Abstract: One of the most critical factors in achieving sharp Novel View Synthesis (NVS) using neural field methods like Neural Radiance Fields (NeRF) and 3D Gaussian Splatting (3DGS) is the quality of the training images. However, Conventional RGB cameras are susceptible to motion blur. In contrast, neuromorphic cameras like event and spike cameras inherently capture more comprehensive temporal information… ▽ More

    Submitted 12 April, 2024; v1 submitted 9 April, 2024; originally announced April 2024.

  39. arXiv:2404.06695  [pdf, other

    eess.IV physics.med-ph

    Spiral Scanning and Self-Supervised Image Reconstruction Enable Ultra-Sparse Sampling Multispectral Photoacoustic Tomography

    Authors: Yutian Zhong, Xiaoming Zhang, Zongxin Mo, Shuangyang Zhang, Wufan Chen, Li Qi

    Abstract: Multispectral photoacoustic tomography (PAT) is an imaging modality that utilizes the photoacoustic effect to achieve non-invasive and high-contrast imaging of internal tissues. However, the hardware cost and computational demand of a multispectral PAT system consisting of up to thousands of detectors are huge. To address this challenge, we propose an ultra-sparse spiral sampling strategy for mult… ▽ More

    Submitted 9 April, 2024; originally announced April 2024.

  40. arXiv:2404.06393  [pdf, other

    cs.SD cs.AI eess.AS

    MuPT: A Generative Symbolic Music Pretrained Transformer

    Authors: Xingwei Qu, Yuelin Bai, Yinghao Ma, Ziya Zhou, Ka Man Lo, Jiaheng Liu, Ruibin Yuan, Lejun Min, Xueling Liu, Tianyu Zhang, Xinrun Du, Shuyue Guo, Yiming Liang, Yizhi Li, Shangda Wu, Junting Zhou, Tianyu Zheng, Ziyang Ma, Fengze Han, Wei Xue, Gus Xia, Emmanouil Benetos, Xiang Yue, Chenghua Lin, Xu Tan , et al. (4 additional authors not shown)

    Abstract: In this paper, we explore the application of Large Language Models (LLMs) to the pre-training of music. While the prevalent use of MIDI in music modeling is well-established, our findings suggest that LLMs are inherently more compatible with ABC Notation, which aligns more closely with their design and strengths, thereby enhancing the model's performance in musical composition. To address the chal… ▽ More

    Submitted 10 April, 2024; v1 submitted 9 April, 2024; originally announced April 2024.

  41. arXiv:2404.06050  [pdf, other

    cs.CV cs.RO

    Incremental Joint Learning of Depth, Pose and Implicit Scene Representation on Monocular Camera in Large-scale Scenes

    Authors: Tianchen Deng, Nailin Wang, Chongdi Wang, Shenghai Yuan, **gchuan Wang, Danwei Wang, Weidong Chen

    Abstract: Dense scene reconstruction for photo-realistic view synthesis has various applications, such as VR/AR, autonomous vehicles. However, most existing methods have difficulties in large-scale scenes due to three core challenges: \textit{(a) inaccurate depth input.} Accurate depth input is impossible to get in real-world large-scale scenes. \textit{(b) inaccurate pose estimation.} Most existing approac… ▽ More

    Submitted 9 April, 2024; originally announced April 2024.

  42. Cymatics Cup: Shape-Changing Drinks by Leveraging Cymatics

    Authors: Weijen Chen, Yang Yang, Kao-Hua Liu, Yun Suen Pai, Junichi Yamaoka, Kouta Minamizawa

    Abstract: To enhance the dining experience, prior studies in Human-Computer Interaction (HCI) and gastrophysics have demonstrated that modifying the static shape of solid foods can amplify taste perception. However, the exploration of dynamic shape-changing mechanisms in liquid foods remains largely untapped. In the present study, we employ cymatics, a scientific discipline focused on utilizing sound freque… ▽ More

    Submitted 9 April, 2024; originally announced April 2024.

    Journal ref: Proceedings of the CHI Conference on Human Factors in Computing Systems (CHI '24), May 11--16, 2024, Honolulu, HI, USA

  43. arXiv:2404.05943  [pdf, other

    cs.CL cs.AI

    Interplay of Machine Translation, Diacritics, and Diacritization

    Authors: Wei-Rui Chen, Ife Adebara, Muhammad Abdul-Mageed

    Abstract: We investigate two research questions: (1) how do machine translation (MT) and diacritization influence the performance of each other in a multi-task learning setting (2) the effect of kee** (vs. removing) diacritics on MT performance. We examine these two questions in both high-resource (HR) and low-resource (LR) settings across 55 different languages (36 African languages and 19 European langu… ▽ More

    Submitted 8 April, 2024; originally announced April 2024.

    Comments: Accepted to NAACL 2024 Main Conference

  44. Observation of dichotomic field-tunable electronic structure in twisted monolayer-bilayer graphene

    Authors: Hongyun Zhang, Qian Li, Youngju Park, Yu** Jia, Wanying Chen, Jiaheng Li, Qinxin Liu, Changhua Bao, Nicolas Leconte, Shaohua Zhou, Yuan Wang, Kenji Watanabe, Takashi Taniguchi, Jose Avila, Pavel Dudin, Pu Yu, Hongming Weng, Wenhui Duan, Quansheng Wu, Jeil Jung, Shuyun Zhou

    Abstract: Twisted bilayer graphene (tBLG) provides a fascinating platform for engineering flat bands and inducing correlated phenomena. By designing the stacking architecture of graphene layers, twisted multilayer graphene can exhibit different symmetries with rich tunability. For example, in twisted monolayer-bilayer graphene (tMBG) which breaks the C2z symmetry, transport measurements reveal an asymmetric… ▽ More

    Submitted 8 April, 2024; originally announced April 2024.

    Comments: 4 figures

    Journal ref: Nat Commun 15, 3737 (2024)

  45. arXiv:2404.05199  [pdf, other

    eess.SP cs.IT

    Decision Transformer for Wireless Communications: A New Paradigm of Resource Management

    Authors: Jie Zhang, Jun Li, Long Shi, Zhe Wang, Shi **, Wen Chen, H. Vincent Poor

    Abstract: As the next generation of mobile systems evolves, artificial intelligence (AI) is expected to deeply integrate with wireless communications for resource management in variable environments. In particular, deep reinforcement learning (DRL) is an important tool for addressing stochastic optimization issues of resource allocation. However, DRL has to start each new training process from the beginning… ▽ More

    Submitted 8 April, 2024; originally announced April 2024.

  46. arXiv:2404.05086  [pdf, ps, other

    cs.LG cs.AI cs.CL

    A Note on LoRA

    Authors: Vlad Fomenko, Han Yu, Jongho Lee, Stanley Hsieh, Weizhu Chen

    Abstract: LoRA (Low-Rank Adaptation) has emerged as a preferred method for efficiently adapting Large Language Models (LLMs) with remarkable simplicity and efficacy. This note extends the original LoRA paper by offering new perspectives that were not initially discussed and presents a series of insights for deploying LoRA at scale. Without introducing new experiments, we aim to improve the understanding and… ▽ More

    Submitted 7 April, 2024; originally announced April 2024.

  47. arXiv:2404.04167  [pdf, other

    cs.CL cs.AI

    Chinese Tiny LLM: Pretraining a Chinese-Centric Large Language Model

    Authors: Xinrun Du, Zhouliang Yu, Songyang Gao, Ding Pan, Yuyang Cheng, Ziyang Ma, Ruibin Yuan, Xingwei Qu, Jiaheng Liu, Tianyu Zheng, Xinchen Luo, Guorui Zhou, Binhang Yuan, Wenhu Chen, Jie Fu, Ge Zhang

    Abstract: In this study, we introduce CT-LLM, a 2B large language model (LLM) that illustrates a pivotal shift towards prioritizing the Chinese language in develo** LLMs. Uniquely initiated from scratch, CT-LLM diverges from the conventional methodology by primarily incorporating Chinese textual data, utilizing an extensive corpus of 1,200 billion tokens, including 800 billion Chinese tokens, 300 billion… ▽ More

    Submitted 9 April, 2024; v1 submitted 5 April, 2024; originally announced April 2024.

  48. arXiv:2404.03882  [pdf, ps, other

    astro-ph.HE astro-ph.SR

    Evolutionary Origin of Ultra-long Period Radio Transients

    Authors: Yun-Ning Fan, Kun Xu, Wen-Cong Chen

    Abstract: Recently, it discovered two ultra-long period radio transients GLEAM-X J162759.5-523504.3 (J1627) and GPM J1839$-$10 (J1839) with spin periods longer than 1000 s. The origin of these two ultra-long period radio transients is intriguing in understanding the spin evolution of neutron stars (NSs). In this work, we diagnose whether the interaction between strong magnetized NSs and fallback disks can s… ▽ More

    Submitted 5 April, 2024; originally announced April 2024.

    Comments: 12 pages, 10 figures, ApJ in press

  49. arXiv:2404.03554  [pdf, other

    cs.MA

    No Panacea in Planning: Algorithm Selection for Suboptimal Multi-Agent Path Finding

    Authors: Weizhe Chen, Zhihan Wang, Jiaoyang Li, Sven Koenig, Bistra Dilkina

    Abstract: Since more and more algorithms are proposed for multi-agent path finding (MAPF) and each of them has its strengths, choosing the correct one for a specific scenario that fulfills some specified requirements is an important task. Previous research in algorithm selection for MAPF built a standard workflow and showed that machine learning can help. In this paper, we study general solvers for MAPF, wh… ▽ More

    Submitted 4 April, 2024; originally announced April 2024.

  50. arXiv:2404.03543  [pdf, other

    cs.SE cs.AI cs.CL cs.LG

    CodeEditorBench: Evaluating Code Editing Capability of Large Language Models

    Authors: Jiawei Guo, Ziming Li, Xueling Liu, Kai**g Ma, Tianyu Zheng, Zhouliang Yu, Ding Pan, Yizhi LI, Ruibo Liu, Yue Wang, Shuyue Guo, Xingwei Qu, Xiang Yue, Ge Zhang, Wenhu Chen, Jie Fu

    Abstract: Large Language Models (LLMs) for code are rapidly evolving, with code editing emerging as a critical capability. We introduce CodeEditorBench, an evaluation framework designed to rigorously assess the performance of LLMs in code editing tasks, including debugging, translating, polishing, and requirement switching. Unlike existing benchmarks focusing solely on code generation, CodeEditorBench empha… ▽ More

    Submitted 6 April, 2024; v1 submitted 4 April, 2024; originally announced April 2024.