Skip to main content

Showing 151–200 of 4,086 results for author: Zhu, Y

.
  1. arXiv:2405.05254  [pdf, other

    cs.CL

    You Only Cache Once: Decoder-Decoder Architectures for Language Models

    Authors: Yutao Sun, Li Dong, Yi Zhu, Shaohan Huang, Wenhui Wang, Shuming Ma, Quanlu Zhang, Jianyong Wang, Furu Wei

    Abstract: We introduce a decoder-decoder architecture, YOCO, for large language models, which only caches key-value pairs once. It consists of two components, i.e., a cross-decoder stacked upon a self-decoder. The self-decoder efficiently encodes global key-value (KV) caches that are reused by the cross-decoder via cross-attention. The overall model behaves like a decoder-only Transformer, although YOCO onl… ▽ More

    Submitted 9 May, 2024; v1 submitted 8 May, 2024; originally announced May 2024.

  2. arXiv:2405.04884  [pdf, other

    quant-ph

    Cost of Locally Approximating High-Dimensional Ground States of Contextual Quantum Models

    Authors: Kaiyan Yang, Yanzheng Zhu, Xiao Zeng, Zuoheng Zou, Man-Hong Yung, Zizhu Wang

    Abstract: Contextuality, one of the strongest forms of quantum correlations, delineates the quantum world and the classical one. It has been shown recently that some quantum models, in the form of infinite one-dimensional translation-invariant Hamiltonians with nearest- and next-to-nearest-neighbor interactions, have the lowest ground state energy density allowed in quantum physics. However, these models al… ▽ More

    Submitted 8 May, 2024; originally announced May 2024.

    Comments: 11 pages, 6 figures, including one figure spanning two pages

  3. arXiv:2405.04867  [pdf, other

    eess.IV cs.CV

    MIPI 2024 Challenge on Demosaic for HybridEVS Camera: Methods and Results

    Authors: Yaqi Wu, Zhihao Fan, Xiaofeng Chu, Jimmy S. Ren, Xiaoming Li, Zongsheng Yue, Chongyi Li, Shangcheng Zhou, Ruicheng Feng, Yuekun Dai, Peiqing Yang, Chen Change Loy, Senyan Xu, Zhi**g Sun, Jiaying Zhu, Yurui Zhu, Xueyang Fu, Zheng-Jun Zha, Jun Cao, Cheng Li, Shu Chen, Liang Ma, Shiyang Zhou, Hai** Zeng, Kai Feng , et al. (24 additional authors not shown)

    Abstract: The increasing demand for computational photography and imaging on mobile platforms has led to the widespread development and integration of advanced image sensors with novel algorithms in camera systems. However, the scarcity of high-quality data for research and the rare opportunity for in-depth exchange of views from industry and academia constrain the development of mobile intelligent photogra… ▽ More

    Submitted 8 May, 2024; originally announced May 2024.

    Comments: MIPI@CVPR2024. Website: https://mipi-challenge.org/MIPI2024/

  4. arXiv:2405.04536  [pdf, other

    cs.CV cs.AI cs.LG

    When Training-Free NAS Meets Vision Transformer: A Neural Tangent Kernel Perspective

    Authors: Qiqi Zhou, Yichen Zhu

    Abstract: This paper investigates the Neural Tangent Kernel (NTK) to search vision transformers without training. In contrast with the previous observation that NTK-based metrics can effectively predict CNNs performance at initialization, we empirically show their inefficacy in the ViT search space. We hypothesize that the fundamental feature learning preference within ViT contributes to the ineffectiveness… ▽ More

    Submitted 15 March, 2024; originally announced May 2024.

    Comments: ICASSP2024 oral

  5. arXiv:2405.04434  [pdf, other

    cs.CL cs.AI

    DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model

    Authors: DeepSeek-AI, Aixin Liu, Bei Feng, Bin Wang, Bingxuan Wang, Bo Liu, Chenggang Zhao, Chengqi Dengr, Chong Ruan, Damai Dai, Daya Guo, Dejian Yang, Deli Chen, Dongjie Ji, Erhang Li, Fangyun Lin, Fuli Luo, Guangbo Hao, Guanting Chen, Guowei Li, H. Zhang, Hanwei Xu, Hao Yang, Haowei Zhang, Honghui Ding , et al. (132 additional authors not shown)

    Abstract: We present DeepSeek-V2, a strong Mixture-of-Experts (MoE) language model characterized by economical training and efficient inference. It comprises 236B total parameters, of which 21B are activated for each token, and supports a context length of 128K tokens. DeepSeek-V2 adopts innovative architectures including Multi-head Latent Attention (MLA) and DeepSeekMoE. MLA guarantees efficient inference… ▽ More

    Submitted 19 June, 2024; v1 submitted 7 May, 2024; originally announced May 2024.

  6. arXiv:2405.03804  [pdf, other

    quant-ph

    EPOC: A Novel Pulse Generation Framework Incorporating Advanced Synthesis Techniques for Quantum Circuits

    Authors: **glei Cheng, Yuchen Zhu, Yidong Zhou, Hang Ren, Zhixin Song, Zhiding Liang

    Abstract: In this paper we propose EPOC, an efficient pulse generation framework for quantum circuits that combines ZX-Calculus, circuit partitioning, and circuit synthesis to accelerate pulse generation. Unlike previous works that focus on generating pulses from unitary matrices without exploring equivalent representations, EPOC employs a finer granularity approach by grou** quantum gates and decomposing… ▽ More

    Submitted 6 May, 2024; originally announced May 2024.

  7. arXiv:2405.03712  [pdf, other

    cs.LG cs.AI cs.CR cs.NE

    Your Network May Need to Be Rewritten: Network Adversarial Based on High-Dimensional Function Graph Decomposition

    Authors: Xiaoyan Su, Yinghao Zhu, Run Li

    Abstract: In the past, research on a single low dimensional activation function in networks has led to internal covariate shift and gradient deviation problems. A relatively small research area is how to use function combinations to provide property completion for a single activation function application. We propose a network adversarial method to address the aforementioned challenges. This is the first met… ▽ More

    Submitted 4 May, 2024; originally announced May 2024.

  8. arXiv:2405.03697  [pdf, other

    cs.HC

    GeoViz: A Multi-View Visualization Platform for Spatio-temporal Knowledge Graph

    Authors: Jian** Zhou, Junhao Li, Guanjie Zheng, Yunqiang Zhu, Xinbing Wang, Chenghu Zhou

    Abstract: In this paper, we propose a multi-view visualization technology for spatio-temporal knowledge graph(STKG), which utilizes three distinct perspectives: knowledge tree, knowledge net, and knowledge map, to facilitate a comprehensive analysis of the STKG. The knowledge tree enables the visualization of hierarchical interrelation within the STKG, while the knowledge net elucidates semantic relationshi… ▽ More

    Submitted 29 April, 2024; originally announced May 2024.

    Comments: 4 pages, 2 figures

  9. arXiv:2405.02376  [pdf, other

    physics.med-ph quant-ph

    Non-invasive magnetocardiography of living rat based on diamond quantum sensor

    Authors: Ziyun Yu, Yi** Xie, Guodong **, Yunbin Zhu, Qi Zhang, Fazhan Shi, Fang-yan Wan, Hongmei Luo, Ai-hui Tang, Xing Rong

    Abstract: Magnetocardiography (MCG) has emerged as a sensitive and precise method to diagnose cardiovascular diseases, providing more diagnostic information than traditional technology. However, the sensor limitations of conventional MCG systems, such as large size and cryogenic requirement, have hindered the widespread application and in-depth understanding of this technology. In this study, we present a h… ▽ More

    Submitted 3 May, 2024; originally announced May 2024.

  10. arXiv:2405.02194  [pdf, other

    physics.atom-ph physics.optics

    Coherent XUV super continuum emission from atomic bound states

    Authors: **g Zhao, Xiaowei Wang, Li Wang, Jiacan Wang, Yalei Zhu, Fan Xiao, Wenkai Tao, Zhigang Zheng, Haizhong Wu, Xu Sun, Yue Lang, Congsen Meng, Dongwen Zhang, Zhihui Lv, **lei Liu, Zengxiu Zhao

    Abstract: Coherent supercontinuum radiation in the extreme-ultraviolet (XUV) range is indispensable for synthesizing attosecond light pulses and for exploring transient atomic structures. Here, we report the striking observations of coherent XUV supercontinuum (XSC) extended from below to far above the ionization threshold, which exhibits completely different temporal and spatial properties comparing to the… ▽ More

    Submitted 3 May, 2024; originally announced May 2024.

  11. Towards Building Autonomous Data Services on Azure

    Authors: Yiwen Zhu, Yuanyuan Tian, Joyce Cahoon, Subru Krishnan, Ankita Agarwal, Rana Alotaibi, Jesús Camacho-Rodríguez, Bibin Chundatt, Andrew Chung, Niharika Dutta, Andrew Fogarty, Anja Gruenheid, Brandon Haynes, Matteo Interlandi, Minu Iyer, Nick Jurgens, Sumeet Khushalani, Brian Kroth, Manoj Kumar, Jyoti Leeka, Sergiy Matusevych, Minni Mittal, Andreas Mueller, Kartheek Muthyala, Harsha Nagulapalli , et al. (13 additional authors not shown)

    Abstract: Modern cloud has turned data services into easily accessible commodities. With just a few clicks, users are now able to access a catalog of data processing systems for a wide range of tasks. However, the cloud brings in both complexity and opportunity. While cloud users can quickly start an application by using various data services, it can be difficult to configure and optimize these services to… ▽ More

    Submitted 2 May, 2024; originally announced May 2024.

    Comments: SIGMOD Companion of the 2023 International Conference on Management of Data. 2023

  12. arXiv:2405.01751  [pdf

    cond-mat.mtrl-sci cond-mat.mes-hall

    Theory of nonlinear terahertz susceptibility in ferroelectrics

    Authors: Yujie Zhu, Taorui Chen, Aiden Ross, Bo Wang, Xiangwei Guo, Venkatraman Gopalan, Long-Qing Chen, Jia-Mian Hu

    Abstract: An analytical theory is developed for predicting the nonlinear susceptibility of ionic polarization to continuous electromagnetic waves in both bulk and strained thin film ferroelectrics. Using a perturbation method for solving the nonlinear equation of motion for ionic polarization within the framework of Landau-Ginzburg-Devonshire theory, the full second-order nonlinear susceptibility tensor is… ▽ More

    Submitted 2 May, 2024; originally announced May 2024.

  13. arXiv:2405.01701  [pdf

    cs.CV

    Active Learning Enabled Low-cost Cell Image Segmentation Using Bounding Box Annotation

    Authors: Yu Zhu, Qiang Yang, Li Xu

    Abstract: Cell image segmentation is usually implemented using fully supervised deep learning methods, which heavily rely on extensive annotated training data. Yet, due to the complexity of cell morphology and the requirement for specialized knowledge, pixel-level annotation of cell images has become a highly labor-intensive task. To address the above problems, we propose an active learning framework for ce… ▽ More

    Submitted 2 May, 2024; originally announced May 2024.

  14. arXiv:2405.00891  [pdf, other

    math.OC math.AP math.NA

    An interacting particle consensus method for constrained global optimization

    Authors: José A. Carrillo, Shi **, Haoyu Zhang, Yuhua Zhu

    Abstract: This paper presents a particle-based optimization method designed for addressing minimization problems with equality constraints, particularly in cases where the loss function exhibits non-differentiability or non-convexity. The proposed method combines components from consensus-based optimization algorithm with a newly introduced forcing term directed at the constraint set. A rigorous mean-field… ▽ More

    Submitted 12 May, 2024; v1 submitted 1 May, 2024; originally announced May 2024.

    MSC Class: 90C56; 65C35; 35Q70; 82C22; 35Q84

  15. arXiv:2405.00704  [pdf, ps, other

    cs.CL cs.AI

    A Survey on the Real Power of ChatGPT

    Authors: Ming Liu, Ran Liu, Ye Zhu, Hua Wang, Youyang Qu, Rongsheng Li, Yongpan Sheng, Wray Buntine

    Abstract: ChatGPT has changed the AI community and an active research line is the performance evaluation of ChatGPT. A key challenge for the evaluation is that ChatGPT is still closed-source and traditional benchmark datasets may have been used by ChatGPT as the training data. In this paper, (i) we survey recent studies which uncover the real performance levels of ChatGPT in seven categories of NLP tasks, (… ▽ More

    Submitted 9 May, 2024; v1 submitted 22 April, 2024; originally announced May 2024.

    Comments: 18 pages, 2 tables

  16. arXiv:2405.00513  [pdf

    q-bio.QM

    3D MR Fingerprinting for Dynamic Contrast-Enhanced Imaging of Whole Mouse Brain

    Authors: Yuran Zhu, Guanhua Wang, Yuning Gu, Walter Zhao, Jiahao Lu, Junqing Zhu, Christina J. MacAskill, Andrew Dupuis, Mark A. Griswold, Dan Ma, Chris A. Flask, Xin Yu

    Abstract: Quantitative MRI enables direct quantification of contrast agent concentrations in contrast-enhanced scans. However, the lengthy scan times required by conventional methods are inadequate for tracking contrast agent transport dynamically in mouse brain. We developed a 3D MR fingerprinting (MRF) method for simultaneous T1 and T2 map** across the whole mouse brain with 4.3-min temporal resolution.… ▽ More

    Submitted 1 May, 2024; originally announced May 2024.

  17. arXiv:2404.19585  [pdf, other

    cs.RO

    Integrating Visuo-tactile Sensing with Haptic Feedback for Teleoperated Robot Manipulation

    Authors: Noah Becker, Erik Gattung, Kay Hansel, Tim Schneider, Yaonan Zhu, Yasuhisa Hasegawa, Jan Peters

    Abstract: Telerobotics enables humans to overcome spatial constraints and allows them to physically interact with the environment in remote locations. However, the sensory feedback provided by the system to the operator is often purely visual, limiting the operator's dexterity in manipulation tasks. In this work, we address this issue by equip** the robot's end-effector with high-resolution visuotactile G… ▽ More

    Submitted 30 April, 2024; originally announced April 2024.

  18. arXiv:2404.19469  [pdf, other

    cond-mat.mes-hall cond-mat.quant-gas

    Novel Topological Insulators with Hybrid-order Boundary States

    Authors: Yan-Qing Zhu, Zhen Zheng, Giandomenico Palumbo, Z. D. Wang

    Abstract: We report the discovery of several classes of novel topological insulators (TIs) with hybrid-order boundary states generated from the first-order TIs with additional crystalline symmetries. Unlike the current studies on hybrid-order TIs where different-order topology arises from merging different-order TIs in various energy, these novel TIs exhibit a remarkable coexsitence of first-order gapless m… ▽ More

    Submitted 27 May, 2024; v1 submitted 30 April, 2024; originally announced April 2024.

    Comments: More relevant references and the supplemental materials are added in this new version

  19. Interest Clock: Time Perception in Real-Time Streaming Recommendation System

    Authors: Yongchun Zhu, **gwu Chen, Ling Chen, Yitan Li, Feng Zhang, Zuotao Liu

    Abstract: User preferences follow a dynamic pattern over a day, e.g., at 8 am, a user might prefer to read news, while at 8 pm, they might prefer to watch movies. Time modeling aims to enable recommendation systems to perceive time changes to capture users' dynamic preferences over time, which is an important and challenging problem in recommendation systems. Especially, streaming recommendation systems in… ▽ More

    Submitted 30 April, 2024; originally announced April 2024.

    Comments: Accepted by SIGIR 2024

  20. arXiv:2404.18948  [pdf, other

    cs.LG

    Sub-Adjacent Transformer: Improving Time Series Anomaly Detection with Reconstruction Error from Sub-Adjacent Neighborhoods

    Authors: Wenzhen Yue, Xianghua Ying, Ruohao Guo, DongDong Chen, Ji Shi, Bowei Xing, Yuqing Zhu, Taiyan Chen

    Abstract: In this paper, we present the Sub-Adjacent Transformer with a novel attention mechanism for unsupervised time series anomaly detection. Unlike previous approaches that rely on all the points within some neighborhood for time point reconstruction, our method restricts the attention to regions not immediately adjacent to the target points, termed sub-adjacent neighborhoods. Our key observation is th… ▽ More

    Submitted 27 April, 2024; originally announced April 2024.

    Comments: IJCAI 2024

  21. arXiv:2404.18580  [pdf, other

    cs.RO eess.SY

    Data-Driven Dynamics Modeling of Miniature Robotic Blimps Using Neural ODEs With Parameter Auto-Tuning

    Authors: Yongjian Zhu, Hao Cheng, Feitian Zhang

    Abstract: Miniature robotic blimps, as one type of lighter-than-air aerial vehicles, have attracted increasing attention in the science and engineering community for their enhanced safety, extended endurance, and quieter operation compared to quadrotors. Accurately modeling the dynamics of these robotic blimps poses a significant challenge due to the complex aerodynamics stemming from their large lifting bo… ▽ More

    Submitted 29 April, 2024; originally announced April 2024.

    Comments: 8 pages, 8 figures

  22. arXiv:2404.18491  [pdf, other

    cond-mat.dis-nn

    Emergent Non-Abelian Thouless Pum** Induced by the Quasiperiodic Disorder

    Authors: Sen Huang, Yan-Qing Zhu, Zhi Li

    Abstract: We investigate the non-Abelian Thouless pum** in a disorder tunable Lieb chain with degenerate flat bands. The results reveal that quasiperiodic disorder will cause a topological phase transition from the trivial (without non-Abelian Thouless pum**) to the non-trivial (with non-Abelian Thouless pum**) phase. The mechanism behind is that the monopole originally outside the topological region… ▽ More

    Submitted 29 April, 2024; originally announced April 2024.

    Comments: 9 pages,5 figures

  23. arXiv:2404.18443  [pdf, other

    cs.CL cs.AI cs.IR q-bio.QM

    BMRetriever: Tuning Large Language Models as Better Biomedical Text Retrievers

    Authors: Ran Xu, Wenqi Shi, Yue Yu, Yuchen Zhuang, Yanqiao Zhu, May D. Wang, Joyce C. Ho, Chao Zhang, Carl Yang

    Abstract: Develo** effective biomedical retrieval models is important for excelling at knowledge-intensive biomedical tasks but still challenging due to the deficiency of sufficient publicly annotated biomedical data and computational resources. We present BMRetriever, a series of dense retrievers for enhancing biomedical retrieval via unsupervised pre-training on large biomedical corpora, followed by ins… ▽ More

    Submitted 29 April, 2024; originally announced April 2024.

    Comments: Work in progress. The model and data will be uploaded to \url{https://github.com/ritaranx/BMRetriever}

  24. arXiv:2404.18319  [pdf, other

    cs.IR

    User Welfare Optimization in Recommender Systems with Competing Content Creators

    Authors: Fan Yao, Yiming Liao, Mingzhe Wu, Chuanhao Li, Yan Zhu, James Yang, Qifan Wang, Haifeng Xu, Hongning Wang

    Abstract: Driven by the new economic opportunities created by the creator economy, an increasing number of content creators rely on and compete for revenue generated from online content recommendation platforms. This burgeoning competition reshapes the dynamics of content distribution and profoundly impacts long-term user welfare on the platform. However, the absence of a comprehensive picture of global use… ▽ More

    Submitted 28 April, 2024; originally announced April 2024.

  25. arXiv:2404.17955  [pdf, other

    cs.SE

    A Survey of Third-Party Library Security Research in Application Software

    Authors: Jia Zeng, Dan Han, Yaling Zhu, Yangzhong Wang, Fangchen Weng

    Abstract: In the current software development environment, third-party libraries play a crucial role. They provide developers with rich functionality and convenient solutions, speeding up the pace and efficiency of software development. However, with the widespread use of third-party libraries, associated security risks and potential vulnerabilities are increasingly apparent. Malicious attackers can exploit… ▽ More

    Submitted 27 April, 2024; originally announced April 2024.

    Comments: 21 pages, 3 figures, one table

  26. arXiv:2404.17521  [pdf, other

    cs.RO cs.CV

    Ag2Manip: Learning Novel Manipulation Skills with Agent-Agnostic Visual and Action Representations

    Authors: Puhao Li, Tengyu Liu, Yuyang Li, Muzhi Han, Haoran Geng, Shu Wang, Yixin Zhu, Song-Chun Zhu, Siyuan Huang

    Abstract: Autonomous robotic systems capable of learning novel manipulation tasks are poised to transform industries from manufacturing to service automation. However, modern methods (e.g., VIP and R3M) still face significant hurdles, notably the domain gap among robotic embodiments and the sparsity of successful task executions within specific action spaces, resulting in misaligned and ambiguous task repre… ▽ More

    Submitted 26 April, 2024; originally announced April 2024.

    Comments: Project website and open-source code: https://xiaoyao-li.github.io/research/ag2manip

  27. arXiv:2404.17028  [pdf, ps, other

    cs.HC cs.AI

    Generative AI in Color-Changing Systems: Re-Programmable 3D Object Textures with Material and Design Constraints

    Authors: Yunyi Zhu, Faraz Faruqi, Stefanie Mueller

    Abstract: Advances in Generative AI tools have allowed designers to manipulate existing 3D models using text or image-based prompts, enabling creators to explore different design goals. Photochromic color-changing systems, on the other hand, allow for the reprogramming of surface texture of 3D models, enabling easy customization of physical objects and opening up the possibility of using object surfaces for… ▽ More

    Submitted 25 April, 2024; originally announced April 2024.

  28. arXiv:2404.16831  [pdf, other

    cs.CV

    The Third Monocular Depth Estimation Challenge

    Authors: Jaime Spencer, Fabio Tosi, Matteo Poggi, Ripudaman Singh Arora, Chris Russell, Simon Hadfield, Richard Bowden, GuangYuan Zhou, ZhengXin Li, Qiang Rao, Yi** Bao, Xiao Liu, Dohyeong Kim, **seong Kim, Myunghyun Kim, Mykola Lavreniuk, Rui Li, Qing Mao, Jiang Wu, Yu Zhu, **qiu Sun, Yanning Zhang, Suraj Patni, Aradhye Agarwal, Chetan Arora , et al. (16 additional authors not shown)

    Abstract: This paper discusses the results of the third edition of the Monocular Depth Estimation Challenge (MDEC). The challenge focuses on zero-shot generalization to the challenging SYNS-Patches dataset, featuring complex scenes in natural and indoor settings. As with the previous edition, methods can use any form of supervision, i.e. supervised or self-supervised. The challenge received a total of 19 su… ▽ More

    Submitted 27 April, 2024; v1 submitted 25 April, 2024; originally announced April 2024.

    Comments: To appear in CVPRW2024

  29. arXiv:2404.16666  [pdf, other

    cs.CV

    PhyRecon: Physically Plausible Neural Scene Reconstruction

    Authors: Junfeng Ni, Yixin Chen, Bohan **g, Nan Jiang, Bin Wang, Bo Dai, Puhao Li, Yixin Zhu, Song-Chun Zhu, Siyuan Huang

    Abstract: Neural implicit representations have gained popularity in multi-view 3D reconstruction. However, most previous work struggles to yield physically plausible results, limiting their utility in domains requiring rigorous physical accuracy, such as embodied AI and robotics. This lack of plausibility stems from the absence of physics modeling in existing methods and their inability to recover intricate… ▽ More

    Submitted 2 June, 2024; v1 submitted 25 April, 2024; originally announced April 2024.

    Comments: project page: https://phyrecon.github.io/. arXiv admin note: text overlap with arXiv:2303.08605 by other authors

  30. arXiv:2404.16425  [pdf, other

    astro-ph.HE

    Soft X-ray prompt emission from a high-redshift gamma-ray burst EP240315a

    Authors: Y. Liu, H. Sun, D. Xu, D. S. Svinkin, J. Delaunay, N. R. Tanvir, H. Gao, C. Zhang, Y. Chen, X. -F. Wu, B. Zhang, W. Yuan, J. An, G. Bruni, D. D. Frederiks, G. Ghirlanda, J. -W. Hu, A. Li, C. -K. Li, J. -D. Li, D. B. Malesani, L. Piro, G. Raman, R. Ricci, E. Troja , et al. (170 additional authors not shown)

    Abstract: Long gamma-ray bursts (GRBs) are believed to originate from core collapse of massive stars. High-redshift GRBs can probe the star formation and reionization history of the early universe, but their detection remains rare. Here we report the detection of a GRB triggered in the 0.5--4 keV band by the Wide-field X-ray Telescope (WXT) on board the Einstein Probe (EP) mission, designated as EP240315a,… ▽ More

    Submitted 25 April, 2024; originally announced April 2024.

    Comments: 41 pages, 8 figures, 7 tables

  31. arXiv:2404.15956  [pdf, other

    cs.CV

    A Survey on Visual Mamba

    Authors: Hanwei Zhang, Ying Zhu, Dan Wang, Lijun Zhang, Tianxiang Chen, Zi Ye

    Abstract: State space models (SSMs) with selection mechanisms and hardware-aware architectures, namely Mamba, have recently demonstrated significant promise in long-sequence modeling. Since the self-attention mechanism in transformers has quadratic complexity with image size and increasing computational demands, the researchers are now exploring how to adapt Mamba for computer vision tasks. This paper is th… ▽ More

    Submitted 26 April, 2024; v1 submitted 24 April, 2024; originally announced April 2024.

  32. arXiv:2404.15954  [pdf, other

    cs.IR cs.LG

    Mixed Supervised Graph Contrastive Learning for Recommendation

    Authors: Weizhi Zhang, Liangwei Yang, Zihe Song, Henry Peng Zou, Ke Xu, Yuanjie Zhu, Philip S. Yu

    Abstract: Recommender systems (RecSys) play a vital role in online platforms, offering users personalized suggestions amidst vast information. Graph contrastive learning aims to learn from high-order collaborative filtering signals with unsupervised augmentation on the user-item bipartite graph, which predominantly relies on the multi-task learning framework involving both the pair-wise recommendation loss… ▽ More

    Submitted 25 April, 2024; v1 submitted 24 April, 2024; originally announced April 2024.

  33. arXiv:2404.15733  [pdf, other

    cs.AR

    BlissCam: Boosting Eye Tracking Efficiency with Learned In-Sensor Sparse Sampling

    Authors: Yu Feng, Tianrui Ma, Yuhao Zhu, Xuan Zhang

    Abstract: Eye tracking is becoming an increasingly important task domain in emerging computing platforms such as Augmented/Virtual Reality (AR/VR). Today's eye tracking system suffers from long end-to-end tracking latency and can easily eat up half of the power budget of a mobile VR device. Most existing optimization efforts exclusively focus on the computation pipeline by optimizing the algorithm and/or de… ▽ More

    Submitted 24 April, 2024; originally announced April 2024.

  34. arXiv:2404.15380  [pdf, other

    cs.LG cs.AI

    ControlTraj: Controllable Trajectory Generation with Topology-Constrained Diffusion Model

    Authors: Yuanshao Zhu, James Jianqiao Yu, Xiangyu Zhao, Qidong Liu, Yongchao Ye, Wei Chen, Zijian Zhang, Xuetao Wei, Yuxuan Liang

    Abstract: Generating trajectory data is among promising solutions to addressing privacy concerns, collection costs, and proprietary restrictions usually associated with human mobility analyses. However, existing trajectory generation methods are still in their infancy due to the inherent diversity and unpredictability of human activities, grappling with issues such as fidelity, flexibility, and generalizabi… ▽ More

    Submitted 23 April, 2024; originally announced April 2024.

  35. arXiv:2404.14851  [pdf, other

    cs.IR cs.AI cs.CL

    From Matching to Generation: A Survey on Generative Information Retrieval

    Authors: Xiaoxi Li, Jiajie **, Yujia Zhou, Yuyao Zhang, Peitian Zhang, Yutao Zhu, Zhicheng Dou

    Abstract: Information Retrieval (IR) systems are crucial tools for users to access information, widely applied in scenarios like search engines, question answering, and recommendation systems. Traditional IR methods, based on similarity matching to return ranked lists of documents, have been reliable means of information acquisition, dominating the IR field for years. With the advancement of pre-trained lan… ▽ More

    Submitted 15 May, 2024; v1 submitted 23 April, 2024; originally announced April 2024.

  36. arXiv:2404.14248  [pdf, other

    cs.CV

    NTIRE 2024 Challenge on Low Light Image Enhancement: Methods and Results

    Authors: Xiaoning Liu, Zongwei Wu, Ao Li, Florin-Alexandru Vasluianu, Yulun Zhang, Shuhang Gu, Le Zhang, Ce Zhu, Radu Timofte, Zhi **, Hongjun Wu, Chenxi Wang, Haitao Ling, Yuanhao Cai, Hao Bian, Yuxin Zheng, **g Lin, Alan Yuille, Ben Shao, ** Guo, Tianli Liu, Mohao Wu, Yixu Feng, Shuo Hou, Haotian Lin , et al. (87 additional authors not shown)

    Abstract: This paper reviews the NTIRE 2024 low light image enhancement challenge, highlighting the proposed solutions and results. The aim of this challenge is to discover an effective network design or solution capable of generating brighter, clearer, and visually appealing results when dealing with a variety of conditions, including ultra-high resolution (4K and beyond), non-uniform illumination, backlig… ▽ More

    Submitted 22 April, 2024; originally announced April 2024.

    Comments: NTIRE 2024 Challenge Report

  37. arXiv:2404.14182  [pdf

    cond-mat.supr-con

    Record high superconducting transition temperature in Ti$_{1-x}$Mn$_x$ alloy with rich magnetic element Mn

    Authors: Ying-Jie Zhang, Yijie Zhu, Qing Li, Zhe-Ning Xiang, Tianheng Huang, Jian Sun, Hai-Hu Wen

    Abstract: It is well-known that magnetic moments are very harmful to superconductivity. A typical example is the element Mn whose compounds usually exhibit strong magnetism. Thus, it is very hard to achieve superconductivity in materials containing Mn. Here, we report enhanced superconductivity with the superconducting transition temperature ($T_\text{c}$) up to a record high-value of about 26 K in a beta-p… ▽ More

    Submitted 22 April, 2024; originally announced April 2024.

    Comments: 28 pages, 5 figures. Comments are welcome and appreciated

  38. arXiv:2404.14173  [pdf, other

    quant-ph

    Noiseless linear amplification-based quantum Ziv-Zakai bound for phase estimation and its Heisenberg error limits in noisy scenarios

    Authors: Wei Ye, Peng Xiao, Xiaofan Xu, Xiang Zhu, Yunbin Yan, Lu Wang, Jie Ren, Yuxuan Zhu, Ying Xia, Xuan Rao, Shoukang Chang

    Abstract: In this work, we address the central problem about how to effectively find the available precision limit of unknown parameters. In the framework of the quantum Ziv-Zakai bound (QZZB), we employ noiseless linear amplification (NLA)techniques to an initial coherent state (CS) as the probe state, and focus on whether the phase estimation performance is improved significantly in noisy scenarios, invol… ▽ More

    Submitted 22 April, 2024; originally announced April 2024.

    Comments: 10 pages, 9 figures

  39. Multi-agent Reinforcement Learning-based Joint Precoding and Phase Shift Optimization for RIS-aided Cell-Free Massive MIMO Systems

    Authors: Yiyang Zhu, Enyu Shi, Ziheng Liu, Jiayi Zhang, Bo Ai

    Abstract: Cell-free (CF) massive multiple-input multiple-output (mMIMO) is a promising technique for achieving high spectral efficiency (SE) using multiple distributed access points (APs). However, harsh propagation environments often lead to significant communication performance degradation due to high penetration loss. To overcome this issue, we introduce the reconfigurable intelligent surface (RIS) into… ▽ More

    Submitted 22 April, 2024; originally announced April 2024.

  40. arXiv:2404.14073  [pdf, other

    cs.LG cs.AI

    Towards Robust Trajectory Representations: Isolating Environmental Confounders with Causal Learning

    Authors: Kang Luo, Yuanshao Zhu, Wei Chen, Kun Wang, Zhengyang Zhou, Sijie Ruan, Yuxuan Liang

    Abstract: Trajectory modeling refers to characterizing human movement behavior, serving as a pivotal step in understanding mobility patterns. Nevertheless, existing studies typically ignore the confounding effects of geospatial context, leading to the acquisition of spurious correlations and limited generalization capabilities. To bridge this gap, we initially formulate a Structural Causal Model (SCM) to de… ▽ More

    Submitted 22 April, 2024; originally announced April 2024.

    Comments: The paper has been accepted by IJCAI 2024

  41. arXiv:2404.14061  [pdf, other

    cs.LG cs.AI cs.DB cs.SI

    FedTAD: Topology-aware Data-free Knowledge Distillation for Subgraph Federated Learning

    Authors: Yinlin Zhu, Xunkai Li, Zhengyu Wu, Di Wu, Miao Hu, Rong-Hua Li

    Abstract: Subgraph federated learning (subgraph-FL) is a new distributed paradigm that facilitates the collaborative training of graph neural networks (GNNs) by multi-client subgraphs. Unfortunately, a significant challenge of subgraph-FL arises from subgraph heterogeneity, which stems from node and topology variation, causing the impaired performance of the global GNN. Despite various studies, they have no… ▽ More

    Submitted 25 April, 2024; v1 submitted 22 April, 2024; originally announced April 2024.

    Comments: Accepted by IJCAI 2024

  42. arXiv:2404.13840  [pdf, other

    hep-ex

    Study of $e^+e^-\toωX(3872)$ and $γX(3872)$ from 4.66 to 4.95 GeV

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (634 additional authors not shown)

    Abstract: Using data samples with an integrated luminosity of $4.5~\text{fb}^{-1}$ collected by the BESIII detector at center-of-mass energies ranging from 4.66 to 4.95 GeV, we study the processes of $e^+e^-\toωX(3872)$ and $e^+e^-\toγX(3872)$. With the $e^+e^-\toωX(3872)$ process, the branching fraction ratio $R\equiv\frac{\mathcal{B}(X(3872)\toγJ/ψ)}{\mathcal{B}(X(3872)\toπ^+π^- J/ψ)}$ is measured to be… ▽ More

    Submitted 21 April, 2024; originally announced April 2024.

    Comments: 19 pages, 10 figures

  43. arXiv:2404.13515  [pdf, other

    cs.LG cs.AI cs.DC

    FedTrans: Efficient Federated Learning via Multi-Model Transformation

    Authors: Yuxuan Zhu, Jiachen Liu, Mosharaf Chowdhury, Fan Lai

    Abstract: Federated learning (FL) aims to train machine learning (ML) models across potentially millions of edge client devices. Yet, training and customizing models for FL clients is notoriously challenging due to the heterogeneity of client data, device capabilities, and the massive scale of clients, making individualized model exploration prohibitively expensive. State-of-the-art FL solutions personalize… ▽ More

    Submitted 25 April, 2024; v1 submitted 20 April, 2024; originally announced April 2024.

    Journal ref: MLSys (2024)

  44. arXiv:2404.12666  [pdf, other

    cs.DC cs.CR cs.ET

    A Survey on Federated Analytics: Taxonomy, Enabling Techniques, Applications and Open Issues

    Authors: Zibo Wang, Haichao Ji, Yifei Zhu, Dan Wang, Zhu Han

    Abstract: The escalating influx of data generated by networked edge devices, coupled with the growing awareness of data privacy, has promoted a transformative shift in computing paradigms from centralized data processing to privacy-preserved distributed data processing. Federated analytics (FA) is an emerging technique to support collaborative data analytics among diverse data owners without centralizing th… ▽ More

    Submitted 19 April, 2024; originally announced April 2024.

    Comments: This survey has been submitted to IEEE Communications Surveys & Tutorials

  45. arXiv:2404.12585  [pdf, other

    astro-ph.CO

    IGM dam** wing constraints on the tail end of reionisation from the enlarged XQR-30 sample

    Authors: Bradley Greig, Andrei Mesinger, Eduardo Bañados, George D. Becker, Sarah E. I. Bosman, Huanqing Chen, Frederick B. Davies, Valentina D'Odorico, Anna-Christina Eilers, Simona Gallerani, Martin G. Haehnelt, Laura Keating, Samuel Lai, Yuxiang Qin, Emma Ryan-Weber, Sindhu Satyavolu, Feige Wang, **yi Yang, Yongda Zhu

    Abstract: The attenuation of Ly$α$ photons by neutral hydrogen in the intergalactic medium (IGM) at $z\gtrsim5$ continues to be a powerful probe for studying the epoch of reionisation. Given a framework to estimate the intrinsic (true) Ly$α$ emission of high-$z$ sources, one can infer the ionisation state of the IGM during reionisation. In this work, we use the enlarged XQR-30 sample of 42 high-resolution a… ▽ More

    Submitted 18 April, 2024; originally announced April 2024.

    Comments: 21 pages, 9 figures and 2 tables. Accepted for publication in MNRAS

  46. arXiv:2404.12522  [pdf, other

    cs.LG cs.AI

    Neural Active Learning Beyond Bandits

    Authors: Yikun Ban, Ishika Agarwal, Ziwei Wu, Yada Zhu, Kommy Weldemariam, Hanghang Tong, **grui He

    Abstract: We study both stream-based and pool-based active learning with neural network approximations. A recent line of works proposed bandit-based approaches that transformed active learning into a bandit problem, achieving both theoretical and empirical success. However, the performance and computational costs of these methods may be susceptible to the number of classes, denoted as $K$, due to this trans… ▽ More

    Submitted 18 April, 2024; originally announced April 2024.

    Comments: Published on ICLR 2024, 40 Pages

  47. Multi-phase black-hole feedback and a bright [CII] halo in a Lo-BAL quasar at $z\sim6.6$

    Authors: Manuela Bischetti, Hyunseop Choi, Fabrizio Fiore, Chiara Feruglio, Stefano Carniani, Valentina D'Odorico, Eduardo Bañados, Huanqing Chen, Roberto Decarli, Simona Gallerani, Julie Hlavacek-Larrondo, Samuel Lai, Karen M. Leighly, Chiara Mazzucchelli, Laurence Perreault-Levasseur, Roberta Tripodi, Fabian Walter, Feige Wang, **yi Yang, Maria Vittoria Zanchettin, Yongda Zhu

    Abstract: Although the mass growth of supermassive black holes during the Epoch of Reionisation is expected to play a role in sha** the concurrent growth of their host-galaxies, observational evidence of feedback at z$\gtrsim$6 is still sparse. We perform the first multi-scale and multi-phase characterisation of black-hole driven outflows in the $z\sim6.6$ quasar J0923+0402 and assess how these winds impa… ▽ More

    Submitted 16 May, 2024; v1 submitted 18 April, 2024; originally announced April 2024.

    Comments: Accepted for publication in ApJ

  48. arXiv:2404.12312  [pdf, ps, other

    cs.LG math.OC stat.ML

    A Mean-Field Analysis of Neural Stochastic Gradient Descent-Ascent for Functional Minimiax Optimization

    Authors: Yuchen Zhu, Yufeng Zhang, Zhaoran Wang, Zhuoran Yang, Xiaohong Chen

    Abstract: This paper studies minimax optimization problems defined over infinite-dimensional function classes of overparameterized two-layer neural networks. In particular, we consider the minimax optimization problem stemming from estimating linear functional equations defined by conditional expectations, where the objective functions are quadratic in the functional spaces. We address (i) the convergence o… ▽ More

    Submitted 25 May, 2024; v1 submitted 18 April, 2024; originally announced April 2024.

    Comments: Submitted

  49. arXiv:2404.12000  [pdf, other

    cs.SE

    How far are AI-powered programming assistants from meeting developers' needs?

    Authors: Xin Tan, Xiao Long, Xianjun Ni, Yinghao Zhu, **g Jiang, Li Zhang

    Abstract: Recent In-IDE AI coding assistant tools (ACATs) like GitHub Copilot have significantly impacted developers' coding habits. While some studies have examined their effectiveness, there lacks in-depth investigation into the actual assistance process. To bridge this gap, we simulate real development scenarios encompassing three typical types of software development tasks and recruit 27 computer scienc… ▽ More

    Submitted 24 April, 2024; v1 submitted 18 April, 2024; originally announced April 2024.

  50. arXiv:2404.11967  [pdf, other

    math.OC

    Multi-Agent Relative Investment Games in a Jump Diffusion Market with Deep Reinforcement Learning Algorithm

    Authors: Liwei Lu, Ruimeng Hu, Xu Yang, Yi Zhu

    Abstract: This paper focuses on multi-agent stochastic differential games for jump-diffusion systems. On one hand, we study the multi-agent game for optimal investment in a jump-diffusion market. We derive constant Nash equilibria and provide sufficient conditions for their existence and uniqueness for exponential, power, and logarithmic utilities, respectively. On the other hand, we introduce a computation… ▽ More

    Submitted 18 April, 2024; originally announced April 2024.