Skip to main content

Showing 151–200 of 2,691 results for author: Zhang, G

.
  1. arXiv:2403.12536  [pdf, other

    cs.CV

    Vox-Fusion++: Voxel-based Neural Implicit Dense Tracking and Map** with Multi-maps

    Authors: Hongjia Zhai, Hai Li, Xingrui Yang, Gan Huang, Yuhang Ming, Hujun Bao, Guofeng Zhang

    Abstract: In this paper, we introduce Vox-Fusion++, a multi-maps-based robust dense tracking and map** system that seamlessly fuses neural implicit representations with traditional volumetric fusion techniques. Building upon the concept of implicit map** and positioning systems, our approach extends its applicability to real-world scenarios. Our system employs a voxel-based neural implicit surface repre… ▽ More

    Submitted 19 March, 2024; originally announced March 2024.

    Comments: 14 pages. arXiv admin note: text overlap with arXiv:2210.15858

  2. arXiv:2403.12451  [pdf, other

    cs.AI

    End-to-End Neuro-Symbolic Reinforcement Learning with Textual Explanations

    Authors: Lirui Luo, Guoxi Zhang, Hongming Xu, Yaodong Yang, Cong Fang, Qing Li

    Abstract: Neuro-symbolic reinforcement learning (NS-RL) has emerged as a promising paradigm for explainable decision-making, characterized by the interpretability of symbolic policies. NS-RL entails structured state representations for tasks with visual observations, but previous methods cannot refine the structured states with rewards due to a lack of efficiency. Accessibility also remains an issue, as ext… ▽ More

    Submitted 13 June, 2024; v1 submitted 19 March, 2024; originally announced March 2024.

    Comments: ICML 2024. Project page: https://ins-rl.github.io/

  3. arXiv:2403.10901  [pdf, other

    astro-ph.GA

    $Herschel$ investigation of cores and filamentary structures in L1251 located in the Cepheus flare

    Authors: Divyansh Dewan, Archana Soam, Guo-Yin Zhang, Akhil Lasrado, Saikhom Pravash Singh, Chang Won Lee

    Abstract: Context: Molecular clouds are the prime locations of star formation. These clouds contain filamentary structures and cores which are crucial in the formation of young stars. Aims: In this work, we aim to quantify the physical properties of structural characteristics within the molecular cloud L1251 to better understand the initial conditions for star formation. Methods: We applied the getsf algori… ▽ More

    Submitted 16 March, 2024; originally announced March 2024.

    Comments: 15 pages, 20 figures, 2 tables, accepted for publication in JAA

  4. arXiv:2403.10877  [pdf, ps, other

    hep-ex hep-ph

    Test of lepton universality and measurement of the form factors of $D^0\to K^{*}(892)^-μ^+ν_μ$

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (637 additional authors not shown)

    Abstract: We report a first study of the semileptonic decay $D^0\rightarrow K^-π^0μ^{+}ν_μ$ by analyzing an $e^+e^-$ annihilation data sample of $7.9~\mathrm{fb}^{-1}$ collected at the center-of-mass energy of 3.773 GeV with the BESIII detector. The absolute branching fraction of $D^0\to K^-π^0μ^{+}ν_μ$ is measured for the first time to be $(0.729 \pm 0.014_{\rm stat} \pm 0.011_{\rm syst})\%$. Based on an a… ▽ More

    Submitted 16 March, 2024; originally announced March 2024.

    Comments: 9 pages, 3 figures

  5. arXiv:2403.10160  [pdf, other

    cs.LG

    Online Policy Learning from Offline Preferences

    Authors: Guoxi Zhang, Han Bao, Hisashi Kashima

    Abstract: In preference-based reinforcement learning (PbRL), a reward function is learned from a type of human feedback called preference. To expedite preference collection, recent works have leveraged \emph{offline preferences}, which are preferences collected for some offline data. In this scenario, the learned reward function is fitted on the offline data. If a learning agent exhibits behaviors that do n… ▽ More

    Submitted 15 March, 2024; originally announced March 2024.

  6. arXiv:2403.10039  [pdf, other

    cs.CV cs.AI

    Rethinking Low-quality Optical Flow in Unsupervised Surgical Instrument Segmentation

    Authors: Peiran Wu, Yang Liu, Jiayu Huo, Gongyu Zhang, Christos Bergeles, Rachel Sparks, Prokar Dasgupta, Alejandro Granados, Sebastien Ourselin

    Abstract: Video-based surgical instrument segmentation plays an important role in robot-assisted surgeries. Unlike supervised settings, unsupervised segmentation relies heavily on motion cues, which are challenging to discern due to the typically lower quality of optical flow in surgical footage compared to natural scenes. This presents a considerable burden for the advancement of unsupervised segmentation… ▽ More

    Submitted 15 March, 2024; originally announced March 2024.

  7. arXiv:2403.08716  [pdf, other

    cs.RO

    DIFFTACTILE: A Physics-based Differentiable Tactile Simulator for Contact-rich Robotic Manipulation

    Authors: Zilin Si, Gu Zhang, Qingwei Ben, Branden Romero, Zhou Xian, Chao Liu, Chuang Gan

    Abstract: We introduce DIFFTACTILE, a physics-based differentiable tactile simulation system designed to enhance robotic manipulation with dense and physically accurate tactile feedback. In contrast to prior tactile simulators which primarily focus on manipulating rigid bodies and often rely on simplified approximations to model stress and deformations of materials in contact, DIFFTACTILE emphasizes physics… ▽ More

    Submitted 13 March, 2024; originally announced March 2024.

  8. arXiv:2403.07868  [pdf, other

    cs.NI cs.IT

    Online Digital Twin-Empowered Content Resale Mechanism in Age of Information-Aware Edge Caching Networks

    Authors: Yuhan Yi, Guanglin Zhang, Hai Jiang

    Abstract: For users requesting popular contents from content providers, edge caching can alleviate backhaul pressure and enhance the quality of experience of users. Recently there is also a growing concern about content freshness that is quantified by age of information (AoI). Therefore, AoI-aware online caching algorithms are required, which is challenging because the content popularity is usually unknown… ▽ More

    Submitted 12 March, 2024; originally announced March 2024.

  9. arXiv:2403.07023  [pdf

    q-bio.OT

    Propensity-score matching analysis in COVID-19-related studies: a method and quality systematic review

    Authors: Chunhui Gu, Ruosha Li, Guoqiang Zhang

    Abstract: Objectives: To provide an overall quality assessment of the methods used for COVID-19-related studies using propensity score matching (PSM). Study Design and Setting: A systematic search was conducted in June 2021 on PubMed to identify COVID-19-related studies that use the PSM analysis between 2020 and 2021. Key information about study design and PSM analysis were extracted, such as covariates,… ▽ More

    Submitted 10 March, 2024; originally announced March 2024.

  10. arXiv:2403.06766  [pdf, other

    hep-ex

    Determination of the number of $ψ(3686)$ events taken at BESIII

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (639 additional authors not shown)

    Abstract: The number of $ψ(3686)$ events collected by the BESIII detector during the 2021 run period is determined to be $(2259.3\pm 11.1)\times 10^6$ by counting inclusive $ψ(3686)$ hadronic events. The uncertainty is systematic and the statistical uncertainty is negligible. Meanwhile, the numbers of $ψ(3686)$ events collected during the 2009 and 2012 run periods are updated to be… ▽ More

    Submitted 28 May, 2024; v1 submitted 11 March, 2024; originally announced March 2024.

  11. arXiv:2403.06062  [pdf, other

    quant-ph

    Higher-order exceptional surface in a pseudo-Hermitian superconducting circuit

    Authors: Guo-Qiang Zhang, Wei Feng, Yu Wang, Chui-** Yang

    Abstract: In the last few years, much attention has been paid to exceptional surfaces (ESs) owing to various important physical phenomena and potential applications. However, high-order ESs in pseudo-Hermitian systems have not been reported until now. Here, we study the high-order ES in a pseudo-Hermitian superconducting (SC) circuit system. In our proposal, the SC circuit system is composed of three circul… ▽ More

    Submitted 9 March, 2024; originally announced March 2024.

    Comments: 8 pages, 5 figures

  12. arXiv:2403.05890  [pdf, other

    cs.LG cs.DC

    Towards Efficient Replay in Federated Incremental Learning

    Authors: Yichen Li, Qunwei Li, Haozhao Wang, Ruixuan Li, Wenliang Zhong, Guannan Zhang

    Abstract: In Federated Learning (FL), the data in each client is typically assumed fixed or static. However, data often comes in an incremental manner in real-world applications, where the data domain may increase dynamically. In this work, we study catastrophic forgetting with data heterogeneity in Federated Incremental Learning (FIL) scenarios where edge clients may lack enough storage space to retain ful… ▽ More

    Submitted 3 June, 2024; v1 submitted 9 March, 2024; originally announced March 2024.

  13. arXiv:2403.05817  [pdf, other

    cs.CV

    SAFDNet: A Simple and Effective Network for Fully Sparse 3D Object Detection

    Authors: Gang Zhang, Junnan Chen, Guohuan Gao, Jianmin Li, Si Liu, Xiaolin Hu

    Abstract: LiDAR-based 3D object detection plays an essential role in autonomous driving. Existing high-performing 3D object detectors usually build dense feature maps in the backbone network and prediction head. However, the computational costs introduced by the dense feature maps grow quadratically as the perception range increases, making these models hard to scale up to long-range detection. Some recent… ▽ More

    Submitted 22 April, 2024; v1 submitted 9 March, 2024; originally announced March 2024.

    Comments: Accepted by CVPR 2024 (Oral)

  14. arXiv:2403.05314  [pdf, other

    q-bio.BM

    Advances of Deep Learning in Protein Science: A Comprehensive Survey

    Authors: Bozhen Hu, Cheng Tan, Lirong Wu, Jiangbin Zheng, Jun Xia, Zhangyang Gao, Zicheng Liu, Fandi Wu, Guijun Zhang, Stan Z. Li

    Abstract: Protein representation learning plays a crucial role in understanding the structure and function of proteins, which are essential biomolecules involved in various biological processes. In recent years, deep learning has emerged as a powerful tool for protein modeling due to its ability to learn complex patterns and representations from large-scale protein data. This comprehensive survey aims to pr… ▽ More

    Submitted 8 March, 2024; originally announced March 2024.

  15. arXiv:2403.05047  [pdf, other

    cs.CV

    REPS: Reconstruction-based Point Cloud Sampling

    Authors: Guoqing Zhang, Wenbo Zhao, Jian Liu, Xianming Liu

    Abstract: Sampling is widely used in various point cloud tasks as it can effectively reduce resource consumption. Recently, some methods have proposed utilizing neural networks to optimize the sampling process for various task requirements. Currently, deep downsampling methods can be categorized into two main types: generative-based and score-based. Generative-based methods directly generate sampled point c… ▽ More

    Submitted 7 March, 2024; originally announced March 2024.

    Comments: project page: https://github.com/hitcslj/REPS

  16. arXiv:2403.04652  [pdf, other

    cs.CL cs.AI

    Yi: Open Foundation Models by 01.AI

    Authors: 01. AI, :, Alex Young, Bei Chen, Chao Li, Chengen Huang, Ge Zhang, Guanwei Zhang, Heng Li, Jiangcheng Zhu, Jianqun Chen, **g Chang, Kaidong Yu, Peng Liu, Qiang Liu, Shawn Yue, Senbin Yang, Shiming Yang, Tao Yu, Wen Xie, Wenhao Huang, Xiaohui Hu, Xiaoyi Ren, Xinyao Niu, Pengcheng Nie , et al. (7 additional authors not shown)

    Abstract: We introduce the Yi model family, a series of language and multimodal models that demonstrate strong multi-dimensional capabilities. The Yi model family is based on 6B and 34B pretrained language models, then we extend them to chat models, 200K long context models, depth-upscaled models, and vision-language models. Our base models achieve strong performance on a wide range of benchmarks like MMLU,… ▽ More

    Submitted 7 March, 2024; originally announced March 2024.

  17. arXiv:2403.04450  [pdf, ps, other

    nucl-th

    Cluster radioactivity preformation probability of trans-lead nuclei in the scheme of NpNn

    Authors: Lin-**g Qi, Dong-Meng Zhang, Song Luo, Gui-Qing Zhang, Peng-Cheng Chu, Xi-Jun Wu, Xiao-Hua Li

    Abstract: In the present work, the cluster radioactivity preformation probability Pc in the scheme of NpNn for the effective number of the valence particles (holes) in trans-lead nuclei has been systematically investigated. This quantity has been explored in the simplified parametrization of NpNn as well as the multiplication NpNnI of this product with the isospin asymmetry I. The calculations for Pc are bo… ▽ More

    Submitted 7 March, 2024; originally announced March 2024.

  18. arXiv:2403.04437  [pdf, other

    cs.CV

    StableDrag: Stable Dragging for Point-based Image Editing

    Authors: Yutao Cui, Xiaotong Zhao, Guozhen Zhang, Shengming Cao, Kai Ma, Limin Wang

    Abstract: Point-based image editing has attracted remarkable attention since the emergence of DragGAN. Recently, DragDiffusion further pushes forward the generative quality via adapting this dragging technique to diffusion models. Despite these great success, this dragging scheme exhibits two major drawbacks, namely inaccurate point tracking and incomplete motion supervision, which may result in unsatisfact… ▽ More

    Submitted 7 March, 2024; originally announced March 2024.

  19. arXiv:2403.04233  [pdf, other

    cs.CL cs.AI

    DEEP-ICL: Definition-Enriched Experts for Language Model In-Context Learning

    Authors: Xingwei Qu, Yiming Liang, Yucheng Wang, Tianyu Zheng, Tommy Yue, Lei Ma, Stephen W. Huang, Jiajun Zhang, Yinan Shi, Chenghua Lin, Jie Fu, Ge Zhang

    Abstract: It has long been assumed that the sheer number of parameters in large language models (LLMs) drives in-context learning (ICL) capabilities, enabling remarkable performance improvements by leveraging task-specific demonstrations. Challenging this hypothesis, we introduce DEEP-ICL, a novel task Definition Enriched ExPert Ensembling methodology for ICL. DEEP-ICL explicitly extracts task definitions f… ▽ More

    Submitted 16 June, 2024; v1 submitted 7 March, 2024; originally announced March 2024.

  20. arXiv:2403.03954  [pdf, other

    cs.RO cs.CV cs.LG

    3D Diffusion Policy: Generalizable Visuomotor Policy Learning via Simple 3D Representations

    Authors: Yanjie Ze, Gu Zhang, Kangning Zhang, Chenyuan Hu, Muhan Wang, Huazhe Xu

    Abstract: Imitation learning provides an efficient way to teach robots dexterous skills; however, learning complex skills robustly and generalizablely usually consumes large amounts of human demonstrations. To tackle this challenging problem, we present 3D Diffusion Policy (DP3), a novel visual imitation learning approach that incorporates the power of 3D visual representations into diffusion policies, a cl… ▽ More

    Submitted 8 June, 2024; v1 submitted 6 March, 2024; originally announced March 2024.

    Comments: Published at Robotics: Science and Systems (RSS) 2024. Videos, code, and data: https://3d-diffusion-policy.github.io

  21. arXiv:2403.03500  [pdf, other

    hep-ex

    Observation of the decay $h_{c}\to3(π^{+}π^{-})π^{0}$

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (639 additional authors not shown)

    Abstract: Based on $(2712.4\pm14.1)\times10^{6}$ $ψ(3686)$ events collected with the BESIII detector, we study the decays $h_{c}\to3(π^{+}π^{-})π^{0}$, $h_{c}\to2(π^{+}π^{-})ω$, $h_{c}\to2(π^{+}π^{-})π^{0}η$, $h_{c}\to2(π^{+}π^{-})η$, and $h_{c}\to p\bar{p}$ via $ψ(3686)\toπ^{0}h_{c}$. The decay channel $h_{c}\to3(π^{+}π^{-})π^{0}$ is observed for the first time, and its branching fraction is determined to… ▽ More

    Submitted 6 March, 2024; originally announced March 2024.

    Comments: 11 pages, 3 figures

  22. arXiv:2403.03416  [pdf, other

    eess.SY

    On discrete-time polynomial dynamical systems on hypergraphs

    Authors: Shaoxuan Cui, Guofeng Zhang, Hildeberto Jardón-Kojakhmetov, Ming Cao

    Abstract: This paper studies the stability of discrete-time polynomial dynamical systems on hypergraphs by utilizing the Perron-Frobenius theorem for nonnegative tensors with respect to the tensors Z-eigenvalues and Z-eigenvectors. Firstly, for a multilinear polynomial system on a uniform hypergraph, we study the stability of the origin of the corresponding systems. Next, we extend our results to non-homoge… ▽ More

    Submitted 5 June, 2024; v1 submitted 5 March, 2024; originally announced March 2024.

    Comments: arXiv admin note: text overlap with arXiv:2401.03652

  23. arXiv:2403.02914  [pdf, ps, other

    cs.AI

    DynST: Dynamic Sparse Training for Resource-Constrained Spatio-Temporal Forecasting

    Authors: Hao Wu, Haomin Wen, Guibin Zhang, Yutong Xia, Kai Wang, Yuxuan Liang, Yu Zheng, Kun Wang

    Abstract: The ever-increasing sensor service, though opening a precious path and providing a deluge of earth system data for deep-learning-oriented earth science, sadly introduce a daunting obstacle to their industrial level deployment. Concretely, earth science systems rely heavily on the extensive deployment of sensors, however, the data collection from sensors is constrained by complex geographical and s… ▽ More

    Submitted 5 March, 2024; originally announced March 2024.

  24. arXiv:2403.02874  [pdf, other

    astro-ph.HE

    The bright black hole X-ray binary 4U 1543-47 during 2021 outburst. A clear state transition from super-Eddington to sub-Eddington accretion revealed by Insight-HXMT

    Authors: Pei **, Guobao Zhang, Yuexin Zhang, Mariano Méndez, **lu Qu, David M. Russell, Jiancheng Wang, Shuangnan Zhang, Yi-Jung Yang, Shumei Jia, Zixu Yang, Hexin Liu

    Abstract: We present a detailed analysis of the observations with the Hard X-ray Modulation Telescope of the black hole X-ray transient 4U~1543-47 during its outburst in 2021. We find a clear state transition during the outburst decay of the source. Using previous measurements of the black-hole mass and distance to the source, the source luminosity during this transition is close to the Eddington limit. The… ▽ More

    Submitted 5 March, 2024; originally announced March 2024.

  25. arXiv:2403.02710  [pdf, other

    cs.CV cs.RO

    FastOcc: Accelerating 3D Occupancy Prediction by Fusing the 2D Bird's-Eye View and Perspective View

    Authors: Jiawei Hou, Xiaoyan Li, Wenhao Guan, Gang Zhang, Di Feng, Yuheng Du, Xiangyang Xue, Jian Pu

    Abstract: In autonomous driving, 3D occupancy prediction outputs voxel-wise status and semantic labels for more comprehensive understandings of 3D scenes compared with traditional perception tasks, such as 3D object detection and bird's-eye view (BEV) semantic segmentation. Recent researchers have extensively explored various aspects of this task, including view transformation techniques, ground-truth label… ▽ More

    Submitted 5 March, 2024; originally announced March 2024.

    Comments: Accepted by ICRA 2024

  26. arXiv:2403.02622  [pdf, other

    cs.LG cs.AI cs.RO

    World Models for Autonomous Driving: An Initial Survey

    Authors: Yanchen Guan, Haicheng Liao, Zhenning Li, Jia Hu, Runze Yuan, Yunjian Li, Guohui Zhang, Chengzhong Xu

    Abstract: In the rapidly evolving landscape of autonomous driving, the capability to accurately predict future events and assess their implications is paramount for both safety and efficiency, critically aiding the decision-making process. World models have emerged as a transformative approach, enabling autonomous driving systems to synthesize and interpret vast amounts of sensor data, thereby predicting po… ▽ More

    Submitted 7 May, 2024; v1 submitted 4 March, 2024; originally announced March 2024.

  27. arXiv:2403.01761  [pdf, other

    hep-ex

    Observation of $ψ(3686)\to 3φ$

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (645 additional authors not shown)

    Abstract: Using $(2.712\pm0.014)\times 10^9$ $ψ(3686)$ events collected by the BESIII detector operating at the BEPCII collider, we report the first observation of $ψ(3686)\to 3φ$ decay with a significance larger than 10$σ$. The branching fraction of this decay is determined to be $(1.46\pm0.05\pm0.17)\times10^{-5}$, where the first uncertainty is statistical and the second is systematic. No significant str… ▽ More

    Submitted 4 March, 2024; originally announced March 2024.

  28. arXiv:2403.01741  [pdf, ps, other

    math.AP

    Local regularity for inhomogeneous parabolic systems with a skew-symmetric part in BMO

    Authors: Guoming Zhang

    Abstract: In this note we aim to establish the local regularity for weak solutions to inhomogeneous parabolic systems having a real and anti-symmetric part in BMO, which can be seen as a generalization of the corresponding results for parabolic systems with bounded coefficients in the work of Auscher, Bortz, Egert and Saari [J. Math. Pures Appl.(9) 2019].

    Submitted 17 April, 2024; v1 submitted 4 March, 2024; originally announced March 2024.

  29. arXiv:2403.01515  [pdf, other

    physics.bio-ph cond-mat.soft

    Cell sorting by active forces in a phase-field model of cell monolayers

    Authors: James N. Graham, Guanming Zhang, Julia M. Yeomans

    Abstract: Cell sorting, the segregation of cells with different properties into distinct domains, is a key phenomenon in biological processes such as embryogenesis. We use a phase-field model of a confluent cell layer to study the role of activity in cell sorting. We find that a mixture of cells with extensile or contractile dipolar activity, and which are identical apart from their activity, quickly sort i… ▽ More

    Submitted 3 March, 2024; originally announced March 2024.

    Comments: 6 pages, 3 figures

  30. arXiv:2403.01179  [pdf, other

    quant-ph

    Optomechanical cooling with simultaneous intracavity and extracavity squeezed light

    Authors: S. S. Zheng, F. X. Sun, M. Asjad, G. W. Zhang, J. Huo, J. Li, J. Zhou, Z. Ma, Q. Y. He

    Abstract: We propose a novel and experimentally feasible approach to achieve high-efficiency ground-state cooling of a mechanical oscillator in an optomechanical system under the deeply unresolved sideband condition with the assistance of both intracavity and extracavity squeezing. In the scheme, a degenerate optical parametric amplifier is placed inside the optical cavity, generating the intracavity squeez… ▽ More

    Submitted 2 March, 2024; originally announced March 2024.

    Comments: 12 pages, 5 figures

  31. arXiv:2402.19274  [pdf, other

    cond-mat.mtrl-sci

    Mixed-halide perovskite alloys $\text{CsPb}(\text{I}_{1-x}^{}\text{Br}_x^{})_3^{}$ and $\text{CsPb}(\text{Br}_{1-x}^{}\text{Cl}_x^{})_3^{}$: New insight of configuration entropy effect from first principles and phase diagrams

    Authors: Fang Pan, Junni Zhai, **yu Chen, Lin Yang, Hua Dong, Fang Yuan, Zhuangde Jiang, Wei Ren, Zuo-Guang Ye, Guo-Xu Zhang, **grui Li

    Abstract: Stability is one of the key issues in mixed-halide perovskite alloys which are promising in emergent optoelectronics. Previous density-functional-theory (DFT) and machine learning studies indicate that the formation-energy convex hulls of these materials are very shallow, and stable alloy compositions are rare. In this work, we revisit this problem using DFT with special focus on the effects of co… ▽ More

    Submitted 29 February, 2024; originally announced February 2024.

  32. arXiv:2402.18595  [pdf, other

    cs.AR cs.CE cs.LG

    EncodingNet: A Novel Encoding-based MAC Design for Efficient Neural Network Acceleration

    Authors: Bo Liu, Grace Li Zhang, Xunzhao Yin, Ulf Schlichtmann, Bing Li

    Abstract: Deep neural networks (DNNs) have achieved great breakthroughs in many fields such as image classification and natural language processing. However, the execution of DNNs needs to conduct massive numbers of multiply-accumulate (MAC) operations on hardware and thus incurs a large power consumption. To address this challenge, we propose a novel digital MAC design based on encoding. In this new design… ▽ More

    Submitted 25 February, 2024; originally announced February 2024.

  33. arXiv:2402.17726  [pdf, other

    cs.CV

    VRP-SAM: SAM with Visual Reference Prompt

    Authors: Yanpeng Sun, Jiahui Chen, Shan Zhang, Xinyu Zhang, Qiang Chen, Gang Zhang, Errui Ding, **gdong Wang, Zechao Li

    Abstract: In this paper, we propose a novel Visual Reference Prompt (VRP) encoder that empowers the Segment Anything Model (SAM) to utilize annotated reference images as prompts for segmentation, creating the VRP-SAM model. In essence, VRP-SAM can utilize annotated reference images to comprehend specific objects and perform segmentation of specific objects in target image. It is note that the VRP encoder ca… ▽ More

    Submitted 30 March, 2024; v1 submitted 27 February, 2024; originally announced February 2024.

    Comments: Accepted by CVPR 2024; The camera-ready version

  34. arXiv:2402.17298  [pdf, other

    cs.CV

    ArcSin: Adaptive ranged cosine Similarity injected noise for Language-Driven Visual Tasks

    Authors: Yang Liu, Xiaomin Yu, Gongyu Zhang, Christos Bergeles, Prokar Dasgupta, Alejandro Granados, Sebastien Ourselin

    Abstract: In this study, we address the challenging task of bridging the modality gap between learning from language and inference for visual tasks, including Visual Question Answering (VQA), Image Captioning (IC) and Visual Entailment (VE). We train models for these tasks in a zero-shot cross-modal transfer setting, a domain where the previous state-of-the-art method relied on the fixed scale noise injecti… ▽ More

    Submitted 27 February, 2024; originally announced February 2024.

  35. arXiv:2402.16671  [pdf, other

    cs.CL

    StructLM: Towards Building Generalist Models for Structured Knowledge Grounding

    Authors: Alex Zhuang, Ge Zhang, Tianyu Zheng, Xinrun Du, Junjie Wang, Weiming Ren, Stephen W. Huang, Jie Fu, Xiang Yue, Wenhu Chen

    Abstract: Structured data sources, such as tables, graphs, and databases, are ubiquitous knowledge sources. Despite the demonstrated capabilities of large language models (LLMs) on plain text, their proficiency in interpreting and utilizing structured data remains limited. Our investigation reveals a notable deficiency in LLMs' ability to process structured data, e.g., ChatGPT lags behind state-of-the-art (… ▽ More

    Submitted 24 April, 2024; v1 submitted 26 February, 2024; originally announced February 2024.

    Comments: Technical Report

  36. arXiv:2402.16297  [pdf, other

    cs.LG cs.AI

    A Poisson-Gamma Dynamic Factor Model with Time-Varying Transition Dynamics

    Authors: Jiahao Wang, Sikun Yang, Heinz Koeppl, Xiuzhen Cheng, Pengfei Hu, Guoming Zhang

    Abstract: Probabilistic approaches for handling count-valued time sequences have attracted amounts of research attentions because their ability to infer explainable latent structures and to estimate uncertainties, and thus are especially suitable for dealing with \emph{noisy} and \emph{incomplete} count data. Among these models, Poisson-Gamma Dynamical Systems (PGDSs) are proven to be effective in capturing… ▽ More

    Submitted 23 May, 2024; v1 submitted 25 February, 2024; originally announced February 2024.

  37. arXiv:2402.16153  [pdf, other

    cs.SD cs.AI cs.CL cs.LG cs.MM eess.AS

    ChatMusician: Understanding and Generating Music Intrinsically with LLM

    Authors: Ruibin Yuan, Hanfeng Lin, Yi Wang, Zeyue Tian, Shangda Wu, Tianhao Shen, Ge Zhang, Yuhang Wu, Cong Liu, Ziya Zhou, Ziyang Ma, Liumeng Xue, Ziyu Wang, Qin Liu, Tianyu Zheng, Yizhi Li, Yinghao Ma, Yiming Liang, Xiaowei Chi, Ruibo Liu, Zili Wang, Pengfei Li, **gcheng Wu, Chenghua Lin, Qifeng Liu , et al. (10 additional authors not shown)

    Abstract: While Large Language Models (LLMs) demonstrate impressive capabilities in text generation, we find that their ability has yet to be generalized to music, humanity's creative language. We introduce ChatMusician, an open-source LLM that integrates intrinsic musical abilities. It is based on continual pre-training and finetuning LLaMA2 on a text-compatible music representation, ABC notation, and the… ▽ More

    Submitted 25 February, 2024; originally announced February 2024.

    Comments: GitHub: https://shanghaicannon.github.io/ChatMusician/

  38. arXiv:2402.15414  [pdf, other

    cs.LG cs.CV

    Does Combining Parameter-efficient Modules Improve Few-shot Transfer Accuracy?

    Authors: Nader Asadi, Mahdi Beitollahi, Yasser Khalil, Yinchuan Li, Guojun Zhang, Xi Chen

    Abstract: Parameter-efficient fine-tuning stands as the standard for efficiently fine-tuning large language and vision models on downstream tasks. Specifically, the efficiency of low-rank adaptation has facilitated the creation and sharing of hundreds of custom LoRA modules, each trained on distinct data from various downstream tasks. In this paper, we explore the composability of LoRA modules, examining if… ▽ More

    Submitted 23 February, 2024; originally announced February 2024.

  39. arXiv:2402.15187  [pdf

    nucl-ex physics.plasm-ph

    Ultra-short lifetime isomer studies from photonuclear reactions using laser-driven ultra-intense γ-ray

    Authors: Di Wu, Haoyang Lan, Jiaxing Liu, Huangang Lu, Jianyao Zhang, Jianfeng Lv, Xuezhi Wu, Hui Zhang, Yadong Xia, Qiangyou He, Jie Cai, Qianyi Ma, Yuhui Xia, Zhenan Wang, Meizhi Wang, Zhiyan Yang, Xinlu Xu, Yixing Geng, Chen Lin, Wenjun Ma, Yanying Zhao, Haoran Wang, Fulong Liu, Chuangye He, **qing Yu , et al. (7 additional authors not shown)

    Abstract: Isomers, ubiquitous populations of relatively long-lived nuclear excited states, play a crucial role in nuclear physics. However, isomers with half-life times of several seconds or less barely had experimental cross section data due to the lack of a suitable measuring method. We report a method of online γ spectroscopy for ultra-short-lived isomers from photonuclear reactions using laser-driven ul… ▽ More

    Submitted 23 February, 2024; originally announced February 2024.

  40. arXiv:2402.14708  [pdf, other

    cs.LG cs.AI q-fin.ST

    CaT-GNN: Enhancing Credit Card Fraud Detection via Causal Temporal Graph Neural Networks

    Authors: Yifan Duan, Guibin Zhang, Shilong Wang, Xiaojiang Peng, Wang Ziqi, Junyuan Mao, Hao Wu, Xinke Jiang, Kun Wang

    Abstract: Credit card fraud poses a significant threat to the economy. While Graph Neural Network (GNN)-based fraud detection methods perform well, they often overlook the causal effect of a node's local structure on predictions. This paper introduces a novel method for credit card fraud detection, the \textbf{\underline{Ca}}usal \textbf{\underline{T}}emporal \textbf{\underline{G}}raph \textbf{\underline{N}… ▽ More

    Submitted 22 February, 2024; originally announced February 2024.

  41. arXiv:2402.14658  [pdf, other

    cs.SE cs.AI cs.CL

    OpenCodeInterpreter: Integrating Code Generation with Execution and Refinement

    Authors: Tianyu Zheng, Ge Zhang, Tianhao Shen, Xueling Liu, Bill Yuchen Lin, Jie Fu, Wenhu Chen, Xiang Yue

    Abstract: The introduction of large language models has significantly advanced code generation. However, open-source models often lack the execution capabilities and iterative refinement of advanced systems like the GPT-4 Code Interpreter. To address this, we introduce OpenCodeInterpreter, a family of open-source code systems designed for generating, executing, and iteratively refining code. Supported by Co… ▽ More

    Submitted 27 February, 2024; v1 submitted 22 February, 2024; originally announced February 2024.

  42. arXiv:2402.14493  [pdf, ps, other

    cs.DS

    An Improved Pseudopolynomial Time Algorithm for Subset Sum

    Authors: Lin Chen, Jiayi Lian, Yuchen Mao, Guochuan Zhang

    Abstract: We investigate pseudo-polynomial time algorithms for Subset Sum. Given a multi-set $X$ of $n$ positive integers and a target $t$, Subset Sum asks whether some subset of $X$ sums to $t$. Bringmann proposes an $\tilde{O}(n + t)$-time algorithm [Bringmann SODA'17], and an open question has naturally arisen: can Subset Sum be solved in $O(n + w)$ time? Here $w$ is the maximum integer in $X$. We make a… ▽ More

    Submitted 4 April, 2024; v1 submitted 22 February, 2024; originally announced February 2024.

    Comments: In first version, we falsely claimed that our algorithm is also able to reconstruct a subset that sums to t. In the latest version, we removed this false claim and explained why we cannot do reconstruction

  43. arXiv:2402.14323  [pdf, other

    cs.SE cs.AI

    REPOFUSE: Repository-Level Code Completion with Fused Dual Context

    Authors: Ming Liang, Xiaoheng Xie, Gehao Zhang, Xun** Zheng, Peng Di, wei jiang, Hongwei Chen, Chengpeng Wang, Gang Fan

    Abstract: The success of language models in code assistance has spurred the proposal of repository-level code completion as a means to enhance prediction accuracy, utilizing the context from the entire codebase. However, this amplified context can inadvertently increase inference latency, potentially undermining the developer experience and deterring tool adoption - a challenge we termed the Context-Latency… ▽ More

    Submitted 22 February, 2024; v1 submitted 22 February, 2024; originally announced February 2024.

  44. arXiv:2402.13750  [pdf, other

    cs.IR cs.AI cs.CL

    Breaking the Barrier: Utilizing Large Language Models for Industrial Recommendation Systems through an Inferential Knowledge Graph

    Authors: Qian Zhao, Hao Qian, Ziqi Liu, Gong-Duo Zhang, Lihong Gu

    Abstract: Recommendation systems are widely used in e-commerce websites and online platforms to address information overload. However, existing systems primarily rely on historical data and user feedback, making it difficult to capture user intent transitions. Recently, Knowledge Base (KB)-based models are proposed to incorporate expert knowledge, but it struggle to adapt to new items and the evolving e-com… ▽ More

    Submitted 21 February, 2024; originally announced February 2024.

    Comments: 9 pages, 5 figures

  45. arXiv:2402.13491  [pdf, other

    math.OC

    Algebraic Riccati Tensor Equations with Applications in Multilinear Control Systems

    Authors: Yuchao Wang, Yimin Wei, Guofeng Zhang, Shih Yu Chang

    Abstract: In a recent interesting paper [8], Chen et al. initialized the control-theoretic study of a class of discrete-time multilinear time-invariant (MLTI) control systems, where system states, inputs and outputs are all tensors endowed with the Einstein product. Criteria for fundamental system-theoretic notions such as stability, reachability and observability are established by means of tensor decompos… ▽ More

    Submitted 20 February, 2024; originally announced February 2024.

    Comments: 25 pages, 6 figures

    MSC Class: 15A69; 93B35; 93C05; 93D15

  46. arXiv:2402.13145  [pdf, other

    cs.CL cs.AI

    CMDAG: A Chinese Metaphor Dataset with Annotated Grounds as CoT for Boosting Metaphor Generation

    Authors: Yujie Shao, Xinrong Yao, Xingwei Qu, Chenghua Lin, Shi Wang, Stephen W. Huang, Ge Zhang, Jie Fu

    Abstract: Metaphor is a prominent linguistic device in human language and literature, as they add color, imagery, and emphasis to enhance effective communication. This paper introduces a large-scale high quality annotated Chinese Metaphor Corpus, which comprises around 28K sentences drawn from a diverse range of Chinese literary sources, such as poems, prose, song lyrics, etc. To ensure the accuracy and con… ▽ More

    Submitted 20 February, 2024; v1 submitted 20 February, 2024; originally announced February 2024.

  47. arXiv:2402.13109  [pdf, other

    cs.CL cs.AI

    CIF-Bench: A Chinese Instruction-Following Benchmark for Evaluating the Generalizability of Large Language Models

    Authors: Yizhi LI, Ge Zhang, Xingwei Qu, Jiali Li, Zhaoqun Li, Zekun Wang, Hao Li, Ruibin Yuan, Yinghao Ma, Kai Zhang, Wangchunshu Zhou, Yiming Liang, Lei Zhang, Lei Ma, Jiajun Zhang, Zuowen Li, Stephen W. Huang, Chenghua Lin, Jie Fu

    Abstract: The advancement of large language models (LLMs) has enhanced the ability to generalize across a wide range of unseen natural language processing (NLP) tasks through instruction-following. Yet, their effectiveness often diminishes in low-resource languages like Chinese, exacerbated by biased evaluations from data leakage, casting doubt on their true generalizability to new linguistic territories. I… ▽ More

    Submitted 4 June, 2024; v1 submitted 20 February, 2024; originally announced February 2024.

    Comments: Camera-ready version for ACL 2024. Project page at https://yizhilll.github.io/CIF-Bench/

  48. arXiv:2402.12845  [pdf, other

    cs.AI cs.GT

    MORE-3S:Multimodal-based Offline Reinforcement Learning with Shared Semantic Spaces

    Authors: Tianyu Zheng, Ge Zhang, Xingwei Qu, Ming Kuang, Stephen W. Huang, Zhaofeng He

    Abstract: Drawing upon the intuition that aligning different modalities to the same semantic embedding space would allow models to understand states and actions more easily, we propose a new perspective to the offline reinforcement learning (RL) challenge. More concretely, we transform it into a supervised learning task by integrating multimodal and pre-trained language models. Our approach incorporates sta… ▽ More

    Submitted 20 February, 2024; originally announced February 2024.

  49. arXiv:2402.12797  [pdf, other

    cs.CV cs.CG

    A Geometric Algorithm for Tubular Shape Reconstruction from Skeletal Representation

    Authors: Guoqing Zhang, Yang Li

    Abstract: We introduce a novel approach for the reconstruction of tubular shapes from skeletal representations. Our method processes all skeletal points as a whole, eliminating the need for splitting input structure into multiple segments. We represent the tubular shape as a truncated signed distance function (TSDF) in a voxel hashing manner, in which the signed distance between a voxel center and the objec… ▽ More

    Submitted 1 July, 2024; v1 submitted 20 February, 2024; originally announced February 2024.

    Comments: 9 pages (without reference), 6 figures

  50. arXiv:2402.12226  [pdf, other

    cs.CL cs.AI cs.CV cs.LG

    AnyGPT: Unified Multimodal LLM with Discrete Sequence Modeling

    Authors: Jun Zhan, Junqi Dai, Jiasheng Ye, Yunhua Zhou, Dong Zhang, Zhigeng Liu, Xin Zhang, Ruibin Yuan, Ge Zhang, Linyang Li, Hang Yan, Jie Fu, Tao Gui, Tianxiang Sun, Yugang Jiang, Xipeng Qiu

    Abstract: We introduce AnyGPT, an any-to-any multimodal language model that utilizes discrete representations for the unified processing of various modalities, including speech, text, images, and music. AnyGPT can be trained stably without any alterations to the current large language model (LLM) architecture or training paradigms. Instead, it relies exclusively on data-level preprocessing, facilitating the… ▽ More

    Submitted 7 March, 2024; v1 submitted 19 February, 2024; originally announced February 2024.

    Comments: 28 pages, 16 figures, under review, work in progress