Skip to main content

Showing 201–250 of 2,116 results for author: Gao, J

.
  1. arXiv:2401.15584  [pdf, other

    cs.LG

    DGNN: Decoupled Graph Neural Networks with Structural Consistency between Attribute and Graph Embedding Representations

    Authors: **lu Wang, Jipeng Guo, Yanfeng Sun, Junbin Gao, Shaofan Wang, Yachao Yang, Baocai Yin

    Abstract: Graph neural networks (GNNs) demonstrate a robust capability for representation learning on graphs with complex structures, showcasing superior performance in various applications. The majority of existing GNNs employ a graph convolution operation by using both attribute and structure information through coupled learning. In essence, GNNs, from an optimization perspective, seek to learn a consensu… ▽ More

    Submitted 28 January, 2024; originally announced January 2024.

  2. arXiv:2401.14580  [pdf, other

    cs.LG

    Design Your Own Universe: A Physics-Informed Agnostic Method for Enhancing Graph Neural Networks

    Authors: Dai Shi, Andi Han, Lequan Lin, Yi Guo, Zhiyong Wang, Junbin Gao

    Abstract: Physics-informed Graph Neural Networks have achieved remarkable performance in learning through graph-structured data by mitigating common GNN challenges such as over-smoothing, over-squashing, and heterophily adaption. Despite these advancements, the development of a simple yet effective paradigm that appropriately integrates previous methods for handling all these challenges is still underway. I… ▽ More

    Submitted 12 June, 2024; v1 submitted 25 January, 2024; originally announced January 2024.

  3. arXiv:2401.13986  [pdf, other

    cs.CL cs.AI cs.LG

    Towards Consistent Natural-Language Explanations via Explanation-Consistency Finetuning

    Authors: Yanda Chen, Chandan Singh, Xiaodong Liu, Simiao Zuo, Bin Yu, He He, Jianfeng Gao

    Abstract: Large language models (LLMs) often generate convincing, fluent explanations. However, different from humans, they often generate inconsistent explanations on different inputs. For example, an LLM may generate the explanation "all birds can fly" when answering the question "Can sparrows fly?" but meanwhile answer "no" to the related question "Can penguins fly?". Explanations should be consistent ac… ▽ More

    Submitted 25 January, 2024; originally announced January 2024.

    Comments: arXiv admin note: text overlap with arXiv:2307.08678

  4. arXiv:2401.13366  [pdf, other

    cs.LG

    Mitigating System Bias in Resource Constrained Asynchronous Federated Learning Systems

    Authors: Jikun Gao, Ioannis Mavromatis, Peizheng Li, Pietro Carnelli, Aftab Khan

    Abstract: Federated learning (FL) systems face performance challenges in dealing with heterogeneous devices and non-identically distributed data across clients. We propose a dynamic global model aggregation method within Asynchronous Federated Learning (AFL) deployments to address these issues. Our aggregation method scores and adjusts the weighting of client model updates based on their upload frequency to… ▽ More

    Submitted 1 February, 2024; v1 submitted 24 January, 2024; originally announced January 2024.

    Comments: 6 pages, 5 figures. This work has been accepted by PerCom PerconAI workshop 2024

  5. arXiv:2401.12881  [pdf, other

    cs.DS cs.CG

    Computing Diameter+2 in Truly Subquadratic Time for Unit-Disk Graphs

    Authors: Hsien-Chih Chang, Jie Gao, Hung Le

    Abstract: Finding the diameter of a graph in general cannot be done in truly subquadratic assuming the Strong Exponential Time Hypothesis (SETH), even when the underlying graph is unweighted and sparse. When restricting to concrete classes of graphs and assuming SETH, planar graphs and minor-free graphs admit truly subquadratic algorithms, while geometric intersection graphs of unit balls, congruent equilat… ▽ More

    Submitted 23 January, 2024; originally announced January 2024.

    Comments: 28 pages, 7 figures

  6. arXiv:2401.12813  [pdf, other

    astro-ph.GA astro-ph.IM gr-qc

    Bayesian parameter estimation of massive black hole binaries with TianQin-LISA

    Authors: Jie Gao, Yi-Ming Hu, En-Kun Li, Jian-dong Zhang, Jianwei Mei

    Abstract: This paper analyses the impact of various parameter changes on the estimation of parameters for massive black hole binary (MBHB) systems using a Bayesian inference technique. Several designed MBHB systems were chosen for comparison with a fiducial system to explore the influence of parameters such as sky location, inclination angle, anti-spin, large mass ratio and light mass. And the two reported… ▽ More

    Submitted 23 January, 2024; originally announced January 2024.

    Comments: 17 pages, 10 figures

  7. arXiv:2401.12137  [pdf, ps, other

    math.DG

    Generalized Minkowski formulas and rigidity results for anisotropic capillary hypersurfaces

    Authors: **yu Gao, Guanghan Li

    Abstract: We show the generalization of Hsiung-Minkowski integral formula for anisotropic capillary hypersurfaces in the half-space, which includes the weighted Hsiung-Minkowski formula and classical anisotropic Minkowski identity for closed hypersurfaces as special cases. As applications, we prove some anisotropic Alexandrov-type theorems and rigidity results for anisotropic capillary hypersurfaces. Specia… ▽ More

    Submitted 22 January, 2024; originally announced January 2024.

    Comments: 16 pages

    MSC Class: 53A10; 53C24; 53C40

  8. arXiv:2401.10820  [pdf, other

    cs.HC

    Help Me Reflect: Leveraging Self-Reflection Interface Nudges to Enhance Deliberativeness on Online Deliberation Platforms

    Authors: Shun Yi Yeo, Gionnieve Lim, Jie Gao, Weiyu Zhang, Simon Tangi Perrault

    Abstract: The deliberative potential of online platforms has been widely examined. However, little is known about how various interface-based reflection nudges impact the quality of deliberation. This paper presents two user studies with 12 and 120 participants, respectively, to investigate the impacts of different reflective nudges on the quality of deliberation. In the first study, we examined five distin… ▽ More

    Submitted 19 January, 2024; originally announced January 2024.

  9. NWPU-MOC: A Benchmark for Fine-grained Multi-category Object Counting in Aerial Images

    Authors: Junyu Gao, Liangliang Zhao, Xuelong Li

    Abstract: Object counting is a hot topic in computer vision, which aims to estimate the number of objects in a given image. However, most methods only count objects of a single category for an image, which cannot be applied to scenes that need to count objects with multiple categories simultaneously, especially in aerial scenes. To this end, this paper introduces a Multi-category Object Counting (MOC) task… ▽ More

    Submitted 19 January, 2024; originally announced January 2024.

  10. arXiv:2401.08119  [pdf, other

    cs.LG

    SpecSTG: A Fast Spectral Diffusion Framework for Probabilistic Spatio-Temporal Traffic Forecasting

    Authors: Lequan Lin, Dai Shi, Andi Han, Junbin Gao

    Abstract: Traffic forecasting, a crucial application of spatio-temporal graph (STG) learning, has traditionally relied on deterministic models for accurate point estimations. Yet, these models fall short of identifying latent risks of unexpected volatility in future observations. To address this gap, probabilistic methods, especially variants of diffusion models, have emerged as uncertainty-aware solutions.… ▽ More

    Submitted 23 January, 2024; v1 submitted 16 January, 2024; originally announced January 2024.

  11. arXiv:2401.08045  [pdf, other

    cs.CV

    Forging Vision Foundation Models for Autonomous Driving: Challenges, Methodologies, and Opportunities

    Authors: Xu Yan, Haiming Zhang, Yingjie Cai, **gming Guo, Weichao Qiu, Bin Gao, Kaiqiang Zhou, Yue Zhao, Huan **, Jiantao Gao, Zhen Li, Lihui Jiang, Wei Zhang, Hongbo Zhang, Dengxin Dai, Bingbing Liu

    Abstract: The rise of large foundation models, trained on extensive datasets, is revolutionizing the field of AI. Models such as SAM, DALL-E2, and GPT-4 showcase their adaptability by extracting intricate patterns and performing effectively across diverse tasks, thereby serving as potent building blocks for a wide range of AI applications. Autonomous driving, a vibrant front in AI applications, remains chal… ▽ More

    Submitted 15 January, 2024; originally announced January 2024.

    Comments: Github Repo: https://github.com/zhanghm1995/Forge_VFM4AD

  12. arXiv:2401.06374  [pdf, other

    cs.CV

    SamLP: A Customized Segment Anything Model for License Plate Detection

    Authors: Haoxuan Ding, Junyu Gao, Yuan Yuan, Qi Wang

    Abstract: With the emergence of foundation model, this novel paradigm of deep learning has encouraged many powerful achievements in natural language processing and computer vision. There are many advantages of foundation model, such as excellent feature extraction power, mighty generalization ability, great few-shot and zero-shot learning capacity, etc. which are beneficial to vision tasks. As the unique id… ▽ More

    Submitted 11 January, 2024; originally announced January 2024.

  13. arXiv:2401.05561  [pdf, other

    cs.CL

    TrustLLM: Trustworthiness in Large Language Models

    Authors: Lichao Sun, Yue Huang, Haoran Wang, Siyuan Wu, Qihui Zhang, Yuan Li, Chujie Gao, Yixin Huang, Wenhan Lyu, Yixuan Zhang, Xiner Li, Zhengliang Liu, Yixin Liu, Yijue Wang, Zhikun Zhang, Bertie Vidgen, Bhavya Kailkhura, Caiming Xiong, Chaowei Xiao, Chunyuan Li, Eric Xing, Furong Huang, Hao Liu, Heng Ji, Hongyi Wang , et al. (45 additional authors not shown)

    Abstract: Large language models (LLMs), exemplified by ChatGPT, have gained considerable attention for their excellent natural language processing capabilities. Nonetheless, these LLMs present many challenges, particularly in the realm of trustworthiness. Therefore, ensuring the trustworthiness of LLMs emerges as an important topic. This paper introduces TrustLLM, a comprehensive study of trustworthiness in… ▽ More

    Submitted 17 March, 2024; v1 submitted 10 January, 2024; originally announced January 2024.

    Comments: This work is still under work and we welcome your contribution

  14. arXiv:2401.03778  [pdf, other

    astro-ph.GA astro-ph.SR

    Evolved Massive Stars at Low-metallicity VI. Mass-Loss Rate of Red Supergiant Stars in the Large Magellanic Cloud

    Authors: **g Wen, Jian Gao, Ming Yang, Bingqiu Chen, Yi Ren, Tianding Wang, Biwei Jiang

    Abstract: Mass loss is a crucial process that affects the observational properties, evolution path and fate of highly evolved stars. However, the mechanism of mass loss is still unclear, and the mass-loss rate (MLR) of red supergiant stars (RSGs) requires further research and precise evaluation. To address this, we utilized an updated and complete sample of RSGs in the Large Magellanic Cloud (LMC) and emplo… ▽ More

    Submitted 8 January, 2024; originally announced January 2024.

    Comments: Accepted for publication in AJ

  15. arXiv:2401.03568  [pdf, other

    cs.AI cs.HC cs.LG

    Agent AI: Surveying the Horizons of Multimodal Interaction

    Authors: Zane Durante, Qiuyuan Huang, Naoki Wake, Ran Gong, Jae Sung Park, Bidipta Sarkar, Rohan Taori, Yusuke Noda, Demetri Terzopoulos, Ye** Choi, Katsushi Ikeuchi, Hoi Vo, Li Fei-Fei, Jianfeng Gao

    Abstract: Multi-modal AI systems will likely become a ubiquitous presence in our everyday lives. A promising approach to making these systems more interactive is to embody them as agents within physical and virtual environments. At present, systems leverage existing foundation models as the basic building blocks for the creation of embodied agents. Embedding agents within such environments facilitates the a… ▽ More

    Submitted 25 January, 2024; v1 submitted 7 January, 2024; originally announced January 2024.

  16. arXiv:2401.02992  [pdf

    cs.CL cs.AI

    Advanced Unstructured Data Processing for ESG Reports: A Methodology for Structured Transformation and Enhanced Analysis

    Authors: Jiahui Peng, **g Gao, Xin Tong, **g Guo, Hang Yang, Jianchuan Qi, Ruiqiao Li, Nan Li, Ming Xu

    Abstract: In the evolving field of corporate sustainability, analyzing unstructured Environmental, Social, and Governance (ESG) reports is a complex challenge due to their varied formats and intricate content. This study introduces an innovative methodology utilizing the "Unstructured Core Library", specifically tailored to address these challenges by transforming ESG reports into structured, analyzable for… ▽ More

    Submitted 4 January, 2024; originally announced January 2024.

  17. arXiv:2401.02781  [pdf, other

    hep-ph hep-ex nucl-th

    Simultaneous Determination of Fragmentation Functions and Test on Momentum Sum Rule

    Authors: Jun Gao, ChongYang Liu, XiaoMin Shen, Hongxi Xing, Yuxiang Zhao

    Abstract: We perform a simultaneous global analysis of hadron fragmentation functions (FFs) to various charged hadrons at next-to-leading order in QCD. The world data set includes results from electron-positron single-inclusive annihilation, semi-inclusive deep inelastic scattering, as well as proton-proton collisions including jet fragmentation measurements which lead to strong constraints on the gluon fra… ▽ More

    Submitted 29 June, 2024; v1 submitted 5 January, 2024; originally announced January 2024.

    Comments: published version; link to FF grids provided

    Journal ref: Phys.Rev.Lett. 132 (2024) 26, 261903

  18. arXiv:2401.02458  [pdf, other

    cs.LG cs.AI

    Data-Centric Foundation Models in Computational Healthcare: A Survey

    Authors: Yunkun Zhang, ** Gao, Zheling Tan, Lingfeng Zhou, Kexin Ding, Mu Zhou, Shaoting Zhang, Dequan Wang

    Abstract: The advent of foundation models (FMs) as an emerging suite of AI techniques has struck a wave of opportunities in computational healthcare. The interactive nature of these models, guided by pre-training data and human instructions, has ignited a data-centric AI paradigm that emphasizes better data characterization, quality, and scale. In healthcare AI, obtaining and processing high-quality clinica… ▽ More

    Submitted 4 January, 2024; originally announced January 2024.

  19. arXiv:2401.02099  [pdf

    cs.CV cs.SD eess.AS

    Oceanship: A Large-Scale Dataset for Underwater Audio Target Recognition

    Authors: Zeyu Li, Suncheng Xiang, Tong Yu, **gsheng Gao, Jiacheng Ruan, Yan** Hu, Ting Liu, Yuzhuo Fu

    Abstract: The recognition of underwater audio plays a significant role in identifying a vessel while it is in motion. Underwater target recognition tasks have a wide range of applications in areas such as marine environmental protection, detection of ship radiated noise, underwater noise control, and coastal vessel dispatch. The traditional UATR task involves training a network to extract features from audi… ▽ More

    Submitted 10 June, 2024; v1 submitted 4 January, 2024; originally announced January 2024.

    Comments: Accepted by ICIC 2024

  20. arXiv:2401.01600  [pdf, other

    cs.CL cs.AI cs.CE cs.LG

    PLLaMa: An Open-source Large Language Model for Plant Science

    Authors: Xianjun Yang, Junfeng Gao, Wenxin Xue, Erik Alexandersson

    Abstract: Large Language Models (LLMs) have exhibited remarkable capabilities in understanding and interacting with natural language across various sectors. However, their effectiveness is limited in specialized areas requiring high accuracy, such as plant science, due to a lack of specific expertise in these fields. This paper introduces PLLaMa, an open-source language model that evolved from LLaMa-2. It's… ▽ More

    Submitted 3 January, 2024; originally announced January 2024.

    Comments: Work in progress

  21. arXiv:2312.17493  [pdf, other

    cs.LG cs.CR

    Differentially Private Low-Rank Adaptation of Large Language Model Using Federated Learning

    Authors: Xiao-Yang Liu, Rongyi Zhu, Daochen Zha, Jiechao Gao, Shan Zhong, Matt White, Meikang Qiu

    Abstract: The surge in interest and application of large language models (LLMs) has sparked a drive to fine-tune these models to suit specific applications, such as finance and medical science. However, concerns regarding data privacy have emerged, especially when multiple stakeholders aim to collaboratively enhance LLMs using sensitive data. In this scenario, federated learning becomes a natural choice, al… ▽ More

    Submitted 2 June, 2024; v1 submitted 29 December, 2023; originally announced December 2023.

    Comments: 21 pages, 1 figure, 19 tables

  22. arXiv:2312.17030  [pdf, other

    eess.IV cs.CV

    Learning Multi-axis Representation in Frequency Domain for Medical Image Segmentation

    Authors: Jiacheng Ruan, **gsheng Gao, Mingye Xie, Suncheng Xiang

    Abstract: Recently, Visual Transformer (ViT) has been extensively used in medical image segmentation (MIS) due to applying self-attention mechanism in the spatial domain to modeling global knowledge. However, many studies have focused on improving models in the spatial domain while neglecting the importance of frequency domain information. Therefore, we propose Multi-axis External Weights UNet (MEW-UNet) ba… ▽ More

    Submitted 28 December, 2023; originally announced December 2023.

    Comments: arXiv admin note: text overlap with arXiv:2210.14007

  23. arXiv:2312.15224  [pdf, other

    cs.AI cs.HC

    LLM-Powered Hierarchical Language Agent for Real-time Human-AI Coordination

    Authors: Jijia Liu, Chao Yu, Jiaxuan Gao, Yuqing Xie, Qingmin Liao, Yi Wu, Yu Wang

    Abstract: AI agents powered by Large Language Models (LLMs) have made significant advances, enabling them to assist humans in diverse complex tasks and leading to a revolution in human-AI coordination. LLM-powered agents typically require invoking LLM APIs and employing artificially designed complex prompts, which results in high inference latency. While this paradigm works well in scenarios with minimal in… ▽ More

    Submitted 9 January, 2024; v1 submitted 23 December, 2023; originally announced December 2023.

    Comments: This paper is accpeted by AAMAS 2024. More demonstrations can be seen on our website https://sites.google.com/view/overcooked-hla/

  24. arXiv:2312.14455  [pdf, other

    cond-mat.mtrl-sci cond-mat.str-el

    Evidence for an Excitonic Insulator State in Ta$_2$Pd$_3$Te$_5$

    Authors: Jierui Huang, Bei Jiang, **gyu Yao, Dayu Yan, Xincheng Lei, Jiacheng Gao, Zhaopeng Guo, Feng **, Yupeng Li, Zhenyu Yuan, Congcong Chai, Haohao Sheng, Mojun Pan, Famin Chen, Junde Liu, Shunye Gao, Gexing Qu, Bo Liu, Zhicheng Jiang, Zhengtai Liu, Xiaoyan Ma, Shiming Zhou, Yaobo Huang, Chenxia Yun, Qingming Zhang , et al. (8 additional authors not shown)

    Abstract: The excitonic insulator (EI) is an exotic ground state of narrow-gap semiconductors and semimetals arising from spontaneous condensation of electron-hole pairs bound by attractive Coulomb interaction. Despite research on EIs dating back to half a century ago, their existence in real materials remains a subject of ongoing debate. In this study, through systematic experimental and theoretical invest… ▽ More

    Submitted 14 March, 2024; v1 submitted 22 December, 2023; originally announced December 2023.

    Comments: 10 pages, 5 figures

    Journal ref: Phys. Rev. X 14, 011046, 2024

  25. arXiv:2312.13877  [pdf, other

    quant-ph

    A complete continuous-variable quantum computation architecture: from cluster state generation to fault-tolerant accomplishment

    Authors: Peilin Du, **g Zhang, Tiancai Zhang, Rongguo Yang, Jiangrui Gao

    Abstract: Continuous-variable measurement-based quantum computation, which requires deterministically generated large-scale cluster state, is a promising candidate for practical, scalable, universal, and fault-tolerant quantum computation. In this work, a complete architecture including cluster state preparation, gate implementations, and error correction, is demonstrated. First, a scheme for generating two… ▽ More

    Submitted 31 January, 2024; v1 submitted 21 December, 2023; originally announced December 2023.

    Comments: 12 pages,12 figures

  26. arXiv:2312.13752  [pdf

    eess.IV cs.AI cs.CV

    Hunting imaging biomarkers in pulmonary fibrosis: Benchmarks of the AIIB23 challenge

    Authors: Yang Nan, Xiaodan Xing, Shiyi Wang, Zeyu Tang, Federico N Felder, Sheng Zhang, Roberta Eufrasia Ledda, Xiaoliu Ding, Ruiqi Yu, Wei** Liu, Feng Shi, Tianyang Sun, Zehong Cao, Minghui Zhang, Yun Gu, Hanxiao Zhang, Jian Gao, **yu Wang, Wen Tang, Pengxin Yu, Han Kang, Junqiang Chen, Xing Lu, Boyu Zhang, Michail Mamalakis , et al. (16 additional authors not shown)

    Abstract: Airway-related quantitative imaging biomarkers are crucial for examination, diagnosis, and prognosis in pulmonary diseases. However, the manual delineation of airway trees remains prohibitively time-consuming. While significant efforts have been made towards enhancing airway modelling, current public-available datasets concentrate on lung diseases with moderate morphological variations. The intric… ▽ More

    Submitted 16 April, 2024; v1 submitted 21 December, 2023; originally announced December 2023.

    Comments: 19 pages

  27. arXiv:2312.13183  [pdf, other

    math.NA

    A framework for stable spectral methods in $d$-dimensional unit balls

    Authors: **g Gao, Arieh Iserles

    Abstract: The subject of this paper is the design of efficient and stable spectral methods for time-dependent partial differential equations in unit balls. We commence by sketching the desired features of a spectral method, which is defined by a choice of an orthonormal basis acting in the spatial domain. We continue by considering in detail the choice of a $W$-function basis in a disc in $\mathbb{R}^2$. Th… ▽ More

    Submitted 20 December, 2023; originally announced December 2023.

    MSC Class: 65M70; 42C05

  28. arXiv:2312.12970  [pdf, other

    cs.CV

    D3Former: Jointly Learning Repeatable Dense Detectors and Feature-enhanced Descriptors via Saliency-guided Transformer

    Authors: Junjie Gao, Pengfei Wang, Qiujie Dong, Qiong Zeng, Shiqing Xin, Caiming Zhang

    Abstract: Establishing accurate and representative matches is a crucial step in addressing the point cloud registration problem. A commonly employed approach involves detecting keypoints with salient geometric features and subsequently map** these keypoints from one frame of the point cloud to another. However, methods within this category are hampered by the repeatability of the sampled keypoints. In thi… ▽ More

    Submitted 20 December, 2023; originally announced December 2023.

    Comments: 15 pages, 6 figures

  29. Solar neutrino measurements using the full data period of Super-Kamiokande-IV

    Authors: Super-Kamiokande Collaboration, :, K. Abe, C. Bronner, Y. Hayato, K. Hiraide, K. Hosokawa, K. Ieki, M. Ikeda, S. Imaizumi, K. Iyogi, J. Kameda, Y. Kanemura, R. Kaneshima, Y. Kashiwagi, Y. Kataoka, Y. Kato, Y. Kishimoto, S. Miki, S. Mine, M. Miura, T. Mochizuki, S. Moriyama, Y. Nagao, M. Nakahata , et al. (305 additional authors not shown)

    Abstract: An analysis of solar neutrino data from the fourth phase of Super-Kamiokande~(SK-IV) from October 2008 to May 2018 is performed and the results are presented. The observation time of the data set of SK-IV corresponds to $2970$~days and the total live time for all four phases is $5805$~days. For more precise solar neutrino measurements, several improvements are applied in this analysis: lowering th… ▽ More

    Submitted 20 February, 2024; v1 submitted 20 December, 2023; originally announced December 2023.

    Comments: 47 pages, 61 figures

    Journal ref: Phys. Rev. D 109, 092001 (2024)

  30. arXiv:2312.11829  [pdf, other

    cs.CV

    RadOcc: Learning Cross-Modality Occupancy Knowledge through Rendering Assisted Distillation

    Authors: Haiming Zhang, Xu Yan, Dongfeng Bai, Jiantao Gao, Pan Wang, Bingbing Liu, Shuguang Cui, Zhen Li

    Abstract: 3D occupancy prediction is an emerging task that aims to estimate the occupancy states and semantics of 3D scenes using multi-view images. However, image-based scene perception encounters significant challenges in achieving accurate prediction due to the absence of geometric priors. In this paper, we address this issue by exploring cross-modal knowledge distillation in this task, i.e., we leverage… ▽ More

    Submitted 18 December, 2023; originally announced December 2023.

    Comments: Accepted by AAAI 2024

  31. Application of AI in Nutrition

    Authors: Ritu Ramakrishnan, Tianxiang Xing, Tianfeng Chen, Ming-Hao Lee, **zhu Gao

    Abstract: In healthcare, artificial intelligence (AI) has been changing the way doctors and health experts take care of people. This paper will cover how AI is making major changes in the health care system, especially with nutrition. Various machine learning and deep learning algorithms have been developed to extract valuable information from healthcare data which help doctors, nutritionists, and health ex… ▽ More

    Submitted 17 December, 2023; originally announced December 2023.

    Journal ref: Journal of Advances in Information Science and Technology, Volume 1, Issue 1, 2023, Pages 7-12

  32. arXiv:2312.11460  [pdf, other

    cs.RO cs.AI cs.CV cs.LG eess.SY

    Hybrid Internal Model: Learning Agile Legged Locomotion with Simulated Robot Response

    Authors: Junfeng Long, Zirui Wang, Quanyi Li, Jiawei Gao, Liu Cao, Jiangmiao Pang

    Abstract: Robust locomotion control depends on accurate state estimations. However, the sensors of most legged robots can only provide partial and noisy observations, making the estimation particularly challenging, especially for external states like terrain frictions and elevation maps. Inspired by the classical Internal Model Control principle, we consider these external states as disturbances and introdu… ▽ More

    Submitted 1 January, 2024; v1 submitted 18 December, 2023; originally announced December 2023.

    Comments: Use 1 hour to train a quadruped robot capable of traversing any terrain under any disturbances in the open world, Project Page: https://github.com/OpenRobotLab/HIMLoco

  33. arXiv:2312.11370  [pdf, other

    cs.CL

    G-LLaVA: Solving Geometric Problem with Multi-Modal Large Language Model

    Authors: Jiahui Gao, Renjie Pi, Jipeng Zhang, Jiacheng Ye, Wanjun Zhong, Yufei Wang, Lanqing Hong, Jianhua Han, Hang Xu, Zhenguo Li, Lingpeng Kong

    Abstract: Large language models (LLMs) have shown remarkable proficiency in human-level reasoning and generation capabilities, which encourages extensive research on their application in mathematical problem solving. However, current work has been largely focused on text-based mathematical problems, with limited investigation in problems involving geometric information. Addressing this gap, we aim to enable… ▽ More

    Submitted 18 December, 2023; originally announced December 2023.

    Comments: 10 pages

  34. Multi-Correlation Siamese Transformer Network with Dense Connection for 3D Single Object Tracking

    Authors: Shihao Feng, Pengpeng Liang, ** Gao, Erkang Cheng

    Abstract: Point cloud-based 3D object tracking is an important task in autonomous driving. Though great advances regarding Siamese-based 3D tracking have been made recently, it remains challenging to learn the correlation between the template and search branches effectively with the sparse LIDAR point cloud data. Instead of performing correlation of the two branches at just one point in the network, in this… ▽ More

    Submitted 18 December, 2023; originally announced December 2023.

    Comments: Preprint version for IEEE Robotics and Automation Letters (RAL)

    Journal ref: IEEE Robotics and Automation Letters (RAL), vol. 8, no. 12, pp. 8066-8073, 2023

  35. arXiv:2312.10704  [pdf, ps, other

    math.RA math.FA

    A m-weak group inverse for rectangular matrices

    Authors: Jiale Gao, Kezheng Zuo, Qing-wen Wang

    Abstract: The purpose of this paper is to extend the definition of the m-weak group inverse from a square matrix to a rectangular matrix, called the W-weighted m-weak group inverse. This new generalized inverse is also a generalization of the weak group inverse, generalized group inverse, Drazin inverse, weighted weak group inverse and W-weighted Drazin inverse. Furthermore, we discuss some properties, char… ▽ More

    Submitted 17 December, 2023; originally announced December 2023.

    Comments: 25 pages, 0 figure

    MSC Class: 15A09; 15A24

  36. arXiv:2312.09685  [pdf, other

    cs.SE

    When Contracts Meets Crypto: Exploring Developers' Struggles with Ethereum Cryptographic APIs

    Authors: Jiashuo Zhang, Jiachi Chen, Zhiyuan Wan, Ting Chen, Jianbo Gao, Zhong Chen

    Abstract: To empower smart contracts with the promising capabilities of cryptography, Ethereum officially introduced a set of cryptographic APIs that facilitate basic cryptographic operations within smart contracts, such as elliptic curve operations. However, since developers are not necessarily cryptography experts, requiring them to directly interact with these basic APIs has caused real-world security is… ▽ More

    Submitted 15 December, 2023; originally announced December 2023.

    Comments: To appear at ICSE'24

  37. arXiv:2312.08212  [pdf, other

    cs.CV

    LAMM: Label Alignment for Multi-Modal Prompt Learning

    Authors: **gsheng Gao, Jiacheng Ruan, Suncheng Xiang, Zefang Yu, Ke Ji, Mingye Xie, Ting Liu, Yuzhuo Fu

    Abstract: With the success of pre-trained visual-language (VL) models such as CLIP in visual representation tasks, transferring pre-trained models to downstream tasks has become a crucial paradigm. Recently, the prompt tuning paradigm, which draws inspiration from natural language processing (NLP), has made significant progress in VL field. However, preceding methods mainly focus on constructing prompt temp… ▽ More

    Submitted 13 December, 2023; originally announced December 2023.

    Comments: Accepted at AAAI 2024 Main Conference

  38. arXiv:2312.07485  [pdf, other

    cs.CV

    MinD-3D: Reconstruct High-quality 3D objects in Human Brain

    Authors: Jianxiong Gao, Yuqian Fu, Yun Wang, Xuelin Qian, Jianfeng Feng, Yanwei Fu

    Abstract: In this paper, we introduce Recon3DMind, an innovative task aimed at reconstructing 3D visuals from Functional Magnetic Resonance Imaging (fMRI) signals, marking a significant advancement in the fields of cognitive neuroscience and computer vision. To support this pioneering task, we present the fMRI-Shape dataset, which includes data from 14 participants and features 360-degree videos of 3D objec… ▽ More

    Submitted 21 March, 2024; v1 submitted 12 December, 2023; originally announced December 2023.

    Comments: 26 pages, 13 figures

  39. arXiv:2312.07255  [pdf, other

    cs.CL cs.CV

    GIST: Improving Parameter Efficient Fine Tuning via Knowledge Interaction

    Authors: Jiacheng Ruan, **gsheng Gao, Mingye Xie, Suncheng Xiang, Zefang Yu, Ting Liu, Yuzhuo Fu

    Abstract: The Parameter-Efficient Fine-Tuning (PEFT) method, which adjusts or introduces fewer trainable parameters to calibrate pre-trained models on downstream tasks, has become a recent research interest. However, existing PEFT methods within the traditional fine-tiuning framework have two main shortcomings: 1) They overlook the explicit association between trainable parameters and downstream task knowle… ▽ More

    Submitted 12 December, 2023; originally announced December 2023.

    Comments: 17pages, 8 figures, 22 tables, Work in progress

  40. arXiv:2312.05758  [pdf, other

    cs.LG stat.AP

    CLeaRForecast: Contrastive Learning of High-Purity Representations for Time Series Forecasting

    Authors: Jiaxin Gao, Yuxiao Hu, Qinglong Cao, Siqi Dai, Yuntian Chen

    Abstract: Time series forecasting (TSF) holds significant importance in modern society, spanning numerous domains. Previous representation learning-based TSF algorithms typically embrace a contrastive learning paradigm featuring segregated trend-periodicity representations. Yet, these methodologies disregard the inherent high-impact noise embedded within time series data, resulting in representation inaccur… ▽ More

    Submitted 9 December, 2023; originally announced December 2023.

  41. arXiv:2312.04837  [pdf, other

    cs.AI cs.CL cs.CV

    Localized Symbolic Knowledge Distillation for Visual Commonsense Models

    Authors: Jae Sung Park, Jack Hessel, Khyathi Raghavi Chandu, Paul Pu Liang, Ximing Lu, Peter West, Youngjae Yu, Qiuyuan Huang, Jianfeng Gao, Ali Farhadi, Ye** Choi

    Abstract: Instruction following vision-language (VL) models offer a flexible interface that supports a broad range of multimodal tasks in a zero-shot fashion. However, interfaces that operate on full images do not directly enable the user to "point to" and access specific regions within images. This capability is important not only to support reference-grounded VL benchmarks, but also, for practical applica… ▽ More

    Submitted 12 December, 2023; v1 submitted 8 December, 2023; originally announced December 2023.

    Comments: Neurips 2023

  42. arXiv:2312.04420  [pdf, other

    cond-mat.str-el cond-mat.stat-mech quant-ph

    Finite-Temperature Simulations of Quantum Lattice Models with Stochastic Matrix Product States

    Authors: Jianxin Gao, Yuan Gao, Qiaoyi Li, Wei Li

    Abstract: In this work, we develop a stochastic matrix product state (stoMPS) approach that combines the MPS technique and Monte Carlo samplings and can be applied to simulate quantum lattice models down to low temperature. In particular, we exploit a procedure to unbiasedly sample the local tensors in the matrix product states, which has one physical index of dimension $d$ and two geometric indices of dime… ▽ More

    Submitted 7 December, 2023; originally announced December 2023.

  43. arXiv:2312.04119  [pdf, other

    cs.CV

    A brief introduction to a framework named Multilevel Guidance-Exploration Network

    Authors: Guoqing Yang, Zhiming Luo, Jianzhe Gao, Yingxin Lai, Kun Yang, Yifan He, Shaozi Li

    Abstract: Human behavior anomaly detection aims to identify unusual human actions, playing a crucial role in intelligent surveillance and other areas. The current mainstream methods still adopt reconstruction or future frame prediction techniques. However, reconstructing or predicting low-level pixel features easily enables the network to achieve overly strong generalization ability, allowing anomalies to b… ▽ More

    Submitted 9 June, 2024; v1 submitted 7 December, 2023; originally announced December 2023.

    Comments: More reasonable

  44. arXiv:2312.02949  [pdf, other

    cs.CV

    LLaVA-Grounding: Grounded Visual Chat with Large Multimodal Models

    Authors: Hao Zhang, Hongyang Li, Feng Li, Tianhe Ren, Xueyan Zou, Shilong Liu, Shijia Huang, Jianfeng Gao, Lei Zhang, Chunyuan Li, Jianwei Yang

    Abstract: With the recent significant advancements in large multi-modal models (LMMs), the importance of their grounding capability in visual chat is increasingly recognized. Despite recent efforts to enable LMMs to support grounding, their capabilities for grounding and chat are usually separate, and their chat performance drops dramatically when asked to ground. The problem is the lack of a dataset for gr… ▽ More

    Submitted 5 December, 2023; originally announced December 2023.

  45. arXiv:2312.02573  [pdf, other

    cs.LG cs.AI

    UTBoost: A Tree-boosting based System for Uplift Modeling

    Authors: Junjie Gao, Xiangyu Zheng, DongDong Wang, Zhixiang Huang, Bangqi Zheng, Kai Yang

    Abstract: Uplift modeling refers to the set of machine learning techniques that a manager may use to estimate customer uplift, that is, the net effect of an action on some customer outcome. By identifying the subset of customers for whom a treatment will have the greatest effect, uplift models assist decision-makers in optimizing resource allocations and maximizing overall returns. Accurately estimating cus… ▽ More

    Submitted 5 December, 2023; originally announced December 2023.

    Comments: 11 pages, 3 figures

  46. arXiv:2312.02298  [pdf, other

    eess.SP cs.CV cs.LG stat.AP

    MoE-AMC: Enhancing Automatic Modulation Classification Performance Using Mixture-of-Experts

    Authors: Jiaxin Gao, Qinglong Cao, Yuntian Chen

    Abstract: Automatic Modulation Classification (AMC) plays a vital role in time series analysis, such as signal classification and identification within wireless communications. Deep learning-based AMC models have demonstrated significant potential in this domain. However, current AMC models inadequately consider the disparities in handling signals under conditions of low and high Signal-to-Noise Ratio (SNR)… ▽ More

    Submitted 4 December, 2023; originally announced December 2023.

  47. arXiv:2312.01771  [pdf, other

    cs.CV

    IMProv: Inpainting-based Multimodal Prompting for Computer Vision Tasks

    Authors: Jiarui Xu, Yossi Gandelsman, Amir Bar, Jianwei Yang, Jianfeng Gao, Trevor Darrell, Xiaolong Wang

    Abstract: In-context learning allows adapting a model to new tasks given a task description at test time. In this paper, we present IMProv - a generative model that is able to in-context learn visual tasks from multimodal prompts. Given a textual description of a visual task (e.g. "Left: input image, Right: foreground segmentation"), a few input-output visual examples, or both, the model in-context learns t… ▽ More

    Submitted 4 December, 2023; originally announced December 2023.

    Comments: Project page: https://jerryxu.net/IMProv

  48. Orbital angular momentum-enhanced phase estimation using non-Gaussian state with photon loss

    Authors: Yong-Jian Chen, **-Wei Gao, **-Xuan Han, Zhong-Hui Yuan, Ruo-Qi Li, Yong-Yuan Jiang, Jie Song

    Abstract: This study investigates the use of orbital angular momentum (OAM) to enhance phase estimation in Mach-Zehnder interferometers (MZIs) by employing non-Gaussian states as input resources in the presence of noise. Our research demonstrates that non-Gaussian states, particularly the photonsubtraction-then-addition (PSA) state, exhibit the best sensitivity in the presence of symmetric noise. Additional… ▽ More

    Submitted 4 December, 2023; originally announced December 2023.

    Comments: 14 pages, 18 figures

    Journal ref: Phys.Rev.A 108,022613(2023)

  49. Coupled Dark Sector Models and Cosmological Tensions

    Authors: Gang Liu, Jiaze Gao, Yufen Han, Yuhao Mu, Lixin Xu

    Abstract: In this paper, we introduce two coupling models of early dark energy (EDE) and cold dark matter aimed at alleviating cosmological tensions. We utilize the EDE component in the coupling models to relieve the Hubble tension, while leveraging the interaction between dark matter and dark energy to alleviate the large-scale structure tension. The interaction is implemented in the form of pure momentum… ▽ More

    Submitted 23 April, 2024; v1 submitted 3 December, 2023; originally announced December 2023.

    Comments: 14 pages, 9 figures. In this replacement, we have amalgamated the original content of this manuscript with that of a previous paper [arXiv:2310.09798]. arXiv admin note: substantial text overlap with arXiv:2310.09798

    Journal ref: Phys. Rev. D 109, 103531 (2024)

  50. arXiv:2312.01201  [pdf, other

    cs.LG cs.AI

    PAC Privacy Preserving Diffusion Models

    Authors: Qipan Xu, Youlong Ding, Xinxi Zhang, Jie Gao, Hao Wang

    Abstract: Data privacy protection is garnering increased attention among researchers. Diffusion models (DMs), particularly with strict differential privacy, can potentially produce images with both high privacy and visual quality. However, challenges arise such as in ensuring robust protection in privatizing specific data attributes, areas where current models often fall short. To address these challenges,… ▽ More

    Submitted 21 April, 2024; v1 submitted 2 December, 2023; originally announced December 2023.