Skip to main content

Showing 101–150 of 946 results for author: Gong, Y

.
  1. arXiv:2312.17133  [pdf, other

    cs.CV

    ARTrackV2: Prompting Autoregressive Tracker Where to Look and How to Describe

    Authors: Yifan Bai, Zeyang Zhao, Yihong Gong, Xing Wei

    Abstract: We present ARTrackV2, which integrates two pivotal aspects of tracking: determining where to look (localization) and how to describe (appearance analysis) the target object across video frames. Building on the foundation of its predecessor, ARTrackV2 extends the concept by introducing a unified generative framework to "read out" object's trajectory and "retell" its appearance in an autoregressive… ▽ More

    Submitted 13 February, 2024; v1 submitted 28 December, 2023; originally announced December 2023.

  2. arXiv:2312.16374  [pdf, other

    cs.CL cs.AI

    LLM Factoscope: Uncovering LLMs' Factual Discernment through Inner States Analysis

    Authors: **wen He, Yujia Gong, Kai Chen, Zi** Lin, Chengan Wei, Yue Zhao

    Abstract: Large Language Models (LLMs) have revolutionized various domains with extensive knowledge and creative capabilities. However, a critical issue with LLMs is their tendency to produce outputs that diverge from factual reality. This phenomenon is particularly concerning in sensitive applications such as medical consultation and legal advice, where accuracy is paramount. In this paper, we introduce th… ▽ More

    Submitted 29 December, 2023; v1 submitted 26 December, 2023; originally announced December 2023.

  3. arXiv:2312.15808  [pdf, other

    cs.NI

    Quantum-Assisted Online Task Offloading and Resource Allocation in MEC-Enabled Satellite-Aerial-Terrestrial Integrated Networks

    Authors: Yu Zhang, Yanmin Gong, Lei Fan, Yu Wang, Zhu Han, Yuanxiong Guo

    Abstract: In the era of Internet of Things (IoT), multi-access edge computing (MEC)-enabled satellite-aerial-terrestrial integrated network (SATIN) has emerged as a promising technology to provide massive IoT devices with seamless and reliable communication and computation services. This paper investigates the cooperation of low Earth orbit (LEO) satellites, high altitude platforms (HAPs), and terrestrial b… ▽ More

    Submitted 25 December, 2023; originally announced December 2023.

  4. arXiv:2312.14448  [pdf, other

    cs.NI eess.SP

    Quantum-Assisted Joint Caching and Power Allocation for Integrated Satellite-Terrestrial Networks

    Authors: Yu Zhang, Yanmin Gong, Lei Fan, Yu Wang, Zhu Han, Yuanxiong Guo

    Abstract: Low earth orbit (LEO) satellite network can complement terrestrial networks for achieving global wireless coverage and improving delay-sensitive Internet services. This paper proposes an integrated satellite-terrestrial network (ISTN) architecture to provide ground users with seamless and reliable content delivery services. For optimal service provisioning in this architecture, we formulate an opt… ▽ More

    Submitted 22 December, 2023; originally announced December 2023.

  5. arXiv:2312.13683  [pdf, other

    eess.SP cs.IT

    Joint Channel Estimation and Cooperative Localization for Near-Field Ultra-Massive MIMO

    Authors: Ruoxiao Cao, Hengtao He, Xianghao Yu, Shenghui Song, Kaibin Huang, Jun Zhang, Yi Gong, Khaled B. Letaief

    Abstract: The next-generation (6G) wireless networks are expected to provide not only seamless and high data-rate communications, but also ubiquitous sensing services. By providing vast spatial degrees of freedom (DoFs), ultra-massive multiple-input multiple-output (UM-MIMO) technology is a key enabler for both sensing and communications in 6G. However, the adoption of UM-MIMO leads to a shift from the far… ▽ More

    Submitted 21 December, 2023; originally announced December 2023.

    Comments: Submit to JSAC

  6. arXiv:2312.12949  [pdf, other

    astro-ph.CO astro-ph.GA astro-ph.IM

    Estimating Photometric Redshift from Mock Flux for CSST Survey by using Weighted Random Forest

    Authors: Junhao Lu, Zhijian Luo, Zhu Chen, Li** Fu, Wei Du, Yan Gong, Yicheng Li, Xian-Min Meng, Zhirui Tang, Shaohua Zhang, Chenggang Shu, Xingchen Zhou, Zuhui Fan

    Abstract: Accurate estimation of photometric redshifts (photo-$z$) is crucial in studies of both galaxy evolution and cosmology using current and future large sky surveys. In this study, we employ Random Forest (RF), a machine learning algorithm, to estimate photo-$z$ and investigate the systematic uncertainties affecting the results. Using galaxy flux and color as input features, we construct a map** bet… ▽ More

    Submitted 25 December, 2023; v1 submitted 20 December, 2023; originally announced December 2023.

  7. arXiv:2312.12716  [pdf, other

    cs.CV cs.CL cs.LG

    BloomVQA: Assessing Hierarchical Multi-modal Comprehension

    Authors: Yunye Gong, Robik Shrestha, Jared Claypoole, Michael Cogswell, Arijit Ray, Christopher Kanan, Ajay Divakaran

    Abstract: We propose a novel VQA dataset, BloomVQA, to facilitate comprehensive evaluation of large vision-language models on comprehension tasks. Unlike current benchmarks that often focus on fact-based memorization and simple reasoning tasks without theoretical grounding, we collect multiple-choice samples based on picture stories that reflect different levels of comprehension, as laid out in Bloom's Taxo… ▽ More

    Submitted 10 June, 2024; v1 submitted 19 December, 2023; originally announced December 2023.

    Comments: Accepted by ACL Findings (2024). Dataset available at https://huggingface.co/datasets/ygong/BloomVQA

  8. arXiv:2312.09738  [pdf, other

    cs.AI

    3DAxiesPrompts: Unleashing the 3D Spatial Task Capabilities of GPT-4V

    Authors: Dingning Liu, Xiaomeng Dong, Renrui Zhang, Xu Luo, Peng Gao, Xiaoshui Huang, Yongshun Gong, Zhihui Wang

    Abstract: In this work, we present a new visual prompting method called 3DAxiesPrompts (3DAP) to unleash the capabilities of GPT-4V in performing 3D spatial tasks. Our investigation reveals that while GPT-4V exhibits proficiency in discerning the position and interrelations of 2D entities through current visual prompting techniques, its abilities in handling 3D spatial tasks have yet to be explored. In our… ▽ More

    Submitted 15 December, 2023; originally announced December 2023.

  9. arXiv:2312.09681  [pdf, other

    cs.LG cs.CV cs.DB

    Urban Region Embedding via Multi-View Contrastive Prediction

    Authors: Zechen Li, Weiming Huang, Kai Zhao, Min Yang, Yongshun Gong, Meng Chen

    Abstract: Recently, learning urban region representations utilizing multi-modal data (information views) has become increasingly popular, for deep understanding of the distributions of various socioeconomic features in cities. However, previous methods usually blend multi-view information in a posteriors stage, falling short in learning coherent and consistent representations across different views. In this… ▽ More

    Submitted 15 December, 2023; originally announced December 2023.

  10. arXiv:2312.08740  [pdf, other

    cs.LG cs.CV

    Learning a Low-Rank Feature Representation: Achieving Better Trade-Off between Stability and Plasticity in Continual Learning

    Authors: Zhenrong Liu, Yang Li, Yi Gong, Yik-Chung Wu

    Abstract: In continual learning, networks confront a trade-off between stability and plasticity when trained on a sequence of tasks. To bolster plasticity without sacrificing stability, we propose a novel training algorithm called LRFR. This approach optimizes network parameters in the null space of the past tasks' feature representation matrix to guarantee the stability. Concurrently, we judiciously select… ▽ More

    Submitted 14 December, 2023; originally announced December 2023.

    Comments: Accepted for publication in the proceedings of ICASSP 2024

  11. arXiv:2312.05946  [pdf, other

    cs.LG cs.AI

    Uncertainty Propagation through Trained Deep Neural Networks Using Factor Graphs

    Authors: Angel Daruna, Yunye Gong, Abhinav Rajvanshi, Han-Pang Chiu, Yi Yao

    Abstract: Predictive uncertainty estimation remains a challenging problem precluding the use of deep neural networks as subsystems within safety-critical applications. Aleatoric uncertainty is a component of predictive uncertainty that cannot be reduced through model improvements. Uncertainty propagation seeks to estimate aleatoric uncertainty by propagating input uncertainties to network predictions. Exist… ▽ More

    Submitted 10 December, 2023; originally announced December 2023.

  12. arXiv:2312.02143  [pdf, other

    cs.CL cs.AI

    Competition-Level Problems are Effective LLM Evaluators

    Authors: Yiming Huang, Zhenghao Lin, Xiao Liu, Yeyun Gong, Shuai Lu, Fangyu Lei, Yaobo Liang, Yelong Shen, Chen Lin, Nan Duan, Weizhu Chen

    Abstract: Large language models (LLMs) have demonstrated impressive reasoning capabilities, yet there is ongoing debate about these abilities and the potential data contamination problem recently. This paper aims to evaluate the reasoning capacities of LLMs, specifically in solving recent competition-level programming problems in Codeforces, which are expert-crafted and unique, requiring deep understanding… ▽ More

    Submitted 4 June, 2024; v1 submitted 4 December, 2023; originally announced December 2023.

    Comments: ACL 2024

  13. Highly sensitive magnetic properties and large linear magnetoresistance in antiferromagnetic CrxSe(0.875\lex\le1)single crystals

    Authors: Yuqing Bai, Shuang Pan, Ziqian Lu, Yuanyuan Gong, Guizhou Xu, Feng Xu

    Abstract: CrxSe (x\le1) is a class of quasi-layered binary compounds with potential applications in spintronics due to its intriguing antiferromagnetic properties. In this work, CrxSe single crystals with high Cr content (x=0.87, 0.91 and 0.95) were grown, and their magnetic and transport properties were investigated in detail. It is found that with small increase of Cr content, the Néel temperature (TN) of… ▽ More

    Submitted 30 November, 2023; originally announced November 2023.

    Journal ref: Journal of Alloys and Compounds 968 (2023) 172080

  14. arXiv:2311.18311  [pdf, other

    cs.CV cs.GR

    Anisotropic Neural Representation Learning for High-Quality Neural Rendering

    Authors: Y. Wang, J. Xu, Y. Zeng, Y. Gong

    Abstract: Neural radiance fields (NeRFs) have achieved impressive view synthesis results by learning an implicit volumetric representation from multi-view images. To project the implicit representation into an image, NeRF employs volume rendering that approximates the continuous integrals of rays as an accumulation of the colors and densities of the sampled points. Although this approximation enables effici… ▽ More

    Submitted 10 March, 2024; v1 submitted 30 November, 2023; originally announced November 2023.

  15. arXiv:2311.16903  [pdf, other

    astro-ph.CO

    Forecasting the BAO Measurements of the CSST galaxy and AGN Spectroscopic Surveys

    Authors: Haitao Miao, Yan Gong, Xuelei Chen, Zhiqi Huang, Xiao-Dong Li, Hu Zhan

    Abstract: The spectroscopic survey of China's Space Survey Telescope (CSST) is expected to obtain a huge number of slitless spectra, including more than one hundred million galaxy spectra and millions of active galactic nuclei (AGN) spectra. By making use of these spectra, we can measure the Baryon Acoustic Oscillation (BAO) signals over large redshift ranges with excellent precisions. In this work, we pred… ▽ More

    Submitted 29 May, 2024; v1 submitted 28 November, 2023; originally announced November 2023.

    Comments: 15 pages, 9 figures, 4 tables. Accepted for publication in MNRAS

    Journal ref: MNRAS, 531, 3991-4005 (2024)

  16. arXiv:2311.14960  [pdf, other

    cs.CV

    Point Cloud Pre-training with Diffusion Models

    Authors: Xiao Zheng, Xiaoshui Huang, Guofeng Mei, Yuenan Hou, Zhaoyang Lyu, Bo Dai, Wanli Ouyang, Yongshun Gong

    Abstract: Pre-training a model and then fine-tuning it on downstream tasks has demonstrated significant success in the 2D image and NLP domains. However, due to the unordered and non-uniform density characteristics of point clouds, it is non-trivial to explore the prior knowledge of point clouds and pre-train a point cloud backbone. In this paper, we propose a novel pre-training method called Point cloud Di… ▽ More

    Submitted 25 November, 2023; originally announced November 2023.

  17. arXiv:2311.12490  [pdf, other

    cs.CV cs.GR cs.LG

    Hyb-NeRF: A Multiresolution Hybrid Encoding for Neural Radiance Fields

    Authors: Yifan Wang, Yi Gong, Yuan Zeng

    Abstract: Recent advances in Neural radiance fields (NeRF) have enabled high-fidelity scene reconstruction for novel view synthesis. However, NeRF requires hundreds of network evaluations per pixel to approximate a volume rendering integral, making it slow to train. Caching NeRFs into explicit data structures can effectively enhance rendering speed but at the cost of higher memory usage. To address these is… ▽ More

    Submitted 21 November, 2023; originally announced November 2023.

    Comments: WACV2024

  18. arXiv:2311.12276  [pdf, other

    astro-ph.GA astro-ph.SR

    The first Ka-band (26.1-35 GHz) blind line survey towards Orion KL

    Authors: Xunchuan Liu, Tie Liu, Zhiqiang Shen, Sheng-Li Qin, Qiuyi Luo, Yan Gong, Yu Cheng, Christian Henkel, Qilao Gu, Fengyao Zhu, Tianwei Zhang, Rongbing Zhao, Yajun Wu, Bin Li, Juan Li, Zhang Zhao, **qing Wang, Weiye Zhong, Qinghui Liu, Bo Xia, Li Fu, Zhen Yan, Chao Zhang, Lingling Wang, Qian Ye , et al. (9 additional authors not shown)

    Abstract: We conducted a Ka-band (26.1--35 GHz) line survey towards Orion KL using the TianMa 65-m Radio Telescope (TMRT). It is the first blind line survey in the Ka band, and achieves a sensitivity of mK level (1--3 mK at a spectral resolution of $\sim$1 km s$^{-1}$). In total, 592 Gaussian features are extracted. Among them, 257 radio recombination lines (RRLs) are identified. The maximum $Δn$ of RRLs of… ▽ More

    Submitted 20 November, 2023; originally announced November 2023.

    Comments: accepted by ApJS

  19. arXiv:2311.12259  [pdf, ps, other

    gr-qc astro-ph.CO hep-ph hep-th

    Analytical models of supermassive black holes in galaxies surrounded by dark matter halos

    Authors: Zibo Shen, Anzhong Wang, Yungui Gong, Shaoyu Yin

    Abstract: In this Letter, we present five analytical models in closed forms, each representing a supermassive black hole (SMBH) located at the center of a galaxy surrounded by dark matter (DM) halo. The density profile of the halo vanishes inside twice the Schwarzschild radius of the hole and satisfies the weak, strong, and dominant energy conditions. The spacetime are asymptotically flat, and the differenc… ▽ More

    Submitted 19 June, 2024; v1 submitted 20 November, 2023; originally announced November 2023.

    Comments: revtex4-2, no figures. Version to appear in Phys. Lett. B 855 (2024) 138797

    Journal ref: Phys. Lett. B 855 (2024) 138797

  20. arXiv:2311.11054  [pdf, other

    stat.ME

    Modern extreme value statistics for Utopian extremes

    Authors: Jordan Richards, Noura Alotaibi, Daniela Cisneros, Yan Gong, Matheus B. Guerrero, Paolo Redondo, Xuanjie Shao

    Abstract: Capturing the extremal behaviour of data often requires bespoke marginal and dependence models which are grounded in rigorous asymptotic theory, and hence provide reliable extrapolation into the upper tails of the data-generating distribution. We present a toolbox of four methodological frameworks, motivated by modern extreme value theory, that can be used to accurately estimate extreme exceedance… ▽ More

    Submitted 1 May, 2024; v1 submitted 18 November, 2023; originally announced November 2023.

  21. arXiv:2311.10766  [pdf, other

    cs.CL cs.AI

    Value FULCRA: Map** Large Language Models to the Multidimensional Spectrum of Basic Human Values

    Authors: **g Yao, Xiaoyuan Yi, Xiting Wang, Yifan Gong, Xing Xie

    Abstract: The rapid advancement of Large Language Models (LLMs) has attracted much attention to value alignment for their responsible development. However, how to define values in this context remains a largely unexplored question. Existing work mainly follows the Helpful, Honest, Harmless principle and specifies values as risk criteria formulated in the AI community, e.g., fairness and privacy protection,… ▽ More

    Submitted 15 November, 2023; originally announced November 2023.

  22. arXiv:2311.09850  [pdf, other

    cs.IT eess.SP

    Semantic-Relay-Aided Text Transmission: Placement Optimization and Bandwidth Allocation

    Authors: Tianyu Liu, Changsheng You, Zeyang Hu, Chenyu Wu, Yi Gong, Kaibin Huang

    Abstract: Semantic communication has emerged as a promising technology to break the Shannon limit by extracting the meaning of source data and sending relevant semantic information only. However, some mobile devices may have limited computation and storage resources, which renders it difficult to deploy and implement the resource-demanding deep learning based semantic encoder/decoder. To tackle this challen… ▽ More

    Submitted 16 November, 2023; originally announced November 2023.

    Comments: 6 pages, 4 figures, accepted for IEEE Global Communication Conference (GLOBECOM) 2023 Workshop

  23. arXiv:2311.09767  [pdf, other

    physics.optics cs.ET

    New advancements, challenges and opportunities of nanophotonics for neuromorphic computing: A state-of-the-art review

    Authors: Renjie Li, Yuanhao Gong, Hai Huang, Yuze Zhou, Sixuan Mao, Connie Chang-Hasnain, Zhaoyu Zhang

    Abstract: The expansion of optoelectronic devices on photonic integration platforms has led to significant growth in the field of photonic computing. Photonic integrated circuits have facilitated the creation of ultrafast artificial neural networks, forming the basis for a novel category of information processing devices. Their application extends to diverse domains such as medical diagnosis, language model… ▽ More

    Submitted 16 November, 2023; originally announced November 2023.

    Comments: 39 pages,17 figures

  24. arXiv:2311.09031  [pdf, other

    cs.IT

    Integrating Sensing, Communication, and Power Transfer: From Theory to Practice

    Authors: Xiaoyang Li, Zidong Han, Guangxu Zhu, Yuanming Shi, Jie Xu, Yi Gong, Qinyu Zhang, Kaibin Huang, Khaled B. Letaief

    Abstract: To support the development of internet-of-things applications, an enormous population of low-power devices are expected to be incorporated in wireless networks performing sensing and communication tasks. As a key technology for improving the data collection efficiency, integrated sensing and communication (ISAC) enables simultaneous data transmission and radar sensing by reusing the same radio sig… ▽ More

    Submitted 18 February, 2024; v1 submitted 15 November, 2023; originally announced November 2023.

    Comments: This paper has been accepted by IEEE Communications Magazine

  25. arXiv:2311.08154  [pdf, other

    cs.CL cs.AI

    Just Ask One More Time! Self-Agreement Improves Reasoning of Language Models in (Almost) All Scenarios

    Authors: Lei Lin, Jiayi Fu, Pengli Liu, Qingyang Li, Yan Gong, Junchen Wan, Fuzheng Zhang, Zhongyuan Wang, Di Zhang, Kun Gai

    Abstract: Although chain-of-thought (CoT) prompting combined with language models has achieved encouraging results on complex reasoning tasks, the naive greedy decoding used in CoT prompting usually causes the repetitiveness and local optimality. To address this shortcoming, ensemble-optimization tries to obtain multiple reasoning paths to get the final answer assembly. However, current ensemble-optimizatio… ▽ More

    Submitted 24 May, 2024; v1 submitted 14 November, 2023; originally announced November 2023.

    Comments: Accepted by Findings of ACL 2024

  26. arXiv:2311.05608  [pdf, other

    cs.CR cs.AI cs.CL

    FigStep: Jailbreaking Large Vision-language Models via Typographic Visual Prompts

    Authors: Yichen Gong, Delong Ran, **yuan Liu, Conglei Wang, Tianshuo Cong, Anyu Wang, Sisi Duan, Xiaoyun Wang

    Abstract: Ensuring the safety of artificial intelligence-generated content (AIGC) is a longstanding topic in the artificial intelligence (AI) community, and the safety concerns associated with Large Language Models (LLMs) have been widely investigated. Recently, large vision-language models (VLMs) represent an unprecedented revolution, as they are built upon LLMs but can incorporate additional modalities (e… ▽ More

    Submitted 13 December, 2023; v1 submitted 9 November, 2023; originally announced November 2023.

    Comments: Technical Report

  27. arXiv:2311.03798  [pdf, other

    cs.CL

    Noisy Pair Corrector for Dense Retrieval

    Authors: Hang Zhang, Yeyun Gong, Xingwei He, Dayiheng Liu, Daya Guo, Jiancheng Lv, Jian Guo

    Abstract: Most dense retrieval models contain an implicit assumption: the training query-document pairs are exactly matched. Since it is expensive to annotate the corpus manually, training pairs in real-world applications are usually collected automatically, which inevitably introduces mismatched-pair noise. In this paper, we explore an interesting and challenging problem in dense retrieval, how to train an… ▽ More

    Submitted 7 November, 2023; originally announced November 2023.

    Comments: Findings of EMNLP 2023

  28. arXiv:2310.13489  [pdf, other

    astro-ph.GA

    Maser Investigation toward Off-Plane Stars (MIOPS): detection of SiO masers in the Galactic thick disk and halo

    Authors: Wen** Yang, Yuanwei Wu, Yan Gong, Nicolas Mauron, Bo Zhang, Karl M. Menten, Xiaofeng Mai, Dejian Liu, Juan Li, **g**g Li

    Abstract: Studying stars that are located off the Galactic plane is important for understanding the formation history of the Milky Way. We searched for SiO masers toward off-plane O-rich asymptotic giant branch (AGB) stars from the catalog presented by Mauron et al. (2019) in order to shed light on the origin of these objects. A total of 102 stars were observed in the SiO $J$=1-0, $v=1$ and 2 transitions wi… ▽ More

    Submitted 20 October, 2023; originally announced October 2023.

    Comments: 15 pages, 6 figures, 3 tables, accepted for publication in ApJ

  29. arXiv:2310.10140  [pdf, other

    astro-ph.CO

    Constraining Ultralight Axions with CSST Weak Gravitational Lensing and Galaxy Clustering Photometric Surveys

    Authors: Hengjie Lin, Furen Deng, Yan Gong, Xuelei Chen

    Abstract: Ultralight axion (ULA) can be one of the potential candidates for dark matter. The extremely low mass of the ULA can lead to a de Broglie wavelength the size of galaxies which results in a suppression of the growth of structure on small scales. In this work, we forecast the constraint on the ULA particle mass $m_{\text{a}}$ and relative fraction to dark matter… ▽ More

    Submitted 28 February, 2024; v1 submitted 16 October, 2023; originally announced October 2023.

    Comments: 13 pages, 10 figures, and 2 tables. Accepted for publication in MNRAS

  30. A global view on star formation: The GLOSTAR Galactic plane survey. IX. Radio Source Catalog III: 2<l<28, 36<l<40, 56<l<60 and |b|<1, VLA B-configuration

    Authors: A. Y. Yang, S. A. Dzib, J. S. Urquhart, A. Brunthaler, S. -N. X. Medina, K. M. Menten, F. Wyrowski, G. N. Ortiz-León, W. D. Cotton, Y. Gong, R. Dokara, M. R. Rugel, H. Beuther, J. D. Pandian, T. Csengeri, V. S. Veena, N. Roy, H. Nguyen, B. Winkel, J. Ott, C. Carrasco-Gonzalez, S. Khan, A. Cheema

    Abstract: As part of the GLOSTAR (GLObal view of STAR formation in the Milky Way) survey, we present the high-resolution continuum source catalog for the regions (l = 2-28, 36-40, 56-60, &|b|<1.0), observed with the Karl G. Jansky Very Large Array (VLA) in its B-configuration. The continuum images are optimized to detect compact sources on angular scales up to 4", and have a typical noise level of 1sigma ~… ▽ More

    Submitted 23 October, 2023; v1 submitted 15 October, 2023; originally announced October 2023.

    Comments: 25pages, 21 figures, has been accepted for publication in Astronomy & Astrophysics (A&A)

    Journal ref: A&A 680, A92 (2023)

  31. arXiv:2310.08252  [pdf, other

    cs.LG cs.AI cs.NE

    MetaBox: A Benchmark Platform for Meta-Black-Box Optimization with Reinforcement Learning

    Authors: Zeyuan Ma, Hongshu Guo, Jiacheng Chen, Zhenrui Li, Guojun Peng, Yue-Jiao Gong, Yining Ma, Zhiguang Cao

    Abstract: Recently, Meta-Black-Box Optimization with Reinforcement Learning (MetaBBO-RL) has showcased the power of leveraging RL at the meta-level to mitigate manual fine-tuning of low-level black-box optimizers. However, this field is hindered by the lack of a unified benchmark. To fill this gap, we introduce MetaBox, the first benchmark platform expressly tailored for develo** and evaluating MetaBBO-RL… ▽ More

    Submitted 27 October, 2023; v1 submitted 12 October, 2023; originally announced October 2023.

    Comments: Accepted at NuerIPS 2023

  32. arXiv:2310.05033  [pdf

    cs.CR

    RSMS: Towards Reliable and Secure Metaverse Service Provision

    Authors: Yanwei Gong, Xiaolin Chang, Jelena Mišić, Vojislav B. Mišić, Yingying Yao

    Abstract: Establishing and sustaining Metaverse service necessitates an unprecedented scale of resources. This paper considers the deployment of Metaverse service in a cloud-edge resource architecture, which can satisfy the escalating demand for Metaverse service resources while ensuring both high bandwidth and low latency. We propose a novel mechanism, named Reliable and Secure Metaverse Service (RSMS), to… ▽ More

    Submitted 8 October, 2023; originally announced October 2023.

  33. arXiv:2310.01444  [pdf, other

    cs.CL cs.AI

    Adapting LLM Agents with Universal Feedback in Communication

    Authors: Kuan Wang, Yadong Lu, Michael Santacroce, Yeyun Gong, Chao Zhang, Yelong Shen

    Abstract: Recent advances in large language models (LLMs) have demonstrated potential for LLM agents. To facilitate the training for these agents with both linguistic feedback and non-linguistic reward signals, we introduce Learning through Communication (LTC). We design a universal buffer to store all the feedback, and an iterative pipeline to enable an LLM agent to explore and update its policy in an give… ▽ More

    Submitted 13 April, 2024; v1 submitted 1 October, 2023; originally announced October 2023.

    Comments: Preprint

  34. arXiv:2310.01342  [pdf, other

    cs.IT eess.SP

    Near-field Integrated Sensing and Communication: Opportunities and Challenges

    Authors: Jiayi Cong, Changsheng You, Jiapeng Li, Li Chen, Beixiong Zheng, Yuanwei Liu, Wen Wu, Yi Gong, Shi **, Rui Zhang

    Abstract: With the extremely large-scale array XL-array deployed in future wireless systems, wireless communication and sensing are expected to operate in the radiative near-field region, which needs to be characterized by the spherical rather than planar wavefronts. Unlike most existing works that considered far-field integrated sensing and communication (ISAC), we study in this article the new near-field… ▽ More

    Submitted 17 October, 2023; v1 submitted 2 October, 2023; originally announced October 2023.

    Comments: This work is submitted to IEEE for possible publication

  35. arXiv:2309.17452  [pdf, other

    cs.CL cs.AI

    ToRA: A Tool-Integrated Reasoning Agent for Mathematical Problem Solving

    Authors: Zhibin Gou, Zhihong Shao, Yeyun Gong, Yelong Shen, Yujiu Yang, Minlie Huang, Nan Duan, Weizhu Chen

    Abstract: Large language models have made significant progress in various language tasks, yet they still struggle with complex mathematics. In this paper, we propose ToRA a series of Tool-integrated Reasoning Agents designed to solve challenging mathematical problems by seamlessly integrating natural language reasoning with the utilization of external tools (e.g., computation libraries and symbolic solvers)… ▽ More

    Submitted 21 February, 2024; v1 submitted 29 September, 2023; originally announced September 2023.

    Comments: ICLR 2024; First two authors equal contribution

  36. P2I-NET: Map** Camera Pose to Image via Adversarial Learning for New View Synthesis in Real Indoor Environments

    Authors: Xujie Kang, Kanglin Liu, Jiang Duan, Yuanhao Gong, Guo** Qiu

    Abstract: Given a new $6DoF$ camera pose in an indoor environment, we study the challenging problem of predicting the view from that pose based on a set of reference RGBD views. Existing explicit or implicit 3D geometry construction methods are computationally expensive while those based on learning have predominantly focused on isolated views of object categories with regular geometric structure. Differing… ▽ More

    Submitted 27 September, 2023; originally announced September 2023.

  37. Sulfur isotope ratios in the Large Magellanic Cloud

    Authors: Y. Gong, C. Henkel, K. M. Menten, C. -H. R. Chen, Z. Y. Zhang, Y. T. Yan, A. Weiss, N. Langer, J. Z. Wang, R. Q. Mao, X. D. Tang, W. Yang, Y. P. Ao, M. Wang

    Abstract: Sulfur isotope ratios have emerged as a promising tool for tracing stellar nucleosynthesis, quantifying stellar populations, and investigating the chemical evolution of galaxies. While extensively studied in the Milky Way, in extragalactic environments they remain largely unexplored. We focus on investigating the sulfur isotope ratios in the Large Magellanic Cloud (LMC) to gain insights into sulfu… ▽ More

    Submitted 18 October, 2023; v1 submitted 26 September, 2023; originally announced September 2023.

    Comments: 6 pages, 1 figures, 2 tables, accepted for publication in A&A, adjusted to the final version

    Journal ref: A&A 679, L6 (2023)

  38. arXiv:2309.14859  [pdf, other

    cs.CV cs.AI cs.GR cs.LG

    Navigating Text-To-Image Customization: From LyCORIS Fine-Tuning to Model Evaluation

    Authors: Shih-Ying Yeh, Yu-Guan Hsieh, Zhidong Gao, Bernard B W Yang, Giyeong Oh, Yanmin Gong

    Abstract: Text-to-image generative models have garnered immense attention for their ability to produce high-fidelity images from text prompts. Among these, Stable Diffusion distinguishes itself as a leading open-source model in this fast-growing field. However, the intricacies of fine-tuning these models pose multiple challenges from new methodology integration to systematic evaluation. Addressing these iss… ▽ More

    Submitted 11 March, 2024; v1 submitted 26 September, 2023; originally announced September 2023.

    Comments: In International Conference on Learning Representations 12 (ICLR 2024) [79 pages, 54 figures, 7 tables]

  39. arXiv:2309.14726  [pdf, other

    cs.CV cs.AI cs.CE cs.CL cs.LG

    PLMM: Personal Large Language Models on Mobile Devices

    Authors: Yuanhao Gong

    Abstract: Inspired by Federated Learning, in this paper, we propose personal large models that are distilled from traditional large language models but more adaptive to local users' personal information such as education background and hobbies. We classify the large language models into three levels: the personal level, expert level and traditional level. The personal level models are adaptive to users' per… ▽ More

    Submitted 4 May, 2024; v1 submitted 26 September, 2023; originally announced September 2023.

    Comments: arXiv admin note: substantial text overlap with arXiv:2307.13221

  40. arXiv:2309.14405  [pdf, other

    cs.SD cs.AI eess.AS

    Joint Audio and Speech Understanding

    Authors: Yuan Gong, Alexander H. Liu, Hongyin Luo, Leonid Karlinsky, James Glass

    Abstract: Humans are surrounded by audio signals that include both speech and non-speech sounds. The recognition and understanding of speech and non-speech audio events, along with a profound comprehension of the relationship between them, constitute fundamental cognitive capabilities. For the first time, we build a machine learning model, called LTU-AS, that has a conceptually similar universal audio perce… ▽ More

    Submitted 10 December, 2023; v1 submitted 25 September, 2023; originally announced September 2023.

    Comments: Accepted at ASRU 2023. Code, dataset, and pretrained models are at https://github.com/yuangongnd/ltu. Interactive demo at https://huggingface.co/spaces/yuangongfdu/ltu-2

  41. arXiv:2309.13643  [pdf, other

    cs.LG cs.NI

    REWAFL: Residual Energy and Wireless Aware Participant Selection for Efficient Federated Learning over Mobile Devices

    Authors: Y. Li, X. Qin, J. Geng, R. Chen, Y. Hou, Y. Gong, M. Pan, P. Zhang

    Abstract: Participant selection (PS) helps to accelerate federated learning (FL) convergence, which is essential for the practical deployment of FL over mobile devices. While most existing PS approaches focus on improving training accuracy and efficiency rather than residual energy of mobile devices, which fundamentally determines whether the selected devices can participate. Meanwhile, the impacts of mobil… ▽ More

    Submitted 24 September, 2023; originally announced September 2023.

  42. arXiv:2309.12510  [pdf, other

    cs.LG

    Confidence Calibration for Systems with Cascaded Predictive Modules

    Authors: Yunye Gong, Yi Yao, Xiao Lin, Ajay Divakaran, Melinda Gervasio

    Abstract: Existing conformal prediction algorithms estimate prediction intervals at target confidence levels to characterize the performance of a regression model on new test samples. However, considering an autonomous system consisting of multiple modules, prediction intervals constructed for individual modules fall short of accommodating uncertainty propagation over different modules and thus cannot provi… ▽ More

    Submitted 21 September, 2023; originally announced September 2023.

  43. arXiv:2309.11161  [pdf, other

    cs.IT eess.SP

    Beamforming Design for RIS-Aided THz Wideband Communication Systems

    Authors: Yihang Jiang, Ziqin Zhou, Xiaoyang Li, Yi Gong

    Abstract: Benefiting from tens of GHz of bandwidth, terahertz (THz) communications has become a promising technology for future 6G networks. However, the conventional hybrid beamforming architecture based on frequency-independent phase-shifters is not able to cope with the beam split effect (BSE) in THz massive multiple-input multiple-output (MIMO) systems. Despite some work introducing the frequency-depend… ▽ More

    Submitted 21 September, 2023; v1 submitted 20 September, 2023; originally announced September 2023.

  44. arXiv:2309.10814  [pdf, other

    cs.CL

    Natural Language Embedded Programs for Hybrid Language Symbolic Reasoning

    Authors: Tianhua Zhang, Jiaxin Ge, Hongyin Luo, Yung-Sung Chuang, Mingye Gao, Yuan Gong, Xixin Wu, Yoon Kim, Helen Meng, James Glass

    Abstract: How can we perform computations over natural language representations to solve tasks that require symbolic and numeric reasoning? We propose natural language embedded programs (NLEP) as a unifying framework for addressing math/symbolic reasoning, natural language understanding, and instruction following tasks. Our approach prompts a language model to generate full Python programs that define funct… ▽ More

    Submitted 28 March, 2024; v1 submitted 19 September, 2023; originally announced September 2023.

    Comments: NAACL 2024

  45. An Unified Search and Recommendation Foundation Model for Cold-Start Scenario

    Authors: Yuqi Gong, Xichen Ding, Yehui Su, Kaiming Shen, Zhongyi Liu, Guannan Zhang

    Abstract: In modern commercial search engines and recommendation systems, data from multiple domains is available to jointly train the multi-domain model. Traditional methods train multi-domain models in the multi-task setting, with shared parameters to learn the similarity of multiple tasks, and task-specific parameters to learn the divergence of features, labels, and sample distributions of individual tas… ▽ More

    Submitted 16 September, 2023; originally announced September 2023.

    Comments: CIKM 2023,6 pages

  46. arXiv:2309.07369  [pdf, other

    eess.AS cs.CL cs.SD

    Hybrid Attention-based Encoder-decoder Model for Efficient Language Model Adaptation

    Authors: Shaoshi Ling, Guoli Ye, Rui Zhao, Yifan Gong

    Abstract: Attention-based encoder-decoder (AED) speech recognition model has been widely successful in recent years. However, the joint optimization of acoustic model and language model in end-to-end manner has created challenges for text adaptation. In particular, effectively, quickly and inexpensively adapting text has become a primary concern for deploying AED systems in industry. To address this issue,… ▽ More

    Submitted 13 September, 2023; originally announced September 2023.

  47. arXiv:2309.01875  [pdf, other

    cs.CV cs.LG cs.MM cs.PF eess.IV

    Gradient Domain Diffusion Models for Image Synthesis

    Authors: Yuanhao Gong

    Abstract: Diffusion models are getting popular in generative image and video synthesis. However, due to the diffusion process, they require a large number of steps to converge. To tackle this issue, in this paper, we propose to perform the diffusion process in the gradient domain, where the convergence becomes faster. There are two reasons. First, thanks to the Poisson equation, the gradient domain is mathe… ▽ More

    Submitted 4 September, 2023; originally announced September 2023.

  48. On the improved dynamics approach in loop quantum black holes

    Authors: Hongchao Zhang, Wen-Cong Gan, Yungui Gong, Anzhong Wang

    Abstract: In this paper, we consider the Böhmer-Vandersloot (BV) model of loop quantum black holes obtained from the improved dynamics approach. We adopt the Saini-Singh gauge, in which it was found analytically that the BV spacetime is geodesically complete. We show that black/white hole horizons do not exist in this geodesically complete spacetime. Instead, there exists only an infinite number of transiti… ▽ More

    Submitted 6 March, 2024; v1 submitted 29 August, 2023; originally announced August 2023.

    Comments: 3 figures and no tables. Version published in Commun. Theor. Phys. 76 (2024) 035401. arXiv admin note: text overlap with arXiv:2212.14535

    Journal ref: Commun. Theor. Phys. 76 (2024) 035401

  49. Protonated hydrogen cyanide as a tracer of pristine molecular gas

    Authors: Y. Gong, F. J. Du, C. Henkel, A. M. Jacob, A. Belloche, J. Z. Wang, K. M. Menten, W. Yang, D. H. Quan, C. T. Bop, G. N. Ortiz-León, X. D. Tang, M. R. Rugel, S. Liu

    Abstract: Protonated hydrogen cyanide, HCNH$^{+}$, plays a fundamental role in astrochemistry because it is an intermediary in gas-phase ion-neutral reactions within cold molecular clouds. However, the impact of the environment on the chemistry of HCNH$^{+}$ remains poorly understood. With the IRAM-30 m and APEX-12 m observations, we report the first robust distribution of HCNH$^{+}$ in the Serpens filament… ▽ More

    Submitted 29 August, 2023; originally announced August 2023.

    Comments: 25 pages, 26 figures, accepted for publication in A&A

    Journal ref: A&A 679, A39 (2023)

  50. arXiv:2308.13690  [pdf, other

    astro-ph.HE astro-ph.CO gr-qc

    Including higher harmonics in gravitational-wave parameter estimation and cosmological implications for LISA

    Authors: Yi Gong, Zhoujian Cao, Junjie Zhao, Li**g Shao

    Abstract: Massive black holes (MBHs) are crucial in sha** their host galaxies. How the MBH co-evolves with its host galaxy is a pressing problem in astrophysics and cosmology. The valuable information carried by the binary MBH is encoded in the gravitational waves (GWs), which will be detectable by the space-borne GW detector LISA. In the GW data analysis, usually, only the dominant $(2,2)$ mode of the GW… ▽ More

    Submitted 25 August, 2023; originally announced August 2023.

    Comments: 14 pages, 11 figures, 4 tables; accepted by Physical Review D

    Journal ref: Phys. Rev. D 108 (2023) 064046