Skip to main content

Showing 51–100 of 1,237 results for author: Hu, C

.
  1. arXiv:2404.13582  [pdf, other

    nucl-th

    Light and hyper nuclei formation at $\sqrt{s_{\text{NN}}} =$ 3 GeV Au+Au collisions using Wigner coalescence approach

    Authors: L. K. Liu, C. L. Hu, X. H. He, S. S. Shi, G. N. Xie

    Abstract: The production of light nuclei and hyper-nuclei in heavy-ion collisions, particularly at high baryon density, is crucial for understanding dynamical evolution of collision system and exploring the internal state of nuclear matter of compacted stellar. Despite being a topic of ongoing debate, an improved theoretical understanding is needed. In this work, production of light nuclei ($d$, $t$,… ▽ More

    Submitted 21 April, 2024; originally announced April 2024.

  2. arXiv:2404.13190  [pdf, other

    quant-ph cond-mat.other

    Anomalous Long-Distance Coherence in Critically-Driven Cavity Magnonics

    Authors: Ying Yang, Jiguang Yao, Yang Xiao, Pak-Tik Fong, Hoi-Kwan Lau, C. -M. Hu

    Abstract: Develo** quantum networks necessitates coherently connecting distant systems via remote strong coupling. Here, we demonstrate long-distance coherence in cavity magnonics operating in the linear regime. By locally setting the cavity near critical coupling with travelling photons, non-local magnon-photon coherence is established via strong coupling over a 2-meter distance. We observe two anomalies… ▽ More

    Submitted 19 April, 2024; originally announced April 2024.

    Comments: 7 pages, 4 figures

    Journal ref: Physical Review Letters 132, 206902 (2024)

  3. arXiv:2404.12769  [pdf

    eess.SY

    Towards Accurate and Efficient Sorting of Retired Lithium-ion Batteries: A Data Driven Based Electrode Aging Assessment Approach

    Authors: Ruohan Guo, Feng Wang, Cungang Hu, Weixiang Shen

    Abstract: Retired batteries (RBs) for second-life applications offer promising economic and environmental benefits. However, accurate and efficient sorting of RBs with discrepant characteristics persists as a pressing challenge. In this study, we introduce a data driven based electrode aging assessment approach to address this concern. To this end, a number of 15 feature points are extracted from battery op… ▽ More

    Submitted 19 April, 2024; originally announced April 2024.

    Comments: 40 pages, 25 figures

  4. arXiv:2404.11836  [pdf, other

    eess.SP

    AI-Empowered RIS-Assisted Networks: CV-Enabled RIS Selection and DNN-Enabled Transmission

    Authors: Conggang Hu, Yang Lu, Hongyang Du, Mi Yang, Bo Ai, Dusit Niyato

    Abstract: This paper investigates artificial intelligence (AI) empowered schemes for reconfigurable intelligent surface (RIS) assisted networks from the perspective of fast implementation. We formulate a weighted sum-rate maximization problem for a multi-RIS-assisted network. To avoid huge channel estimation overhead due to activate all RISs, we propose a computer vision (CV) enabled RIS selection scheme ba… ▽ More

    Submitted 17 April, 2024; originally announced April 2024.

  5. arXiv:2404.11380  [pdf

    physics.app-ph physics.optics

    Non-hermitian magnonic knobbing between electromagnetically induced reflection and transparancy

    Authors: Youcai Han, Changhao Meng, Ze** Rao, Jie Qian, Yiming Lv, Li** Zhu, CanMing Hu, Zhenghua An

    Abstract: Manipulation of wave propagation through open resonant systems has attracted tremendous interest. When accessible to the open system, the system under study is prone to tempering to out of equilibrium, and a lack of reciprocity is the rule rather than the exception. Open systems correspond to non-hermitian Hamiltonians with very unique properties such as resulting exceptional points and ideal isol… ▽ More

    Submitted 17 April, 2024; originally announced April 2024.

  6. arXiv:2404.09544  [pdf, other

    cs.LG cs.AI

    GNNavigator: Towards Adaptive Training of Graph Neural Networks via Automatic Guideline Exploration

    Authors: Tong Qiao, Jianlei Yang, Yingjie Qi, Ao Zhou, Chen Bai, Bei Yu, Weisheng Zhao, Chunming Hu

    Abstract: Graph Neural Networks (GNNs) succeed significantly in many applications recently. However, balancing GNNs training runtime cost, memory consumption, and attainable accuracy for various applications is non-trivial. Previous training methodologies suffer from inferior adaptability and lack a unified training optimization solution. To address the problem, this work proposes GNNavigator, an adaptive G… ▽ More

    Submitted 15 April, 2024; originally announced April 2024.

    Comments: Accepted by DAC'24

  7. arXiv:2404.09185  [pdf, other

    cond-mat.str-el

    Robust spin order and fragile charge order in Na0.5CoO2 as revealed by time-resolved terahertz spectroscopy

    Authors: X. Y. Zhou, S. J. Zhang, D. Wu, H. Wang, B. H. Li, S. F. Wu, Q. M. Liu, T. C. Hu, R. S. Li, J. Y. Yuan, S. X. Xu, Q. Wu, L. Yue, T. Dong, N. L. Wang

    Abstract: Near-infrared (NIR) pump-terahertz (THz) probe spectroscopy is used to investigate the charge and spin exciations in a strongly correlated electron compound Na0.5CoO2. This compound exhibits a coexistence of various charge and spin orders arising from intricate interactions among charge, spin, and orbital degrees of freedom. NIR pulses create significantly diverse effects on the charge and spin or… ▽ More

    Submitted 14 April, 2024; originally announced April 2024.

  8. arXiv:2404.08943  [pdf, other

    math.OC eess.SY

    A Novel State-Centric Necessary Condition for Time-Optimal Control of Controllable Linear Systems Based on Augmented Switching Laws

    Authors: Yunan Wang, Chuxiong Hu, Yujie Lin, Zeyang Li, Shize Lin, Suqin He

    Abstract: Most existing necessary conditions for optimal control based on adjoining methods require both state information and costate information, yet the lack of costates for a given feasible trajectory in practice impedes the determination of optimality. This paper establishes a novel theoretical framework for time-optimal control of controllable linear systems, proposing the augmented switching law that… ▽ More

    Submitted 13 April, 2024; originally announced April 2024.

  9. arXiv:2404.08706  [pdf, other

    cs.AI

    Game Generation via Large Language Models

    Authors: Chengpeng Hu, Yunlong Zhao, Jialin Liu

    Abstract: Recently, the emergence of large language models (LLMs) has unlocked new opportunities for procedural content generation. However, recent attempts mainly focus on level generation for specific games with defined game rules such as Super Mario Bros. and Zelda. This paper investigates the game generation via LLMs. Based on video game description language, this paper proposes an LLM-based framework t… ▽ More

    Submitted 29 May, 2024; v1 submitted 11 April, 2024; originally announced April 2024.

    Comments: 2024 IEEE Conference on Games

  10. arXiv:2404.08382  [pdf, other

    cs.CL cs.AI

    Look at the Text: Instruction-Tuned Language Models are More Robust Multiple Choice Selectors than You Think

    Authors: Xinpeng Wang, Chengzhi Hu, Bolei Ma, Paul Röttger, Barbara Plank

    Abstract: Multiple choice questions (MCQs) are commonly used to evaluate the capabilities of large language models (LLMs). One common way to evaluate the model response is to rank the candidate answers based on the log probability of the first token prediction. An alternative way is to examine the text output. Prior work has shown that first token probabilities lack robustness to changes in MCQ phrasing, an… ▽ More

    Submitted 12 April, 2024; originally announced April 2024.

  11. arXiv:2404.07577  [pdf, other

    cs.LG eess.SP

    Generating Comprehensive Lithium Battery Charging Data with Generative AI

    Authors: Lidang Jiang, Changyan Hu, Sibei Ji, Hang Zhao, Junxiong Chen, Ge He

    Abstract: In optimizing performance and extending the lifespan of lithium batteries, accurate state prediction is pivotal. Traditional regression and classification methods have achieved some success in battery state prediction. However, the efficacy of these data-driven approaches heavily relies on the availability and quality of public datasets. Additionally, generating electrochemical data predominantly… ▽ More

    Submitted 11 April, 2024; originally announced April 2024.

  12. arXiv:2404.07343  [pdf, other

    astro-ph.GA

    Monitoring AGNs with H$β$ Asymmetry. IV. First Reverberation Map** Results of 14 AGNs

    Authors: T. E. Zastrocky, Michael S. Brotherton, Pu Du, Jacob N. McLane, Kianna A. Olson, D. A. Dale, H. A. Kobulnicky, Jaya Maithil, My L. Nguyen, William T. Chick, David H. Kasper, Derek Hand, C. Adelman, Z. Carter, G. Murphree, M. Oeur, T. Roth, S. Schonsberg, M. J. Caradonna, J. Favro, A. J. Ferguson, I. M. Gonzalez, L. M. Hadding, H. D. Hagler, C. J. Rogers , et al. (19 additional authors not shown)

    Abstract: We report first-time reverberation map** results for 14 AGNs from the ongoing Monitoring AGNs with H$β$ Asymmetry campaign (MAHA). These results utilize optical spectra obtained with the Long Slit Spectrograph on the Wyoming Infrared 2.3m Telescope between 2017 November-2023 May. MAHA combines long-duration monitoring with high cadence. We report results from multiple observing seasons for 9 of… ▽ More

    Submitted 10 April, 2024; originally announced April 2024.

    Comments: 35 pages, 19 figures, accepted for publication in ApJ Supplement

  13. arXiv:2404.07157  [pdf, other

    cond-mat.str-el cond-mat.mes-hall

    Local probe of bulk and edge states in a fractional Chern insulator

    Authors: Zhurun Ji, Heonjoon Park, Mark E. Barber, Chaowei Hu, Kenji Watanabe, Takashi Taniguchi, Jiun-Haw Chu, Xiaodong Xu, Zhi-xun Shen

    Abstract: Fractional quantum Hall effect (FQHE) is a prime example of topological quantum many-body phenomena, arising from the interplay between strong electron correlation, topological order, and time reversal symmetry breaking. Recently, a lattice analog of FQHE at zero magnetic field has been observed, confirming the existence of a zero-field fractional Chern insulator (FCI). Despite this, the bulk-edge… ▽ More

    Submitted 10 April, 2024; originally announced April 2024.

  14. arXiv:2404.05829  [pdf, other

    cs.CL cs.AI cs.LG

    SambaLingo: Teaching Large Language Models New Languages

    Authors: Zoltan Csaki, Bo Li, Jonathan Li, Qiantong Xu, Pian Pawakapan, Leon Zhang, Yun Du, Hengyu Zhao, Changran Hu, Urmish Thakker

    Abstract: Despite the widespread availability of LLMs, there remains a substantial gap in their capabilities and availability across diverse languages. One approach to address these issues has been to take an existing pre-trained LLM and continue to train it on new languages. While prior works have experimented with language adaptation, many questions around best practices and methodology have not been cove… ▽ More

    Submitted 8 April, 2024; originally announced April 2024.

    Comments: 23 pages

  15. arXiv:2404.05605  [pdf, other

    cs.LG cs.AI

    Graph Neural Networks Automated Design and Deployment on Device-Edge Co-Inference Systems

    Authors: Ao Zhou, Jianlei Yang, Tong Qiao, Yingjie Qi, Zhi Yang, Weisheng Zhao, Chunming Hu

    Abstract: The key to device-edge co-inference paradigm is to partition models into computation-friendly and computation-intensive parts across the device and the edge, respectively. However, for Graph Neural Networks (GNNs), we find that simply partitioning without altering their structures can hardly achieve the full potential of the co-inference paradigm due to various computational-communication overhead… ▽ More

    Submitted 8 April, 2024; originally announced April 2024.

    Comments: Accepted by DAC'24

  16. arXiv:2404.05158  [pdf, ps, other

    quant-ph cond-mat.mes-hall physics.optics

    Quantum and Classical Two-photon Interference of Single Photons with Ultralong Coherence Time

    Authors: Manman Wang, Yanfeng Li, Hanqing Liu, Haiqiao Ni, Zhichuan Niu, Xiaogang Wei, Renfu Yang, Chengyong Hu

    Abstract: Two-photon interference (TPI) is a fundamental phenomenon in quantum optics and plays a crucial role in quantum information science and technology. TPI is commonly considered as quantum interference with an upper bound of $100\%$ for both the TPI visibility and the beat visibility in contrast to its classical counterpart with a maximum visibility of $50\%$. However, this is not always the case. He… ▽ More

    Submitted 7 April, 2024; originally announced April 2024.

    Comments: 7 pages, 4 figures, Comments are welcome

  17. arXiv:2404.04801  [pdf, ps, other

    astro-ph.IM astro-ph.HE

    LHAASO-KM2A detector simulation using Geant4

    Authors: Zhen Cao, F. Aharonian, Q. An, Axikegu, Y. X. Bai, Y. W. Bao, D. Bastieri, X. J. Bi, Y. J. Bi, J. T. Cai, Q. Cao, W. Y. Cao, Zhe Cao, J. Chang, J. F. Chang, A. M. Chen, E. S. Chen, Liang Chen, Lin Chen, Long Chen, M. J. Chen, M. L. Chen, Q. H. Chen, S. H. Chen, S. Z. Chen , et al. (254 additional authors not shown)

    Abstract: KM2A is one of the main sub-arrays of LHAASO, working on gamma ray astronomy and cosmic ray physics at energies above 10 TeV. Detector simulation is the important foundation for estimating detector performance and data analysis. It is a big challenge to simulate the KM2A detector in the framework of Geant4 due to the need to track numerous photons from a large number of detector units (>6000) with… ▽ More

    Submitted 7 April, 2024; originally announced April 2024.

  18. Physics-Informed Machine Learning for Battery Degradation Diagnostics: A Comparison of State-of-the-Art Methods

    Authors: Sina Navidi, Adam Thelen, Tingkai Li, Chao Hu

    Abstract: Monitoring the health of lithium-ion batteries' internal components as they age is crucial for optimizing cell design and usage control strategies. However, quantifying component-level degradation typically involves aging many cells and destructively analyzing them throughout the aging test, limiting the scope of quantifiable degradation to the test conditions and duration. Fortunately, recent adv… ▽ More

    Submitted 5 April, 2024; originally announced April 2024.

    Comments: It's an unformatted version of the paper titled 'Physics-Informed Machine Learning for Battery Degradation Diagnostics: A Comparison of State-of-the-Art Methods,' published in Energy Storage Materials, Volume 68, 103343. This version includes an acknowledgment section, which is not present in the journal-published version. Please cite the journal version when you refer to this study

    Journal ref: Energy Storage Materials (2024): 103343

  19. arXiv:2403.19837  [pdf, other

    cs.LG cs.AI cs.CL cs.CV cs.LO

    Concept-based Analysis of Neural Networks via Vision-Language Models

    Authors: Ravi Mangal, Nina Narodytska, Divya Gopinath, Boyue Caroline Hu, Anirban Roy, Susmit Jha, Corina Pasareanu

    Abstract: The analysis of vision-based deep neural networks (DNNs) is highly desirable but it is very challenging due to the difficulty of expressing formal specifications for vision tasks and the lack of efficient verification procedures. In this paper, we propose to leverage emerging multimodal, vision-language, foundation models (VLMs) as a lens through which we can reason about vision models. VLMs have… ▽ More

    Submitted 10 April, 2024; v1 submitted 28 March, 2024; originally announced March 2024.

  20. arXiv:2403.19754  [pdf, other

    cs.CL

    GOLD: Generalized Knowledge Distillation via Out-of-Distribution-Guided Language Data Generation

    Authors: Mohsen Gholami, Mohammad Akbari, Cindy Hu, Vaden Masrani, Z. Jane Wang, Yong Zhang

    Abstract: Knowledge distillation from LLMs is essential for the efficient deployment of language models. Prior works have proposed data generation using LLMs for preparing distilled models. We argue that generating data with LLMs is prone to sampling mainly from the center of original content distribution. This limitation hinders the distilled model from learning the true underlying data distribution and to… ▽ More

    Submitted 28 March, 2024; originally announced March 2024.

  21. arXiv:2403.17675  [pdf, other

    math.OC eess.SY

    Chattering Phenomena in Time-Optimal Control for High-Order Chain-of-Integrators Systems with Full State Constraints

    Authors: Yunan Wang, Chuxiong Hu, Zeyang Li, Yujie Lin, Shize Lin, Suqin He

    Abstract: Time-optimal control for high-order chain-of-integrators systems with full state constraints remains an open and challenging problem in the optimal control theory domain. The behaviors of optimal control in high-order problems lack precision characterization, even where the existence of the chattering phenomenon remains unknown and overlooked. This paper establishes a theoretical framework for cha… ▽ More

    Submitted 29 March, 2024; v1 submitted 26 March, 2024; originally announced March 2024.

  22. arXiv:2403.17253  [pdf, ps, other

    quant-ph cond-mat.mes-hall physics.atom-ph physics.optics

    Convert laser light into single photons via interference

    Authors: Yanfeng Li, Manman Wang, Guoqi Huang, Li Liu, Wenyan Wang, Weijie Ji, Hanqing Liu, Xiangbin Su, Shulun Li, Deyan Dai, Xiangjun Shang, Haiqiao Ni, Zhichuan Niu, Chengyong Hu

    Abstract: Laser light possesses perfect coherence, but cannot be attenuated to single photons via linear optics. An elegant route to convert laser light into single photons is based on photon blockade in a cavity with a single atom in the strong coupling regime. However, the single-photon purity achieved by this method remains relatively low. Here we propose an interference-based approach where laser light… ▽ More

    Submitted 25 March, 2024; originally announced March 2024.

    Comments: Comments are welcome

  23. arXiv:2403.16040  [pdf, ps, other

    hep-ph

    General One-loop Generating Function by IBP relations

    Authors: Bo Feng, Chang Hu, Jiyuan Shen, Yaobo Zhang

    Abstract: In this paper we have studied the most general generating function of reduction for one loop integrals with arbitrary tensor structure in numerator and arbitrary power distribution of propagators in denominator. Using IBP relations, we have established the partial differential equations for these generating functions and solved them analytically. These results provide useful guidance for applying… ▽ More

    Submitted 24 March, 2024; originally announced March 2024.

    Comments: 50 pages

  24. arXiv:2403.14301  [pdf, other

    physics.optics physics.app-ph

    Picotesla-sensitivity microcavity optomechanical magnetometry

    Authors: Zhi-Gang Hu, Yi-Meng Gao, Jian-Fei Liu, Hao Yang, Min Wang, Yuechen Lei, Xin Zhou, **cheng Li, Xuening Cao, ****g Liang, Chao-Qun Hu, Zhilin Li, Yong-Chang Lau, Jian-Wang Cai, Bei-Bei Li

    Abstract: Cavity optomechanical systems have enabled precision sensing of magnetic fields, by leveraging the optical resonance-enhanced readout and mechanical resonance-enhanced response. Previous studies have successfully achieved scalable and reproducible microcavity optomechanical magnetometry (MCOM) by incorporating Terfenol-D thin films into high-quality ($Q$) factor whispering gallery mode (WGM) micro… ▽ More

    Submitted 21 March, 2024; originally announced March 2024.

  25. arXiv:2403.14077  [pdf, other

    cs.AI cs.CR

    Can ChatGPT Detect DeepFakes? A Study of Using Multimodal Large Language Models for Media Forensics

    Authors: Shan Jia, Reilin Lyu, Kangran Zhao, Yize Chen, Zhiyuan Yan, Yan Ju, Chuanbo Hu, Xin Li, Baoyuan Wu, Siwei Lyu

    Abstract: DeepFakes, which refer to AI-generated media content, have become an increasing concern due to their use as a means for disinformation. Detecting DeepFakes is currently solved with programmed machine learning algorithms. In this work, we investigate the capabilities of multimodal large language models (LLMs) in DeepFake detection. We conducted qualitative and quantitative experiments to demonstrat… ▽ More

    Submitted 11 June, 2024; v1 submitted 20 March, 2024; originally announced March 2024.

  26. arXiv:2403.12631  [pdf

    cs.RO cs.AI

    PointGrasp: Point Cloud-based Gras** for Tendon-driven Soft Robotic Glove Applications

    Authors: Chen Hu, Shirui Lyu, Eo** Rho, Daekyum Kim, Shan Luo, Letizia Gionfrida

    Abstract: Controlling hand exoskeletons to assist individuals with gras** tasks poses a challenge due to the difficulty in understanding user intentions. We propose that most daily gras** tasks during activities of daily living (ADL) can be deduced by analyzing object geometries (simple and complex) from 3D point clouds. The study introduces PointGrasp, a real-time system designed for identifying househ… ▽ More

    Submitted 19 March, 2024; originally announced March 2024.

    Comments: 6 pages, 8 figures, conference

    ACM Class: I.2; I.4

  27. arXiv:2403.12373  [pdf, other

    cs.CL

    RankPrompt: Step-by-Step Comparisons Make Language Models Better Reasoners

    Authors: Chi Hu, Yuan Ge, Xiangnan Ma, Hang Cao, Qiang Li, Yonghua Yang, Tong Xiao, **gbo Zhu

    Abstract: Large Language Models (LLMs) have achieved impressive performance across various reasoning tasks. However, even state-of-the-art LLMs such as ChatGPT are prone to logical errors during their reasoning processes. Existing solutions, such as deploying task-specific verifiers or voting over multiple reasoning paths, either require extensive human annotations or fail in scenarios with inconsistent res… ▽ More

    Submitted 22 March, 2024; v1 submitted 18 March, 2024; originally announced March 2024.

    Comments: LREC-Coling 2024 Long Paper

  28. arXiv:2403.11465  [pdf

    cond-mat.mes-hall cond-mat.mtrl-sci

    Ultra-Long Homochiral Graphene Nanoribbons Grown Within h-BN Stacks for High-Performance Electronics

    Authors: Bosai Lyu, Jiajun Chen, Sen Wang, Shuo Lou, Peiyue Shen, **gxu Xie, Lu Qiu, Izaac Mitchell, Can Li, Cheng Hu, Xianliang Zhou, Kenji Watanabe, Takashi Taniguchi, Xiaoqun Wang, **feng Jia, Qi Liang, Guorui Chen, Tingxin Li, Shiyong Wang, Wengen Ouyang, Oded Hod, Feng Ding, Michael Urbakh, Zhiwen Shi

    Abstract: Van der Waals encapsulation of two-dimensional materials within hexagonal boron nitride (h-BN) stacks has proven to be a promising way to create ultrahigh-performance electronic devices. However, contemporary approaches for achieving van der Waals encapsulation, which involve artificial layer stacking using mechanical transfer techniques, are difficult to control, prone to contamination, and unsca… ▽ More

    Submitted 18 March, 2024; originally announced March 2024.

  29. Measurements of All-Particle Energy Spectrum and Mean Logarithmic Mass of Cosmic Rays from 0.3 to 30 PeV with LHAASO-KM2A

    Authors: The LHAASO Collaboration, Zhen Cao, F. Aharonian, Q. An, A. Axikegu, Y. X. Bai, Y. W. Bao, D. Bastieri, X. J. Bi, Y. J. Bi, J. T. Cai, Q. Cao, W. Y. Cao, Zhe Cao, J. Chang, J. F. Chang, A. M. Chen, E. S. Chen, Liang Chen, Lin Chen, Long Chen, M. J. Chen, M. L. Chen, Q. H. Chen, S. H. Chen , et al. (256 additional authors not shown)

    Abstract: We present the measurements of all-particle energy spectrum and mean logarithmic mass of cosmic rays in the energy range of 0.3-30 PeV using data collected from LHAASO-KM2A between September 2021 and December 2022, which is based on a nearly composition-independent energy reconstruction method, achieving unprecedented accuracy. Our analysis reveals the position of the knee at… ▽ More

    Submitted 26 March, 2024; v1 submitted 15 March, 2024; originally announced March 2024.

    Comments: 8 pages, 3 figures

    Journal ref: Physical Review Letters 132, 131002 (2024)

  30. arXiv:2403.05530  [pdf, other

    cs.CL cs.AI

    Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context

    Authors: Gemini Team, Petko Georgiev, Ving Ian Lei, Ryan Burnell, Libin Bai, Anmol Gulati, Garrett Tanzer, Damien Vincent, Zhufeng Pan, Shibo Wang, Soroosh Mariooryad, Yifan Ding, Xinyang Geng, Fred Alcober, Roy Frostig, Mark Omernick, Lexi Walker, Cosmin Paduraru, Christina Sorokin, Andrea Tacchetti, Colin Gaffney, Samira Daruki, Olcan Sercinoglu, Zach Gleicher, Juliette Love , et al. (1092 additional authors not shown)

    Abstract: In this report, we introduce the Gemini 1.5 family of models, representing the next generation of highly compute-efficient multimodal models capable of recalling and reasoning over fine-grained information from millions of tokens of context, including multiple long documents and hours of video and audio. The family includes two new models: (1) an updated Gemini 1.5 Pro, which exceeds the February… ▽ More

    Submitted 14 June, 2024; v1 submitted 8 March, 2024; originally announced March 2024.

  31. arXiv:2403.04158  [pdf, other

    cs.CL cs.AI

    DA-Net: A Disentangled and Adaptive Network for Multi-Source Cross-Lingual Transfer Learning

    Authors: Ling Ge, Chunming Hu, Guanghui Ma, Jihong Liu, Hong Zhang

    Abstract: Multi-Source cross-lingual transfer learning deals with the transfer of task knowledge from multiple labelled source languages to an unlabeled target language under the language shift. Existing methods typically focus on weighting the predictions produced by language-specific classifiers of different sources that follow a shared encoder. However, all source languages share the same encoder, which… ▽ More

    Submitted 6 March, 2024; originally announced March 2024.

    Comments: AAAI 2024

  32. arXiv:2403.03954  [pdf, other

    cs.RO cs.CV cs.LG

    3D Diffusion Policy: Generalizable Visuomotor Policy Learning via Simple 3D Representations

    Authors: Yanjie Ze, Gu Zhang, Kangning Zhang, Chenyuan Hu, Muhan Wang, Huazhe Xu

    Abstract: Imitation learning provides an efficient way to teach robots dexterous skills; however, learning complex skills robustly and generalizablely usually consumes large amounts of human demonstrations. To tackle this challenging problem, we present 3D Diffusion Policy (DP3), a novel visual imitation learning approach that incorporates the power of 3D visual representations into diffusion policies, a cl… ▽ More

    Submitted 8 June, 2024; v1 submitted 6 March, 2024; originally announced March 2024.

    Comments: Published at Robotics: Science and Systems (RSS) 2024. Videos, code, and data: https://3d-diffusion-policy.github.io

  33. arXiv:2403.02743  [pdf

    physics.flu-dyn

    Spectral effects of radiating gases on the ignition in a multiswirl staged model combustor using full-spectrum k distribution method -- A Large Eddy Simulation Investigation

    Authors: Hongyuan Di, Chaojun Wang, Chuanlong Hu, Xiao Liu, Lixin Yang

    Abstract: Radiative heat transfer has been proven to be important during the ignition process in gas turbine. Those radiating gases (CO2, H2O, CO) generated during combustion may display strong spectral, or nongray behavior, which is difficult to both characterize and calculate. In this work, both the full-spectrum k-distribution (FSK) and weighted-sum-of-gray-gases (WSGG) method, along with the Dynamic-thi… ▽ More

    Submitted 12 April, 2024; v1 submitted 5 March, 2024; originally announced March 2024.

  34. arXiv:2403.01570  [pdf, other

    cs.CL cs.LG

    SERVAL: Synergy Learning between Vertical Models and LLMs towards Oracle-Level Zero-shot Medical Prediction

    Authors: Jiahuan Yan, **tai Chen, Chaowen Hu, Bo Zheng, Yaojun Hu, Jimeng Sun, Jian Wu

    Abstract: Recent development of large language models (LLMs) has exhibited impressive zero-shot proficiency on generic and common sense questions. However, LLMs' application on domain-specific vertical questions still lags behind, primarily due to the humiliation problems and deficiencies in vertical knowledge. Furthermore, the vertical data annotation process often requires labor-intensive expert involveme… ▽ More

    Submitted 16 March, 2024; v1 submitted 3 March, 2024; originally announced March 2024.

  35. arXiv:2402.19401  [pdf, other

    cs.CV

    Assessing Visually-Continuous Corruption Robustness of Neural Networks Relative to Human Performance

    Authors: Huakun Shen, Boyue Caroline Hu, Krzysztof Czarnecki, Lina Marsso, Marsha Chechik

    Abstract: While Neural Networks (NNs) have surpassed human accuracy in image classification on ImageNet, they often lack robustness against image corruption, i.e., corruption robustness. Yet such robustness is seemingly effortless for human perception. In this paper, we propose visually-continuous corruption robustness (VCR) -- an extension of corruption robustness to allow assessing it over the wide and co… ▽ More

    Submitted 29 February, 2024; originally announced February 2024.

  36. arXiv:2402.18191  [pdf, other

    cs.CL

    Clustering and Ranking: Diversity-preserved Instruction Selection through Expert-aligned Quality Estimation

    Authors: Yuan Ge, Yilun Liu, Chi Hu, Weibin Meng, Shimin Tao, Xiaofeng Zhao, Hongxia Ma, Li Zhang, Hao Yang, Tong Xiao

    Abstract: With contributions from the open-source community, a vast amount of instruction tuning (IT) data has emerged. Given the significant resource allocation required by training and evaluating models, it is advantageous to have an efficient method for selecting high-quality IT data. However, existing methods for instruction data selection have limitations such as relying on fragile external APIs, being… ▽ More

    Submitted 28 February, 2024; originally announced February 2024.

  37. arXiv:2402.14499  [pdf, other

    cs.CL

    "My Answer is C": First-Token Probabilities Do Not Match Text Answers in Instruction-Tuned Language Models

    Authors: Xinpeng Wang, Bolei Ma, Chengzhi Hu, Leon Weber-Genzel, Paul Röttger, Frauke Kreuter, Dirk Hovy, Barbara Plank

    Abstract: The open-ended nature of language generation makes the evaluation of autoregressive large language models (LLMs) challenging. One common evaluation approach uses multiple-choice questions (MCQ) to limit the response space. The model is then evaluated by ranking the candidate answers by the log probability of the first token prediction. However, first-tokens may not consistently reflect the final r… ▽ More

    Submitted 4 July, 2024; v1 submitted 22 February, 2024; originally announced February 2024.

    Comments: ACL 2024 Findings

  38. arXiv:2402.10987  [pdf, other

    cs.CL cs.AI

    WilKE: Wise-Layer Knowledge Editor for Lifelong Knowledge Editing

    Authors: Chenhui Hu, Pengfei Cao, Yubo Chen, Kang Liu, Jun Zhao

    Abstract: Knowledge editing aims to rectify inaccuracies in large language models (LLMs) without costly retraining for outdated or erroneous knowledge. However, current knowledge editing methods primarily focus on single editing, failing to meet the requirements for lifelong editing. This study reveals a performance degradation encountered by knowledge editing in lifelong editing, characterized by toxicity… ▽ More

    Submitted 5 June, 2024; v1 submitted 16 February, 2024; originally announced February 2024.

    Comments: To be published in ACL Findings 2024

  39. arXiv:2402.10476  [pdf, other

    cs.CV

    Spike-EVPR: Deep Spiking Residual Network with Cross-Representation Aggregation for Event-Based Visual Place Recognition

    Authors: Chenming Hu, Zheng Fang, Kuanxu Hou, Delei Kong, Junjie Jiang, Hao Zhuang, Mingyuan Sun, Xinjie Huang

    Abstract: Event cameras have been successfully applied to visual place recognition (VPR) tasks by using deep artificial neural networks (ANNs) in recent years. However, previously proposed deep ANN architectures are often unable to harness the abundant temporal information presented in event streams. In contrast, deep spiking networks exhibit more intricate spatiotemporal dynamics and are inherently well-su… ▽ More

    Submitted 16 February, 2024; originally announced February 2024.

    Comments: 14 pages, 10 figures

  40. Rapid spin changes around a magnetar fast radio burst

    Authors: Chin-** Hu, Takuto Narita, Teruaki Enoto, George Younes, Zorawar Wadiasingh, Matthew G. Baring, Wynn C. G. Ho, Sebastien Guillot, Paul S. Ray, Tolga Guver, Kaustubh Rajwade, Zaven Arzoumanian, Chryssa Kouveliotou, Alice K. Harding, Keith C. Gendreau

    Abstract: Magnetars are neutron stars with extremely high magnetic fields that exhibit various X-ray phenomena such as sporadic sub-second bursts, long-term persistent flux enhancements, and variable rates of rotation period change. In 2020, a fast radio burst (FRB), akin to cosmological millisecond-duration radio bursts, was detected from the Galactic magnetar SGR 1935+2154, confirming the long-suspected a… ▽ More

    Submitted 14 February, 2024; originally announced February 2024.

    Comments: 46 pages, 9figures, 4 tables, a submitted version of Nature 626, 500 (https://www.nature.com/articles/s41586-023-07012-5)

  41. arXiv:2402.05733  [pdf, other

    cs.CL

    TimeArena: Sha** Efficient Multitasking Language Agents in a Time-Aware Simulation

    Authors: Yikai Zhang, Siyu Yuan, Caiyu Hu, Kyle Richardson, Yanghua Xiao, Jiangjie Chen

    Abstract: Despite remarkable advancements in emulating human-like behavior through Large Language Models (LLMs), current textual simulations do not adequately address the notion of time. To this end, we introduce TimeArena, a novel textual simulated environment that incorporates complex temporal dynamics and constraints that better reflect real-life planning scenarios. In TimeArena, agents are asked to comp… ▽ More

    Submitted 8 February, 2024; originally announced February 2024.

    Comments: Work in progress

  42. arXiv:2402.00320  [pdf

    eess.IV

    DARCS: Memory-Efficient Deep Compressed Sensing Reconstruction for Acceleration of 3D Whole-Heart Coronary MR Angiography

    Authors: Zhihao Xue, Fan Yang, Juan Gao, Zhuo Chen, Hao Peng, Chao Zou, Hang **, Chenxi Hu

    Abstract: Three-dimensional coronary magnetic resonance angiography (CMRA) demands reconstruction algorithms that can significantly suppress the artifacts from a heavily undersampled acquisition. While unrolling-based deep reconstruction methods have achieved state-of-the-art performance on 2D image reconstruction, their application to 3D reconstruction is hindered by the large amount of memory needed to tr… ▽ More

    Submitted 2 February, 2024; v1 submitted 31 January, 2024; originally announced February 2024.

    Comments: 10 pages, 8 figures

  43. arXiv:2401.16758  [pdf, other

    astro-ph.HE physics.geo-ph

    Similarity to earthquakes again: periodic radio pulses of the magnetar SGR 1935+2154 are accompanied by aftershocks like fast radio bursts

    Authors: Yuya Tsuzuki, Tomonori Totani, Chin-** Hu, Teruaki Enoto

    Abstract: It was recently discovered that the time correlations of repeating fast radio bursts (FRBs) are similar to earthquake aftershocks. Motivated by the association between FRBs and magnetars, here we report correlation function analyses in the time-energy space for the 563 periodic radio pulses and the 579 X-ray short bursts from the magnetar SGR 1935+2154, which is known to have generated FRBs. Altho… ▽ More

    Submitted 9 April, 2024; v1 submitted 30 January, 2024; originally announced January 2024.

    Comments: 9 pages, 7 figures. Accepted by MNRAS

  44. arXiv:2401.16723  [pdf, other

    q-fin.RM

    Improving Business Insurance Loss Models by Leveraging InsurTech Innovation

    Authors: Zhiyu Quan, Changyue Hu, Panyi Dong, Emiliano A. Valdez

    Abstract: Recent transformative and disruptive advancements in the insurance industry have embraced various InsurTech innovations. In particular, with the rapid progress in data science and computational capabilities, InsurTech is able to integrate a multitude of emerging data sources, shedding light on opportunities to enhance risk classification and claims management. This paper presents a groundbreaking… ▽ More

    Submitted 29 January, 2024; originally announced January 2024.

  45. arXiv:2401.16102  [pdf, other

    cs.LG cs.AI cs.CE

    Flexible Parallel Neural Network Architecture Model for Early Prediction of Lithium Battery Life

    Authors: Lidang Jiang, Zhuoxiang Li, Changyan Hu, Qingsong Huang, Ge He

    Abstract: The early prediction of battery life (EPBL) is vital for enhancing the efficiency and extending the lifespan of lithium batteries. Traditional models with fixed architectures often encounter underfitting or overfitting issues due to the diverse data distributions in different EPBL tasks. An interpretable deep learning model of flexible parallel neural network (FPNN) is proposed, which includes an… ▽ More

    Submitted 29 January, 2024; originally announced January 2024.

  46. arXiv:2401.15703  [pdf, other

    stat.ME stat.AP

    A Bayesian multivariate extreme value mixture model

    Authors: Chenglei Hu, Ben Swallow, Daniela Castro-Camilo

    Abstract: Impact assessment of natural hazards requires the consideration of both extreme and non-extreme events. Extensive research has been conducted on the joint modeling of bulk and tail in univariate settings; however, the corresponding body of research in the context of multivariate analysis is comparatively scant. This study extends the univariate joint modeling of bulk and tail to the multivariate f… ▽ More

    Submitted 28 January, 2024; originally announced January 2024.

    Comments: 34 pages, 7 figures

  47. arXiv:2401.14699  [pdf, other

    cond-mat.str-el

    Quantum Oscillations Measurement of the Heavy Electron Mass near the van Hove Singularity in a Kagome Metal

    Authors: Elliott Rosenberg, Jonathan DeStefano, Yongbin Lee, Chaowei Hu, Yue Shi, David Graf, Shermane M. Benjamin, Liqin Ke, Jiun-Haw Chu

    Abstract: Kagome metals with the Fermi energy tuned near the van Hove singularities (vHss) have shown to host exotic phases including unconventional superconductivity and a chiral flux phase arising from a charge density wave. However, most quantum oscillations studies of the electronic structure of kagome metals focus on compounds which electronically or magnetically order, obscuring the unperturbed vHs. H… ▽ More

    Submitted 26 January, 2024; originally announced January 2024.

  48. arXiv:2401.11500  [pdf, other

    cs.RO cs.AI eess.SY

    Integration of Large Language Models in Control of EHD Pumps for Precise Color Synthesis

    Authors: Yanhong Peng, Ceng Zhang, Chenlong Hu, Zebing Mao

    Abstract: This paper presents an innovative approach to integrating Large Language Models (LLMs) with Arduino-controlled Electrohydrodynamic (EHD) pumps for precise color synthesis in automation systems. We propose a novel framework that employs fine-tuned LLMs to interpret natural language commands and convert them into specific operational instructions for EHD pump control. This approach aims to enhance u… ▽ More

    Submitted 21 January, 2024; originally announced January 2024.

  49. arXiv:2401.11181  [pdf, other

    cs.DC

    Inference without Interference: Disaggregate LLM Inference for Mixed Downstream Workloads

    Authors: Cunchen Hu, Heyang Huang, Liangliang Xu, Xusheng Chen, Jiang Xu, Shuang Chen, Hao Feng, Chenxi Wang, Sa Wang, Yungang Bao, Ninghui Sun, Yizhou Shan

    Abstract: Transformer-based large language model (LLM) inference serving is now the backbone of many cloud services. LLM inference consists of a prefill phase and a decode phase. However, existing LLM deployment practices often overlook the distinct characteristics of these phases, leading to significant interference. To mitigate interference, our insight is to carefully schedule and group inference request… ▽ More

    Submitted 20 January, 2024; originally announced January 2024.

  50. arXiv:2401.00283  [pdf, other

    cs.IT eess.SP

    Near-Space Communications: the Last Piece of 6G Space-Air-Ground-Sea Integrated Network Puzzle

    Authors: Hongshan Liu, Tong Qin, Zhen Gao, Tianqi Mao, Keke Ying, Ziwei Wan, Li Qiao, Rui Na, Zhongxiang Li, Chun Hu, Yikun Mei, Tuan Li, Guanghui Wen, Lei Chen, Zhonghuai Wu, Ruiqi Liu, Gaojie Chen, Shuo Wang, Dezhi Zheng

    Abstract: This article presents a comprehensive study on the emerging near-space communications (NS-COM) within the context of space-air-ground-sea integrated network (SAGSIN). Specifically, we firstly explore the recent technical developments of NS-COM, followed by the discussions about motivations behind integrating NS-COM into SAGSIN. To further demonstrate the necessity of NS-COM, a comparative analysis… ▽ More

    Submitted 4 March, 2024; v1 submitted 30 December, 2023; originally announced January 2024.

    Comments: 28 pages, 8 figures, 2 tables