Skip to main content

Showing 51–100 of 1,041 results for author: Zha, D

.
  1. arXiv:2404.17684  [pdf, other

    cs.RO cs.LG

    Generalize by Touching: Tactile Ensemble Skill Transfer for Robotic Furniture Assembly

    Authors: Haohong Lin, Radu Corcodel, Ding Zhao

    Abstract: Furniture assembly remains an unsolved problem in robotic manipulation due to its long task horizon and nongeneralizable operations plan. This paper presents the Tactile Ensemble Skill Transfer (TEST) framework, a pioneering offline reinforcement learning (RL) approach that incorporates tactile feedback in the control loop. TEST's core design is to learn a skill transition model for high-level pla… ▽ More

    Submitted 26 April, 2024; originally announced April 2024.

  2. arXiv:2404.16425  [pdf, other

    astro-ph.HE

    Soft X-ray prompt emission from a high-redshift gamma-ray burst EP240315a

    Authors: Y. Liu, H. Sun, D. Xu, D. S. Svinkin, J. Delaunay, N. R. Tanvir, H. Gao, C. Zhang, Y. Chen, X. -F. Wu, B. Zhang, W. Yuan, J. An, G. Bruni, D. D. Frederiks, G. Ghirlanda, J. -W. Hu, A. Li, C. -K. Li, J. -D. Li, D. B. Malesani, L. Piro, G. Raman, R. Ricci, E. Troja , et al. (170 additional authors not shown)

    Abstract: Long gamma-ray bursts (GRBs) are believed to originate from core collapse of massive stars. High-redshift GRBs can probe the star formation and reionization history of the early universe, but their detection remains rare. Here we report the detection of a GRB triggered in the 0.5--4 keV band by the Wide-field X-ray Telescope (WXT) on board the Einstein Probe (EP) mission, designated as EP240315a,… ▽ More

    Submitted 25 April, 2024; originally announced April 2024.

    Comments: 41 pages, 8 figures, 7 tables

  3. arXiv:2404.16299  [pdf, ps, other

    gr-qc

    Conformal transformation of f(Q) gravity and its cosmological perturbations

    Authors: Dehao Zhao

    Abstract: Symmetric teleparallel gravity (STG) is a gravity theory which takes non-metricity tensor to describe gravity effects. In the STG framework, we study the conformal equivalent scalar-tensor theory of f(Q) model and calculate the cosmological linear perturbations of the conformal transformed action. We confirm the result already present in references that f(Q) gravity shows different degrees of free… ▽ More

    Submitted 24 April, 2024; originally announced April 2024.

    Comments: 14 pages, no figure

  4. arXiv:2404.14934  [pdf, other

    cs.MM cs.CV cs.HC

    G3R: Generating Rich and Fine-grained mmWave Radar Data from 2D Videos for Generalized Gesture Recognition

    Authors: Kaikai Deng, Dong Zhao, Wenxin Zheng, Yue Ling, Kangwen Yin, Huadong Ma

    Abstract: Millimeter wave radar is gaining traction recently as a promising modality for enabling pervasive and privacy-preserving gesture recognition. However, the lack of rich and fine-grained radar datasets hinders progress in develo** generalized deep learning models for gesture recognition across various user postures (e.g., standing, sitting), positions, and scenes. To remedy this, we resort to desi… ▽ More

    Submitted 23 April, 2024; originally announced April 2024.

    Comments: 18 pages, 29 figures

  5. DeviceRadar: Online IoT Device Fingerprinting in ISPs using Programmable Switches

    Authors: Ruoyu Li, Qing Li, Tao Lin, Qingsong Zou, Dan Zhao, Yucheng Huang, Gareth Tyson, Guorui Xie, Yong Jiang

    Abstract: Device fingerprinting can be used by Internet Service Providers (ISPs) to identify vulnerable IoT devices for early prevention of threats. However, due to the wide deployment of middleboxes in ISP networks, some important data, e.g., 5-tuples and flow statistics, are often obscured, rendering many existing approaches invalid. It is further challenged by the high-speed traffic of hundreds of teraby… ▽ More

    Submitted 19 April, 2024; originally announced April 2024.

    Comments: Submitted to IEEE/ACM Transactions on Networking (ToN)

  6. arXiv:2404.12022  [pdf, other

    cs.CL

    Parallel Decoding via Hidden Transfer for Lossless Large Language Model Acceleration

    Authors: Pengfei Wu, Jiahao Liu, Zhuocheng Gong, Qifan Wang, **peng Li, **gang Wang, Xunliang Cai, Dongyan Zhao

    Abstract: Large language models (LLMs) have recently shown remarkable performance across a wide range of tasks. However, the substantial number of parameters in LLMs contributes to significant latency during model inference. This is particularly evident when utilizing autoregressive decoding methods, which generate one token in a single forward process, thereby not fully capitalizing on the parallel computi… ▽ More

    Submitted 18 April, 2024; originally announced April 2024.

  7. arXiv:2404.10610  [pdf, other

    cs.CR

    Shining Light into the Tunnel: Understanding and Classifying Network Traffic of Residential Proxies

    Authors: Ronghong Huang, Dongfang Zhao, Xianghang Mi, Xiaofeng Wang

    Abstract: Emerging in recent years, residential proxies (RESIPs) feature multiple unique characteristics when compared with traditional network proxies (e.g., commercial VPNs), particularly, the deployment in residential networks rather than data center networks, the worldwide distribution in tens of thousands of cities and ISPs, and the large scale of millions of exit nodes. All these factors allow RESIP u… ▽ More

    Submitted 30 April, 2024; v1 submitted 16 April, 2024; originally announced April 2024.

  8. arXiv:2404.10004  [pdf

    cs.LG physics.soc-ph stat.AP

    A Strategy Transfer and Decision Support Approach for Epidemic Control in Experience Shortage Scenarios

    Authors: X. Xiao, P. Chen, X. Cao, K. Liu, L. Deng, D. Zhao, Z. Chen, Q. Deng, F. Yu, H. Zhang

    Abstract: Epidemic outbreaks can cause critical health concerns and severe global economic crises. For countries or regions with new infectious disease outbreaks, it is essential to generate preventive strategies by learning lessons from others with similar risk profiles. A Strategy Transfer and Decision Support Approach (STDSA) is proposed based on the profile similarity evaluation. There are four steps in… ▽ More

    Submitted 9 April, 2024; originally announced April 2024.

    Comments: 20 pages, 9 figures

  9. arXiv:2404.05649  [pdf

    physics.optics physics.app-ph

    Realization of a three-dimensional photonic higher-order topological insulator

    Authors: Ziyao Wang, Yan Meng, Bei Yan, Dong Zhao, Linyun Yang, **g-Ming Chen, Min-Qi Cheng, Tao Xiao, Perry ** Shum, Gui-Geng Liu, Yihao Yang, Hongsheng Chen, Xiang Xi, Zhen-Xiao Zhu, Biye Xie, Zhen Gao

    Abstract: The discovery of photonic higher-order topological insulators (HOTIs) has significantly expanded our understanding of band topology and provided unprecedented lower-dimensional topological boundary states for robust photonic devices. However, due to the vectorial and leaky nature of electromagnetic waves, it is challenging to discover three-dimensional (3D) topological photonic systems and photoni… ▽ More

    Submitted 8 April, 2024; originally announced April 2024.

    Comments: 23 pages,4 figures

  10. arXiv:2404.05609  [pdf, other

    math.OC eess.SY

    Feedback Stability Under Mixed Gain and Phase Uncertainty

    Authors: Jia** Liang, Di Zhao, Li Qiu

    Abstract: In this study, we investigate the robust feedback stability problem for multiple-input-multiple-output linear time-invariant systems involving sectored-disk uncertainty, namely, dynamic uncertainty subject to simultaneous gain and phase constraints. This problem is thereby called a sectored-disk problem. Employing a frequency-wise analysis approach, we derive a fundamental static matrix problem th… ▽ More

    Submitted 8 April, 2024; originally announced April 2024.

  11. Panoptic Perception: A Novel Task and Fine-grained Dataset for Universal Remote Sensing Image Interpretation

    Authors: Danpei Zhao, Bo Yuan, Ziqiang Chen, Tian Li, Zhuoran Liu, Wentao Li, Yue Gao

    Abstract: Current remote-sensing interpretation models often focus on a single task such as detection, segmentation, or caption. However, the task-specific designed models are unattainable to achieve the comprehensive multi-level interpretation of images. The field also lacks support for multi-task joint interpretation datasets. In this paper, we propose Panoptic Perception, a novel task and a new fine-grai… ▽ More

    Submitted 25 April, 2024; v1 submitted 6 April, 2024; originally announced April 2024.

    Journal ref: IEEE Transactions on Geoscience and Remote Sensing, 2024

  12. arXiv:2404.00064  [pdf, other

    hep-ph

    Confronting CP symmetry of order 4 with experimental data

    Authors: Igor P. Ivanov, Duanyang Zhao

    Abstract: CP4 3HDM is a three-Higgs-doublet model based on a $CP$ symmetry of order 4 (CP4). It is the minimal model incorporating CP4 without leading to accidental symmetries or running into immediate conflict with experiment. Imposing CP4 on the lagrangian induces remarkably tight connections between the scalar and Yukawa sectors, including the unavoidable tree-level flavor-changing neutral couplings (FCN… ▽ More

    Submitted 28 March, 2024; originally announced April 2024.

    Comments: 14 pages, 3 figures, Proceedings of the "Workshop on the Standard Model and Beyond" within the Corfu Summer Institute 2023, Corfu, Greece, August 27 - September 7, 2023

    Journal ref: PoS(CORFU2023)086

  13. arXiv:2403.20173  [pdf, other

    cs.CV

    MCNet: A crowd denstity estimation network based on integrating multiscale attention module

    Authors: Qiang Guo, Rubo Zhang, Di Zhao

    Abstract: Aiming at the metro video surveillance system has not been able to effectively solve the metro crowd density estimation problem, a Metro Crowd density estimation Network (called MCNet) is proposed to automatically classify crowd density level of passengers. Firstly, an Integrating Multi-scale Attention (IMA) module is proposed to enhance the ability of the plain classifiers to extract semantic cro… ▽ More

    Submitted 29 March, 2024; originally announced March 2024.

  14. arXiv:2403.19725  [pdf, other

    cs.CL cs.AI cs.LG

    MUGC: Machine Generated versus User Generated Content Detection

    Authors: Yaqi Xie, Anjali Rawal, Yu**g Cen, Dixuan Zhao, Sunil K Narang, Shanu Sushmita

    Abstract: As advanced modern systems like deep neural networks (DNNs) and generative AI continue to enhance their capabilities in producing convincing and realistic content, the need to distinguish between user-generated and machine generated content is becoming increasingly evident. In this research, we undertake a comparative evaluation of eight traditional machine-learning algorithms to distinguish betwe… ▽ More

    Submitted 28 March, 2024; originally announced March 2024.

    Comments: 11 pages, 16 figures

  15. arXiv:2403.19248  [pdf, other

    cs.CR cs.NI

    Genos: General In-Network Unsupervised Intrusion Detection by Rule Extraction

    Authors: Ruoyu Li, Qing Li, Yu Zhang, Dan Zhao, Xi Xiao, Yong Jiang

    Abstract: Anomaly-based network intrusion detection systems (A-NIDS) use unsupervised models to detect unforeseen attacks. However, existing A-NIDS solutions suffer from low throughput, lack of interpretability, and high maintenance costs. Recent in-network intelligence (INI) exploits programmable switches to offer line-rate deployment of NIDS. Nevertheless, current in-network NIDS are either model-specific… ▽ More

    Submitted 28 March, 2024; originally announced March 2024.

    Comments: accepted by IEEE International Conference on Computer Communications (INFOCOM 2024)

  16. arXiv:2403.18197  [pdf, other

    cs.RO

    LocoMan: Advancing Versatile Quadrupedal Dexterity with Lightweight Loco-Manipulators

    Authors: Changyi Lin, Xingyu Liu, Yuxiang Yang, Yaru Niu, Wenhao Yu, Tingnan Zhang, Jie Tan, Byron Boots, Ding Zhao

    Abstract: Quadrupedal robots have emerged as versatile agents capable of locomoting and manipulating in complex environments. Traditional designs typically rely on the robot's inherent body parts or incorporate top-mounted arms for manipulation tasks. However, these configurations may limit the robot's operational dexterity, efficiency and adaptability, particularly in cluttered or constrained spaces. In th… ▽ More

    Submitted 26 March, 2024; originally announced March 2024.

    Comments: Project page: https://linchangyi1.github.io/LocoMan

  17. arXiv:2403.17354  [pdf

    cond-mat.mtrl-sci

    Large topological Hall effect arising from spin reorientation in kagome magnet Fe3Ge

    Authors: Zixuan Zhang, Mingyue Zhao, Li Ma, Guoke Li, Congmian Zhen, Dewei Zhao, Denglu Hou

    Abstract: Materials systems with spin chirality can provide ultra-high-density, ultra-fast, and ultralow-power information carriers for digital transformation. These material systems include magnetic skyrmions, chiral domain walls, spin reorientation,and so on. The topological Hall effect (THE) has been identified as the most convenient and effective tool for detecting the presence of spin chirality in thes… ▽ More

    Submitted 25 March, 2024; originally announced March 2024.

  18. arXiv:2403.13208  [pdf, other

    cs.RO

    CaDRE: Controllable and Diverse Generation of Safety-Critical Driving Scenarios using Real-World Trajectories

    Authors: Peide Huang, Wenhao Ding, Jonathan Francis, Bingqing Chen, Ding Zhao

    Abstract: Simulation is an indispensable tool in the development and testing of autonomous vehicles (AVs), offering an efficient and safe alternative to road testing by allowing the exploration of a wide range of scenarios. Despite its advantages, a significant challenge within simulation-based testing is the generation of safety-critical scenarios, which are essential to ensure that AVs can handle rare but… ▽ More

    Submitted 19 March, 2024; originally announced March 2024.

  19. arXiv:2403.12969  [pdf, other

    cs.LG quant-ph

    Entangling Machine Learning with Quantum Tensor Networks

    Authors: Constantijn van der Poel, Dan Zhao

    Abstract: This paper examines the use of tensor networks, which can efficiently represent high-dimensional quantum states, in language modeling. It is a distillation and continuation of the work done in (van der Poel, 2023). To do so, we will abstract the problem down to modeling Motzkin spin chains, which exhibit long-range correlations reminiscent of those found in language. The Matrix Product State (MPS)… ▽ More

    Submitted 8 January, 2024; originally announced March 2024.

    Comments: See source code at https://github.com/ConstantijnvdP/eidolon

  20. arXiv:2403.11439  [pdf, other

    cs.CL

    StyleChat: Learning Recitation-Augmented Memory in LLMs for Stylized Dialogue Generation

    Authors: **peng Li, Zekai Zhang, Quan Tu, Xin Cheng, Dongyan Zhao, Rui Yan

    Abstract: Large Language Models (LLMs) demonstrate superior performance in generative scenarios and have attracted widespread attention. Among them, stylized dialogue generation is essential in the context of LLMs for building intelligent and engaging dialogue agent. However the ability of LLMs is data-driven and limited by data bias, leading to poor performance on specific tasks. In particular, stylized di… ▽ More

    Submitted 17 March, 2024; originally announced March 2024.

  21. arXiv:2403.10228  [pdf, other

    cs.CV cs.AI cs.CL

    HawkEye: Training Video-Text LLMs for Grounding Text in Videos

    Authors: Yueqian Wang, Xiaojun Meng, Jianxin Liang, Yuxuan Wang, Qun Liu, Dongyan Zhao

    Abstract: Video-text Large Language Models (video-text LLMs) have shown remarkable performance in answering questions and holding conversations on simple videos. However, they perform almost the same as random on grounding text queries in long and complicated videos, having little ability to understand and reason about temporal information, which is the most fundamental difference between videos and images.… ▽ More

    Submitted 15 March, 2024; originally announced March 2024.

  22. arXiv:2403.09971  [pdf, other

    cs.RO

    Advancing Object Goal Navigation Through LLM-enhanced Object Affinities Transfer

    Authors: Mengying Lin, Yaran Chen, Dongbin Zhao, Zhaoran Wang

    Abstract: In object goal navigation, agents navigate towards objects identified by category labels using visual and spatial information. Previously, solely network-based methods typically rely on historical data for object affinities estimation, lacking adaptability to new environments and unseen targets. Simultaneously, employing Large Language Models (LLMs) for navigation as either planners or agents, tho… ▽ More

    Submitted 14 March, 2024; originally announced March 2024.

  23. arXiv:2403.09933  [pdf, other

    cs.RO

    Design and Control Co-Optimization for Automated Design Iteration of Dexterous Anthropomorphic Soft Robotic Hands

    Authors: Pragna Mannam, Xingyu Liu, Ding Zhao, Jean Oh, Nancy Pollard

    Abstract: We automate soft robotic hand design iteration by co-optimizing design and control policy for dexterous manipulation skills in simulation. Our design iteration pipeline combines genetic algorithms and policy transfer to learn control policies for nearly 400 hand designs, testing grasp quality under external force disturbances. We validate the optimized designs in the real world through teleoperati… ▽ More

    Submitted 25 June, 2024; v1 submitted 14 March, 2024; originally announced March 2024.

    Journal ref: IEEE-RAS International Conference on Soft Robotics (RoboSoft) 2024

  24. Discrete Opial type inequalities for interval-valued functions

    Authors: Dafang Zhao, Xuexiao You, Delfim F. M. Torres

    Abstract: We introduce the forward (backward) gH-difference operator of interval sequences, and establish some new discrete Opial type inequalities for interval-valued functions. Further, we obtain generalizations of classical discrete Opial type inequalities. Some examples are presented to illustrate our results.

    Submitted 22 November, 2023; originally announced March 2024.

    Comments: This is a preprint of a paper whose final and definite form is published in Mathematical Inequalities & Applications

    MSC Class: 26D15; 26E50; 65G30

    Journal ref: Math. Inequal. Appl. 26 (2023), no. 4, 811--826

  25. arXiv:2403.06408  [pdf, other

    cs.LG cs.AI

    What Makes Quantization for Large Language Models Hard? An Empirical Study from the Lens of Perturbation

    Authors: Zhuocheng Gong, Jiahao Liu, **gang Wang, Xunliang Cai, Dongyan Zhao, Rui Yan

    Abstract: Quantization has emerged as a promising technique for improving the memory and computational efficiency of large language models (LLMs). Though the trade-off between performance and efficiency is well-known, there is still much to be learned about the relationship between quantization and LLM performance. To shed light on this relationship, we propose a new perspective on quantization, viewing it… ▽ More

    Submitted 10 March, 2024; originally announced March 2024.

  26. arXiv:2403.05169  [pdf, ps, other

    math.CO

    Bivariate $Q$-polynomial structures for the nonbinary Johnson scheme and the association scheme obtained from attenuated spaces

    Authors: Eiichi Bannai, Hirotake Kurihara, Da Zhao, Yan Zhu

    Abstract: The study of $P$-polynomial association schemes (distance-regular graphs) and $Q$-polynomial association schemes, and in particular $P$- and $Q$-polynomial association schemes, has been a central theme not only in the theory of association schemes but also in the whole study of algebraic combinatorics in general. Leonard's theorem (1982) says that the spherical functions (or the character tables)… ▽ More

    Submitted 8 March, 2024; originally announced March 2024.

    Comments: 31 pages, no figure

    MSC Class: 05E30; 20C15

  27. arXiv:2403.04553  [pdf, other

    cs.DC cs.LG

    Improvements & Evaluations on the MLCommons CloudMask Benchmark

    Authors: Varshitha Chennamsetti, Laiba Mehnaz, Dan Zhao, Banani Ghosh, Sergey V. Samsonau

    Abstract: In this paper, we report the performance benchmarking results of deep learning models on MLCommons' Science cloud-masking benchmark using a high-performance computing cluster at New York University (NYU): NYU Greene. MLCommons is a consortium that develops and maintains several scientific benchmarks that can benefit from developments in AI. We provide a description of the cloud-masking benchmark t… ▽ More

    Submitted 7 March, 2024; originally announced March 2024.

    Comments: arXiv admin note: text overlap with arXiv:2401.08636

  28. arXiv:2403.04365  [pdf, other

    cs.NI cs.SE

    DV-Hop localization based on Distance Estimation using Multinode and Hop Loss in WSNs

    Authors: Penghong Wang, Xingtao Wang, Wenrui Li, Xiaopeng Fan, Debin Zhao

    Abstract: Location awareness is a critical issue in wireless sensor network applications. For more accurate location estimation, the two issues should be considered extensively: 1) how to sufficiently utilize the connection information between multiple nodes and 2) how to select a suitable solution from multiple solutions obtained by the Euclidean distance loss. In this paper, a DV-Hop localization based on… ▽ More

    Submitted 7 March, 2024; originally announced March 2024.

  29. arXiv:2403.04296  [pdf, other

    quant-ph

    Variational quantum eigensolver with linear depth problem-inspired ansatz for solving portfolio optimization in finance

    Authors: Shengbin Wang, Peng Wang, Guihui Li, Shubin Zhao, Dongyi Zhao, **g Wang, Yuan Fang, Menghan Dou, Yongjian Gu, Yu-Chun Wu, Guo-** Guo

    Abstract: Great efforts have been dedicated in recent years to explore practical applications for noisy intermediate-scale quantum (NISQ) computers, which is a fundamental and challenging problem in quantum computing. As one of the most promising methods, the variational quantum eigensolver (VQE) has been extensively studied. In this paper, VQE is applied to solve portfolio optimization problems in finance… ▽ More

    Submitted 7 March, 2024; originally announced March 2024.

    Comments: 21 pages, 20 figures

  30. arXiv:2403.03788  [pdf, other

    cs.CL

    PPTC-R benchmark: Towards Evaluating the Robustness of Large Language Models for PowerPoint Task Completion

    Authors: Zekai Zhang, Yiduo Guo, Yaobo Liang, Dongyan Zhao, Nan Duan

    Abstract: The growing dependence on Large Language Models (LLMs) for finishing user instructions necessitates a comprehensive understanding of their robustness to complex task completion in real-world situations. To address this critical need, we propose the PowerPoint Task Completion Robustness benchmark (PPTC-R) to measure LLMs' robustness to the user PPT task instruction and software version. Specificall… ▽ More

    Submitted 6 March, 2024; originally announced March 2024.

    Comments: LLM evaluation, Multi-turn, Multi-language, Multi-modal benchmark

  31. arXiv:2403.03068  [pdf, other

    quant-ph physics.atom-ph

    Enhancing single-atom loading in tightly confined dipole traps with ancillary dipole beam

    Authors: Guang-Jie Chen, Zhu-Bo Wang, Chenyue Gu, Dong Zhao, Ji-Zhe Zhang, Yan-Lei Zhang, Chun-Hua Dong, Kun Huang, Guang-Can Guo, Chang-Ling Zou

    Abstract: Single atoms trapped in tightly focused optical dipole traps provide an excellent experimental platform for quantum computing, precision measurement, and fundamental physics research. In this work, we propose and demonstrate a novel approach to enhancing the loading of single atoms by introducing a weak ancillary dipole beam. The loading rate of single atoms in a dipole trap can be significantly i… ▽ More

    Submitted 5 March, 2024; originally announced March 2024.

    Comments: 10 pages, 7 figures

  32. arXiv:2403.01451  [pdf, other

    cs.CR cs.DB cs.LG

    Enhancing Data Provenance and Model Transparency in Federated Learning Systems -- A Database Approach

    Authors: Michael Gu, Ramasoumya Naraparaju, Dongfang Zhao

    Abstract: Federated Learning (FL) presents a promising paradigm for training machine learning models across decentralized edge devices while preserving data privacy. Ensuring the integrity and traceability of data across these distributed environments, however, remains a critical challenge. The ability to create transparent artificial intelligence, such as detailing the training process of a machine learnin… ▽ More

    Submitted 3 March, 2024; originally announced March 2024.

    Comments: 14 pages, 16 figures

  33. arXiv:2402.19111  [pdf, other

    eess.IV cs.CV

    Deep Network for Image Compressed Sensing Coding Using Local Structural Sampling

    Authors: Wenxue Cui, Xingtao Wang, Xiaopeng Fan, Shaohui Liu, Xinwei Gao, Debin Zhao

    Abstract: Existing image compressed sensing (CS) coding frameworks usually solve an inverse problem based on measurement coding and optimization-based image reconstruction, which still exist the following two challenges: 1) The widely used random sampling matrix, such as the Gaussian Random Matrix (GRM), usually leads to low measurement coding efficiency. 2) The optimization-based reconstruction methods gen… ▽ More

    Submitted 29 February, 2024; originally announced February 2024.

    Comments: Accepted by ACM Transactions on Multimedia Computing Communications and Applications (TOMM)

  34. arXiv:2402.18784  [pdf, other

    cs.AI q-bio.NC

    Brain-inspired and Self-based Artificial Intelligence

    Authors: Yi Zeng, Feifei Zhao, Yuxuan Zhao, Dongcheng Zhao, Enmeng Lu, Qian Zhang, Yuwei Wang, Hui Feng, Zhuoya Zhao, Jihang Wang, Qingqun Kong, Yinqian Sun, Yang Li, Guobin Shen, Bing Han, Yiting Dong, Wenxuan Pan, Xiang He, Aorigele Bao, ** Wang

    Abstract: The question "Can machines think?" and the Turing Test to assess whether machines could achieve human-level intelligence is one of the roots of AI. With the philosophical argument "I think, therefore I am", this paper challenge the idea of a "thinking machine" supported by current AIs since there is no sense of self in them. Current artificial intelligence is only seemingly intelligent information… ▽ More

    Submitted 28 February, 2024; originally announced February 2024.

  35. arXiv:2402.18593  [pdf, other

    cs.AR cs.AI cs.DC

    Sustainable Supercomputing for AI: GPU Power Cap** at HPC Scale

    Authors: Dan Zhao, Siddharth Samsi, Joseph McDonald, Baolin Li, David Bestor, Michael Jones, Devesh Tiwari, Vijay Gadepally

    Abstract: As research and deployment of AI grows, the computational burden to support and sustain its progress inevitably does too. To train or fine-tune state-of-the-art models in NLP, computer vision, etc., some form of AI hardware acceleration is virtually a requirement. Recent large language models require considerable resources to train and deploy, resulting in significant energy usage, potential carbo… ▽ More

    Submitted 24 February, 2024; originally announced February 2024.

  36. A Search for Radio Pulsars in Supernova Remnants Using FAST with One Pulsar Discovered

    Authors: Zhen Zhang, Wen-Ming Yan, Jian-** Yuan, Na Wang, Jun-Tao Bai, Zhi-Gang Wen, Bao-Da Li, **-Tao Xie, De Zhao, Yu-Bin Wang, Nan-Nan Zhai

    Abstract: We report on the results of a search for radio pulsars in five supernova remnants (SNRs) with FAST. The observations were made using the 19-beam receiver in the Snapshot mode. The integration time for each pointing is 10 min. We discovered a new pulsar PSR J1845$-$0306 which has a spin period of 983.6 ms and a dispersion measure of 444.6$\pm$2.0 cm$^{-3}$ pc in observations of SNR G29.6+0.1. To ju… ▽ More

    Submitted 27 February, 2024; originally announced February 2024.

    Comments: 6 pages, 2 figures, 2 tables published in CPL

    Journal ref: Chin. Phys. Lett. 2024, 41 (2): 029701 February 2024

  37. arXiv:2402.17304  [pdf, ps, other

    cs.CL cs.AI

    Probing Multimodal Large Language Models for Global and Local Semantic Representations

    Authors: Mingxu Tao, Quzhe Huang, Kun Xu, Liwei Chen, Yansong Feng, Dongyan Zhao

    Abstract: The advancement of Multimodal Large Language Models (MLLMs) has greatly accelerated the development of applications in understanding integrated texts and images. Recent works leverage image-caption datasets to train MLLMs, achieving state-of-the-art performance on image-to-text tasks. However, there are few studies exploring which layers of MLLMs make the most effort to the global image informatio… ▽ More

    Submitted 26 March, 2024; v1 submitted 27 February, 2024; originally announced February 2024.

    Comments: Accepted by LREC-COLING 2024 as a short paper (Camera Ready)

  38. arXiv:2402.16313  [pdf, other

    cs.CL cs.AI

    Chain-of-Discussion: A Multi-Model Framework for Complex Evidence-Based Question Answering

    Authors: Mingxu Tao, Dongyan Zhao, Yansong Feng

    Abstract: Open-ended question answering requires models to find appropriate evidence to form well-reasoned, comprehensive and helpful answers. In practical applications, models also need to engage in extended discussions on potential scenarios closely relevant to the question. With augmentation of retrieval module, open-source Large Language Models (LLMs) can produce coherent answers often with different fo… ▽ More

    Submitted 26 February, 2024; originally announced February 2024.

    Comments: Under review

  39. arXiv:2402.16050  [pdf, other

    cs.CV cs.CL

    LSTP: Language-guided Spatial-Temporal Prompt Learning for Long-form Video-Text Understanding

    Authors: Yuxuan Wang, Yueqian Wang, Pengfei Wu, Jianxin Liang, Dongyan Zhao, Zilong Zheng

    Abstract: Despite progress in video-language modeling, the computational challenge of interpreting long-form videos in response to task-specific linguistic queries persists, largely due to the complexity of high-dimensional video data and the misalignment between language and visual cues over space and time. To tackle this issue, we introduce a novel approach called Language-guided Spatial-Temporal Prompt L… ▽ More

    Submitted 25 February, 2024; originally announced February 2024.

  40. arXiv:2402.15999  [pdf

    cond-mat.str-el cond-mat.mes-hall cond-mat.mtrl-sci

    Revelation of new magnetic domain wall category in the itinerant antiferromagnet Chromium

    Authors: Yining Hu, Xu Wang, Chen Chen, Qingle Zhang, Dongming Zhao, Tianzhen Zhang, Chenxi Wang, Donglai Feng, Tong Zhang

    Abstract: Conventional magnetic domain walls are characterized by the reorientation of local moments. However, what occurs at the boundary of itinerant magnets is largely unknown. Here using spin-sensitive scanning tunneling microscopy, we investigated the microscopic boundaries of spin-density-wave (SDW) state in a prototypical itinerant anti-ferromagnet of Cr. We find at the boundary of two incommensurate… ▽ More

    Submitted 25 February, 2024; originally announced February 2024.

    Comments: 19 pages, 10 figures, supplementary materials included

  41. Simulation Studies for the First Pathfinder of the CATCH Space Mission

    Authors: Yiming Huang, Juan Zhang, Lian Tao, Zhengwei Li, Donghua Zhao, Qian-Qing Yin, Xiangyang Wen, **gyu Xiao, Chen Zhang, Shuang-Nan Zhang, Shaolin Xiong, Qingcui Bu, Jirong Cang, Dezhi Cao, Wen Chen, Siran Ding, Min Gao, Yang Gao, Shu** Hou, Li** Jia, Ge **, Dalin Li, **song Li, Pan** Li, Yajun Li , et al. (20 additional authors not shown)

    Abstract: The Chasing All Transients Constellation Hunters (CATCH) space mission is an intelligent constellation consisting of 126 micro-satellites in three types (A, B, and C), designed for X-ray observation with the objective of studying the dynamic universe. Currently, we are actively develo** the first Pathfinder (CATCH-1) for the CATCH mission, specifically for type-A satellites. CATCH-1 is equipped… ▽ More

    Submitted 23 February, 2024; originally announced February 2024.

  42. arXiv:2402.13628  [pdf, other

    cs.LG eess.SP

    Improving Building Temperature Forecasting: A Data-driven Approach with System Scenario Clustering

    Authors: Dafang Zhao, Zheng Chen, Zhengmao Li, Xiaolei Yuan, Ittetsu Taniguchi

    Abstract: Heat, Ventilation and Air Conditioning (HVAC) systems play a critical role in maintaining a comfortable thermal environment and cost approximately 40% of primary energy usage in the building sector. For smart energy management in buildings, usage patterns and their resulting profiles allow the improvement of control systems with prediction capabilities. However, for large-scale HVAC system managem… ▽ More

    Submitted 21 February, 2024; originally announced February 2024.

    Comments: Accepted and will be published on IEEE PES GM 2024

  43. arXiv:2402.12728  [pdf, other

    cs.CV cs.AI cs.CL cs.IR cs.LG

    Modality-Aware Integration with Large Language Models for Knowledge-based Visual Question Answering

    Authors: Junnan Dong, Qinggang Zhang, Huachi Zhou, Daochen Zha, Pai Zheng, Xiao Huang

    Abstract: Knowledge-based visual question answering (KVQA) has been extensively studied to answer visual questions with external knowledge, e.g., knowledge graphs (KGs). While several attempts have been proposed to leverage large language models (LLMs) as an implicit knowledge source, it remains challenging since LLMs may generate hallucinations. Moreover, multiple knowledge sources, e.g., images, KGs and L… ▽ More

    Submitted 2 March, 2024; v1 submitted 20 February, 2024; originally announced February 2024.

    Comments: 8 pages,3 figures and 1 page appendix; The processed graphs and codes will be avalibale

  44. arXiv:2402.07627  [pdf

    physics.app-ph cond-mat.mtrl-sci

    Unveiling the GeI2-Assisted Oriented Growth of Perovskite Crystallite for High-Performance Flexible Sn Perovskite Solar Cells

    Authors: Huagui Lai, Selina Olthof, Shengqiang Ren, Radha K. Kothandaraman, Matthias Diethelm, Quentin Jeangros, Roland Hany, Ayodhya N. Tiwari, Dewei Zhao, Fan Fu

    Abstract: Tin perovskites are emerging as promising alternatives to their lead-based counterparts for high-performance and flexible perovskite solar cells (PSCs). However, their rapid crystallization often leads to inadequate film quality and poor device performance. In this study, the role of GeI2 as an additive is investigated for controlling the nucleation and crystallization processes of formamidium tin… ▽ More

    Submitted 12 February, 2024; originally announced February 2024.

  45. arXiv:2402.03992  [pdf, other

    cs.LG cond-mat.mtrl-sci

    Space Group Constrained Crystal Generation

    Authors: Rui Jiao, Wenbing Huang, Yu Liu, Deli Zhao, Yang Liu

    Abstract: Crystals are the foundation of numerous scientific and industrial applications. While various learning-based approaches have been proposed for crystal generation, existing methods seldom consider the space group constraint which is crucial in describing the geometry of crystals and closely relevant to many desirable properties. However, considering space group constraint is challenging owing to it… ▽ More

    Submitted 8 April, 2024; v1 submitted 6 February, 2024; originally announced February 2024.

    Comments: ICLR 2024 poster

  46. arXiv:2402.02163  [pdf

    physics.app-ph

    Exceptional point-based ultrasensitive surface acoustic wave gas sensor

    Authors: Xingyu Lu, Yang Yuan, Fa Chen, Xiaoxiao Hou, Yanlong Guo, Leonhard Reindl, Wei Luo, Degang Zhao

    Abstract: Exceptional points (EPs) refer to degeneracies in non-Hermitian systems where two or more eigenvalues and their corresponding eigenvectors coalesce. Recently, there has been growing interest in harnessing EPs to enhance the responsivity of sensors. Significant improvements in the sensitivity of sensors in optics and electronics have been developed. In this work, we present a novel ultrasensitive s… ▽ More

    Submitted 3 February, 2024; originally announced February 2024.

  47. arXiv:2402.01115  [pdf, other

    cs.CL eess.SP

    Interpretation of Intracardiac Electrograms Through Textual Representations

    Authors: William Jongwon Han, Diana Gomez, Avi Alok, Chao**g Duan, Michael A. Rosenberg, Douglas Weber, Emerson Liu, Ding Zhao

    Abstract: Understanding the irregular electrical activity of atrial fibrillation (AFib) has been a key challenge in electrocardiography. For serious cases of AFib, catheter ablations are performed to collect intracardiac electrograms (EGMs). EGMs offer intricately detailed and localized electrical activity of the heart and are an ideal modality for interpretable cardiac studies. Recent advancements in artif… ▽ More

    Submitted 11 April, 2024; v1 submitted 1 February, 2024; originally announced February 2024.

    Comments: 18 pages, 9 figures; Accepted to CHIL 2024

    ACM Class: I.2.7; J.3

  48. arXiv:2402.00738  [pdf, other

    cs.AI

    FM3Q: Factorized Multi-Agent MiniMax Q-Learning for Two-Team Zero-Sum Markov Game

    Authors: Guangzheng Hu, Yuanheng Zhu, Haoran Li, Dongbin Zhao

    Abstract: Many real-world applications involve some agents that fall into two teams, with payoffs that are equal within the same team but of opposite sign across the opponent team. The so-called two-team zero-sum Markov games (2t0sMGs) can be resolved with reinforcement learning in recent years. However, existing methods are thus inefficient in light of insufficient consideration of intra-team credit assign… ▽ More

    Submitted 1 February, 2024; originally announced February 2024.

  49. arXiv:2402.00449  [pdf, other

    cs.NE

    Parallel Spiking Unit for Efficient Training of Spiking Neural Networks

    Authors: Yang Li, Yinqian Sun, Xiang He, Yiting Dong, Dongcheng Zhao, Yi Zeng

    Abstract: Efficient parallel computing has become a pivotal element in advancing artificial intelligence. Yet, the deployment of Spiking Neural Networks (SNNs) in this domain is hampered by their inherent sequential computational dependency. This constraint arises from the need for each time step's processing to rely on the preceding step's outcomes, significantly impeding the adaptability of SNN models to… ▽ More

    Submitted 7 June, 2024; v1 submitted 1 February, 2024; originally announced February 2024.

  50. arXiv:2402.00401  [pdf, other

    hep-ph hep-ex

    Higgs boson pair production and decay at NLO in QCD: the $b\bar{b}γγ$ final state

    Authors: Hai Tao Li, Zong-Guo Si, Jian Wang, Xiao Zhang, Dan Zhao

    Abstract: The Higgs boson pair production at the LHC provides a probe to the Higgs boson self-coupling. The higher-order QCD corrections in this process are sizable and must be taken into account in comparison with data. Due to the small cross section, it is necessary to consider at least one of the Higgs bosons decaying to bottom quarks. The QCD corrections to the decay processes would also be important in… ▽ More

    Submitted 1 February, 2024; originally announced February 2024.

    Comments: 19 pages, 4 figures