Skip to main content

Showing 51–100 of 1,678 results for author: Zhao, Q

.
  1. arXiv:2405.00286  [pdf

    physics.optics

    Ultrafast Photocurrent Hysteresis in Photoferroelectric α-In2Se3

    Authors: Zhen Lei, Jiawei Chang, Qiyi Zhao, Jian Zhou, Yuanyuan Huang, Qihua Xiong, Xinlong Xu

    Abstract: The photon-electron interactions are generally volatile and the intricate multiphysics details of photoexcited carrier dynamics are not yet distinguished. How to nonvolatile control the physical state through all-optical means and clarify the intricate physical processes has been a long-term goal pursued in polar materials. Photoferroelectric α-In2Se3 holds the great potential for capturing multim… ▽ More

    Submitted 30 April, 2024; originally announced May 2024.

  2. arXiv:2404.19287  [pdf, other

    cs.CV

    Revisiting the Adversarial Robustness of Vision Language Models: a Multimodal Perspective

    Authors: Wanqi Zhou, Shuanghao Bai, Qibin Zhao, Badong Chen

    Abstract: Pretrained vision-language models (VLMs) like CLIP have shown impressive generalization performance across various downstream tasks, yet they remain vulnerable to adversarial attacks. While prior research has primarily concentrated on improving the adversarial robustness of image encoders to guard against attacks on images, the exploration of text-based and multimodal attacks has largely been over… ▽ More

    Submitted 30 April, 2024; originally announced April 2024.

    Comments: 16 pages, 14 figures

  3. arXiv:2404.18540  [pdf, other

    quant-ph

    Tunable coupling of a quantum phononic resonator to a transmon qubit with flip-chip architecture

    Authors: Xinhui Ruan, Li Li, Guihan Liang, Silu Zhao, Jia-heng Wang, Yizhou Bu, Bingjie Chen, Xiaohui Song, Xiang Li, He Zhang, **zhe Wang, Qianchuan Zhao, Kai Xu, Heng Fan, Yu-xi Liu, **g Zhang, Zhihui Peng, Zhongcheng Xiang, Dongning Zheng

    Abstract: A hybrid system with tunable coupling between phonons and qubits shows great potential for advancing quantum information processing. In this work, we demonstrate strong and tunable coupling between a surface acoustic wave (SAW) resonator and a transmon qubit based on galvanic-contact flip-chip technique. The coupling strength varies from $2π\times$7.0 MHz to -$2π\times$20.6 MHz, which is extracted… ▽ More

    Submitted 29 April, 2024; originally announced April 2024.

  4. arXiv:2404.18047  [pdf, other

    cs.RO

    LIKO: LiDAR, Inertial, and Kinematic Odometry for Bipedal Robots

    Authors: Qingrui Zhao, Mingyuan Li, Yongliang Shi, Xuechao Chen, Zhangguo Yu, Lianqiang Han, Zhenyuan Fu, **tao Zhang, Chao Li, Yuanxi Zhang, Qiang Huang

    Abstract: High-frequency and accurate state estimation is crucial for biped robots. This paper presents a tightly-coupled LiDAR-Inertial-Kinematic Odometry (LIKO) for biped robot state estimation based on an iterated extended Kalman filter. Beyond state estimation, the foot contact position is also modeled and estimated. This allows for both position and velocity updates from kinematic measurement. Addition… ▽ More

    Submitted 27 April, 2024; originally announced April 2024.

  5. arXiv:2404.16414  [pdf, ps, other

    physics.atom-ph quant-ph

    Validating a lutetium frequency reference

    Authors: Kyle J. Arnold, Scott Bustabad, Qin Qichen, Zhao Zhang, Qi Zhao, Murray D. Barrett

    Abstract: We review our progress in develo** a frequency reference with singly ionized lutetium and give estimates of the levels of inaccuracy we expect to achieve in the near future with both the $^1S_0\leftrightarrow{}^3D_1$ and $^1S_0\leftrightarrow{}^3D_2$ transitions. Based on established experimental results, we show that inaccuracies at the low $10^{-19}$ level are readily achievable for the… ▽ More

    Submitted 25 April, 2024; originally announced April 2024.

    Comments: 10 pages

  6. arXiv:2404.15000  [pdf, other

    cs.CR

    EarPass: Secure and Implicit Call Receiver Authentication Using Ear Acoustic Sensing

    Authors: Xi** Sun, **g Chen, Kun He, Zhixiang He, Ruiying Du, Yebo Feng, Qingchuan Zhao, Cong Wu

    Abstract: Private voice communication often contains sensitive information, making it critical to ensure that only authorized users have access to such calls. Unfortunately, current authentication mechanisms, such as PIN-based passwords, fingerprint recognition, and face recognition, fail to authenticate the call receiver, leaving a gap in security. To fill the gap, we present EarPass, a secure and implicit… ▽ More

    Submitted 23 April, 2024; originally announced April 2024.

  7. arXiv:2404.14443  [pdf

    cs.CL cs.AI

    Evaluation of Machine Translation Based on Semantic Dependencies and Keywords

    Authors: Kewei Yuan, Qiurong Zhao, Yang Xu, Xiao Zhang, Huansheng Ning

    Abstract: In view of the fact that most of the existing machine translation evaluation algorithms only consider the lexical and syntactic information, but ignore the deep semantic information contained in the sentence, this paper proposes a computational method for evaluating the semantic correctness of machine translations based on reference translations and incorporating semantic dependencies and sentence… ▽ More

    Submitted 20 April, 2024; originally announced April 2024.

  8. arXiv:2404.13798  [pdf, other

    cs.CV

    Enforcing Conditional Independence for Fair Representation Learning and Causal Image Generation

    Authors: Jensen Hwa, Qingyu Zhao, Aditya Lahiri, Adnan Masood, Babak Salimi, Ehsan Adeli

    Abstract: Conditional independence (CI) constraints are critical for defining and evaluating fairness in machine learning, as well as for learning unconfounded or causal representations. Traditional methods for ensuring fairness either blindly learn invariant features with respect to a protected variable (e.g., race when classifying sex from face images) or enforce CI relative to the protected attribute onl… ▽ More

    Submitted 21 April, 2024; originally announced April 2024.

    Comments: To appear at the 2024 IEEE CVPR Workshop on Fair, Data-Efficient, and Trusted Computer Vision

  9. arXiv:2404.13430  [pdf, other

    physics.chem-ph cs.LG

    React-OT: Optimal Transport for Generating Transition State in Chemical Reactions

    Authors: Chenru Duan, Guan-Horng Liu, Yuanqi Du, Tianrong Chen, Qiyuan Zhao, Haojun Jia, Carla P. Gomes, Evangelos A. Theodorou, Heather J. Kulik

    Abstract: Transition states (TSs) are transient structures that are key in understanding reaction mechanisms and designing catalysts but challenging to be captured in experiments. Alternatively, many optimization algorithms have been developed to search for TSs computationally. Yet the cost of these algorithms driven by quantum chemistry methods (usually density functional theory) is still high, posing chal… ▽ More

    Submitted 20 April, 2024; originally announced April 2024.

    Comments: 5 figures, 1 table

  10. arXiv:2404.12976  [pdf, other

    astro-ph.HE

    Insights from the Gaussian Processes Method for the FRB-associated X-ray Burst of SGR 1935+2154

    Authors: Rui**g Tang, Dahai Yan, Haiyun Zhang, Qingchang Zhao, Lian Tao, Chengkui Li, Mingyu Ge, Xiaobo Li, Qianqing Yin, Ce Cai

    Abstract: Gaussian processes method is employed to analyze the light curves of bursts detected by Insight-HXMT, NICER, and GECAM from SGR 1935+2154 between 2020 to 2022. It is found that a stochastically driven damped simple harmonic oscillator (SHO) is necessary to capture the characteristics of the X-ray bursts. Variability timescale of the X-ray bursts, corresponding to the broken frequencies in the SHO… ▽ More

    Submitted 19 June, 2024; v1 submitted 19 April, 2024; originally announced April 2024.

    Comments: 13 pages,17 figures,1 table

    MSC Class: 85-02

  11. arXiv:2404.12659  [pdf, ps, other

    cs.CL

    SOS-1K: A Fine-grained Suicide Risk Classification Dataset for Chinese Social Media Analysis

    Authors: Hongzhi Qi, Hanfei Liu, Jianqiang Li, Qing Zhao, Wei Zhai, Dan Luo, Tian Yu He, Shuo Liu, Bing Xiang Yang, Guanghui Fu

    Abstract: In the social media, users frequently express personal emotions, a subset of which may indicate potential suicidal tendencies. The implicit and varied forms of expression in internet language complicate accurate and rapid identification of suicidal intent on social media, thus creating challenges for timely intervention efforts. The development of deep learning models for suicide risk detection is… ▽ More

    Submitted 19 April, 2024; originally announced April 2024.

  12. arXiv:2404.12235  [pdf, other

    cs.CV

    Beyond Average: Individualized Visual Scanpath Prediction

    Authors: Xianyu Chen, Ming Jiang, Qi Zhao

    Abstract: Understanding how attention varies across individuals has significant scientific and societal impacts. However, existing visual scanpath models treat attention uniformly, neglecting individual differences. To bridge this gap, this paper focuses on individualized scanpath prediction (ISP), a new attention modeling task that aims to accurately predict how different individuals shift their attention… ▽ More

    Submitted 18 April, 2024; v1 submitted 18 April, 2024; originally announced April 2024.

    Comments: To appear in CVPR2024

  13. arXiv:2404.11952  [pdf, other

    hep-ph

    Generation of Ultrarelativistic Vortex Leptons with Large Orbital Angular Momenta

    Authors: Mamutjan Ababekri, Jun-Lin Zhou, Ren-Tong Guo, Yong-Zheng Ren, Yu-Han Kou, Qian Zhao, Zhong-Peng Li, Jian-Xing Li

    Abstract: Ultrarelativistic vortex leptons with intrinsic orbital angular momenta (OAM) have important applications in high energy particle physics, nuclear physics, astrophysics, etc. However, unfortunately, their generation still poses a great challenge. Here, we put forward a novel method for generating ultrarelativistic vortex positrons and electrons through nonlinear Breit-Wheeler (NBW) scattering of v… ▽ More

    Submitted 24 April, 2024; v1 submitted 18 April, 2024; originally announced April 2024.

    Comments: 5

  14. arXiv:2404.11449  [pdf, other

    cs.CL cs.LG

    AI-Enhanced Cognitive Behavioral Therapy: Deep Learning and Large Language Models for Extracting Cognitive Pathways from Social Media Texts

    Authors: Meng Jiang, Yi **g Yu, Qing Zhao, Jianqiang Li, Changwei Song, Hongzhi Qi, Wei Zhai, Dan Luo, Xiaoqin Wang, Guanghui Fu, Bing Xiang Yang

    Abstract: Cognitive Behavioral Therapy (CBT) is an effective technique for addressing the irrational thoughts stemming from mental illnesses, but it necessitates precise identification of cognitive pathways to be successfully implemented in patient care. In current society, individuals frequently express negative emotions on social media on specific topics, often exhibiting cognitive distortions, including… ▽ More

    Submitted 17 April, 2024; originally announced April 2024.

  15. arXiv:2404.09790  [pdf, other

    cs.CV

    NTIRE 2024 Challenge on Image Super-Resolution ($\times$4): Methods and Results

    Authors: Zheng Chen, Zongwei Wu, Eduard Zamfir, Kai Zhang, Yulun Zhang, Radu Timofte, Xiaokang Yang, Hongyuan Yu, Cheng Wan, Yuxin Hong, Zhijuan Huang, Yajun Zou, Yuan Huang, Jiamin Lin, Bingnan Han, Xianyu Guan, Yongsheng Yu, Daoan Zhang, Xuanwu Yin, Kunlong Zuo, **hua Hao, Kai Zhao, Kun Yuan, Ming Sun, Chao Zhou , et al. (63 additional authors not shown)

    Abstract: This paper reviews the NTIRE 2024 challenge on image super-resolution ($\times$4), highlighting the solutions proposed and the outcomes obtained. The challenge involves generating corresponding high-resolution (HR) images, magnified by a factor of four, from low-resolution (LR) inputs using prior information. The LR images originate from bicubic downsampling degradation. The aim of the challenge i… ▽ More

    Submitted 15 April, 2024; originally announced April 2024.

    Comments: NTIRE 2024 webpage: https://cvlai.net/ntire/2024. Code: https://github.com/zhengchen1999/NTIRE2024_ImageSR_x4

  16. arXiv:2404.09712  [pdf, ps, other

    nucl-th

    1/2$^-$ $α$ cluster resonances of $^{13}$C studied by the analytic continuation in the coupling constant

    Authors: Seungheon Shin, Masaaki Kimura, Bo Zhou, Qing Zhao

    Abstract: The 1/2$^-$ resonant states in $^{13}{\rm C}$ are investigated to search for the Hoyle-analog state. In order to treat the resonance states located around the 3$α+n$ threshold, the analytic continuation in the coupling constant (ACCC) has been combined with the real-time evolution method (REM). The properties of the 1/2$^-$ resonance states such as the radii and monopole transition probabilities a… ▽ More

    Submitted 15 April, 2024; originally announced April 2024.

  17. arXiv:2404.09149  [pdf, other

    eess.SY cs.NE math.NA

    Heuristic Solution to Joint Deployment and Beamforming Design for STAR-RIS Aided Networks

    Authors: Bai Yan, Qi Zhao, ** Zhang, J. Andrew Zhang

    Abstract: This paper tackles the deployment challenges of Simultaneous Transmitting and Reflecting Reconfigurable Intelligent Surface (STAR-RIS) in communication systems. Unlike existing works that use fixed deployment setups or solely optimize the location, this paper emphasizes the joint optimization of the location and orientation of STAR-RIS. This enables searching across all user grou** possibilities… ▽ More

    Submitted 14 April, 2024; originally announced April 2024.

    Comments: 30 pages

  18. arXiv:2404.08921  [pdf, other

    cs.CV

    PNeRV: Enhancing Spatial Consistency via Pyramidal Neural Representation for Videos

    Authors: Qi Zhao, M. Salman Asif, Zhan Ma

    Abstract: The primary focus of Neural Representation for Videos (NeRV) is to effectively model its spatiotemporal consistency. However, current NeRV systems often face a significant issue of spatial inconsistency, leading to decreased perceptual quality. To address this issue, we introduce the Pyramidal Neural Representation for Videos (PNeRV), which is built on a multi-scale information connection and comp… ▽ More

    Submitted 13 April, 2024; originally announced April 2024.

  19. arXiv:2404.08917  [pdf, other

    cs.CV

    MAProtoNet: A Multi-scale Attentive Interpretable Prototypical Part Network for 3D Magnetic Resonance Imaging Brain Tumor Classification

    Authors: Binghua Li, Jie Mao, Zhe Sun, Chao Li, Qibin Zhao, Toshihisa Tanaka

    Abstract: Automated diagnosis with artificial intelligence has emerged as a promising area in the realm of medical imaging, while the interpretability of the introduced deep neural networks still remains an urgent concern. Although contemporary works, such as XProtoNet and MProtoNet, has sought to design interpretable prediction models for the issue, the localization precision of their resulting attribution… ▽ More

    Submitted 13 April, 2024; originally announced April 2024.

  20. arXiv:2404.07019  [pdf, other

    physics.optics nlin.CD quant-ph

    Chiral Chaos Enhanced Sensing

    Authors: Yun-Qiu Ge, Zhe Wang, Qian-Chuan Zhao, **g Zhang, Yu-xi Liu

    Abstract: Chirality refers to the property that an object and its mirror image cannot overlap each other by spatial rotation and translation, and can be found in various research fields. We here propose chiral chaos and construct a chiral chaotic device via coupled whispering gallery mode resonators, where routes to chaos exhibit pronounced chirality for two opposite pum** directions. The mechanism respon… ▽ More

    Submitted 10 April, 2024; originally announced April 2024.

  21. arXiv:2404.06477  [pdf, other

    cs.PL cs.LO

    Mechanised Hypersafety Proofs about Structured Data: Extended Version

    Authors: Vladimir Gladshtein, Qiyuan Zhao, Willow Ahrens, Saman Amarasinghe, Ilya Sergey

    Abstract: Arrays are a fundamental abstraction to represent collections of data. It is often possible to exploit structural properties of the data stored in an array (e.g., repetition or sparsity) to develop a specialised representation optimised for space efficiency. Formally reasoning about correctness of manipulations with such structured data is challenging, as they are often composed of multiple loops… ▽ More

    Submitted 9 April, 2024; originally announced April 2024.

    Comments: Extended version of the paper accepted at PLDI'24

  22. arXiv:2404.05892  [pdf, other

    cs.CL cs.AI

    Eagle and Finch: RWKV with Matrix-Valued States and Dynamic Recurrence

    Authors: Bo Peng, Daniel Goldstein, Quentin Anthony, Alon Albalak, Eric Alcaide, Stella Biderman, Eugene Cheah, Xingjian Du, Teddy Ferdinan, Haowen Hou, Przemysław Kazienko, Kranthi Kiran GV, Jan Kocoń, Bartłomiej Koptyra, Satyapriya Krishna, Ronald McClelland Jr., Niklas Muennighoff, Fares Obeid, Atsushi Saito, Guangyu Song, Haoqin Tu, Stanisław Woźniak, Ruichong Zhang, Bingchen Zhao, Qihang Zhao , et al. (3 additional authors not shown)

    Abstract: We present Eagle (RWKV-5) and Finch (RWKV-6), sequence models improving upon the RWKV (RWKV-4) architecture. Our architectural design advancements include multi-headed matrix-valued states and a dynamic recurrence mechanism that improve expressivity while maintaining the inference efficiency characteristics of RNNs. We introduce a new multilingual corpus with 1.12 trillion tokens and a fast tokeni… ▽ More

    Submitted 10 April, 2024; v1 submitted 8 April, 2024; originally announced April 2024.

  23. arXiv:2404.05165  [pdf

    physics.app-ph cond-mat.mtrl-sci

    Zincophilic armor: Phytate ammonium as a multifunctional additive for enhanced performance in aqueous zinc-ion batteries

    Authors: Fangyuan Xiao, Xiaoke Wang, Kaitong Sun, Qian Zhao, Cui** Han, Hai-Feng Li

    Abstract: Corrosion and the formation of by-products resulting from parasitic side reactions, as well as random dendrite growth, pose significant challenges for aqueous zinc-ion batteries (AZIBs). In this study, phytate ammonium is introduced into the traditional dilute Zinc sulfate electrolyte as a multi-functional additive. Leveraging the inherent zincophilic nature of the phytic anion, a protective layer… ▽ More

    Submitted 7 April, 2024; originally announced April 2024.

  24. arXiv:2404.04421  [pdf, other

    cs.GR cs.CV

    PhysAvatar: Learning the Physics of Dressed 3D Avatars from Visual Observations

    Authors: Yang Zheng, Qingqing Zhao, Guandao Yang, Wang Yifan, Donglai Xiang, Florian Dubost, Dmitry Lagun, Thabo Beeler, Federico Tombari, Leonidas Guibas, Gordon Wetzstein

    Abstract: Modeling and rendering photorealistic avatars is of crucial importance in many applications. Existing methods that build a 3D avatar from visual observations, however, struggle to reconstruct clothed humans. We introduce PhysAvatar, a novel framework that combines inverse rendering with inverse physics to automatically estimate the shape and appearance of a human from multi-view video data along w… ▽ More

    Submitted 9 April, 2024; v1 submitted 5 April, 2024; originally announced April 2024.

    Comments: Project Page: https://qingqing-zhao.github.io/PhysAvatar

  25. arXiv:2404.03741  [pdf, other

    cs.RO

    A High-Fidelity Simulation Framework for Gras** Stability Analysis in Human Casualty Manipulation

    Authors: Qianwen Zhao, Rajarshi Roy, Chad Spurlock, Kevin Lister, Long Wang

    Abstract: Recently, there has been a growing interest in rescue robots due to their vital role in addressing emergency scenarios and providing crucial support in challenging or hazardous situations where human intervention is difficult. However, very few of these robots are capable of actively engaging with humans and undertaking physical manipulation tasks. This limitation is largely attributed to the abse… ▽ More

    Submitted 4 April, 2024; originally announced April 2024.

    Comments: 8 pages, revision submitted to IEEE RA-L, under review

  26. arXiv:2404.03162  [pdf, other

    cs.CR

    LTRDetector: Exploring Long-Term Relationship for Advanced Persistent Threats Detection

    Authors: Xiaoxiao Liu, Fan Xu, Nan Wang, Qinxin Zhao, Dalin Zhang, Xibin Zhao, Jiqiang Liu

    Abstract: Advanced Persistent Threat (APT) is challenging to detect due to prolonged duration, infrequent occurrence, and adept concealment techniques. Existing approaches primarily concentrate on the observable traits of attack behaviors, neglecting the intricate relationships formed throughout the persistent attack lifecycle. Thus, we present an innovative APT detection framework named LTRDetector, implem… ▽ More

    Submitted 3 April, 2024; originally announced April 2024.

  27. arXiv:2404.03066  [pdf, other

    cs.MA cs.NI math.DS

    Traffic Divergence Theory: An Analysis Formalism for Dynamic Networks

    Authors: Matin Macktoobian, Zhan Shu, Qing Zhao

    Abstract: Traffic dynamics is universally crucial in analyzing and designing almost any network. This article introduces a novel theoretical approach to analyzing network traffic dynamics. This theory's machinery is based on the notion of traffic divergence, which captures the flow (im)balance of network nodes and links. It features various analytical probes to investigate both spatial and temporal traffic… ▽ More

    Submitted 3 April, 2024; originally announced April 2024.

    Journal ref: IEEE Access, 2024

  28. arXiv:2404.01754  [pdf, other

    cs.SE cs.AI

    Peer-aided Repairer: Empowering Large Language Models to Repair Advanced Student Assignments

    Authors: Qianhui Zhao, Fang Liu, Li Zhang, Yang Liu, Zhen Yan, Zhenghao Chen, Yufei Zhou, **g Jiang, Ge Li

    Abstract: Automated generation of feedback on programming assignments holds significant benefits for programming education, especially when it comes to advanced assignments. Automated Program Repair techniques, especially Large Language Model based approaches, have gained notable recognition for their potential to fix introductory assignments. However, the programs used for evaluation are relatively simple.… ▽ More

    Submitted 2 April, 2024; originally announced April 2024.

    Comments: On-going work

  29. arXiv:2403.17712  [pdf, other

    cs.CV

    Invisible Gas Detection: An RGB-Thermal Cross Attention Network and A New Benchmark

    Authors: Jue Wang, Yuxiang Lin, Qi Zhao, Dong Luo, Shuaibao Chen, Wei Chen, Xiaojiang Peng

    Abstract: The widespread use of various chemical gases in industrial processes necessitates effective measures to prevent their leakage during transportation and storage, given their high toxicity. Thermal infrared-based computer vision detection techniques provide a straightforward approach to identify gas leakage areas. However, the development of high-quality algorithms has been challenging due to the lo… ▽ More

    Submitted 26 March, 2024; originally announced March 2024.

  30. arXiv:2403.17297  [pdf, other

    cs.CL cs.AI

    InternLM2 Technical Report

    Authors: Zheng Cai, Maosong Cao, Haojiong Chen, Kai Chen, Keyu Chen, Xin Chen, Xun Chen, Zehui Chen, Zhi Chen, Pei Chu, Xiaoyi Dong, Haodong Duan, Qi Fan, Zhaoye Fei, Yang Gao, Jiaye Ge, Chenya Gu, Yuzhe Gu, Tao Gui, Aijia Guo, Qipeng Guo, Conghui He, Yingfan Hu, Ting Huang, Tao Jiang , et al. (75 additional authors not shown)

    Abstract: The evolution of Large Language Models (LLMs) like ChatGPT and GPT-4 has sparked discussions on the advent of Artificial General Intelligence (AGI). However, replicating such advancements in open-source models has been challenging. This paper introduces InternLM2, an open-source LLM that outperforms its predecessors in comprehensive evaluations across 6 dimensions and 30 benchmarks, long-context m… ▽ More

    Submitted 25 March, 2024; originally announced March 2024.

  31. arXiv:2403.17235  [pdf, ps, other

    eess.SY

    A Discrete-Time Least-Squares Adaptive State Tracking Control Scheme with A Mobile-Robot System Study

    Authors: Qianhong Zhao, Gang Tao

    Abstract: This paper develops an adaptive state tracking control scheme for discrete-time systems, using the least-squares algorithm, as the new solution to the long-standing discrete-time adaptive state tracking control problem to which the Lyapunov method (well-developed for the continuous-time adaptive state tracking problem) is not applicable. The new adaptive state tracking scheme is based on a recentl… ▽ More

    Submitted 25 March, 2024; originally announced March 2024.

  32. arXiv:2403.16649  [pdf, other

    cs.AI

    CLHA: A Simple yet Effective Contrastive Learning Framework for Human Alignment

    Authors: Feiteng Fang, Liang Zhu, Min Yang, Xi Feng, **chang Hou, Qixuan Zhao, Chengming Li, Xi** Hu, Ruifeng Xu

    Abstract: Reinforcement learning from human feedback (RLHF) is a crucial technique in aligning large language models (LLMs) with human preferences, ensuring these LLMs behave in beneficial and comprehensible ways to users. However, a longstanding challenge in human alignment techniques based on reinforcement learning lies in their inherent complexity and difficulty in training. To address this challenge, we… ▽ More

    Submitted 26 March, 2024; v1 submitted 25 March, 2024; originally announced March 2024.

  33. arXiv:2403.16067  [pdf, other

    cs.CV cs.AI

    Robust Diffusion Models for Adversarial Purification

    Authors: Guang Lin, Zerui Tao, Jianhai Zhang, Toshihisa Tanaka, Qibin Zhao

    Abstract: Diffusion models (DMs) based adversarial purification (AP) has shown to be the most powerful alternative to adversarial training (AT). However, these methods neglect the fact that pre-trained diffusion models themselves are not robust to adversarial attacks as well. Additionally, the diffusion process can easily destroy semantic information and generate a high quality image but totally different f… ▽ More

    Submitted 24 May, 2024; v1 submitted 24 March, 2024; originally announced March 2024.

  34. arXiv:2403.15574  [pdf, other

    cs.AI

    SensoryT5: Infusing Sensorimotor Norms into T5 for Enhanced Fine-grained Emotion Classification

    Authors: Yuhan Xia, Qingqing Zhao, Yunfei Long, Ge Xu, Jia Wang

    Abstract: In traditional research approaches, sensory perception and emotion classification have traditionally been considered separate domains. Yet, the significant influence of sensory experiences on emotional responses is undeniable. The natural language processing (NLP) community has often missed the opportunity to merge sensory knowledge with emotion classification. To address this gap, we propose Sens… ▽ More

    Submitted 22 March, 2024; originally announced March 2024.

    Comments: Accepted by CogALex 2024 conference

  35. Tensor-force effects on nuclear matter in relativistic ab initio theory

    Authors: Sibo Wang, Hui Tong, Chencan Wang, Qiang Zhao, Peter Ring, Jie Meng

    Abstract: Within the relativistic Brueckner-Hartree-Fock theory in the full Dirac space, the tensor-force effects on infinite nuclear matter are elucidated by subtracting the matrix elements of tensor forces from the realistic nucleon-nucleon interaction. The tensor-force effects for the binding energy per particle of symmetric nuclear matter (SNM) as well as the symmetry energy are attractive and are more… ▽ More

    Submitted 3 June, 2024; v1 submitted 21 March, 2024; originally announced March 2024.

    Comments: 5 pages, 2 figures, discussion on four-component unitary Fermi gas is updated, accepted by Science Bulletin

  36. arXiv:2403.11473  [pdf, other

    cs.CL cs.AI

    Word Order's Impacts: Insights from Reordering and Generation Analysis

    Authors: Qinghua Zhao, Jiaang Li, Lei Li, Zenghui Zhou, Junfeng Liu

    Abstract: Existing works have studied the impacts of the order of words within natural text. They usually analyze it by destroying the original order of words to create a scrambled sequence, and then comparing the models' performance between the original and scrambled sequences. The experimental results demonstrate marginal drops. Considering this findings, different hypothesis about word order is proposed,… ▽ More

    Submitted 18 March, 2024; originally announced March 2024.

  37. arXiv:2403.11405  [pdf, other

    eess.SP

    A Deep Learning Method for Beat-Level Risk Analysis and Interpretation of Atrial Fibrillation Patients during Sinus Rhythm

    Authors: Jun Lei, Yuxi Zhou, Xue Tian, Qinghao Zhao, Qi Zhang, Shijia Geng, Qingbo Wu, Shenda Hong

    Abstract: Atrial Fibrillation (AF) is a common cardiac arrhythmia. Many AF patients experience complications such as stroke and other cardiovascular issues. Early detection of AF is crucial. Existing algorithms can only distinguish ``AF rhythm in AF patients'' from ``sinus rhythm in normal individuals'' . However, AF patients do not always exhibit AF rhythm, posing a challenge for diagnosis when the AF rhyt… ▽ More

    Submitted 17 March, 2024; originally announced March 2024.

  38. arXiv:2403.11142  [pdf, other

    quant-ph

    Dynamics and Resonance Fluorescence from a Superconducting Artificial Atom Doubly Driven by Quantized and Classical Fields

    Authors: Xinhui Ruan, Jia-Heng Wang, Dong He, Pengtao Song, Shengyong Li, Qianchuan Zhao, L. M. Kuang, Jaw-Shen Tsai, Chang-Ling Zou, **g Zhang, Dongning Zheng, O. V. Astafiev, Yu-xi Liu, Zhihui Peng

    Abstract: We report an experimental demonstration of resonance fluorescence in a two-level superconducting artificial atom under two driving fields coupled to a detuned cavity. One of the fields is classical and the other is varied from quantum (vacuum fluctuations) to classical one by controlling the photon number inside the cavity. The device consists of a transmon qubit strongly coupled to a one-dimensio… ▽ More

    Submitted 17 March, 2024; originally announced March 2024.

  39. arXiv:2403.11101  [pdf, other

    cs.CV

    Hierarchical Generative Network for Face Morphing Attacks

    Authors: Zuyuan He, Zongyong Deng, Qiaoyun He, Qijun Zhao

    Abstract: Face morphing attacks circumvent face recognition systems (FRSs) by creating a morphed image that contains multiple identities. However, existing face morphing attack methods either sacrifice image quality or compromise the identity preservation capability. Consequently, these attacks fail to bypass FRSs verification well while still managing to deceive human observers. These methods typically rel… ▽ More

    Submitted 17 March, 2024; originally announced March 2024.

    Comments: Accepted by FG2024

  40. arXiv:2403.10831  [pdf, other

    cs.CV

    DUE: Dynamic Uncertainty-Aware Explanation Supervision via 3D Imputation

    Authors: Qilong Zhao, Yifei Zhang, Mengdan Zhu, Siyi Gu, Yuyang Gao, Xiaofeng Yang, Liang Zhao

    Abstract: Explanation supervision aims to enhance deep learning models by integrating additional signals to guide the generation of model explanations, showcasing notable improvements in both the predictability and explainability of the model. However, the application of explanation supervision to higher-dimensional data, such as 3D medical images, remains an under-explored domain. Challenges associated wit… ▽ More

    Submitted 16 March, 2024; originally announced March 2024.

    Comments: 9 pages,6 figures

  41. arXiv:2403.10481  [pdf, other

    eess.IV eess.SP

    Tensor Star Decomposition

    Authors: Wuyang Zhou, Yu-Bang Zheng, Qibin Zhao, Danilo Mandic

    Abstract: A novel tensor decomposition framework, termed Tensor Star (TS) decomposition, is proposed which represents a new type of tensor network decomposition based on tensor contractions. This is achieved by connecting the core tensors in a ring shape, whereby the core tensors act as skip connections between the factor tensors and allow for direct correlation characterisation between any two arbitrary di… ▽ More

    Submitted 15 March, 2024; originally announced March 2024.

  42. arXiv:2403.09037  [pdf, other

    cs.CV cs.CL

    The First to Know: How Token Distributions Reveal Hidden Knowledge in Large Vision-Language Models?

    Authors: Qinyu Zhao, Ming Xu, Kartik Gupta, Akshay Asthana, Liang Zheng, Stephen Gould

    Abstract: Large vision-language models (LVLMs), designed to interpret and respond to human instructions, occasionally generate hallucinated or harmful content due to inappropriate instructions. This study uses linear probing to shed light on the hidden knowledge at the output layer of LVLMs. We demonstrate that the logit distributions of the first tokens contain sufficient information to determine whether t… ▽ More

    Submitted 13 March, 2024; originally announced March 2024.

    Comments: Under review. Project page: https://github.com/Qinyu-Allen-Zhao/LVLM-LP

  43. arXiv:2403.08334  [pdf, other

    cs.CR

    DONAPI: Malicious NPM Packages Detector using Behavior Sequence Knowledge Map**

    Authors: Cheng Huang, Nannan Wang, Ziyan Wang, Siqi Sun, Lingzi Li, Junren Chen, Qianchong Zhao, Jiaxuan Han, Zhen Yang, Lei Shi

    Abstract: With the growing popularity of modularity in software development comes the rise of package managers and language ecosystems. Among them, npm stands out as the most extensive package manager, hosting more than 2 million third-party open-source packages that greatly simplify the process of building code. However, this openness also brings security risks, as evidenced by numerous package poisoning i… ▽ More

    Submitted 13 March, 2024; originally announced March 2024.

    Comments: 18 pages, accepted for publication at USENIX Security 2024

  44. arXiv:2403.06942  [pdf, other

    eess.SY cs.LG stat.ML

    Grid Monitoring and Protection with Continuous Point-on-Wave Measurements and Generative AI

    Authors: Lang Tong, Xinyi Wang, Qing Zhao

    Abstract: Purpose This article presents a case for a next-generation grid monitoring and control system, leveraging recent advances in generative artificial intelligence (AI), machine learning, and statistical inference. Advancing beyond earlier generations of wide-area monitoring systems built upon supervisory control and data acquisition (SCADA) and synchrophasor technologies, we argue for a monitoring an… ▽ More

    Submitted 11 March, 2024; originally announced March 2024.

  45. arXiv:2403.05854  [pdf, other

    cs.CV

    LTGC: Long-tail Recognition via Leveraging LLMs-driven Generated Content

    Authors: Qihao Zhao, Yalun Dai, Hao Li, Wei Hu, Fan Zhang, Jun Liu

    Abstract: Long-tail recognition is challenging because it requires the model to learn good representations from tail categories and address imbalances across all categories. In this paper, we propose a novel generative and fine-tuning framework, LTGC, to handle long-tail recognition via leveraging generated content. Firstly, inspired by the rich implicit knowledge in large-scale models (e.g., large language… ▽ More

    Submitted 26 May, 2024; v1 submitted 9 March, 2024; originally announced March 2024.

    Comments: CVPR 2024, Oral

  46. arXiv:2403.05808  [pdf, other

    cs.CV eess.IV

    Adaptive Multi-modal Fusion of Spatially Variant Kernel Refinement with Diffusion Model for Blind Image Super-Resolution

    Authors: Junxiong Lin, Yan Wang, Zeng Tao, Boyang Wang, Qing Zhao, Haorang Wang, Xuan Tong, Xinji Mai, Yuxuan Lin, Wei Song, Jiawen Yu, Shaoqi Yan, Wenqiang Zhang

    Abstract: Pre-trained diffusion models utilized for image generation encapsulate a substantial reservoir of a priori knowledge pertaining to intricate textures. Harnessing the potential of leveraging this a priori knowledge in the context of image super-resolution presents a compelling avenue. Nonetheless, prevailing diffusion-based methodologies presently overlook the constraints imposed by degradation inf… ▽ More

    Submitted 9 March, 2024; originally announced March 2024.

  47. arXiv:2403.05743  [pdf, ps, other

    eess.SP cs.LG econ.GN

    Forecasting Electricity Market Signals via Generative AI

    Authors: Xinyi Wang, Qing Zhao, Lang Tong

    Abstract: This paper presents a generative artificial intelligence approach to probabilistic forecasting of electricity market signals, such as real-time locational marginal prices and area control error signals. Inspired by the Wiener-Kallianpur innovation representation of nonparametric time series, we propose a weak innovation autoencoder architecture and a novel deep learning algorithm that extracts the… ▽ More

    Submitted 27 June, 2024; v1 submitted 8 March, 2024; originally announced March 2024.

  48. arXiv:2403.05444  [pdf, other

    cond-mat.mtrl-sci

    Chlorine and zinc co-do** effects on the electronic structure and optical properties of γ-CuI

    Authors: Chao Li, Meicong Li, Zhuli Zhang, Qiang Zhao, Naixin Liu, Kailei Wang, Fan Zhang, ** Ouyang

    Abstract: The effects of chlorine (Cl) and zinc (Zn) co-do** on the electronic structure and optical properties of the zinc blende (γ) phase of copper iodide (γ-CuI) scintillator material are investigated by using first-principles density functional theory calculations. The band structure, density of states, dielectric function, absorption coefficients, and reflectivity were analyzed before and after dopi… ▽ More

    Submitted 8 March, 2024; originally announced March 2024.

  49. arXiv:2403.04299  [pdf, other

    cs.RO cs.AI

    LitSim: A Conflict-aware Policy for Long-term Interactive Traffic Simulation

    Authors: Haojie Xin, Xiaodong Zhang, Renzhi Tang, Songyang Yan, Qianrui Zhao, Chunze Yang, Wen Cui, Zijiang Yang

    Abstract: Simulation is pivotal in evaluating the performance of autonomous driving systems due to the advantages of high efficiency and low cost compared to on-road testing. Bridging the gap between simulation and the real world requires realistic agent behaviors. However, the existing works have the following shortcomings in achieving this goal: (1) log replay offers realistic scenarios but often leads to… ▽ More

    Submitted 1 May, 2024; v1 submitted 7 March, 2024; originally announced March 2024.

    Comments: 9 pages, 6 figures, under review

  50. arXiv:2403.04294  [pdf, other

    cs.CV

    A$^{3}$lign-DFER: Pioneering Comprehensive Dynamic Affective Alignment for Dynamic Facial Expression Recognition with CLIP

    Authors: Zeng Tao, Yan Wang, Junxiong Lin, Haoran Wang, Xinji Mai, Jiawen Yu, Xuan Tong, Ziheng Zhou, Shaoqi Yan, Qing Zhao, Liyuan Han, Wenqiang Zhang

    Abstract: The performance of CLIP in dynamic facial expression recognition (DFER) task doesn't yield exceptional results as observed in other CLIP-based classification tasks. While CLIP's primary objective is to achieve alignment between images and text in the feature space, DFER poses challenges due to the abstract nature of text and the dynamic nature of video, making label representation limited and perf… ▽ More

    Submitted 7 March, 2024; originally announced March 2024.