Skip to main content

Showing 51–100 of 1,406 results for author: Han, Z

.
  1. arXiv:2404.18930  [pdf, other

    cs.CV

    Hallucination of Multimodal Large Language Models: A Survey

    Authors: Zechen Bai, Pichao Wang, Tianjun Xiao, Tong He, Zongbo Han, Zheng Zhang, Mike Zheng Shou

    Abstract: This survey presents a comprehensive analysis of the phenomenon of hallucination in multimodal large language models (MLLMs), also known as Large Vision-Language Models (LVLMs), which have demonstrated significant advancements and remarkable abilities in multimodal tasks. Despite these promising developments, MLLMs often generate outputs that are inconsistent with the visual content, a challenge k… ▽ More

    Submitted 29 April, 2024; originally announced April 2024.

    Comments: 140 references

  2. arXiv:2404.18077  [pdf, other

    cs.NI cs.LG

    Generative AI for Low-Carbon Artificial Intelligence of Things

    Authors: **bo Wen, Ruichen Zhang, Dusit Niyato, Jiawen Kang, Hongyang Du, Yang Zhang, Zhu Han

    Abstract: By integrating Artificial Intelligence (AI) with the Internet of Things (IoT), Artificial Intelligence of Things (AIoT) has revolutionized many fields. However, AIoT is facing the challenges of energy consumption and carbon emissions due to the continuous advancement of mobile technology. Fortunately, Generative AI (GAI) holds immense potential to reduce carbon emissions of AIoT due to its excelle… ▽ More

    Submitted 28 April, 2024; originally announced April 2024.

  3. arXiv:2404.16258  [pdf, ps, other

    math.AG

    Central charges in local mirror symmetry via hypergeometric duality

    Authors: Zengrui Han

    Abstract: We apply the better-behaved GKZ hypergeometric systems to study toric Calabi-Yau Deligne-Mumford stacks and their Hori-Vafa mirrors given by affine hypersurfaces in algebraic tori. We show the equality between A-brane and B-brane central charges, in terms of period integrals and hypergeometric series respectively. This settles a conjecture of Hosono, which could also be considered as a generalizat… ▽ More

    Submitted 14 May, 2024; v1 submitted 24 April, 2024; originally announced April 2024.

    Comments: 32 pages. Comparison with related results and references added; typos fixed; exposition improved

    MSC Class: 14M25; 14J32; 33C99

  4. arXiv:2404.14778  [pdf, other

    cs.IT eess.SP

    Channel Estimation for Optical Intelligent Reflecting Surface-Assisted VLC System: A Joint Space-Time Sampling Approach

    Authors: Shiyuan Sun, Fang Yang, Weidong Mei, Jian Song, Zhu Han, Rui Zhang

    Abstract: Optical intelligent reflecting surface (OIRS) has attracted increasing attention due to its capability of overcoming signal blockages in visible light communication (VLC), an emerging technology for the next-generation advanced transceivers. However, current works on OIRS predominantly assume known channel state information (CSI), which is essential to practical OIRS configuration. To bridge such… ▽ More

    Submitted 23 April, 2024; originally announced April 2024.

  5. arXiv:2404.14706  [pdf, other

    cs.IT eess.SP

    Channel Estimation for Optical IRS-Assisted VLC System via Spatial Coherence

    Authors: Shiyuan Sun, Fang Yang, Weidong Mei, Jian Song, Zhu Han, Rui Zhang

    Abstract: Optical intelligent reflecting surface (OIRS) has been considered a promising technology for visible light communication (VLC) by constructing visual line-of-sight propagation paths to address the signal blockage issue. However, the existing works on OIRSs are mostly based on perfect channel state information (CSI), whose acquisition appears to be challenging due to the passive nature of the OIRS.… ▽ More

    Submitted 22 April, 2024; originally announced April 2024.

  6. arXiv:2404.14140  [pdf, other

    eess.SP

    Generative Artificial Intelligence Assisted Wireless Sensing: Human Flow Detection in Practical Communication Environments

    Authors: Jiacheng Wang, Hongyang Du, Dusit Niyato, Zehui Xiong, Jiawen Kang, Bo Ai, Zhu Han, Dong In Kim

    Abstract: Groundbreaking applications such as ChatGPT have heightened research interest in generative artificial intelligence (GAI). Essentially, GAI excels not only in content generation but also in signal processing, offering support for wireless sensing. Hence, we introduce a novel GAI-assisted human flow detection system (G-HFD). Rigorously, G-HFD first uses channel state information (CSI) to estimate t… ▽ More

    Submitted 22 April, 2024; originally announced April 2024.

  7. arXiv:2404.14131  [pdf, ps, other

    gr-qc astro-ph.HE hep-th

    Possible signatures of higher dimension in thin accretion disk around brane world black hole

    Authors: Ailin Liu, Tong-Yu He, Ming Liu, Zhan-Wen Han, Rong-Jia Yang

    Abstract: We probe deeply into the characteristics of thin accretion disk surrounding black hole within the brane world paradigm. We investigate how model parameters affect the physical properties of the disk. Our findings indicate that as the tidal charge parameter inherited from the higher dimension increases, the energy flux, the radiation temperature, the spectral cutoff frequency, the spectral luminosi… ▽ More

    Submitted 22 April, 2024; originally announced April 2024.

    Comments: 16 pages, 16 figures

  8. arXiv:2404.13816  [pdf, other

    cs.CV

    Neural Radiance Field in Autonomous Driving: A Survey

    Authors: Lei He, Leheng Li, Wenchao Sun, Zeyu Han, Yichen Liu, Sifa Zheng, Jianqiang Wang, Keqiang Li

    Abstract: Neural Radiance Field (NeRF) has garnered significant attention from both academia and industry due to its intrinsic advantages, particularly its implicit representation and novel view synthesis capabilities. With the rapid advancements in deep learning, a multitude of methods have emerged to explore the potential applications of NeRF in the domain of Autonomous Driving (AD). However, a conspicuou… ▽ More

    Submitted 26 April, 2024; v1 submitted 21 April, 2024; originally announced April 2024.

  9. arXiv:2404.13689  [pdf, other

    math.AP

    Stability of the Abstract Thermoelastic System with Singularity

    Authors: Chenxi Deng, Zhong-Jie Han, Zhaobin Kuang, Qiong Zhang

    Abstract: In this paper, we analyze an abstract thermoelastic system, where the heat conduction follows the Cattaneo law. Zero becomes a spectrum point of the system operator when the coupling and thermal dam** parameters of the system satisfy specific conditions. We obtain the decay rates of solutions to the system with or without the inertial term. Furthermore, the decay rate of the system without inert… ▽ More

    Submitted 21 April, 2024; originally announced April 2024.

    Comments: 15 pages

    MSC Class: 35Q74; 74F05

  10. arXiv:2404.12666  [pdf, other

    cs.DC cs.CR cs.ET

    A Survey on Federated Analytics: Taxonomy, Enabling Techniques, Applications and Open Issues

    Authors: Zibo Wang, Haichao Ji, Yifei Zhu, Dan Wang, Zhu Han

    Abstract: The escalating influx of data generated by networked edge devices, coupled with the growing awareness of data privacy, has promoted a transformative shift in computing paradigms from centralized data processing to privacy-preserved distributed data processing. Federated analytics (FA) is an emerging technique to support collaborative data analytics among diverse data owners without centralizing th… ▽ More

    Submitted 19 April, 2024; originally announced April 2024.

    Comments: This survey has been submitted to IEEE Communications Surveys & Tutorials

  11. arXiv:2404.12154  [pdf, other

    cs.CV

    StyleBooth: Image Style Editing with Multimodal Instruction

    Authors: Zhen Han, Chaojie Mao, Zeyinzi Jiang, Yulin Pan, **gfeng Zhang

    Abstract: Given an original image, image editing aims to generate an image that align with the provided instruction. The challenges are to accept multimodal inputs as instructions and a scarcity of high-quality training data, including crucial triplets of source/target image pairs and multimodal (text and image) instructions. In this paper, we focus on image style editing and present StyleBooth, a method th… ▽ More

    Submitted 18 April, 2024; originally announced April 2024.

  12. arXiv:2404.11950  [pdf, other

    cond-mat.str-el cond-mat.supr-con

    Pair density waves in the strong-coupling two-dimensional Holstein-Hubbard model: a variational Monte Carlo study

    Authors: Jiucai Wang, Wen Sun, Hao-Xin Wang, Zhaoyu Han, Steven A. Kivelson, Hong Yao

    Abstract: A robust theory of the mechanism of pair density wave (PDW) superconductivity (i.e. where Cooper pairs have nonzero center of mass momentum) remains elusive. Here we explore the triangular lattice $t$-$J$-$V$ model, a low-energy effective theory derived from the strong-coupling limit of the Holstein-Hubbard model, by large-scale variational Monte Carlo simulations. When the electron density is suf… ▽ More

    Submitted 18 April, 2024; originally announced April 2024.

    Comments: 4.5 pages, 4 figures, 2 tables

  13. arXiv:2404.10343  [pdf, other

    cs.CV eess.IV

    The Ninth NTIRE 2024 Efficient Super-Resolution Challenge Report

    Authors: Bin Ren, Yawei Li, Nancy Mehta, Radu Timofte, Hongyuan Yu, Cheng Wan, Yuxin Hong, Bingnan Han, Zhuoyuan Wu, Yajun Zou, Yuqing Liu, Jizhe Li, Keji He, Chao Fan, Heng Zhang, Xiaolin Zhang, Xuanwu Yin, Kunlong Zuo, Bohao Liao, Peizhe Xia, Long Peng, Zhibo Du, Xin Di, Wangkai Li, Yang Wang , et al. (109 additional authors not shown)

    Abstract: This paper provides a comprehensive review of the NTIRE 2024 challenge, focusing on efficient single-image super-resolution (ESR) solutions and their outcomes. The task of this challenge is to super-resolve an input image with a magnification factor of x4 based on pairs of low and corresponding high-resolution images. The primary objective is to develop networks that optimize various aspects such… ▽ More

    Submitted 25 June, 2024; v1 submitted 16 April, 2024; originally announced April 2024.

    Comments: The report paper of NTIRE2024 Efficient Super-resolution, accepted by CVPRW2024

  14. arXiv:2404.09699  [pdf, other

    cs.GT

    Generative AI for Game Theory-based Mobile Networking

    Authors: Long He, Geng Sun, Dusit Niyato, Hongyang Du, Fang Mei, Jiawen Kang, Mérouane Debbah, and Zhu Han

    Abstract: With the continuous advancement of network technology, various emerging complex networking optimization problems opened up a wide range of applications utilizating of game theory. However, since game theory is a mathematical framework, game theory-based solutions often require the experience and knowledge of human experts. Recently, the remarkable advantages exhibited by generative artificial inte… ▽ More

    Submitted 15 April, 2024; originally announced April 2024.

  15. arXiv:2404.09079  [pdf, ps, other

    math.AP

    Compactness results for a Dirichlet energy of nonlocal gradient with applications

    Authors: Zhaolong Han, Tadele Mengesha, Xiaochuan Tian

    Abstract: We prove two compactness results for function spaces with finite Dirichlet energy of half-space nonlocal gradients. In each of these results, we provide sufficient conditions on a sequence of kernel functions that guarantee the asymptotic compact embedding of the associated nonlocal function spaces into the class of square-integrable functions. Moreover, we will demonstrate that the sequence of no… ▽ More

    Submitted 13 April, 2024; originally announced April 2024.

  16. arXiv:2404.08160  [pdf, other

    cs.CR

    A Survey on Security of Ultra/Hyper Reliable Low Latency Communication: Recent Advancements, Challenges, and Future Directions

    Authors: Annapurna Pradhan, Susmita Das, Md. Jalil Piran, Zhu Han

    Abstract: Ultra-reliable low latency communication (URLLC) is an innovative service offered by fifth-generation (5G) wireless systems. URLLC enables various mission-critical applications by facilitating reliable and low-latency signal transmission to support extreme Quality of Service (QoS) requirements. Apart from reliability and latency, ensuring secure data transmission for URLLC has been a prominent iss… ▽ More

    Submitted 11 April, 2024; originally announced April 2024.

  17. arXiv:2404.07477  [pdf, ps, other

    eess.SP

    Integrated Sensing and Communication Under DISCO Physical-Layer Jamming Attacks

    Authors: Huan Huang, Hongliang Zhang, Weidong Mei, Jun Li, Yi Cai, A. Lee Swindlehurst, Zhu Han

    Abstract: Integrated sensing and communication (ISAC) systems traditionally presuppose that sensing and communication (S&C) channels remain approximately constant during their coherence time. However, a "DISCO" reconfigurable intelligent surface (DRIS), i.e., an illegitimate RIS with random, time-varying reflection properties that acts like a "disco ball," introduces a paradigm shift that enables active cha… ▽ More

    Submitted 11 April, 2024; originally announced April 2024.

    Comments: This paper has been submitted for possible publication. For the code of the DISCO RIS is available on Github (https://github.com/huanhuan1799/Disco-Intelligent-Reflecting-Surfaces-Active-Channel-Aging-for-Fully-Passive-Jamming-Attacks)

  18. arXiv:2404.06851  [pdf, other

    cs.CV

    UDiFF: Generating Conditional Unsigned Distance Fields with Optimal Wavelet Diffusion

    Authors: Junsheng Zhou, Weiqi Zhang, Baorui Ma, Kanle Shi, Yu-Shen Liu, Zhizhong Han

    Abstract: Diffusion models have shown remarkable results for image generation, editing and inpainting. Recent works explore diffusion models for 3D shape generation with neural implicit functions, i.e., signed distance function and occupancy function. However, they are limited to shapes with closed surfaces, which prevents them from generating diverse 3D real-world contents containing open surfaces. In this… ▽ More

    Submitted 10 April, 2024; originally announced April 2024.

    Comments: To appear at CVPR2024. Project page: https://weiqi-zhang.github.io/UDiFF

  19. arXiv:2404.06765  [pdf, other

    eess.SP

    Harnessing the Power of AI-Generated Content for Semantic Communication

    Authors: Yiru Wang, Wanting Yang, Zehui Xiong, Yu** Zhao, Tony Q. S. Quek, Zhu Han

    Abstract: Semantic Communication (SemCom) is envisaged as the next-generation paradigm to address challenges stemming from the conflicts between the increasing volume of transmission data and the scarcity of spectrum resources. However, existing SemCom systems face drawbacks, such as low explainability, modality rigidity, and inadequate reconstruction functionality. Recognizing the transformative capabiliti… ▽ More

    Submitted 10 April, 2024; originally announced April 2024.

  20. arXiv:2404.06148  [pdf, other

    astro-ph.SR

    The radius variations of accreting main sequence stars and mass transfer instability

    Authors: Zi-Qi Zhao, Zhen-Wei Li, Lin Xiao, Hong-Wei Ge, Zhan-Wen Han

    Abstract: Many previous works studied the dynamical timescale mass transfer stability criteria based on the donor response with neglecting the stellar structure of the accretor. In this letter, we investigate the radial response of accretors with mass accumulation and its effect on the binary mass transfer stability. We perform a series of detailed stellar evolution simulations with different types of accre… ▽ More

    Submitted 12 April, 2024; v1 submitted 9 April, 2024; originally announced April 2024.

    Comments: 7 pages,9 figures, accepted for publication in MNRAS

  21. arXiv:2404.06046  [pdf, other

    nucl-ex nucl-th

    Nuclear charge radii of germanium isotopes around $N$ = 40

    Authors: S. J. Wang, A. Kanellakopoulos, X. F. Yang, S. W. Bai, J. Billowes, M. L. Bissell, K. Blaum, B. Cheal, C. S. Devlin, R. F. Garcia Ruiz, J. Z. Han, H. Heylen, S. Kaufmann, K. Konig, A. Koszorus, S. Lechner, S. Malbrunot-Ettenauer, W. Nazarewicz, R. Neugart, G. Neyens, W. Nortershauser, T. Ratajczyk, P. -G. Reinhard, L. V. Rodrıguez, S. Sels , et al. (4 additional authors not shown)

    Abstract: Collinear laser spectroscopy measurements were performed on $^{68-74}$Ge isotopes ($Z = 32$) at ISOLDE-CERN, by probing the $4s^2 4p^2 \, ^3\!P_1 \rightarrow 4s^2 4p 5s \, ^3\!P_1^o$ atomic transition (269~nm) of germanium. Nuclear charge radii are determined via the measured isotope shifts, revealing a larger local variation than the neighboring isotopic chains. Nuclear density functional theory… ▽ More

    Submitted 9 April, 2024; originally announced April 2024.

    Comments: 6 pages,5 figures

  22. arXiv:2404.05050  [pdf, other

    cs.HC

    Co-design Accessible Public Robots: Insights from People with Mobility Disability, Robotic Practitioners and Their Collaborations

    Authors: Howard Ziyu Han, Franklin Mingzhe Li, Alesandra Baca Vazquez, Daragh Byrne, Nikolas Martelaro, Sarah E Fox

    Abstract: Sidewalk robots are increasingly common across the globe. Yet, their operation on public paths poses challenges for people with mobility disabilities (PwMD) who face barriers to accessibility, such as insufficient curb cuts. We interviewed 15 PwMD to understand how they perceive sidewalk robots. Findings indicated that PwMD feel they have to compete for space on the sidewalk when robots are introd… ▽ More

    Submitted 7 April, 2024; originally announced April 2024.

  23. arXiv:2404.04835  [pdf, other

    astro-ph.SR astro-ph.HE

    A born ultramassive white dwarf-hot subdwarf super-Chandrasekhar candidate

    Authors: Changqing Luo, Jiao Li, Chuanjie Zheng, Dongdong Liu, Zhenwei Li, Yang** Luo, Peter Nemeth, Bo Zhang, Bo Wang, Song Wang, Yu Bai, Qingzheng Li, Pei Wang, Zhanwen Han, Jifeng Liu, Yang Huang, Xuefei Chen, Chao Liu

    Abstract: Although supernovae is a well-known endpoint of an accreting white dwarf, alternative theoretical possibilities has been discussing broadly, such as the accretion-induced collapse (AIC) event as the endpoint of oxygen-neon (ONe) white dwarfs, either accreting up to or merging to excess the Chandrasekhar limit (the maximum mass of a stable white dwarf). AIC is an important channel to form neutron s… ▽ More

    Submitted 7 April, 2024; originally announced April 2024.

    Comments: 25 pages, 14 figures

  24. arXiv:2404.04830  [pdf

    cond-mat.mes-hall

    Magneto-Induced Topological Phase Transition in Inverted InAs/GaSb Bilayers

    Authors: Zhongdong Han, Tingxin Li, Long Zhang, Rui-Rui Du

    Abstract: We report a magneto-induced topological phase transition in inverted InAs/GaSb bilayers from a quantum spin Hall insulator to a normal insulator. We utilize a dual-gated Corbino device in which the degree of band inversion, or equivalently the electron and hole densities, can be continuously tuned. We observe a topological phase transition around the magnetic field where a band crossing occurs, th… ▽ More

    Submitted 7 April, 2024; originally announced April 2024.

    Comments: 15+15 pages, 4+9 figures

  25. arXiv:2404.03866  [pdf, other

    astro-ph.SR astro-ph.IM

    Derivative Spectroscopy and its Application at Detecting the Weak Emission/Absorption Lines

    Authors: Lihuan Yu, Jiangdan Li, **liang Wang, Jiajia Li, Jiao Li, Qiang Xi, Zhanwen Han

    Abstract: The development of spectroscopic survey telescopes like Large Sky Area Multi-Object Fiber Spectroscopic Telescope (LAMOST), Apache Point Observatory Galactic Evolution Experiment and Sloan Digital Sky Survey has opened up unprecedented opportunities for stellar classification. Specific types of stars, such as early-type emission-line stars and those with stellar winds, can be distinguished by the… ▽ More

    Submitted 4 April, 2024; originally announced April 2024.

  26. arXiv:2404.03411  [pdf, ps, other

    cs.LG cs.CL cs.CR

    Red Teaming GPT-4V: Are GPT-4V Safe Against Uni/Multi-Modal Jailbreak Attacks?

    Authors: Shuo Chen, Zhen Han, Bailan He, Zifeng Ding, Wenqian Yu, Philip Torr, Volker Tresp, **dong Gu

    Abstract: Various jailbreak attacks have been proposed to red-team Large Language Models (LLMs) and revealed the vulnerable safeguards of LLMs. Besides, some methods are not limited to the textual modality and extend the jailbreak attack to Multimodal Large Language Models (MLLMs) by perturbing the visual input. However, the absence of a universal evaluation benchmark complicates the performance reproductio… ▽ More

    Submitted 4 April, 2024; originally announced April 2024.

    Comments: technical report

  27. arXiv:2404.02163  [pdf, other

    cs.IT

    FastqZip: An Improved Reference-Based Genome Sequence Lossy Compression Framework

    Authors: Yuanjian Liu, Huihao Luo, Zhijun Han, Yao Hu, Yehui Yang, Kyle Chard, Sheng Di, Ian Foster, Jiesheng Wu

    Abstract: Storing and archiving data produced by next-generation sequencing (NGS) is a huge burden for research institutions. Reference-based compression algorithms are effective in dealing with these data. Our work focuses on compressing FASTQ format files with an improved reference-based compression algorithm to achieve a higher compression ratio than other state-of-the-art algorithms. We propose FastqZip… ▽ More

    Submitted 22 February, 2024; originally announced April 2024.

  28. arXiv:2404.01954  [pdf, other

    cs.CL cs.AI

    HyperCLOVA X Technical Report

    Authors: Kang Min Yoo, Jaegeun Han, Sookyo In, Heewon Jeon, Jisu Jeong, Jaewook Kang, Hyunwook Kim, Kyung-Min Kim, Munhyong Kim, Sungju Kim, Donghyun Kwak, Hanock Kwak, Se Jung Kwon, Bado Lee, Dongsoo Lee, Gichang Lee, Jooho Lee, Baeseong Park, Seong** Shin, Joonsang Yu, Seolki Baek, Sumin Byeon, Eungsup Cho, Dooseok Choe, Jeesung Han , et al. (371 additional authors not shown)

    Abstract: We introduce HyperCLOVA X, a family of large language models (LLMs) tailored to the Korean language and culture, along with competitive capabilities in English, math, and coding. HyperCLOVA X was trained on a balanced mix of Korean, English, and code data, followed by instruction-tuning with high-quality human-annotated datasets while abiding by strict safety guidelines reflecting our commitment t… ▽ More

    Submitted 13 April, 2024; v1 submitted 2 April, 2024; originally announced April 2024.

    Comments: 44 pages; updated authors list and fixed author names

  29. arXiv:2404.01158  [pdf, other

    cs.CL cs.RO

    Dialogue with Robots: Proposals for Broadening Participation and Research in the SLIVAR Community

    Authors: Casey Kennington, Malihe Alikhani, Heather Pon-Barry, Katherine Atwell, Yonatan Bisk, Daniel Fried, Felix Gervits, Zhao Han, Mert Inan, Michael Johnston, Raj Korpan, Diane Litman, Matthew Marge, Cynthia Matuszek, Ross Mead, Shiwali Mohan, Raymond Mooney, Natalie Parde, Jivko Sinapov, Angela Stewart, Matthew Stone, Stefanie Tellex, Tom Williams

    Abstract: The ability to interact with machines using natural human language is becoming not just commonplace, but expected. The next step is not just text interfaces, but speech interfaces and not just with computers, but with all machines including robots. In this paper, we chronicle the recent history of this growing field of spoken dialogue with robots and offer the community three proposals, the first… ▽ More

    Submitted 1 April, 2024; originally announced April 2024.

    Comments: NSF Report on the "Dialogue with Robots" Workshop held in Pittsburg, PA, April 2023

  30. From Learning to Analytics: Improving Model Efficacy with Goal-Directed Client Selection

    Authors: **gwen Tong, Zhenzhen Chen, Liqun Fu, Jun Zhang, Zhu Han

    Abstract: Federated learning (FL) is an appealing paradigm for learning a global model among distributed clients while preserving data privacy. Driven by the demand for high-quality user experiences, evaluating the well-trained global model after the FL process is crucial. In this paper, we propose a closed-loop model analytics framework that allows for effective evaluation of the trained global model using… ▽ More

    Submitted 30 March, 2024; originally announced April 2024.

    Comments: This work was partly presented at IEEE ICC 2022

    MSC Class: 14J60 ACM Class: I.2.7

  31. arXiv:2404.00323  [pdf, other

    cs.CV cs.LG

    CLIP-driven Outliers Synthesis for few-shot OOD detection

    Authors: Hao Sun, Rundong He, Zhongyi Han, Zhicong Lin, Yongshun Gong, Yilong Yin

    Abstract: Few-shot OOD detection focuses on recognizing out-of-distribution (OOD) images that belong to classes unseen during training, with the use of only a small number of labeled in-distribution (ID) images. Up to now, a mainstream strategy is based on large-scale vision-language models, such as CLIP. However, these methods overlook a crucial issue: the lack of reliable OOD supervision information, whic… ▽ More

    Submitted 30 March, 2024; originally announced April 2024.

    Comments: 9 pages,5 figures

  32. arXiv:2403.19534  [pdf, other

    cs.CV

    Locate, Assign, Refine: Taming Customized Image Inpainting with Text-Subject Guidance

    Authors: Yulin Pan, Chaojie Mao, Zeyinzi Jiang, Zhen Han, **gfeng Zhang

    Abstract: Prior studies have made significant progress in image inpainting guided by either text or subject image. However, the research on editing with their combined guidance is still in the early stages. To tackle this challenge, we present LAR-Gen, a novel approach for image inpainting that enables seamless inpainting of masked scene images, incorporating both the textual prompts and specified subjects.… ▽ More

    Submitted 28 March, 2024; originally announced March 2024.

    Comments: 22 pages, 14 figures

  33. arXiv:2403.19432  [pdf, other

    cs.CL cs.AI

    Uncovering Misattributed Suicide Causes through Annotation Inconsistency Detection in Death Investigation Notes

    Authors: Song Wang, Yiliang Zhou, Ziqiang Han, Cui Tao, Yunyu Xiao, Ying Ding, Joydeep Ghosh, Yifan Peng

    Abstract: Data accuracy is essential for scientific research and policy development. The National Violent Death Reporting System (NVDRS) data is widely used for discovering the patterns and causes of death. Recent studies suggested the annotation inconsistencies within the NVDRS and the potential impact on erroneous suicide-cause attributions. We present an empirical Natural Language Processing (NLP) approa… ▽ More

    Submitted 29 March, 2024; v1 submitted 28 March, 2024; originally announced March 2024.

    Comments: 19 pages, 6 figures

  34. arXiv:2403.17326  [pdf

    cond-mat.mtrl-sci

    Unveiling the origin of unconventional moire ferroelectricity

    Authors: Ruirui Niu, Zhuoxian Li, Xiangyan Han, Qianling Liu, Zhuangzhuang Qu, Zhiyu Wang, Chunrui Han, Kenji Watanabe, Takashi Taniguchi, Kaihui Liu, **hai Mao, Wu Shi, Bo Peng, Zheng Vitto Han, Zizhao Gan, Jianming Lu

    Abstract: Interfacial ferroelectricity emerges in heterostructures consisting of nonpolar van der Waals (vdW) layers, greatly expanding the scope of two dimensional ferroelectrics. In particular, the unconventional moire ferroelectricity observed in bilayer graphene/boron nitride (BN) heterostructures, exhibits promising functionalities with topological current, superconductivity and synaptic responses. How… ▽ More

    Submitted 25 March, 2024; originally announced March 2024.

  35. arXiv:2403.16003  [pdf, other

    cs.CV cs.AI

    Diverse Representation Embedding for Lifelong Person Re-Identification

    Authors: Shiben Liu, Huijie Fan, Qiang Wang, Xiai Chen, Zhi Han, Yandong Tang

    Abstract: Lifelong Person Re-Identification (LReID) aims to continuously learn from successive data streams, matching individuals across multiple cameras. The key challenge for LReID is how to effectively preserve old knowledge while incrementally learning new information, which is caused by task-level domain gaps and limited old task datasets. Existing methods based on CNN backbone are insufficient to expl… ▽ More

    Submitted 2 April, 2024; v1 submitted 24 March, 2024; originally announced March 2024.

    Comments: 11 pages,7 Tables,3 Figures

  36. arXiv:2403.14608  [pdf, other

    cs.LG

    Parameter-Efficient Fine-Tuning for Large Models: A Comprehensive Survey

    Authors: Zeyu Han, Chao Gao, **yang Liu, Jeff Zhang, Sai Qian Zhang

    Abstract: Large models represent a groundbreaking advancement in multiple application fields, enabling remarkable achievements across various tasks. However, their unprecedented scale comes with significant computational costs. These models, often consisting of billions of parameters, require vast amounts of computational resources for execution. Especially, the expansive scale and computational demands pos… ▽ More

    Submitted 29 April, 2024; v1 submitted 21 March, 2024; originally announced March 2024.

    Comments: 24 pages, 12 figures

  37. arXiv:2403.12771  [pdf, other

    astro-ph.SR

    TYC 3340-2437-1: A Quadruple System with A Massive Star

    Authors: Jiao Li, Chao Liu, Changqing Luo, Bo Zhang, Jiang-Dan Li, Jia-Dong Li, Zhan-Wen Han, Xue-Fei Chen, Lu-Qian Wang, Min Fang, Li-Feng Xing, Xi-Liang Zhang, Chichuan **

    Abstract: Hierarchical massive quadruple systems are ideal laboratories for examining the theories of star formation, dynamical evolution, and stellar evolution. The successive mergers of hierarchical quadruple systems might explain the mass gap between neutron stars and black holes. Looking for light curves of O-type binaries identified by LAMOST, we find a (2+2) quadruple system: TYC 3340-2437-1, located… ▽ More

    Submitted 19 March, 2024; originally announced March 2024.

  38. arXiv:2403.12361  [pdf

    cond-mat.mtrl-sci cond-mat.mes-hall cond-mat.other physics.app-ph

    Multi-State, Ultra-thin, BEOL-Compatible AlScN Ferroelectric Diodes

    Authors: Kwan-Ho Kim, Zirun Han, Yinuo Zhang, Pariasadat Musavigharavi, Jeffrey Zheng, Dhiren K. Pradhan, Eric A. Stach, Roy H. Olsson III, Deep Jariwala

    Abstract: The growth in data generation necessitates efficient data processing technologies to address the von Neumann bottleneck in conventional computer architecture. Memory-driven computing, which integrates non-volatile memory (NVM) devices in a 3D stack, is gaining attention, with CMOS back-end-of-line (BEOL) compatible ferroelectric (FE) diodes being ideal due to their two-terminal design and inherent… ▽ More

    Submitted 18 March, 2024; originally announced March 2024.

  39. arXiv:2403.11071  [pdf, other

    eess.SP cs.IT

    Wavenumber Domain Sparse Channel Estimation in Holographic MIMO

    Authors: Xufeng Guo, Yuanbin Chen, Ying Wang, Zhaocheng Wang, Zhu Han

    Abstract: In this paper, we investigate the sparse channel estimation in holographic multiple-input multiple-output (HMIMO) systems. The conventional angular-domain representation fails to capture the continuous angular power spectrum characterized by the spatially-stationary electromagnetic random field, thus leading to the ambiguous detection of the significant angular power, which is referred to as the p… ▽ More

    Submitted 16 March, 2024; originally announced March 2024.

    Comments: This paper has been accepted in 2024 ICC

  40. arXiv:2403.08931  [pdf, ps, other

    eess.SY

    Unleashing the True Power of Age-of-Information: Service Aggregation in Connected and Autonomous Vehicles

    Authors: Anik Mallik, Dawei Chen, Kyungtae Han, Jiang Xie, Zhu Han

    Abstract: Connected and autonomous vehicles (CAVs) rely heavily upon time-sensitive information update services to ensure the safety of people and assets, and satisfactory entertainment applications. Therefore, the freshness of information is a crucial performance metric for CAV services. However, information from roadside sensors and nearby vehicles can get delayed in transmission due to the high mobility… ▽ More

    Submitted 13 March, 2024; originally announced March 2024.

    Comments: 6 pages, 8 figures, to appear in the Proceedings of IEEE International Conference on Communications (IEEE ICC, 9-13 June 2024, Denver, CO, USA)

  41. arXiv:2403.07252  [pdf, ps, other

    math.RT

    Serre functors and complete torsion pairs

    Authors: Zhe Han, ** He

    Abstract: Given a torsion pair $(\mathcal{T},\mathcal{F})$ in an abelian category $\mathcal{A}$, there is a t-structure $(\mathcal{U}_\mathcal{T},\mathcal{V}_\mathcal{T})$ determined by $\mathcal{T}$ on the derived category $D^b(\mathcal{A})$. The existence of derived equivalence between heart $\mathcal{B}$ of the t-structure and $\mathcal{A}$ which naturally extends the embedding… ▽ More

    Submitted 11 March, 2024; originally announced March 2024.

    Comments: 18pages

  42. arXiv:2403.06927  [pdf

    physics.optics

    Effective multiband synthetic four-wave mixing by cascading quadratic processes

    Authors: Li Chen, Zheng Ge, Su-Jian Niu, Yin-Hai Li, Zhao-Qi-Zhi Han, Yue-Wei Song, Wu-Zhen Li, Ren-Hui Chen, Ming-Yuan Gao, Meng-Yu Xie, Zhi-Yuan Zhou, Bao-Sen Shi

    Abstract: Four wave mixing (FWM) is an important way to generate supercontinuum and frequency combs in the mid-infrared band. Here, we obtain simultaneous synthetic FWM in the visible and mid-infrared bands by cascading quadratic nonlinear processes in a periodically poled lithium niobate crystal (PPLN), which has a 110dB(at 3000nm) higher conversion efficiency than the FWM directly generated by third-order… ▽ More

    Submitted 11 March, 2024; originally announced March 2024.

  43. arXiv:2403.06388  [pdf, other

    cs.CR cs.LG

    A Zero Trust Framework for Realization and Defense Against Generative AI Attacks in Power Grid

    Authors: Md. Shirajum Munir, Sravanthi Proddatoori, Manjushree Muralidhara, Walid Saad, Zhu Han, Sachin Shetty

    Abstract: Understanding the potential of generative AI (GenAI)-based attacks on the power grid is a fundamental challenge that must be addressed in order to protect the power grid by realizing and validating risk in new attack vectors. In this paper, a novel zero trust framework for a power grid supply chain (PGSC) is proposed. This framework facilitates early detection of potential GenAI-driven attack vect… ▽ More

    Submitted 10 March, 2024; originally announced March 2024.

    Comments: Accepted article by IEEE International Conference on Communications (ICC 2024), Copyright 2024 IEEE

  44. arXiv:2403.05826  [pdf, other

    cs.NI eess.SP

    Cached Model-as-a-Resource: Provisioning Large Language Model Agents for Edge Intelligence in Space-air-ground Integrated Networks

    Authors: Minrui Xu, Dusit Niyato, Hongliang Zhang, Jiawen Kang, Zehui Xiong, Shiwen Mao, Zhu Han

    Abstract: Edge intelligence in space-air-ground integrated networks (SAGINs) can enable worldwide network coverage beyond geographical limitations for users to access ubiquitous and low-latency intelligence services. Facing global coverage and complex environments in SAGINs, edge intelligence can provision approximate large language models (LLMs) agents for users via edge servers at ground base stations (BS… ▽ More

    Submitted 31 May, 2024; v1 submitted 9 March, 2024; originally announced March 2024.

  45. arXiv:2403.05793  [pdf, ps, other

    eess.SP

    Performance Bounds for Passive Sensing in Asynchronous ISAC Systems -- Appendices

    Authors: **gbo Zhao, Zhaoming Lu, J. Andrew Zhang, Weicai Li, Yifeng Xiong, Zijun Han, Xiangming Wen, Tao Gu

    Abstract: This document contains the appendices for our paper titled ``Performance Bounds for Passive Sensing in Asynchronous ISAC Systems." The appendices include rigorous derivations of key formulas, detailed proofs of the theorems and propositions introduced in the paper, and details of the algorithm tested in the numerical simulation for validation. These appendices aim to support and elaborate on the f… ▽ More

    Submitted 29 March, 2024; v1 submitted 8 March, 2024; originally announced March 2024.

    Comments: 5 pages

  46. arXiv:2403.05567  [pdf, other

    cs.HC

    A Unified Framework for Underwater Metaverse with Optical Perception

    Authors: **gyang Cao, Mu Zhou, Jiacheng Wang, Guangyuan Liu, Dusit Niyato, Shiwen Mao, Zhu Han, Jiawen Kang

    Abstract: With the advancement of AI technology and increasing attention to deep-sea exploration, the underwater Metaverse is gradually emerging. This paper explores the concept of underwater Metaverse, emerging virtual reality systems and services aimed at simulating and enhancing virtual experience of marine environments. First, we discuss potential applications of underwater Metaverse in underwater scien… ▽ More

    Submitted 20 February, 2024; originally announced March 2024.

  47. arXiv:2403.02977  [pdf, other

    cs.RO

    Fast Iterative Region Inflation for Computing Large 2-D/3-D Convex Regions of Obstacle-Free Space

    Authors: Qianhao Wang, Zhepei Wang, Mingyang Wang, Jialin Ji, Zhichao Han, Tianyue Wu, Rui **, Yuman Gao, Chao Xu, Fei Gao

    Abstract: Convex polytopes have compact representations and exhibit convexity, which makes them suitable for abstracting obstacle-free spaces from various environments. Existing methods for generating convex polytopes always struggle to strike a balance between two requirements, producing high-quality polytope and efficiency. Moreover, another crucial requirement for convex polytopes to accurately contain c… ▽ More

    Submitted 6 June, 2024; v1 submitted 5 March, 2024; originally announced March 2024.

  48. arXiv:2402.18404  [pdf

    quant-ph physics.optics

    Polarization entanglement by two simultaneous backward phase-matching processes in a single crystal

    Authors: Ming-Yuan Gao, Yin-Hai Li, Zhao-Qi-Zhi Han, Qiang Zhou, Guang-Can Guo, Zhi-Yuan Zhou, Bao-Sen Shi

    Abstract: Entanglement enables many promising applications in quantum technology. Devising new generation methods and harnessing entanglement are prerequisites for practical applications. Here we realize a distinct polarization-entangled source by simultaneously achieving type-0 and type-I backward quasi-phase matching (BQPM) through spontaneous parametric down-conversion in a single bulk crystal, which is… ▽ More

    Submitted 28 February, 2024; originally announced February 2024.

  49. arXiv:2402.17401  [pdf

    quant-ph physics.optics

    Quantum entanglement enabled ellipsometer for phase retardance measurement

    Authors: Meng-Yu Xie, Su-Jian Niu, Yin-Hai Li, Zheng Ge, Ming-Yuan Gao, Zhao-Qi-Zhi Han, Ren-Hui Chen, Zhi-Yuan Zhou, Bao-Sen Shi

    Abstract: An ellipsometer is a vital precision tool used for measuring optical parameters with wide applications in many fields, including accurate measurements in film thickness, optical constants, structural profiles, etc. However, the precise measurement of photosensitive materials meets huge obstacles because of the excessive input photons, therefore the requirement of enhancing detection accuracy under… ▽ More

    Submitted 27 February, 2024; originally announced February 2024.

    Comments: 13 pages, 5 figures. This work has been submitted for possible publication

  50. arXiv:2402.14899  [pdf, other

    cs.CV cs.AI cs.CR cs.LG

    Stop Reasoning! When Multimodal LLMs with Chain-of-Thought Reasoning Meets Adversarial Images

    Authors: Zefeng Wang, Zhen Han, Shuo Chen, Fan Xue, Zifeng Ding, Xun Xiao, Volker Tresp, Philip Torr, **dong Gu

    Abstract: Recently, Multimodal LLMs (MLLMs) have shown a great ability to understand images. However, like traditional vision models, they are still vulnerable to adversarial images. Meanwhile, Chain-of-Thought (CoT) reasoning has been widely explored on MLLMs, which not only improves model's performance, but also enhances model's explainability by giving intermediate reasoning steps. Nevertheless, there is… ▽ More

    Submitted 18 March, 2024; v1 submitted 22 February, 2024; originally announced February 2024.