Skip to main content

Showing 101–150 of 2,982 results for author: Huang, H

.
  1. arXiv:2405.07472  [pdf, other

    cs.CV

    GaussianVTON: 3D Human Virtual Try-ON via Multi-Stage Gaussian Splatting Editing with Image Prompting

    Authors: Haodong Chen, Yongle Huang, Haojian Huang, Xiangsheng Ge, Dian Shao

    Abstract: The increasing prominence of e-commerce has underscored the importance of Virtual Try-On (VTON). However, previous studies predominantly focus on the 2D realm and rely heavily on extensive data for training. Research on 3D VTON primarily centers on garment-body shape compatibility, a topic extensively covered in 2D VTON. Thanks to advances in 3D scene editing, a 2D diffusion model has now been ada… ▽ More

    Submitted 23 May, 2024; v1 submitted 13 May, 2024; originally announced May 2024.

    Comments: On-going work

  2. arXiv:2405.07303  [pdf, other

    hep-ex hep-ph physics.ins-det

    Search for solar axions by Primakoff effect with the full dataset of the CDEX-1B Experiment

    Authors: L. T. Yang, S. K. Liu, Q. Yue, K. J. Kang, Y. J. Li, H. P. An, Greeshma C., J. P. Chang, Y. H. Chen, J. P. Cheng, W. H. Dai, Z. Deng, C. H. Fang, X. P. Geng, H. Gong, Q. J. Guo, T. Guo, X. Y. Guo, L. He, J. R. He, J. W. Hu, H. X. Huang, T. C. Huang, L. Jiang, S. Karmakar , et al. (61 additional authors not shown)

    Abstract: We present the first limit on $g_{Aγ}$ coupling constant using the Bragg-Primakoff conversion based on an exposure of 1107.5 kg days of data from the CDEX-1B experiment at the China **** Underground Laboratory. The data are consistent with the null signal hypothesis, and no excess signals are observed. Limits of the coupling $g_{Aγ}<2.08\times10^{-9}$ GeV$^{-1}$ (95\% C.L.) are derived for axio… ▽ More

    Submitted 12 May, 2024; originally announced May 2024.

    Comments: 7 pages, 5 figures

  3. Bremsstrahlung of 5-25 keV electrons incident on MoSi$_2$, TiB$_2$ and ZrB$_2$ thick solid conductive compounds

    Authors: Heng Zhang, Zhu An, **gjun Zhu, Hong Huang

    Abstract: Absolute measurements were conducted to study the bremsstrahlung emission from ~5-25 keV electrons incident on three thick solid conductive compounds of MoSi$_2$, TiB$_2$ and ZrB$_2$. The additivity approximation was applied in the Monte Carlo PENELOPE simulations for compounds and mixtures. The results showed that in general the experimental bremsstrahlung spectra were in good agreement with the… ▽ More

    Submitted 12 May, 2024; originally announced May 2024.

  4. arXiv:2405.04883  [pdf, other

    cs.CV cs.AI cs.LG

    FreeBind: Free Lunch in Unified Multimodal Space via Knowledge Fusion

    Authors: Zehan Wang, Ziang Zhang, Xize Cheng, Rongjie Huang, Lu** Liu, Zhenhui Ye, Haifeng Huang, Yang Zhao, Tao **, Peng Gao, Zhou Zhao

    Abstract: Unified multi-model representation spaces are the foundation of multimodal understanding and generation. However, the billions of model parameters and catastrophic forgetting problems make it challenging to further enhance pre-trained unified spaces. In this work, we propose FreeBind, an idea that treats multimodal representation spaces as basic units, and freely augments pre-trained unified space… ▽ More

    Submitted 10 May, 2024; v1 submitted 8 May, 2024; originally announced May 2024.

    Comments: Accepted by ICML 2024. The code and checkpoints will be released at https://github.com/zehanwang01/FreeBind

  5. arXiv:2405.04686  [pdf

    physics.app-ph physics.optics

    Ultrafast dynamics of wavelength-sensitive magnons in unconventional compensated semiconducting antiferromagnet

    Authors: Hanshen Huang, Tao Qu, Yang Cheng, Lixuan Tai, Christopher Eckberg, Quanjun Pan, Abdullah Alrasheed, Su Kong Chong, Bingqian Dai, Yaochen Li, Qingyuan Shu, Chao-Yao Yang, Jie-Xiang Yu, Gen Yin, Kang L. Wang

    Abstract: Antiferromagnet is a promising candidate for the next generation spintronic devices, benefiting from its ultrafast dynamics and spontaneous zero stray field. However, the understanding of their ultrafast spin behaviors is lacking due to the challenges of controlling/detecting the quenched net magnetization. Unconventional compensated semiconducting antiferromagnets present strong time-reversal sym… ▽ More

    Submitted 7 May, 2024; originally announced May 2024.

  6. arXiv:2405.04093  [pdf, other

    cs.CV cs.AI

    DCNN: Dual Cross-current Neural Networks Realized Using An Interactive Deep Learning Discriminator for Fine-grained Objects

    Authors: Da Fu, Mingfei Rong, Eun-Hu Kim, Hao Huang, Witold Pedrycz

    Abstract: Accurate classification of fine-grained images remains a challenge in backbones based on convolutional operations or self-attention mechanisms. This study proposes novel dual-current neural networks (DCNN), which combine the advantages of convolutional operations and self-attention mechanisms to improve the accuracy of fine-grained image classification. The main novel design features for construct… ▽ More

    Submitted 7 May, 2024; originally announced May 2024.

  7. arXiv:2405.04065  [pdf, other

    cs.CL

    FlashBack:Efficient Retrieval-Augmented Language Modeling for Long Context Inference

    Authors: Runheng Liu, Xingchen Xiao, Heyan Huang, Zewen Chi, Zhi**g Wu

    Abstract: Retrieval-Augmented Language Modeling (RALM) by integrating large language models (LLM) with relevant documents from an external corpus is a proven method for enabling the LLM to generate information beyond the scope of its pre-training corpus. Previous work utilizing retrieved content by simply prepending it to the input poses a high runtime issue, which degrades the inference efficiency of the L… ▽ More

    Submitted 16 May, 2024; v1 submitted 7 May, 2024; originally announced May 2024.

    Comments: 14 pages

  8. arXiv:2405.03969  [pdf, other

    cs.RO

    Speak the Same Language: Global LiDAR Registration on BIM Using Pose Hough Transform

    Authors: Zhijian Qiao, Haoming Huang, Chuhao Liu, Shaojie Shen, Fumin Zhang, Huan Yin

    Abstract: The construction and robotic sensing data originate from disparate sources and are associated with distinct frames of reference. The primary objective of this study is to align LiDAR point clouds with building information modeling (BIM) using a global point cloud registration approach, aimed at establishing a shared understanding between the two modalities, i.e., ``speak the same language''. To ac… ▽ More

    Submitted 6 May, 2024; originally announced May 2024.

    Comments: 12 pages, 10 figures

  9. LGTM: Local-to-Global Text-Driven Human Motion Diffusion Model

    Authors: Haowen Sun, Ruikun Zheng, Haibin Huang, Chongyang Ma, Hui Huang, Ruizhen Hu

    Abstract: In this paper, we introduce LGTM, a novel Local-to-Global pipeline for Text-to-Motion generation. LGTM utilizes a diffusion-based architecture and aims to address the challenge of accurately translating textual descriptions into semantically coherent human motion in computer animation. Specifically, traditional methods often struggle with semantic discrepancies, particularly in aligning specific m… ▽ More

    Submitted 6 May, 2024; originally announced May 2024.

    Comments: 9 pages,7 figures, SIGGRAPH 2024

  10. arXiv:2405.03221  [pdf, other

    cs.CV cs.GR cs.LG

    Spatial and Surface Correspondence Field for Interaction Transfer

    Authors: Zeyu Huang, Honghao Xu, Haibin Huang, Chongyang Ma, Hui Huang, Ruizhen Hu

    Abstract: In this paper, we introduce a new method for the task of interaction transfer. Given an example interaction between a source object and an agent, our method can automatically infer both surface and spatial relationships for the agent and target objects within the same category, yielding more accurate and valid transfers. Specifically, our method characterizes the example interaction using a combin… ▽ More

    Submitted 6 May, 2024; originally announced May 2024.

    Comments: Accepted to SIGGRAPH 2024, project page at https://vcc.tech/research/2024/InterTransfer

  11. arXiv:2405.02982  [pdf, other

    cs.CV

    Paintings and Drawings Aesthetics Assessment with Rich Attributes for Various Artistic Categories

    Authors: Xin **, Qianqian Qiao, Yi Lu, Shan Gao, Heng Huang, Guangdong Li

    Abstract: Image aesthetic evaluation is a highly prominent research domain in the field of computer vision. In recent years, there has been a proliferation of datasets and corresponding evaluation methodologies for assessing the aesthetic quality of photographic works, leading to the establishment of a relatively mature research environment. However, in contrast to the extensive research in photographic aes… ▽ More

    Submitted 5 May, 2024; originally announced May 2024.

  12. arXiv:2405.01102  [pdf, other

    cs.LG cs.AI

    Less is More: on the Over-Globalizing Problem in Graph Transformers

    Authors: Yujie Xing, Xiao Wang, Yibo Li, Hai Huang, Chuan Shi

    Abstract: Graph Transformer, due to its global attention mechanism, has emerged as a new tool in dealing with graph-structured data. It is well recognized that the global attention mechanism considers a wider receptive field in a fully connected graph, leading many to believe that useful information can be extracted from all the nodes. In this paper, we challenge this belief: does the globalizing property a… ▽ More

    Submitted 24 May, 2024; v1 submitted 2 May, 2024; originally announced May 2024.

    Comments: Accepted by ICML 2024 (Camera-Ready)

  13. arXiv:2405.00749  [pdf, other

    cs.CV cs.LG

    More is Better: Deep Domain Adaptation with Multiple Sources

    Authors: Sicheng Zhao, Hui Chen, Hu Huang, Pengfei Xu, Guiguang Ding

    Abstract: In many practical applications, it is often difficult and expensive to obtain large-scale labeled data to train state-of-the-art deep neural networks. Therefore, transferring the learned knowledge from a separate, labeled source domain to an unlabeled or sparsely labeled target domain becomes an appealing alternative. However, direct transfer often results in significant performance decay due to d… ▽ More

    Submitted 30 April, 2024; originally announced May 2024.

    Comments: Accepted by IJCAI 2024. arXiv admin note: text overlap with arXiv:2002.12169

  14. arXiv:2405.00393  [pdf, other

    cs.CR

    Inferring State Machine from the Protocol Implementation via Large Language Model

    Authors: Haiyang Wei, Zhengjie Du, Haohui Huang, Yue Liu, Guang Cheng, Linzhang Wang, Bing Mao

    Abstract: State machines play a pivotal role in augmenting the efficacy of protocol analyzing to unveil more vulnerabilities. However, the task of inferring state machines from network protocol implementations presents significant challenges. Traditional methods based on dynamic analysis often overlook crucial state transitions due to limited coverage, while static analysis faces difficulties with complex c… ▽ More

    Submitted 14 June, 2024; v1 submitted 1 May, 2024; originally announced May 2024.

  15. arXiv:2405.00125  [pdf, other

    astro-ph.CO

    Neural network based emulation of galaxy power spectrum covariances -- A reanalysis of BOSS DR12 data

    Authors: Joseph Adamo, Hung-** Huang, Tim Eifler

    Abstract: We train neural networks to quickly generate redshift-space galaxy power spectrum covariances from a given parameter set (cosmology and galaxy bias). This covariance emulator utilizes a combination of traditional fully-connected network layers and transformer architecture to accurately predict covariance matrices for the high redshift, north galactic cap sample of the BOSS DR12 galaxy catalog. We… ▽ More

    Submitted 30 April, 2024; originally announced May 2024.

    Comments: 11 pages, 8 figures, to be submitted to Physical Review D

  16. arXiv:2404.19368  [pdf, other

    cs.SE

    Exploring Multi-Lingual Bias of Large Code Models in Code Generation

    Authors: Chaozheng Wang, Zongjie Li, Cuiyun Gao, Wenxuan Wang, Ting Peng, Hailiang Huang, Yuetang Deng, Shuai Wang, Michael R. Lyu

    Abstract: Code generation aims to synthesize code and fulfill functional requirements based on natural language (NL) specifications, which can greatly improve development efficiency. In the era of large language models (LLMs), large code models (LCMs) have been recently proposed to generate source code. LCMs can generate highly feasible solutions for programming problems described in natural language. Despi… ▽ More

    Submitted 30 April, 2024; originally announced April 2024.

    Comments: 12 pages

  17. arXiv:2404.18753  [pdf, ps, other

    math.GR math.CO

    Fixers and derangements of finite permutation groups

    Authors: Hong Yi Huang, Cai Heng Li, Yi Lin Xie

    Abstract: Let $G\leqslant\mathrm{Sym}(Ω)$ be a finite transitive permutation group with point stabiliser $H$. We say that a subgroup $K$ of $G$ is a fixer if every element of $K$ has fixed points, and we say that $K$ is large if $|K| \geqslant |H|$. There is a special interest in studying large fixers due to connections with Erdős-Ko-Rado type problems. In this paper, we classify up to conjugacy the large f… ▽ More

    Submitted 29 April, 2024; originally announced April 2024.

    Comments: 40 pages

  18. arXiv:2404.18644  [pdf, other

    quant-ph

    Low-Overhead Defect-Adaptive Surface Code with Bandage-Like Super-Stabilizers

    Authors: Zuolin Wei, Tan He, Yangsen Ye, Dachao Wu, Yiming Zhang, Youwei Zhao, Wei** Lin, He-Liang Huang, Xiaobo Zhu, Jian-Wei Pan

    Abstract: To make practical quantum algorithms work, large-scale quantum processors protected by error-correcting codes are required to resist noise and ensure reliable computational outcomes. However, a major challenge arises from defects in processor fabrication, as well as occasional losses or cosmic rays during the computing process, all of which can lead to qubit malfunctions and disrupt error-correcti… ▽ More

    Submitted 29 April, 2024; originally announced April 2024.

  19. arXiv:2404.18202  [pdf, other

    cs.AI cs.MM

    WorldGPT: Empowering LLM as Multimodal World Model

    Authors: Zhiqi Ge, Hongzhe Huang, Mingze Zhou, Juncheng Li, Guoming Wang, Siliang Tang, Yueting Zhuang

    Abstract: World models are progressively being employed across diverse fields, extending from basic environment simulation to complex scenario construction. However, existing models are mainly trained on domain-specific states and actions, and confined to single-modality state representations. In this paper, We introduce WorldGPT, a generalist world model built upon Multimodal Large Language Model (MLLM). W… ▽ More

    Submitted 28 April, 2024; originally announced April 2024.

  20. arXiv:2404.17238  [pdf, other

    cs.IR

    TruthSR: Trustworthy Sequential Recommender Systems via User-generated Multimodal Content

    Authors: Meng Yan, Haibin Huang, Ying Liu, Juan Zhao, Xiyue Gao, Cai Xu, Ziyu Guan, Wei Zhao

    Abstract: Sequential recommender systems explore users' preferences and behavioral patterns from their historically generated data. Recently, researchers aim to improve sequential recommendation by utilizing massive user-generated multi-modal content, such as reviews, images, etc. This content often contains inevitable noise. Some studies attempt to reduce noise interference by suppressing cross-modal incon… ▽ More

    Submitted 26 April, 2024; originally announced April 2024.

  21. arXiv:2404.17169  [pdf, other

    cs.LG cs.CY

    FairGT: A Fairness-aware Graph Transformer

    Authors: Renqiang Luo, Huafei Huang, Shuo Yu, Xiuzhen Zhang, Feng Xia

    Abstract: The design of Graph Transformers (GTs) generally neglects considerations for fairness, resulting in biased outcomes against certain sensitive subgroups. Since GTs encode graph information without relying on message-passing mechanisms, conventional fairness-aware graph learning methods cannot be directly applicable to address these issues. To tackle this challenge, we propose FairGT, a Fairness-awa… ▽ More

    Submitted 26 April, 2024; originally announced April 2024.

    Journal ref: IJCAI2024

  22. arXiv:2404.16027  [pdf, other

    cs.RO

    ORBIT-Surgical: An Open-Simulation Framework for Learning Surgical Augmented Dexterity

    Authors: Qinxi Yu, Masoud Moghani, Karthik Dharmarajan, Vincent Schorp, William Chung-Ho Panitch, **gzhou Liu, Kush Hari, Huang Huang, Mayank Mittal, Ken Goldberg, Animesh Garg

    Abstract: Physics-based simulations have accelerated progress in robot learning for driving, manipulation, and locomotion. Yet, a fast, accurate, and robust surgical simulation environment remains a challenge. In this paper, we present ORBIT-Surgical, a physics-based surgical robot simulation framework with photorealistic rendering in NVIDIA Omniverse. We provide 14 benchmark surgical tasks for the da Vinci… ▽ More

    Submitted 24 April, 2024; originally announced April 2024.

  23. arXiv:2404.15789  [pdf, other

    cs.CV

    MotionMaster: Training-free Camera Motion Transfer For Video Generation

    Authors: Teng Hu, Jiangning Zhang, Ran Yi, Yating Wang, Hongrui Huang, Jieyu Weng, Yabiao Wang, Lizhuang Ma

    Abstract: The emergence of diffusion models has greatly propelled the progress in image and video generation. Recently, some efforts have been made in controllable video generation, including text-to-video generation and video motion control, among which camera motion control is an important topic. However, existing camera motion control methods rely on training a temporal camera module, and necessitate sub… ▽ More

    Submitted 30 April, 2024; v1 submitted 24 April, 2024; originally announced April 2024.

  24. arXiv:2404.14771  [pdf, other

    cs.SD cs.AI

    Music Style Transfer With Diffusion Model

    Authors: Hong Huang, Yuyi Wang, Luyao Li, Jun Lin

    Abstract: Previous studies on music style transfer have mainly focused on one-to-one style conversion, which is relatively limited. When considering the conversion between multiple styles, previous methods required designing multiple modes to disentangle the complex style of the music, resulting in large computational costs and slow audio generation. The existing music style transfer methods generate spectr… ▽ More

    Submitted 23 April, 2024; originally announced April 2024.

    Comments: 8 pages, 6 figures, ICMC 2023

    Journal ref: International Computer Music Conference (ICMC 2023) pp. 40-47, October 2023

  25. arXiv:2404.14753  [pdf, ps, other

    hep-ph nucl-th

    Investigating $Ξ$ resonances from pentaquark perspective

    Authors: Ye Yan, Qi Huang, Xinmei Zhu, Hongxia Huang, Jialun **

    Abstract: We have investigated the $qss\bar{q}q$ ($q = u$ or $d$) system to find possible pentaquark explanations for the $Ξ$ resonances. The bound state calculation is carried out within the framework of the quark delocalization color screening model. The scattering processes are also studied to examine the possible resonance states. The current results indicate that the $Ξ(1950)$ can be interpreted as… ▽ More

    Submitted 23 April, 2024; originally announced April 2024.

    Comments: 15 pages, 7 figures. arXiv admin note: text overlap with arXiv:2312.04977, arXiv:2309.15380

  26. arXiv:2404.14569  [pdf, other

    gr-qc astro-ph.IM physics.ins-det quant-ph

    LIGO operates with quantum noise below the Standard Quantum Limit

    Authors: Wenxuan Jia, Victoria Xu, Kevin Kuns, Masayuki Nakano, Lisa Barsotti, Matthew Evans, Nergis Mavalvala, Rich Abbott, Ibrahim Abouelfettouh, Rana Adhikari, Alena Ananyeva, Stephen Appert, Koji Arai, Naoki Aritomi, Stuart Aston, Matthew Ball, Stefan Ballmer, David Barker, Beverly Berger, Joseph Betzwieser, Dripta Bhattacharjee, Garilynn Billingsley, Nina Bode, Edgard Bonilla, Vladimir Bossilkov , et al. (146 additional authors not shown)

    Abstract: Precision measurements of space and time, like those made by the detectors of the Laser Interferometer Gravitational-wave Observatory (LIGO), are often confronted with fundamental limitations imposed by quantum mechanics. The Heisenberg uncertainty principle dictates that the position and momentum of an object cannot both be precisely measured, giving rise to an apparent limitation called the Stan… ▽ More

    Submitted 22 April, 2024; originally announced April 2024.

    Report number: LIGO-P2400059

  27. arXiv:2404.14566  [pdf, other

    cond-mat.supr-con cond-mat.mes-hall

    Superconducting Diode Effect in Two-dimensional Topological Insulator Edges and Josephson Junctions

    Authors: Haixuan Huang, Tatiana de Picoli, Jukka I. Väyrynen

    Abstract: The superconducting diode effect -- the dependence of critical current on its direction -- can arise from the simultaneous breaking of inversion and time-reversal symmetry in a superconductor and has gained interest for its potential applications in superconducting electronics. In this letter, we study the effect in a two-dimensional topological insulator (2D TI) in both a uniform geometry as well… ▽ More

    Submitted 22 April, 2024; originally announced April 2024.

    Comments: Submitted to Applied Physics Letters on April 8th, 2024

  28. arXiv:2404.13953  [pdf, other

    cs.CV

    360VOTS: Visual Object Tracking and Segmentation in Omnidirectional Videos

    Authors: Yinzhe Xu, Huajian Huang, Yingshu Chen, Sai-Kit Yeung

    Abstract: Visual object tracking and segmentation in omnidirectional videos are challenging due to the wide field-of-view and large spherical distortion brought by 360° images. To alleviate these problems, we introduce a novel representation, extended bounding field-of-view (eBFoV), for target localization and use it as the foundation of a general 360 tracking framework which is applicable for both omnidire… ▽ More

    Submitted 22 April, 2024; originally announced April 2024.

  29. arXiv:2404.13707  [pdf, other

    stat.ME stat.AP

    Robust inference for the unification of confidence intervals in meta-analysis

    Authors: Wei Liang, Haicheng Huang, Hongsheng Dai, Yinghui Wei

    Abstract: Traditional meta-analysis assumes that the effect sizes estimated in individual studies follow a Gaussian distribution. However, this distributional assumption is not always satisfied in practice, leading to potentially biased results. In the situation when the number of studies, denoted as K, is large, the cumulative Gaussian approximation errors from each study could make the final estimation un… ▽ More

    Submitted 21 April, 2024; originally announced April 2024.

  30. arXiv:2404.13644  [pdf, ps, other

    math.AP math-ph math.DS math.PR

    Error Estimation in the Mean-Field Limit of Kinetic Flocking Models with Local Alignments

    Authors: **huan Wang, Keyu Li, Hui Huang

    Abstract: In this paper, we present an innovative particle system characterized by moderate interactions, designed to accurately approximate kinetic flocking models that incorporate singular interaction forces and local alignment mechanisms. We establish the existence of weak solutions to the corresponding flocking equations and provide an error estimate for the mean-field limit. This is achieved through th… ▽ More

    Submitted 21 April, 2024; originally announced April 2024.

  31. arXiv:2404.13631  [pdf, other

    cs.LG cond-mat.dis-nn cond-mat.stat-mech cs.NE q-bio.NC

    Fermi-Bose Machine

    Authors: Mingshan Xie, Yuchen Wang, Hai** Huang

    Abstract: Distinct from human cognitive processing, deep neural networks trained by backpropagation can be easily fooled by adversarial examples. To design a semantically meaningful representation learning, we discard backpropagation, and instead, propose a local contrastive learning, where the representation for the inputs bearing the same label shrink (akin to boson) in hidden layers, while those of diffe… ▽ More

    Submitted 21 April, 2024; originally announced April 2024.

    Comments: 17 pages, 6 figures, a physics inspired machine without backpropagation and enhanced adversarial robustness

  32. arXiv:2404.13033  [pdf, other

    cs.CL

    Sample Design Engineering: An Empirical Study of What Makes Good Downstream Fine-Tuning Samples for LLMs

    Authors: Biyang Guo, He Wang, Wenyilin Xiao, Hong Chen, Zhuxin Lee, Songqiao Han, Hailiang Huang

    Abstract: In the burgeoning field of Large Language Models (LLMs) like ChatGPT and LLaMA, Prompt Engineering (PE) is renowned for boosting zero-shot or in-context learning (ICL) through prompt modifications. Yet, the realm of the sample design for downstream fine-tuning, crucial for task-specific LLM adaptation, is largely unexplored. This paper introduces Sample Design Engineering (SDE), a methodical appro… ▽ More

    Submitted 19 April, 2024; originally announced April 2024.

    Comments: 23 pages, 12 figures, 14 tables

  33. arXiv:2404.12768  [pdf, other

    cs.CV cs.AI cs.GR

    MixLight: Borrowing the Best of both Spherical Harmonics and Gaussian Models

    Authors: Xinlong Ji, Fangneng Zhan, Shijian Lu, Shi-Sheng Huang, Hua Huang

    Abstract: Accurately estimating scene lighting is critical for applications such as mixed reality. Existing works estimate illumination by generating illumination maps or regressing illumination parameters. However, the method of generating illumination maps has poor generalization performance and parametric models such as Spherical Harmonic (SH) and Spherical Gaussian (SG) fall short in capturing high-freq… ▽ More

    Submitted 19 April, 2024; originally announced April 2024.

  34. arXiv:2404.12242  [pdf, other

    cs.CL

    CMNEE: A Large-Scale Document-Level Event Extraction Dataset based on Open-Source Chinese Military News

    Authors: Mengna Zhu, Zijie Xu, Kaisheng Zeng, Kaiming Xiao, Mao Wang, Wenjun Ke, Hongbin Huang

    Abstract: Extracting structured event knowledge, including event triggers and corresponding arguments, from military texts is fundamental to many applications, such as intelligence analysis and decision assistance. However, event extraction in the military field faces the data scarcity problem, which impedes the research of event extraction models in this domain. To alleviate this problem, we propose CMNEE,… ▽ More

    Submitted 18 April, 2024; originally announced April 2024.

    Comments: 13 pages, 7 figures, accepted to LREC-COLING 2024

  35. arXiv:2404.11422  [pdf

    cs.LG cs.AI physics.ao-ph

    Short-term wind speed forecasting model based on an attention-gated recurrent neural network and error correction strategy

    Authors: Haojian Huang

    Abstract: The accurate wind speed series forecast is very pivotal to security of grid dispatching and the application of wind power. Nevertheless, on account of their nonlinear and non-stationary nature, their short-term forecast is extremely challenging. Therefore, this dissertation raises one short-term wind speed forecast pattern on the foundation of attention with an improved gated recurrent neural netw… ▽ More

    Submitted 22 April, 2024; v1 submitted 17 April, 2024; originally announced April 2024.

    Comments: 23 pages, 11 figures, 6 tables, Technical Report

  36. arXiv:2404.11199  [pdf, other

    q-bio.BM

    RiboDiffusion: Tertiary Structure-based RNA Inverse Folding with Generative Diffusion Models

    Authors: Han Huang, Ziqian Lin, Dongchen He, Liang Hong, Yu Li

    Abstract: RNA design shows growing applications in synthetic biology and therapeutics, driven by the crucial role of RNA in various biological processes. A fundamental challenge is to find functional RNA sequences that satisfy given structural constraints, known as the inverse folding problem. Computational approaches have emerged to address this problem based on secondary structures. However, designing RNA… ▽ More

    Submitted 17 April, 2024; originally announced April 2024.

    Comments: 15 pages

  37. arXiv:2404.10681  [pdf, other

    cs.CV

    StyleCity: Large-Scale 3D Urban Scenes Stylization with Vision-and-Text Reference via Progressive Optimization

    Authors: Yingshu Chen, Huajian Huang, Tuan-Anh Vu, Ka Chun Shum, Sai-Kit Yeung

    Abstract: Creating large-scale virtual urban scenes with variant styles is inherently challenging. To facilitate prototypes of virtual production and bypass the need for complex materials and lighting setups, we introduce the first vision-and-text-driven texture stylization system for large-scale urban scenes, StyleCity. Taking an image and text as references, StyleCity stylizes a 3D textured mesh of a larg… ▽ More

    Submitted 16 April, 2024; originally announced April 2024.

    Comments: project page: https://chenyingshu.github.io/stylecity3d/

  38. arXiv:2404.10400  [pdf, other

    gr-qc

    Phase space analysis of the evolution of the early universe in Einstein-Cartan theory

    Authors: Qihong Huang, He Huang, Bing Xu, Kaituo Zhang, Hao Chen

    Abstract: In this paper, we perform the phase space analysis to investigate the evolution of the early universe in Einstein-Cartan theory. By studying the stability of critical points in dynamical system, it is found that there exist two stable critical points which represent an expanding solution and an Einstein static solution respectively. After analyzing the phase diagram of the dynamical system, we fin… ▽ More

    Submitted 16 April, 2024; originally announced April 2024.

  39. arXiv:2404.10253  [pdf, other

    cs.DC

    Kilometer-Level Coupled Modeling Using 40 Million Cores: An Eight-Year Journey of Model Development

    Authors: Xiaohui Duan, Yuxuan Li, Zhao Liu, Bin Yang, Juepeng Zheng, Haohuan Fu, Shaoqing Zhang, Shiming Xu, Yang Gao, Wei Xue, Di Wei, Xiao**g Lv, Lifeng Yan, Haopeng Huang, Haitian Lu, Lingfeng Wan, Haoran Lin, Qixin Chang, Chenlin Li, Quanjie He, Zeyu Song, Xuantong Wang, Yangyang Yu, Xilong Fan, Zhaopeng Qu , et al. (16 additional authors not shown)

    Abstract: With current and future leading systems adopting heterogeneous architectures, adapting existing models for heterogeneous supercomputers is of urgent need for improving model resolution and reducing modeling uncertainty. This paper presents our three-week effort on porting a complex earth system model, CESM 2.2, to a 40-million-core Sunway supercomputer. Taking a non-intrusive approach that tries t… ▽ More

    Submitted 15 April, 2024; originally announced April 2024.

    Comments: 18 pages, 13 figures

  40. arXiv:2404.09793  [pdf, other

    hep-ex hep-ph physics.ins-det

    First Search for Light Fermionic Dark Matter Absorption on Electrons Using Germanium Detector in CDEX-10 Experiment

    Authors: J. X. Liu, L. T. Yang, Q. Yue, K. J. Kang, Y. J. Li, H. P. An, Greeshma C., J. P. Chang, Y. H. Chen, J. P. Cheng, W. H. Dai, Z. Deng, C. H. Fang, X. P. Geng, H. Gong, Q. J. Guo, T. Guo, X. Y. Guo, L. He, J. R. He, J. W. Hu, H. X. Huang, T. C. Huang, L. Jiang, S. Karmakar , et al. (61 additional authors not shown)

    Abstract: We present the first results of the search for sub-MeV fermionic dark matter absorbed by electron targets of Germanium using the 205.4~kg$\cdot$day data collected by the CDEX-10 experiment, with the analysis threshold of 160~eVee. No significant dark matter (DM) signals over the background are observed. Results are presented as limits on the cross section of DM--electron interaction. We present ne… ▽ More

    Submitted 15 April, 2024; originally announced April 2024.

    Comments: 6 pages, 4 figures

  41. arXiv:2404.09640  [pdf, other

    cs.CV

    CREST: Cross-modal Resonance through Evidential Deep Learning for Enhanced Zero-Shot Learning

    Authors: Haojian Huang, Xiaozhen Qiao, Zhuo Chen, Haodong Chen, Bingyu Li, Zhe Sun, Mulin Chen, Xuelong Li

    Abstract: Zero-shot learning (ZSL) enables the recognition of novel classes by leveraging semantic knowledge transfer from known to unknown categories. This knowledge, typically encapsulated in attribute descriptions, aids in identifying class-specific visual features, thus facilitating visual-semantic alignment and improving ZSL performance. However, real-world challenges such as distribution imbalances an… ▽ More

    Submitted 20 April, 2024; v1 submitted 15 April, 2024; originally announced April 2024.

    Comments: Ongoing work; 10 pages, 2 Tables, 9 Figures; Repo is available at: https://github.com/JethroJames/CREST

  42. arXiv:2404.09540  [pdf, other

    cs.CV

    Text-Driven Diverse Facial Texture Generation via Progressive Latent-Space Refinement

    Authors: Chi Wang, Junming Huang, Rong Zhang, Qi Wang, Haotian Yang, Haibin Huang, Chongyang Ma, Weiwei Xu

    Abstract: Automatic 3D facial texture generation has gained significant interest recently. Existing approaches may not support the traditional physically based rendering pipeline or rely on 3D data captured by Light Stage. Our key contribution is a progressive latent space refinement approach that can bootstrap from 3D Morphable Models (3DMMs)-based texture maps generated from facial images to generate high… ▽ More

    Submitted 15 April, 2024; originally announced April 2024.

  43. arXiv:2404.09192  [pdf, other

    cs.SD cs.AI eess.AS

    Prior-agnostic Multi-scale Contrastive Text-Audio Pre-training for Parallelized TTS Frontend Modeling

    Authors: Quanxiu Wang, Hui Huang, Mingjie Wang, Yong Dai, **zuomu Zhong, Benlai Tang

    Abstract: Over the past decade, a series of unflagging efforts have been dedicated to develo** highly expressive and controllable text-to-speech (TTS) systems. In general, the holistic TTS comprises two interconnected components: the frontend module and the backend module. The frontend excels in capturing linguistic representations from the raw text input, while the backend module converts linguistic cues… ▽ More

    Submitted 14 April, 2024; originally announced April 2024.

  44. arXiv:2404.09115  [pdf, other

    cs.CV

    GCC: Generative Calibration Clustering

    Authors: Haifeng Xia, Hai Huang, Zhengming Ding

    Abstract: Deep clustering as an important branch of unsupervised representation learning focuses on embedding semantically similar samples into the identical feature space. This core demand inspires the exploration of contrastive learning and subspace clustering. However, these solutions always rely on the basic assumption that there are sufficient and category-balanced samples for generating valid high-lev… ▽ More

    Submitted 13 April, 2024; originally announced April 2024.

  45. arXiv:2404.08145  [pdf

    physics.app-ph cond-mat.mtrl-sci

    Polar vortex hidden in twisted bilayers of paraelectric SrTiO3

    Authors: Haozhi Sha, Yixuan Zhang, Yunpeng Ma, Wei Li, Wenfeng Yang, Jizhe Cui, Qian Li, Houbing Huang, Rong Yu

    Abstract: Polar topologies, such as vortex and skyrmion, have attracted significant interest due to their unique physical properties and promising applications in high-density memory devices. Currently, most polar vortices are observed in heterostructures containing ferroelectric materials and constrained by substrates. In this study, we unravel arrays of polar vortices formed in twisted freestanding bilaye… ▽ More

    Submitted 11 April, 2024; originally announced April 2024.

  46. arXiv:2404.07477  [pdf, ps, other

    eess.SP

    Integrated Sensing and Communication Under DISCO Physical-Layer Jamming Attacks

    Authors: Huan Huang, Hongliang Zhang, Weidong Mei, Jun Li, Yi Cai, A. Lee Swindlehurst, Zhu Han

    Abstract: Integrated sensing and communication (ISAC) systems traditionally presuppose that sensing and communication (S&C) channels remain approximately constant during their coherence time. However, a "DISCO" reconfigurable intelligent surface (DRIS), i.e., an illegitimate RIS with random, time-varying reflection properties that acts like a "disco ball," introduces a paradigm shift that enables active cha… ▽ More

    Submitted 11 April, 2024; originally announced April 2024.

    Comments: This paper has been submitted for possible publication. For the code of the DISCO RIS is available on Github (https://github.com/huanhuan1799/Disco-Intelligent-Reflecting-Surfaces-Active-Channel-Aging-for-Fully-Passive-Jamming-Attacks)

  47. arXiv:2404.07281  [pdf, other

    quant-ph cs.IT cs.LG

    Certifying almost all quantum states with few single-qubit measurements

    Authors: Hsin-Yuan Huang, John Preskill, Mehdi Soleimanifar

    Abstract: Certifying that an n-qubit state synthesized in the lab is close to the target state is a fundamental task in quantum information science. However, existing rigorous protocols either require deep quantum circuits or exponentially many single-qubit measurements. In this work, we prove that almost all n-qubit target states, including those with exponential circuit complexity, can be certified from o… ▽ More

    Submitted 10 April, 2024; originally announced April 2024.

    Comments: 63 pages, 5 figures

  48. arXiv:2404.07092  [pdf, other

    eess.SP physics.optics

    Net 835-Gb/s/λ Carrier- and LO-Free 100-km Transmission Using Channel-Aware Phase Retrieval Reception

    Authors: Hanzi Huang, Haoshuo Chen, Qian Hu, Di Che, Yetian Huang, Brian Stern, Nicolas K. Fontaine, Mikael Mazur, Lauren Dallachiesa, Roland Ryf, Zhengxuan Li, Yingxiong Song

    Abstract: We experimentally demonstrate the first carrier- and LO-free 800G/λ receiver enabling direct compatibility with standard coherent transmitters via phase retrieval, achieving net 835-Gb/s transmission over 100-km SMF and record 8.27-b/s/Hz net optical spectral efficiency.

    Submitted 10 April, 2024; originally announced April 2024.

    Comments: 3 pages, 3 figures

  49. arXiv:2404.06681  [pdf, other

    cs.AI cs.LG stat.ME

    Causal Unit Selection using Tractable Arithmetic Circuits

    Authors: Haiying Huang, Adnan Darwiche

    Abstract: The unit selection problem aims to find objects, called units, that optimize a causal objective function which describes the objects' behavior in a causal context (e.g., selecting customers who are about to churn but would most likely change their mind if encouraged). While early studies focused mainly on bounding a specific class of counterfactual objective functions using data, more recent work… ▽ More

    Submitted 9 April, 2024; originally announced April 2024.

  50. arXiv:2404.06098  [pdf, other

    astro-ph.CO astro-ph.GA

    Weak lensing combined with the kinetic Sunyaev Zel'dovich effect: A study of baryonic feedback

    Authors: L. Bigwood, A. Amon, A. Schneider, J. Salcido, I. G. McCarthy, C. Preston, D. Sanchez, D. Sijacki, E. Schaan, S. Ferraro, N. Battaglia, A. Chen, S. Dodelson, A. Roodman, A. Pieres, A. Ferte, A. Alarcon, A. Drlica-Wagner, A. Choi, A. Navarro-Alsina, A. Campos, A. J. Ross, A. Carnero Rosell, B. Yin, B. Yanny , et al. (100 additional authors not shown)

    Abstract: Extracting precise cosmology from weak lensing surveys requires modelling the non-linear matter power spectrum, which is suppressed at small scales due to baryonic feedback processes. However, hydrodynamical galaxy formation simulations make widely varying predictions for the amplitude and extent of this effect. We use measurements of Dark Energy Survey Year 3 weak lensing (WL) and Atacama Cosmolo… ▽ More

    Submitted 9 April, 2024; originally announced April 2024.