Skip to main content

Showing 301–350 of 7,384 results for author: Liu, H

.
  1. arXiv:2404.19055  [pdf, other

    cs.CL

    Plan of Thoughts: Heuristic-Guided Problem Solving with Large Language Models

    Authors: Houjun Liu

    Abstract: While language models (LMs) offer significant capability in zero-shot reasoning tasks across a wide range of domains, they do not perform satisfactorily in problems which requires multi-step reasoning. Previous approaches to mitigate this involves breaking a larger, multi-step task into sub-tasks and asking the language model to generate proposals ("thoughts") for each sub-task and using exhaustiv… ▽ More

    Submitted 29 April, 2024; originally announced April 2024.

    Comments: 7 pages, 2 figures

  2. arXiv:2404.18415  [pdf, other

    astro-ph.EP

    Photo-dynamical Analysis of Circumbinary Multi-planet system TOI-1338: a Fully Coplanar Configuration with a Puffy Planet

    Authors: Mu-Tian Wang, Hui-Gen Liu

    Abstract: TOI-1338 is the first circumbinary planet system discovered by TESS. It has one transiting planet at P$\sim$95 day and an outer non-transiting planet at P$\sim$215 day complemented by RV observation. Here we present a global photo-dynamical modeling of the TOI-1338 system that self-consistently accounts for the mutual gravitational interactions between all known bodies in the system. As a result,… ▽ More

    Submitted 29 April, 2024; originally announced April 2024.

    Comments: 13 Figures, 3 Tables; Accepted by AJ

  3. arXiv:2404.18412  [pdf

    cond-mat.mtrl-sci cond-mat.str-el

    Uncovering an Interfacial Band Resulting from Orbital Hybridization in Nickelate Heterostructures

    Authors: Mingyao Chen, Huimin Liu, Xu He, Minjuan Li, Chi Sin Tang, Mengxia Sun, Krishna Prasad Koirala, Mark E. Bowden, Yangyang Li, Xiongfang Liu, Difan Zhou, Shuo Sun, Mark B. H. Breese, Chuanbing Cai, Yingge Du, Andrew T. S. Wee, Le Wang, Xinmao Yin

    Abstract: The interaction of atomic orbitals at the interface of perovskite oxide heterostructures has been investigated for its profound impact on the band structures and electronic properties, giving rise to unique electronic states and a variety of tunable functionalities. In this study, we conducted an extensive investigation of the optical and electronic properties of epitaxial NdNiO3 thin films grown… ▽ More

    Submitted 29 April, 2024; originally announced April 2024.

    Comments: 26 pages,4 figures

  4. arXiv:2404.18411  [pdf, other

    cs.RO cs.CV

    Multi-modal Perception Dataset of In-water Objects for Autonomous Surface Vehicles

    Authors: Mingi Jeong, Arihant Chadda, Ziang Ren, Luyang Zhao, Haowen Liu, Monika Roznere, Aiwei Zhang, Yitao Jiang, Sabriel Achong, Samuel Lensgraf, Alberto Quattrini Li

    Abstract: This paper introduces the first publicly accessible multi-modal perception dataset for autonomous maritime navigation, focusing on in-water obstacles within the aquatic environment to enhance situational awareness for Autonomous Surface Vehicles (ASVs). This dataset, consisting of diverse objects encountered under varying environmental conditions, aims to bridge the research gap in marine robotics… ▽ More

    Submitted 29 April, 2024; originally announced April 2024.

    Comments: Accepted to the IEEE ICRA Workshop on Field Robotics 2024

  5. arXiv:2404.18344  [pdf, ps, other

    math.DG cs.IT

    Some Computational Results on Koszul-Vinberg Cochain Complexes

    Authors: Hanwen Liu, Jun Zhang

    Abstract: An affine connection is said to be flat if its curvature tensor vanishes identically. Koszul-Vinberg (KV for abbreviation) cohomology has been invoked to study the deformation theory of flat and torsion-free affine connections on tangent bundle. In this Note, we compute explicitly the differentials of various specific KV cochains, and study their relation to classical objects in information geomet… ▽ More

    Submitted 28 April, 2024; originally announced April 2024.

    Comments: 9 pages, 0 figue

  6. arXiv:2404.18304  [pdf, other

    cs.IR cs.AI

    Retrieval-Oriented Knowledge for Click-Through Rate Prediction

    Authors: Huanshuo Liu, Bo Chen, Menghui Zhu, Jianghao Lin, Jiarui Qin, Yang Yang, Hao Zhang, Ruiming Tang

    Abstract: Click-through rate (CTR) prediction plays an important role in personalized recommendations. Recently, sample-level retrieval-based models (e.g., RIM) have achieved remarkable performance by retrieving and aggregating relevant samples. However, their inefficiency at the inference stage makes them impractical for industrial applications. To overcome this issue, this paper proposes a universal plug-… ▽ More

    Submitted 28 April, 2024; originally announced April 2024.

  7. arXiv:2404.18255  [pdf, other

    cs.CL cs.AI

    PatentGPT: A Large Language Model for Intellectual Property

    Authors: Zilong Bai, Ruiji Zhang, Linqing Chen, Qijun Cai, Yuan Zhong, Cong Wang, Yan Fang, Jie Fang, **g Sun, Weikuan Wang, Lizhi Zhou, Haoran Hua, Tian Qiu, Chaochao Wang, Cheng Sun, Jian** Lu, Yixin Wang, Yubin Xia, Meng Hu, Haowen Liu, Peng Xu, Licong Xu, Fu Bian, Xiaolong Gu, Lisha Zhang , et al. (2 additional authors not shown)

    Abstract: In recent years, large language models(LLMs) have attracted significant attention due to their exceptional performance across a multitude of natural language process tasks, and have been widely applied in various fields. However, the application of large language models in the Intellectual Property (IP) domain is challenging due to the strong need for specialized knowledge, privacy protection, pro… ▽ More

    Submitted 4 June, 2024; v1 submitted 28 April, 2024; originally announced April 2024.

    Comments: 19 pages, 9 figures

    ACM Class: I.2.7

  8. arXiv:2404.18225  [pdf, other

    cs.RO

    Quadruped robot traversing 3D complex environments with limited perception

    Authors: Yi Cheng, Hang Liu, Guo** Pan, Linqi Ye, Houde Liu, Bin Liang

    Abstract: Traversing 3-D complex environments has always been a significant challenge for legged locomotion. Existing methods typically rely on external sensors such as vision and lidar to preemptively react to obstacles by acquiring environmental information. However, in scenarios like nighttime or dense forests, external sensors often fail to function properly, necessitating robots to rely on propriocepti… ▽ More

    Submitted 29 April, 2024; v1 submitted 28 April, 2024; originally announced April 2024.

    Comments: 10 pages, 8 figures,submitted to iros2024

  9. arXiv:2404.18201  [pdf, other

    cs.RO

    What Foundation Models can Bring for Robot Learning in Manipulation : A Survey

    Authors: Dingzhe Li, Yixiang **, Yong A, Hongze Yu, Jun Shi, Xiaoshuai Hao, Peng Hao, Hua** Liu, Fuchun Sun, Bin Fang

    Abstract: The realization of universal robots is an ultimate goal of researchers. However, a key hurdle in achieving this goal lies in the robots' ability to manipulate objects in their unstructured surrounding environments according to different tasks. The learning-based approach is considered an effective way to address generalization. The impressive performance of foundation models in the fields of compu… ▽ More

    Submitted 28 April, 2024; originally announced April 2024.

  10. Static Application Security Testing (SAST) Tools for Smart Contracts: How Far Are We?

    Authors: Kaixuan Li, Yue Xue, Sen Chen, Han Liu, Kairan Sun, Ming Hu, Haijun Wang, Yang Liu, Yixiang Chen

    Abstract: In recent years, the importance of smart contract security has been heightened by the increasing number of attacks against them. To address this issue, a multitude of static application security testing (SAST) tools have been proposed for detecting vulnerabilities in smart contracts. However, objectively comparing these tools to determine their effectiveness remains challenging. Existing studies o… ▽ More

    Submitted 29 June, 2024; v1 submitted 28 April, 2024; originally announced April 2024.

    Comments: to appear at FSE 2024

  11. arXiv:2404.18130  [pdf, other

    cs.AI cs.CL

    Logic Agent: Enhancing Validity with Logic Rule Invocation

    Authors: Hanmeng Liu, Zhiyang Teng, Chaoli Zhang, Yue Zhang

    Abstract: Chain-of-Thought (CoT) prompting has emerged as a pivotal technique for augmenting the inferential capabilities of language models during reasoning tasks. Despite its advancements, CoT often grapples with challenges in validating reasoning validity and ensuring informativeness. Addressing these limitations, this paper introduces the Logic Agent (LA), an agent-based framework aimed at enhancing the… ▽ More

    Submitted 28 April, 2024; originally announced April 2024.

  12. arXiv:2404.17806  [pdf, other

    cs.SD cs.CL cs.LG eess.AS

    T-CLAP: Temporal-Enhanced Contrastive Language-Audio Pretraining

    Authors: Yi Yuan, Zhuo Chen, Xubo Liu, Haohe Liu, Xuenan Xu, Dongya Jia, Yuanzhe Chen, Mark D. Plumbley, Wenwu Wang

    Abstract: Contrastive language-audio pretraining~(CLAP) has been developed to align the representations of audio and language, achieving remarkable performance in retrieval and classification tasks. However, current CLAP struggles to capture temporal information within audio and text features, presenting substantial limitations for tasks such as audio retrieval and generation. To address this gap, we introd… ▽ More

    Submitted 27 April, 2024; originally announced April 2024.

    Comments: Preprint submitted to IEEE MLSP 2024

  13. arXiv:2404.17685  [pdf

    cs.RO

    Localization Through Particle Filter Powered Neural Network Estimated Monocular Camera Poses

    Authors: Yi Shen, Hao Liu, Xinxin Liu, Wen**g Zhou, Chang Zhou, Yizhou Chen

    Abstract: The reduced cost and computational and calibration requirements of monocular cameras make them ideal positioning sensors for mobile robots, albeit at the expense of any meaningful depth measurement. Solutions proposed by some scholars to this localization problem involve fusing pose estimates from convolutional neural networks (CNNs) with pose estimates from geometric constraints on motion to gene… ▽ More

    Submitted 26 April, 2024; originally announced April 2024.

  14. arXiv:2404.17379  [pdf

    cs.RO

    Adaptive speed planning for Unmanned Vehicle Based on Deep Reinforcement Learning

    Authors: Hao Liu, Yi Shen, Wen**g Zhou, Yuelin Zou, Chang Zhou, Shuyao He

    Abstract: In order to solve the problem of frequent deceleration of unmanned vehicles when approaching obstacles, this article uses a Deep Q-Network (DQN) and its extension, the Double Deep Q-Network (DDQN), to develop a local navigation system that adapts to obstacles while maintaining optimal speed planning. By integrating improved reward functions and obstacle angle determination methods, the system demo… ▽ More

    Submitted 26 April, 2024; originally announced April 2024.

  15. arXiv:2404.17286  [pdf, other

    hep-th

    Black Hole Singularity from OPE

    Authors: Nejc Čeplak, Hong Liu, Andrei Parnachev, Samuel Valach

    Abstract: Eternal asymptotically AdS black holes are dual to thermofield double states in the boundary CFT. It has long been known that black hole singularities have certain signatures in boundary thermal two-point functions related to null geodesics bouncing off the singularities (bouncing geodesics). In this paper we shed light on the manifestations of black hole singularities in the dual CFT. We decompos… ▽ More

    Submitted 26 April, 2024; originally announced April 2024.

    Comments: 36+29 pages, 18 figures

  16. arXiv:2404.17147  [pdf, other

    cs.CV cs.LG

    On the Federated Learning Framework for Cooperative Perception

    Authors: Zhenrong Zhang, Jianan Liu, Xi Zhou, Tao Huang, Qing-Long Han, **gxin Liu, Hongbin Liu

    Abstract: Cooperative perception is essential to enhance the efficiency and safety of future transportation systems, requiring extensive data sharing among vehicles on the road, which raises significant privacy concerns. Federated learning offers a promising solution by enabling data privacy-preserving collaborative enhancements in perception, decision-making, and planning among connected and autonomous veh… ▽ More

    Submitted 26 April, 2024; originally announced April 2024.

  17. arXiv:2404.16776  [pdf, other

    cs.CL

    Modeling Selective Feature Attention for Representation-based Siamese Text Matching

    Authors: Jianxiang Zang, Hui Liu

    Abstract: Representation-based Siamese networks have risen to popularity in lightweight text matching due to their low deployment and inference costs. While word-level attention mechanisms have been implemented within Siamese networks to improve performance, we propose Feature Attention (FA), a novel downstream block designed to enrich the modeling of dependencies among embedding features. Employing "squeez… ▽ More

    Submitted 25 April, 2024; originally announced April 2024.

    Comments: Accepted by IJCAI2024

  18. arXiv:2404.16555  [pdf, other

    cs.IR

    MMGRec: Multimodal Generative Recommendation with Transformer Model

    Authors: Han Liu, Yinwei Wei, Xuemeng Song, Weili Guan, Yuan-Fang Li, Liqiang Nie

    Abstract: Multimodal recommendation aims to recommend user-preferred candidates based on her/his historically interacted items and associated multimodal information. Previous studies commonly employ an embed-and-retrieve paradigm: learning user and item representations in the same embedding space, then retrieving similar candidate items for a user via embedding inner product. However, this paradigm suffers… ▽ More

    Submitted 25 April, 2024; originally announced April 2024.

  19. arXiv:2404.16425  [pdf, other

    astro-ph.HE

    Soft X-ray prompt emission from a high-redshift gamma-ray burst EP240315a

    Authors: Y. Liu, H. Sun, D. Xu, D. S. Svinkin, J. Delaunay, N. R. Tanvir, H. Gao, C. Zhang, Y. Chen, X. -F. Wu, B. Zhang, W. Yuan, J. An, G. Bruni, D. D. Frederiks, G. Ghirlanda, J. -W. Hu, A. Li, C. -K. Li, J. -D. Li, D. B. Malesani, L. Piro, G. Raman, R. Ricci, E. Troja , et al. (170 additional authors not shown)

    Abstract: Long gamma-ray bursts (GRBs) are believed to originate from core collapse of massive stars. High-redshift GRBs can probe the star formation and reionization history of the early universe, but their detection remains rare. Here we report the detection of a GRB triggered in the 0.5--4 keV band by the Wide-field X-ray Telescope (WXT) on board the Einstein Probe (EP) mission, designated as EP240315a,… ▽ More

    Submitted 25 April, 2024; originally announced April 2024.

    Comments: 41 pages, 8 figures, 7 tables

  20. arXiv:2404.16235  [pdf, other

    nucl-ex nucl-th

    Inclusive studies of two- and three-nucleon short-range correlations in $^3$H and $^3$He

    Authors: S. Li, S. N. Santiesteban, J. Arrington, R. Cruz-Torres, L. Kurbany, D. Abrams, S. Alsalmi, D. Androic, K. Aniol, T. Averett, C. Ayerbe Gayoso, J. Bane, S. Barcus, J. Barrow, A. Beck, V. Bellini, H. Bhatt, D. Bhetuwal, D. Biswas, D. Bulumulla, A. Camsonne, J. Castellanos, J. Chen, J-P. Chen, D. Chrisman , et al. (91 additional authors not shown)

    Abstract: Inclusive electron scattering at carefully chosen kinematics can isolate scattering from short-range correlations (SRCs), produced through hard, short-distance interactions of nucleons in the nucleus. Because the two-nucleon (2N) SRCs arise from the same N-N interaction in all nuclei, the cross section in the SRC-dominated regime is identical up to an overall scaling factor, and the A/2H cross sec… ▽ More

    Submitted 24 April, 2024; originally announced April 2024.

  21. arXiv:2404.15657  [pdf, other

    cs.LG cs.AI

    FedSI: Federated Subnetwork Inference for Efficient Uncertainty Quantification

    Authors: Hui Chen, Hengyu Liu, Zhangkai Wu, Xuhui Fan, Longbing Cao

    Abstract: While deep neural networks (DNNs) based personalized federated learning (PFL) is demanding for addressing data heterogeneity and shows promising performance, existing methods for federated learning (FL) suffer from efficient systematic uncertainty quantification. The Bayesian DNNs-based PFL is usually questioned of either over-simplified model structures or high computational and memory costs. In… ▽ More

    Submitted 24 April, 2024; originally announced April 2024.

  22. arXiv:2404.15284  [pdf, other

    eess.SP cs.AI

    Global 4D Ionospheric STEC Prediction based on DeepONet for GNSS Rays

    Authors: Dijia Cai, Zenghui Shi, Haiyang Fu, Huan Liu, Hongyi Qian, Yun Sui, Feng Xu, Ya-Qiu **

    Abstract: The ionosphere is a vitally dynamic charged particle region in the Earth's upper atmosphere, playing a crucial role in applications such as radio communication and satellite navigation. The Slant Total Electron Contents (STEC) is an important parameter for characterizing wave propagation, representing the integrated electron density along the ray of radio signals passing through the ionosphere. Th… ▽ More

    Submitted 12 March, 2024; originally announced April 2024.

  23. arXiv:2404.15070  [pdf, other

    cs.SI cs.AI

    BotDGT: Dynamicity-aware Social Bot Detection with Dynamic Graph Transformers

    Authors: Buyun He, Yingguang Yang, Qi Wu, Hao Liu, Renyu Yang, Hao Peng, Xiang Wang, Yong Liao, Pengyuan Zhou

    Abstract: Detecting social bots has evolved into a pivotal yet intricate task, aimed at combating the dissemination of misinformation and preserving the authenticity of online interactions. While earlier graph-based approaches, which leverage topological structure of social networks, yielded notable outcomes, they overlooked the inherent dynamicity of social networks -- In reality, they largely depicted the… ▽ More

    Submitted 24 April, 2024; v1 submitted 23 April, 2024; originally announced April 2024.

    Comments: IJCAI 2024

  24. arXiv:2404.15028  [pdf, other

    cs.CV

    PRISM: A Promptable and Robust Interactive Segmentation Model with Visual Prompts

    Authors: Hao Li, Han Liu, Dewei Hu, Jiacheng Wang, Ipek Oguz

    Abstract: In this paper, we present PRISM, a Promptable and Robust Interactive Segmentation Model, aiming for precise segmentation of 3D medical images. PRISM accepts various visual inputs, including points, boxes, and scribbles as sparse prompts, as well as masks as dense prompts. Specifically, PRISM is designed with four principles to achieve robustness: (1) Iterative learning. The model produces segmenta… ▽ More

    Submitted 23 April, 2024; originally announced April 2024.

  25. arXiv:2404.14928  [pdf, other

    cs.LG cs.AI cs.CL cs.SI

    Graph Machine Learning in the Era of Large Language Models (LLMs)

    Authors: Wenqi Fan, Shijie Wang, Jiani Huang, Zhikai Chen, Yu Song, Wenzhuo Tang, Haitao Mao, Hui Liu, Xiaorui Liu, Dawei Yin, Qing Li

    Abstract: Graphs play an important role in representing complex relationships in various domains like social networks, knowledge graphs, and molecular discovery. With the advent of deep learning, Graph Neural Networks (GNNs) have emerged as a cornerstone in Graph Machine Learning (Graph ML), facilitating the representation and processing of graph structures. Recently, LLMs have demonstrated unprecedented ca… ▽ More

    Submitted 3 June, 2024; v1 submitted 23 April, 2024; originally announced April 2024.

  26. arXiv:2404.14775  [pdf, ps, other

    nucl-th

    Properties of quark stars based on the density-dependent MIT bag model

    Authors: Min Ju, Pengcheng Chu, Xuhao Wu, He Liu

    Abstract: In this study, we extend the MIT bag model by incorporating the vector interaction among quarks and introducing a density-dependent bag pressure. Then we proceed to investigate the thermodynamic properties of strange quark matter (SQM) and pure up-down quark matter (udQM) in quark stars. The results demonstrate that the vector interaction among quarks and the densitydependent bag pressure have sig… ▽ More

    Submitted 23 April, 2024; originally announced April 2024.

  27. arXiv:2404.14720  [pdf, other

    cs.CR

    Incorporating Gradients to Rules: Towards Lightweight, Adaptive Provenance-based Intrusion Detection

    Authors: Lingzhi Wang, Xiangmin Shen, Weijian Li, Zhenyuan Li, R. Sekar, Han Liu, Yan Chen

    Abstract: As cyber-attacks become increasingly sophisticated and stealthy, it becomes more imperative and challenging to detect intrusion from normal behaviors. Through fine-grained causality analysis, provenance-based intrusion detection systems (PIDS) demonstrated a promising capacity to distinguish benign and malicious behaviors, attracting widespread attention from both industry and academia. Among dive… ▽ More

    Submitted 22 April, 2024; originally announced April 2024.

  28. arXiv:2404.14700  [pdf, other

    eess.AS cs.AI cs.CL cs.LG cs.SD

    FlashSpeech: Efficient Zero-Shot Speech Synthesis

    Authors: Zhen Ye, Zeqian Ju, Haohe Liu, Xu Tan, Jianyi Chen, Yiwen Lu, Peiwen Sun, Jiahao Pan, Weizhen Bian, Shulin He, Qifeng Liu, Yike Guo, Wei Xue

    Abstract: Recent progress in large-scale zero-shot speech synthesis has been significantly advanced by language models and diffusion models. However, the generation process of both methods is slow and computationally intensive. Efficient speech synthesis using a lower computing budget to achieve quality on par with previous work remains a significant challenge. In this paper, we present FlashSpeech, a large… ▽ More

    Submitted 24 April, 2024; v1 submitted 22 April, 2024; originally announced April 2024.

    Comments: Efficient zero-shot speech synthesis

  29. arXiv:2404.14467  [pdf, other

    cs.CL cs.AI

    Integrating Chemistry Knowledge in Large Language Models via Prompt Engineering

    Authors: Hongxuan Liu, Haoyu Yin, Zhiyao Luo, Xiaonan Wang

    Abstract: This paper presents a study on the integration of domain-specific knowledge in prompt engineering to enhance the performance of large language models (LLMs) in scientific domains. A benchmark dataset is curated to encapsulate the intricate physical-chemical properties of small molecules, their drugability for pharmacology, alongside the functional attributes of enzymes and crystal materials, under… ▽ More

    Submitted 22 April, 2024; originally announced April 2024.

    Comments: 43 pages, 17 figures

  30. arXiv:2404.14013  [pdf, ps, other

    math.CA

    A characterization of compactness via bilinear $T1$ theorem

    Authors: Mingming Cao, Honghai Liu, Zengyan Si, Kôzô Yabuta

    Abstract: We establish a bilinear $T1$ theorem to characterize the weighted compactness of bilinear Calderón--Zygmund operators. Let $T$ be a bilinear operator associated with a standard bilinear Calderón--Zygmund kernel. We demonstrate that $T$ can be extended to a compact bilinear operator from $L^{p_1}(w_1^{p_1}) \times L^{p_2}(w_2^{p_2})$ to $L^p(w^p)$ for all exponents… ▽ More

    Submitted 22 April, 2024; originally announced April 2024.

    Comments: This is just a draft, but we post the file in its current form now, in response to several queries about the result and method. Eventually, these results will be a part of a more extensive work about compactness of bilinear singular integrals

    MSC Class: 42B20; 42B35

  31. arXiv:2404.13840  [pdf, other

    hep-ex

    Study of $e^+e^-\toωX(3872)$ and $γX(3872)$ from 4.66 to 4.95 GeV

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (634 additional authors not shown)

    Abstract: Using data samples with an integrated luminosity of $4.5~\text{fb}^{-1}$ collected by the BESIII detector at center-of-mass energies ranging from 4.66 to 4.95 GeV, we study the processes of $e^+e^-\toωX(3872)$ and $e^+e^-\toγX(3872)$. With the $e^+e^-\toωX(3872)$ process, the branching fraction ratio $R\equiv\frac{\mathcal{B}(X(3872)\toγJ/ψ)}{\mathcal{B}(X(3872)\toπ^+π^- J/ψ)}$ is measured to be… ▽ More

    Submitted 21 April, 2024; originally announced April 2024.

    Comments: 19 pages, 10 figures

  32. arXiv:2404.13677  [pdf, other

    cs.CV eess.IV

    A Dataset and Model for Realistic License Plate Deblurring

    Authors: Haoyan Gong, Yuzheng Feng, Zhenrong Zhang, Xianxu Hou, **gxin Liu, Siqi Huang, Hongbin Liu

    Abstract: Vehicle license plate recognition is a crucial task in intelligent traffic management systems. However, the challenge of achieving accurate recognition persists due to motion blur from fast-moving vehicles. Despite the widespread use of image synthesis approaches in existing deblurring and recognition algorithms, their effectiveness in real-world scenarios remains unproven. To address this, we int… ▽ More

    Submitted 22 April, 2024; v1 submitted 21 April, 2024; originally announced April 2024.

    Comments: Accepted by IJCAI 2024

  33. arXiv:2404.13657  [pdf, other

    cs.CV cs.AI

    MLP: Motion Label Prior for Temporal Sentence Localization in Untrimmed 3D Human Motions

    Authors: Sheng Yan, Mengyuan Liu, Yong Wang, Yang Liu, Chen Chen, Hong Liu

    Abstract: In this paper, we address the unexplored question of temporal sentence localization in human motions (TSLM), aiming to locate a target moment from a 3D human motion that semantically corresponds to a text query. Considering that 3D human motions are captured using specialized motion capture devices, motions with only a few joints lack complex scene information like objects and lighting. Due to thi… ▽ More

    Submitted 21 April, 2024; originally announced April 2024.

    Comments: 13 pages, 9 figures

  34. arXiv:2404.13472  [pdf, other

    physics.optics nlin.PS physics.app-ph

    Foundry manufacturing of octave-spanning microcombs

    Authors: Jizhao Zang, Haixin Liu, Travis C. Briles, Scott B. Papp

    Abstract: Soliton microcombs provide a chip-based, octave-spanning source for self-referencing and optical metrology. We explore use of a silicon-nitride integrated photonics foundry to manufacture octave-spanning microcombs. By group-velocity dispersion engineering with the waveguide cross-section, we shape the soliton spectrum for dispersive-wave spectral enhancements at the frequencies for f-2f self-refe… ▽ More

    Submitted 20 April, 2024; originally announced April 2024.

  35. arXiv:2404.13390  [pdf, other

    cs.CL

    Explanation based Bias Decoupling Regularization for Natural Language Inference

    Authors: Jianxiang Zang, Hui Liu

    Abstract: The robustness of Transformer-based Natural Language Inference encoders is frequently compromised as they tend to rely more on dataset biases than on the intended task-relevant features. Recent studies have attempted to mitigate this by reducing the weight of biased samples during the training process. However, these debiasing methods primarily focus on identifying which samples are biased without… ▽ More

    Submitted 20 April, 2024; originally announced April 2024.

  36. Automatic BLAS Offloading on Unified Memory Architecture: A Study on NVIDIA Grace-Hopper

    Authors: Junjie Li, Yinzhi Wang, Xiao Liang, Hang Liu

    Abstract: Porting codes to GPU often requires major efforts. While several tools exist for automatically offload numerical libraries such as BLAS and LAPACK, they often prove impractical due to the high cost of mandatory data transfer. The new unified memory architecture in NVIDIA Grace-Hopper allows high bandwidth cache-coherent memory access of all memory from both CPU and GPU, potentially eliminating bot… ▽ More

    Submitted 5 June, 2024; v1 submitted 19 April, 2024; originally announced April 2024.

  37. arXiv:2404.12817  [pdf, other

    hep-ex

    Determination of the CKM angle $φ_{3}$ from a combination of Belle and Belle II results

    Authors: Belle, Belle II Collaborations, :, I. Adachi, L. Aggarwal, H. Aihara, N. Akopov, A. Aloisio, S. Al Said, N. Anh Ky, D. M. Asner, H. Atmacan, V. Aushev, M. Aversano, R. Ayad, V. Babu, H. Bae, S. Bahinipati, P. Bambade, Sw. Banerjee, S. Bansal, M. Barrett, J. Baudot, A. Baur, A. Beaubien , et al. (377 additional authors not shown)

    Abstract: We report a determination of the CKM angle $φ_{3}$, also known as $γ$, from a combination of measurements using samples of up to 711~fb$^{-1}$ from the Belle experiment and up to 362~fb$^{-1}$ from the Belle II experiment. We combine results from analyses of $B^+\to DK^+, B^+\to Dπ^+$, and $B^+ \to D^{*}K^+$ decays, where $D$ is an admixture of $D^0$ and $\overline{D}{}^{0}$ mesons, in a likelihoo… ▽ More

    Submitted 19 April, 2024; originally announced April 2024.

    Comments: 31 pages, 4 figures

    Report number: Belle II Preprint 2023-015, KEK Preprint 2023-31

  38. arXiv:2404.12803  [pdf, other

    cs.CV cs.LG

    TextSquare: Scaling up Text-Centric Visual Instruction Tuning

    Authors: **gqun Tang, Chunhui Lin, Zhen Zhao, Shu Wei, Binghong Wu, Qi Liu, Hao Feng, Yang Li, Siqi Wang, Lei Liao, Wei Shi, Yuliang Liu, Hao Liu, Yuan Xie, Xiang Bai, Can Huang

    Abstract: Text-centric visual question answering (VQA) has made great strides with the development of Multimodal Large Language Models (MLLMs), yet open-source models still fall short of leading models like GPT4V and Gemini, partly due to a lack of extensive, high-quality instruction tuning data. To this end, we introduce a new approach for creating a massive, high-quality instruction-tuning dataset, Square… ▽ More

    Submitted 19 April, 2024; originally announced April 2024.

  39. arXiv:2404.12659  [pdf, ps, other

    cs.CL

    SOS-1K: A Fine-grained Suicide Risk Classification Dataset for Chinese Social Media Analysis

    Authors: Hongzhi Qi, Hanfei Liu, Jianqiang Li, Qing Zhao, Wei Zhai, Dan Luo, Tian Yu He, Shuo Liu, Bing Xiang Yang, Guanghui Fu

    Abstract: In the social media, users frequently express personal emotions, a subset of which may indicate potential suicidal tendencies. The implicit and varied forms of expression in internet language complicate accurate and rapid identification of suicidal intent on social media, thus creating challenges for timely intervention efforts. The development of deep learning models for suicide risk detection is… ▽ More

    Submitted 19 April, 2024; originally announced April 2024.

  40. arXiv:2404.12582  [pdf

    physics.optics

    Effective Sorting of Fractional Optical Vortex Modes

    Authors: Zhengyang Mao, Haigang Liu, Xianfeng Chen

    Abstract: Mode sorter is the crucial component of the communication systems based on orbital angular momentum (OAM). However, schemes proposed so far can only effectively sort integer OAM (IOAM) modes. Here, we demonstrate the effective sorting of fractional OAM (FOAM) modes by utilizing the coordinate transformation method, which can convert FOAM modes to IOAM modes. The transformed IOAM modes are subseque… ▽ More

    Submitted 18 April, 2024; originally announced April 2024.

    Comments: 6 pages, 5 figures

  41. arXiv:2404.12578  [pdf

    cond-mat.mtrl-sci

    Mixed polytype/polymorph formation and its effects on the electronic properties in InSe films grown by molecular beam epitaxy on GaAs(111)B

    Authors: Maria Hilse, Justin Rodriguez, Jennifer Gray, **yuan Yao, Shaoqing Ding, Derrick Shao Heng Liu, Mo Li, Joshua Young, Ying Liu, Roman Engel-Herbert

    Abstract: The top-down synthesis of inherently ferroelectric semiconductors and their integration with traditional material platforms have the potential to enable new low power logic devices, and to harness the bulk photoelectric effect for more efficient photovoltaic cells. InSe is a layered van der Waals compound exhibiting multiple polytypes, with semiconducting gamma-InSe revealing a non-centrosymmetric… ▽ More

    Submitted 18 April, 2024; originally announced April 2024.

  42. arXiv:2404.12374  [pdf

    cond-mat.str-el cond-mat.mtrl-sci

    Tunable Kondo physics in a van der Waals kagome antiferromagnet

    Authors: Boqin Song, Yuyang Xie, Wei-Jian Li, Hui Liu, Qinghua Zhang, Jian-gang Guo, Lin Zhao, Shun-Li Yu, Xingjiang Zhou, Xiaolong Chen, Tian** Ying

    Abstract: The Kondo lattice physics, describing the hybridization of localized spin matrix with dispersive conduction electrons, breeds numerous discoveries in the realm of strongly correlated quantum matter. Generally observed in lanthanide and actinide compounds, increasing attention has been directed towards alternative pathways for achieving flat band structures, such as Morie superlattices and Kagome m… ▽ More

    Submitted 18 April, 2024; originally announced April 2024.

  43. arXiv:2404.12322  [pdf, other

    cs.CV cs.AI

    Generalizable Face Landmarking Guided by Conditional Face War**

    Authors: Jiayi Liang, Haotian Liu, Hongteng Xu, Dixin Luo

    Abstract: As a significant step for human face modeling, editing, and generation, face landmarking aims at extracting facial keypoints from images. A generalizable face landmarker is required in practice because real-world facial images, e.g., the avatars in animations and games, are often stylized in various ways. However, achieving generalizable face landmarking is challenging due to the diversity of faci… ▽ More

    Submitted 21 April, 2024; v1 submitted 18 April, 2024; originally announced April 2024.

    Comments: Accepted in CVPR 2024

  44. The 2018 outburst of MAXI J1820+070 as seen by Insight-HXMT

    Authors: Ningyue Fan, Songyu Li, Rui Zhan, Honghui Liu, Zuobin Zhang, Cosimo Bambi, Long Ji, Xiang Ma, James F. Steiner, Shuang-Nan Zhang, Menglei Zhou

    Abstract: We present an analysis of the whole 2018 outburst of the black hole X-ray binary MAXI J1820+070 with Insight-HXMT data. We focus our study on the temporal evolution of the parameters of the source. We employ two different models to fit the disk's thermal spectrum: the Newtonian model DISKBB and the relativistic model NKBB. These two models provide different pictures of the source in the soft state… ▽ More

    Submitted 1 July, 2024; v1 submitted 18 April, 2024; originally announced April 2024.

    Comments: 14 pages, 8 figures. v2: refereed version

    Journal ref: Astrophys.J. 969: 61 (2024)

  45. arXiv:2404.12139  [pdf, other

    cs.CV

    Omniview-Tuning: Boosting Viewpoint Invariance of Vision-Language Pre-training Models

    Authors: Shouwei Ruan, Yinpeng Dong, Hanqing Liu, Yao Huang, Hang Su, Xingxing Wei

    Abstract: Vision-Language Pre-training (VLP) models like CLIP have achieved remarkable success in computer vision and particularly demonstrated superior robustness to distribution shifts of 2D images. However, their robustness under 3D viewpoint variations is still limited, which can hinder the development for real-world applications. This paper successfully addresses this concern while kee** VLPs' origin… ▽ More

    Submitted 18 April, 2024; originally announced April 2024.

    Comments: 20 pages

  46. arXiv:2404.11890  [pdf, other

    math.NA cs.LG

    FCNCP: A Coupled Nonnegative CANDECOMP/PARAFAC Decomposition Based on Federated Learning

    Authors: Yukai Cai, Hang Liu, Xiulin Wang, Hong** Li, Ziyi Wang, Chuanshuai Yang, Fengyu Cong

    Abstract: In the field of brain science, data sharing across servers is becoming increasingly challenging due to issues such as industry competition, privacy security, and administrative procedure policies and regulations. Therefore, there is an urgent need to develop new methods for data analysis and processing that enable scientific collaboration without data sharing. In view of this, this study proposes… ▽ More

    Submitted 18 April, 2024; originally announced April 2024.

  47. arXiv:2404.11884  [pdf, other

    cs.CV

    Seeing Motion at Nighttime with an Event Camera

    Authors: Haoyue Liu, Shihan Peng, Lin Zhu, Yi Chang, Hanyu Zhou, Luxin Yan

    Abstract: We focus on a very challenging task: imaging at nighttime dynamic scenes. Most previous methods rely on the low-light enhancement of a conventional RGB camera. However, they would inevitably face a dilemma between the long exposure time of nighttime and the motion blur of dynamic scenes. Event cameras react to dynamic changes with higher temporal resolution (microsecond) and higher dynamic range (… ▽ More

    Submitted 17 April, 2024; originally announced April 2024.

    Comments: Accepted by CVPR 2024

  48. arXiv:2404.11631  [pdf, other

    cs.DC

    A Preliminary Study on Accelerating Simulation Optimization with GPU Implementation

    Authors: **ghai He, Haoyu Liu, Yuhang Wu, Zeyu Zheng, Tingyu Zhu

    Abstract: We provide a preliminary study on utilizing GPU (Graphics Processing Unit) to accelerate computation for three simulation optimization tasks with either first-order or second-order algorithms. Compared to the implementation using only CPU (Central Processing Unit), the GPU implementation benefits from computational advantages of parallel processing for large-scale matrices and vectors operations.… ▽ More

    Submitted 16 April, 2024; originally announced April 2024.

  49. arXiv:2404.11401  [pdf, other

    cs.CV

    RainyScape: Unsupervised Rainy Scene Reconstruction using Decoupled Neural Rendering

    Authors: Xianqiang Lyu, Hui Liu, Junhui Hou

    Abstract: We propose RainyScape, an unsupervised framework for reconstructing clean scenes from a collection of multi-view rainy images. RainyScape consists of two main modules: a neural rendering module and a rain-prediction module that incorporates a predictor network and a learnable latent embedding that captures the rain characteristics of the scene. Specifically, based on the spectral bias property of… ▽ More

    Submitted 17 April, 2024; originally announced April 2024.

  50. arXiv:2404.11036  [pdf, other

    cs.LG cs.CL

    Cross-Platform Hate Speech Detection with Weakly Supervised Causal Disentanglement

    Authors: Paras Sheth, Tharindu Kumarage, Raha Moraffah, Aman Chadha, Huan Liu

    Abstract: Content moderation faces a challenging task as social media's ability to spread hate speech contrasts with its role in promoting global connectivity. With rapidly evolving slang and hate speech, the adaptability of conventional deep learning to the fluid landscape of online dialogue remains limited. In response, causality inspired disentanglement has shown promise by segregating platform specific… ▽ More

    Submitted 16 April, 2024; originally announced April 2024.