Skip to main content

Showing 1–50 of 10,185 results for author: liu, J

.
  1. arXiv:2407.05872  [pdf, other

    cs.LG

    Scaling Exponents Across Parameterizations and Optimizers

    Authors: Katie Everett, Lechao Xiao, Mitchell Wortsman, Alexander A. Alemi, Roman Novak, Peter J. Liu, Izzeddin Gur, Jascha Sohl-Dickstein, Leslie Pack Kaelbling, Jaehoon Lee, Jeffrey Pennington

    Abstract: Robust and effective scaling of models from small to large width typically requires the precise adjustment of many algorithmic and architectural details, such as parameterization and optimizer choices. In this work, we propose a new perspective on parameterization by investigating a key assumption in prior work about the alignment between parameters and data and derive new theoretical results unde… ▽ More

    Submitted 8 July, 2024; originally announced July 2024.

    Comments: 63 pages, International Conference on Machine Learning 2024

  2. arXiv:2407.05831  [pdf, other

    hep-ph

    Revisiting for maximal flavor violating $Z^{'}_{eμ}$ and its phenomenology constraints

    Authors: Jia Liu, Muyuan Song, Haohao Zhang

    Abstract: Lepton flavor violation (LFV), observed conclusively in neutrino oscillations, remains a pivotal area of investigation due to its absence in the Standard Model (SM). Beyond the Standard Model (BSM) physics explores charged lepton flavor violation (CLFV), particularly through new particle candidates such as the $Z^{'}$. This article focuses on maximal LFV interactions facilitated by the $Z^{'}$ bos… ▽ More

    Submitted 8 July, 2024; originally announced July 2024.

    Comments: 32 pages, 9 figures

  3. arXiv:2407.05705  [pdf, other

    cs.AI

    Fast and Continual Knowledge Graph Embedding via Incremental LoRA

    Authors: Jiajun Liu, Wenjun Ke, Peng Wang, Jiahao Wang, **hua Gao, Ziyu Shang, Guozheng Li, Zijie Xu, Ke Ji, Yining Li

    Abstract: Continual Knowledge Graph Embedding (CKGE) aims to efficiently learn new knowledge and simultaneously preserve old knowledge. Dominant approaches primarily focus on alleviating catastrophic forgetting of old knowledge but neglect efficient learning for the emergence of new knowledge. However, in real-world scenarios, knowledge graphs (KGs) are continuously growing, which brings a significant chall… ▽ More

    Submitted 8 July, 2024; originally announced July 2024.

    Comments: Accepted by IJCAI2024

  4. arXiv:2407.05645  [pdf, other

    cs.CV cs.MM

    OneDiff: A Generalist Model for Image Difference

    Authors: Erdong Hu, Longteng Guo, Tongtian Yue, Zijia Zhao, Shuning Xue, **g Liu

    Abstract: In computer vision, Image Difference Captioning (IDC) is crucial for accurately describing variations between closely related images. Traditional IDC methods often rely on specialist models, which restrict their applicability across varied contexts. This paper introduces the OneDiff model, a novel generalist approach that utilizes a robust vision-language model architecture, integrating a siamese… ▽ More

    Submitted 8 July, 2024; originally announced July 2024.

  5. arXiv:2407.05558  [pdf

    math.OC eess.SY

    Hidden Convexity-Based Distributed Operation of Integrated Electricity-Gas Systems

    Authors: Rong-Peng Liu, Yue Song, Junhong Liu, Xiaozhe Wang, **peng Guo, Yunhe Hou

    Abstract: We propose a hidden convexity-based method to address distributed optimal energy flow (OEF) problems for transmission-level integrated electricity-gas systems. First, we develop a node-wise decoupling method to de-compose an OEF problem into multiple OEF subproblems. Then, we propose a hidden convexity-based method to equivalently reformulate nonconvex OEF subproblems as semi-definite programs. Th… ▽ More

    Submitted 7 July, 2024; originally announced July 2024.

    Comments: 7 pages

  6. arXiv:2407.05552  [pdf, other

    cs.CV

    Ada-adapter:Fast Few-shot Style Personlization of Diffusion Model with Pre-trained Image Encoder

    Authors: Jia Liu, Changlin Li, Qirui Sun, Jiahui Ming, Chen Fang, Jue Wang, Bing Zeng, Shuaicheng Liu

    Abstract: Fine-tuning advanced diffusion models for high-quality image stylization usually requires large training datasets and substantial computational resources, hindering their practical applicability. We propose Ada-Adapter, a novel framework for few-shot style personalization of diffusion models. Ada-Adapter leverages off-the-shelf diffusion models and pre-trained image feature encoders to learn a com… ▽ More

    Submitted 7 July, 2024; originally announced July 2024.

    Comments: 16 pages, 11 figures

    MSC Class: 68T07 ACM Class: I.4.0

  7. arXiv:2407.05491  [pdf, other

    cond-mat.soft

    Cornerstones are the Key Stones: Using Interpretable Machine Learning to Probe the Clogging Process in 2D Granular Hoppers

    Authors: Jesse M. Hanlan, Sam Dillavou, Andrea J. Liu, Douglas J. Durian

    Abstract: The sudden arrest of flow by formation of a stable arch over an outlet is a unique and characteristic feature of granular materials. Previous work suggests that grains near the outlet randomly sample configurational flow microstates until a clog-causing flow microstate is reached. However, factors that lead to clogging remain elusive. Here we experimentally observe over 50,000 clogging events for… ▽ More

    Submitted 7 July, 2024; originally announced July 2024.

    Comments: 16 pages, 11 figures

  8. arXiv:2407.05415  [pdf, other

    cs.CV

    DIVESPOT: Depth Integrated Volume Estimation of Pile of Things Based on Point Cloud

    Authors: Yiran Ling, Rongqiang Zhao, Yixuan Shen, Dongbo Li, **g **, Jie Liu

    Abstract: Non-contact volume estimation of pile-type objects has considerable potential in industrial scenarios, including grain, coal, mining, and stone materials. However, using existing method for these scenarios is challenged by unstable measurement poses, significant light interference, the difficulty of training data collection, and the computational burden brought by large piles. To address the above… ▽ More

    Submitted 7 July, 2024; originally announced July 2024.

  9. arXiv:2407.05332  [pdf, other

    quant-ph

    Experimental investigation of direct non-Hermitian measurement and uncertainty relation towards high-dimensional quantum domain

    Authors: Yi-Tao Wang, Zhao-An Wang, Zhi-Peng Li, Xiao-Dong Zeng, Jia-Ming Ren, Wei Liu, Yuan-Ze Yang, Nai-Jie Guo, Lin-Ke Xie, Jun-You Liu, Yu-Hang Ma, Jian-Shun Tang, Chengjie Zhang, Chuan-Feng Li, Guang-Can Guo

    Abstract: Non-Hermitian dynamics in quantum systems have unveiled novel phenomena, yet the implementation of valid non-Hermitian quantum measurement remains a challenge, because a universal quantum projective mechanism on the complete but skewed non-Hermitian eigenstates is not explicit in experiment. This limitation hinders the direct acquisition of non-Hermitian observable statistics (e.g., non-Hermitian… ▽ More

    Submitted 7 July, 2024; originally announced July 2024.

    Comments: 6 pages, 4 figures

  10. arXiv:2407.05286  [pdf, other

    cs.LG math.OC

    Stability and Generalization for Stochastic Recursive Momentum-based Algorithms for (Strongly-)Convex One to $K$-Level Stochastic Optimizations

    Authors: Xiaokang Pan, Xingyu Li, ** Liu, Tao Sun, Kai Sun, Lixing Chen, Zhe Qu

    Abstract: STOchastic Recursive Momentum (STORM)-based algorithms have been widely developed to solve one to $K$-level ($K \geq 3$) stochastic optimization problems. Specifically, they use estimators to mitigate the biased gradient issue and achieve near-optimal convergence results. However, there is relatively little work on understanding their generalization performance, particularly evident during the tra… ▽ More

    Submitted 7 July, 2024; originally announced July 2024.

  11. arXiv:2407.05155  [pdf, other

    cs.IT eess.SP

    Wi-Fi Beyond Communications: Experimental Evaluation of Respiration Monitoring and Motion Detection Using COTS Devices

    Authors: Jiuyu Liu, Yi Ma, Rahim Tafazolli

    Abstract: Wi-Fi sensing has become an attractive option for non-invasive monitoring of human activities and vital signs. This paper explores the feasibility of using state-of-the-art commercial off-the-shelf (COTS) devices for Wi-Fi sensing applications, particularly respiration monitoring and motion detection. We utilize the Intel AX210 network interface card (NIC) to transmit Wi-Fi signals in both 2.4 GHz… ▽ More

    Submitted 6 July, 2024; originally announced July 2024.

    Comments: This work has been accepted by IEEE ICCC Workshop 2024. Copyright may be transferred without notice, after which this version may no longer be accessible

  12. arXiv:2407.04881  [pdf, ps, other

    math-ph math.ST nlin.CD

    Coupled Stochastic-Statistical Equations for Filtering Multiscale Turbulent Systems

    Authors: Di Qi, Jian-Guo Liu

    Abstract: We present a new strategy for filtering high-dimensional multiscale systems characterized by high-order non-Gaussian statistics using observations from leading-order moments. A closed stochastic-statistical modeling framework suitable for systematic theoretical analysis and efficient numerical simulations is designed. Optimal filtering solutions are derived based on the explicit coupling structure… ▽ More

    Submitted 5 July, 2024; originally announced July 2024.

    Comments: 35 pages

  13. arXiv:2407.04877  [pdf

    cond-mat.mtrl-sci cs.LG

    Leveraging Data Mining, Active Learning, and Domain Adaptation in a Multi-Stage, Machine Learning-Driven Approach for the Efficient Discovery of Advanced Acidic Oxygen Evolution Electrocatalysts

    Authors: Rui Ding, Jianguo Liu, Kang Hua, Xuebin Wang, Xiaoben Zhang, Minhua Shao, Yuxin Chen, Junhong Chen

    Abstract: Develo** advanced catalysts for acidic oxygen evolution reaction (OER) is crucial for sustainable hydrogen production. This study introduces a novel, multi-stage machine learning (ML) approach to streamline the discovery and optimization of complex multi-metallic catalysts. Our method integrates data mining, active learning, and domain adaptation throughout the materials discovery process. Unlik… ▽ More

    Submitted 5 July, 2024; originally announced July 2024.

    Comments: 95 pages (main text 37 pages; supplementary materials 58 pages); 38 figures (main text 6 figures; supplementary materials 32 figures)

  14. arXiv:2407.04846  [pdf, other

    cs.LG cs.AI

    Amazing Things Come From Having Many Good Models

    Authors: Cynthia Rudin, Chudi Zhong, Lesia Semenova, Margo Seltzer, Ronald Parr, Jiachang Liu, Srikar Katta, Jon Donnelly, Harry Chen, Zachery Boner

    Abstract: The Rashomon Effect, coined by Leo Breiman, describes the phenomenon that there exist many equally good predictive models for the same dataset. This phenomenon happens for many real datasets and when it does, it sparks both magic and consternation, but mostly magic. In light of the Rashomon Effect, this perspective piece proposes resha** the way we think about machine learning, particularly for… ▽ More

    Submitted 5 July, 2024; originally announced July 2024.

    Journal ref: ICML (spotlight), 2024

  15. arXiv:2407.04591  [pdf, other

    cs.LG math.OC

    Proximal Point Method for Online Saddle Point Problem

    Authors: Qing-xin Meng, Jian-wei Liu

    Abstract: This paper focuses on the online saddle point problem, which involves a sequence of two-player time-varying convex-concave games. Considering the nonstationarity of the environment, we adopt the duality gap and the dynamic Nash equilibrium regret as performance metrics for algorithm design. We present three variants of the proximal point method: the Online Proximal Point Method~(OPPM), the Optimis… ▽ More

    Submitted 5 July, 2024; originally announced July 2024.

  16. arXiv:2407.04220  [pdf, other

    astro-ph.CO gr-qc hep-ph

    Detecting the dark sector through scalar-induced gravitational waves

    Authors: Xiao-Bin Sui, **g Liu, Xing-Yu Yang, Rong-Gen Cai

    Abstract: We investigate the evolution of cosmological scalar perturbations in the case that the background radiation is weakly coupled to a light scalar field $φ$. The light scalar $φ$ is a homogeneous background field with a large initial value. In the radiation-dominated Universe, the coupling term introduces an effective mass to $φ$ and the background ultra-relativistic particles. The oscillations of… ▽ More

    Submitted 4 July, 2024; originally announced July 2024.

    Comments: 13 pages, 4 figures

  17. arXiv:2407.03799  [pdf, other

    cs.NI

    Your Mega-Constellations Can Be Slim:A Cost-Effective Approach for Constructing Survivable and Performant LEO Satellite Networks

    Authors: Zeqi Lai, Yibo Wang, Hewu Li, Qian Wu, Qi Zhang, Yunan Hou, Jun Liu, Yuanjie Li

    Abstract: In this paper, we investigate an important research problem facing the upcoming satellite Internet: from a network perspective, how many satellites exactly do we need to construct a survivable and performant LSN? To answer this question, we first formulate the survivable and performant LSN design (SPLD) problem, which aims to find the minimum number of needed satellites to construct an LSN that ca… ▽ More

    Submitted 4 July, 2024; originally announced July 2024.

  18. arXiv:2407.03721  [pdf, other

    astro-ph.HE astro-ph.SR

    Discovery of a dusty yellow supergiant progenitor for the Type IIb SN 2017gkk

    Authors: Zexi Niu, Ning-Chen Sun, Jifeng Liu

    Abstract: Type IIb supernovae are important subclass of stripped-envelope supernovae (SNe), which show H lines only at early times. Their progenitors are believed to contain a low-mass H envelope before explosion. This work reports the discovery of a progenitor candidate in pre-explosion Hubble Space Telescope images for the Type IIb SN~2017gkk. With detailed analysis of its spectral energy distribution and… ▽ More

    Submitted 4 July, 2024; originally announced July 2024.

    Comments: 6 pages, 3 figures. ApJL accepted. Comments are welcome

  19. arXiv:2407.03625  [pdf, other

    cs.SE

    Augmenting LLMs to Repair Obsolete Test Cases with Static Collector and Neural Reranker

    Authors: Jun Liu, Jiwei Yan, Yuanyuan Xie, Jun Yan, Jian Zhang

    Abstract: During software evolution, it is advocated that test code should co-evolve with production code. In real development scenarios, test updating may lag behind production code changing, which may cause the project to fail to compile or bring other troubles. Existing techniques based on pre-trained language models can be adopted to repair obsolete tests caused by such unsynchronized code changes, espe… ▽ More

    Submitted 4 July, 2024; originally announced July 2024.

  20. arXiv:2407.03201  [pdf, other

    quant-ph cond-mat.mes-hall physics.app-ph

    Wideband Coherent Microwave Conversion via Magnon Nonlinearity in Hybrid Quantum System

    Authors: Jiahao Wu, Jiacheng Liu, Zheyu Ren, Man Yin Leung, Wai Kuen Leung, Kin On Ho, Xiangrong Wang, Qiming Shao, Sen Yang

    Abstract: Frequency conversion is a widely realized physical process in nonlinear systems of optics and electronics. As an emerging nonlinear platform, spintronic devices have the potential to achieve stronger frequency conversion. Here, we demonstrated a microwave frequency conversion method in a hybrid quantum system, integrating nitrogen-vacancy centers in diamond with magnetic thin film CoFeB. We achiev… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

    Comments: 11 pages, 5 figures

    Journal ref: npj Spintronics volume 2, Article number: 30 (2024)

  21. Frequency-Controlled Diffusion Model for Versatile Text-Guided Image-to-Image Translation

    Authors: Xiang Gao, Zhengbo Xu, Junhan Zhao, Jiaying Liu

    Abstract: Recently, large-scale text-to-image (T2I) diffusion models have emerged as a powerful tool for image-to-image translation (I2I), allowing open-domain image translation via user-provided text prompts. This paper proposes frequency-controlled diffusion model (FCDiffusion), an end-to-end diffusion-based framework that contributes a novel solution to text-guided I2I from a frequency-domain perspective… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

    Comments: Proceedings of the 38th AAAI Conference on Artificial Intelligence (AAAI 2024)

    Journal ref: Proceedings of the AAAI Conference on Artificial Intelligence, 2024, 38(3), 1824-1832

  22. arXiv:2407.02933  [pdf, other

    cs.RO

    Online Time-Informed Kinodynamic Motion Planning of Nonlinear Systems

    Authors: Fei Meng, Jianbang Liu, Haojie Shi, Han Ma, Hongliang Ren, Max Q. -H. Meng

    Abstract: Sampling-based kinodynamic motion planners (SKMPs) are powerful in finding collision-free trajectories for high-dimensional systems under differential constraints. Time-informed set (TIS) can provide the heuristic search domain to accelerate their convergence to the time-optimal solution. However, existing TIS approximation methods suffer from the curse of dimensionality, computational burden, and… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

  23. arXiv:2407.02899  [pdf, other

    hep-ex

    Measurement of the branching fraction of the decay $J/ψ\to p \bar{p} η$

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (639 additional authors not shown)

    Abstract: A high precision measurement of the branching fraction of the decay $J/ψ\to p \bar{p} η$ is performed using $(10 087 \pm 44) \times 10^6$ $J/ψ$ events recorded by the {BESIII} detector at the {BEPCII} storage ring. The branching fractions of the two decays $J/ψ\to p \bar{p} η(η\to γγ)$ and $J/ψ\to p \bar{p} η(η\to π^+ π^- π^0)$ are measured individually to be… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

  24. arXiv:2407.02824  [pdf, other

    cs.SE

    Exploring the Capabilities of LLMs for Code Change Related Tasks

    Authors: Lishui Fan, Jiakun Liu, Zhongxin Liu, David Lo, Xin Xia, Shan** Li

    Abstract: Developers deal with code-change-related tasks daily, e.g., reviewing code. Pre-trained code and code-change-oriented models have been adapted to help developers with such tasks. Recently, large language models (LLMs) have shown their effectiveness in code-related tasks. However, existing LLMs for code focus on general code syntax and semantics rather than the differences between two code versions… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

  25. arXiv:2407.02767  [pdf, other

    cond-mat.mtrl-sci cond-mat.mes-hall

    Comparison of Short-Range Order in GeSn Grown by Molecular Beam Epitaxy and Chemical Vapor Deposition

    Authors: Shang Liu, Yunfan Liang, Haochen Zhao, Nirosh M. Eldose, **-Hee Bae, Omar Concepcion, Xiaochen **, Shunda Chen, Ilias Bikmukhametov, Austin Akey, Cory T. Cline, Alejandra Cuervo Covian, Xiaoxin Wang, Tianshu Li, Yu** Zeng, Dan Buca, Shui-Qing Yu, Gregory J. Salamo, Shengbai Zhang, Jifeng Liu

    Abstract: Atomic short-range order (SRO) in direct-bandgap GeSn for infrared photonics has recently attracted attention due to its notable impact on band structures. However, the SRO in GeSn thin films grown by different methods have hardly been compared. This paper compares SRO in GeSn thin films of similar compositions grown by molecular beam epitaxy (MBE) and chemical vapor deposition (CVD) using atom pr… ▽ More

    Submitted 2 July, 2024; originally announced July 2024.

  26. arXiv:2407.02723  [pdf, other

    cs.CL

    e-Health CSIRO at "Discharge Me!" 2024: Generating Discharge Summary Sections with Fine-tuned Language Models

    Authors: **ghui Liu, Aaron Nicolson, Jason Dowling, Bevan Koopman, Anthony Nguyen

    Abstract: Clinical documentation is an important aspect of clinicians' daily work and often demands a significant amount of time. The BioNLP 2024 Shared Task on Streamlining Discharge Documentation (Discharge Me!) aims to alleviate this documentation burden by automatically generating discharge summary sections, including brief hospital course and discharge instruction, which are often time-consuming to syn… ▽ More

    Submitted 2 July, 2024; originally announced July 2024.

    Comments: BioNLP @ ACL 2024

  27. arXiv:2407.02423  [pdf, other

    cs.LG math.CT

    On the Anatomy of Attention

    Authors: Nikhil Khatri, Tuomas Laakkonen, Jonathon Liu, Vincent Wang-Maścianica

    Abstract: We introduce a category-theoretic diagrammatic formalism in order to systematically relate and reason about machine learning models. Our diagrams present architectures intuitively but without loss of essential detail, where natural relationships between models are captured by graphical transformations, and important differences and similarities can be identified at a glance. In this paper, we focu… ▽ More

    Submitted 7 July, 2024; v1 submitted 2 July, 2024; originally announced July 2024.

    Comments: Replaced to fix typos

    MSC Class: 68T01; 18M30 ACM Class: I.2.6

  28. arXiv:2407.02392  [pdf, other

    cs.CV

    TokenPacker: Efficient Visual Projector for Multimodal LLM

    Authors: Wentong Li, Yuqian Yuan, Jian Liu, Dongqi Tang, Song Wang, Jianke Zhu, Lei Zhang

    Abstract: The visual projector serves as an essential bridge between the visual encoder and the Large Language Model (LLM) in a Multimodal LLM (MLLM). Typically, MLLMs adopt a simple MLP to preserve all visual contexts via one-to-one transformation. However, the visual tokens are redundant and can be considerably increased when dealing with high-resolution images, impairing the efficiency of MLLMs significa… ▽ More

    Submitted 2 July, 2024; originally announced July 2024.

    Comments: 16 pages, Codes:https://github.com/CircleRadon/TokenPacker

  29. arXiv:2407.02376  [pdf, other

    astro-ph.HE

    A new subclass of gamma-ray burst originating from compact binary merger

    Authors: Chen-Wei Wang, Wen-Jun Tan, Shao-Lin Xiong, Shu-Xu Yi, Rahim Moradi, Bing Li, Zhen Zhang, Yu Wang, Yan-Zhi Meng, Jia-Cong Liu, Yue Wang, Sheng-Lun Xie, Wang-Chen Xue, Zheng-Hang Yu, Peng Zhang, Wen-Long Zhang, Yan-Qiu Zhang, Chao Zheng

    Abstract: Type I gamma-ray bursts (GRBs) are believed to originate from compact binary merger usually with duration less than 2 seconds for the main emission. However, recent observations of GRB 211211A and GRB 230307A indicate that some merger-origin GRBs could last much longer. Since they show strikingly similar properties (indicating a common mechanism) which are different from the classic "long"-short b… ▽ More

    Submitted 2 July, 2024; originally announced July 2024.

  30. arXiv:2407.02319  [pdf, other

    cond-mat.mtrl-sci

    Catalogue of $C$-paired spin-valley locking in antiferromagnetic systems

    Authors: Mengli Hu, Xingkai Cheng, Zhenqiao Huang, Junwei Liu

    Abstract: Antiferromagnetic materials (AFMs) have been gaining lots of attentions due to its great potential in spintronics devices and the recently discovered novel spin structure in the momentum space, i.e., $C$-paired spin-valley or spin-momentum locking (CSVL), where spins and valleys/momenta are locked to each other due to the crystal symmetry guaranteeing zero magnetization. Here, we systematically st… ▽ More

    Submitted 2 July, 2024; originally announced July 2024.

    Comments: Main text: 9 pages, 4 figures

  31. arXiv:2407.02273  [pdf, other

    cs.CL

    Multilingual Trolley Problems for Language Models

    Authors: Zhi**g **, Sydney Levine, Max Kleiman-Weiner, Giorgio Piatti, Jiarui Liu, Fernando Gonzalez Adauto, Francesco Ortu, András Strausz, Mrinmaya Sachan, Rada Mihalcea, Ye** Choi, Bernhard Schölkopf

    Abstract: As large language models (LLMs) are deployed in more and more real-world situations, it is crucial to understand their decision-making when faced with moral dilemmas. Inspired by a large-scale cross-cultural study of human moral preferences, "The Moral Machine Experiment", we set up the same set of moral choices for LLMs. We translate 1K vignettes of moral dilemmas, parametrically varied across ke… ▽ More

    Submitted 2 July, 2024; originally announced July 2024.

  32. arXiv:2407.02057  [pdf, other

    cs.LG cs.SI

    HC-GLAD: Dual Hyperbolic Contrastive Learning for Unsupervised Graph-Level Anomaly Detection

    Authors: Yali Fu, **dong Li, Jiahong Liu, Qianli Xing, Qi Wang, Irwin King

    Abstract: Unsupervised graph-level anomaly detection (UGAD) has garnered increasing attention in recent years due to its significance. However, most existing methods only rely on traditional graph neural networks to explore pairwise relationships but such kind of pairwise edges are not enough to describe multifaceted relationships involving anomaly. There is an emergency need to exploit node group informati… ▽ More

    Submitted 2 July, 2024; originally announced July 2024.

  33. arXiv:2407.02052  [pdf, other

    eess.AS cs.SD

    The USTC-NERCSLIP Systems for The ICMC-ASR Challenge

    Authors: Minghui Wu, Luzhen Xu, Jie Zhang, Haitao Tang, Yanyan Yue, Ruizhi Liao, **tao Zhao, Zhengzhe Zhang, Yichi Wang, Haoyin Yan, Hongliang Yu, Tongle Ma, Jiachen Liu, Chongliang Wu, Yongchao Li, Yanyong Zhang, Xin Fang, Yue Zhang

    Abstract: This report describes the submitted system to the In-Car Multi-Channel Automatic Speech Recognition (ICMC-ASR) challenge, which considers the ASR task with multi-speaker overlap** and Mandarin accent dynamics in the ICMC case. We implement the front-end speaker diarization using the self-supervised learning representation based multi-speaker embedding and beamforming using the speaker position,… ▽ More

    Submitted 2 July, 2024; originally announced July 2024.

    Comments: Accepted at ICASSP 2024

  34. arXiv:2407.01905  [pdf, other

    cs.CV

    Enhancing Multi-Class Anomaly Detection via Diffusion Refinement with Dual Conditioning

    Authors: Jiawei Zhan, **xiang Lai, Bin-Bin Gao, Jun Liu, Xiaochen Chen, Chengjie Wang

    Abstract: Anomaly detection, the technique of identifying abnormal samples using only normal samples, has attracted widespread interest in industry. Existing one-model-per-category methods often struggle with limited generalization capabilities due to their focus on a single category, and can fail when encountering variations in product. Recent feature reconstruction methods, as representatives in one-model… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

  35. arXiv:2407.01612  [pdf, other

    math.CO

    A Note on Improved bounds for the Oriented Radius of Mixed Multigraphs

    Authors: Hengzhe Li, Zhiwei Ding, Jianbing Liu, Yanhong Gao, Shuli Zhao

    Abstract: For a positive integer $r$, let $f(r)$ denote the smallest number such that any 2-edge connected mixed graph with radius $r$ has an oriented radius of at most $f(r)$. Recently, Babu, Benson, and Rajendraprasad significantly improved the upper bound of $f(r)$ by establishing that $f(r) \leq 1.5r^2 + r + 1$, see [Improved bounds for the oriented radius of mixed multigraphs, J. Graph Theory, 103 (202… ▽ More

    Submitted 27 June, 2024; originally announced July 2024.

    Comments: 7 pages, 1 figure

    MSC Class: 05C12; 05C40

  36. arXiv:2407.01560  [pdf, other

    cs.GR cs.AI

    3DMeshNet: A Three-Dimensional Differential Neural Network for Structured Mesh Generation

    Authors: Jiaming Peng, Xinhai Chen, Jie Liu

    Abstract: Mesh generation is a crucial step in numerical simulations, significantly impacting simulation accuracy and efficiency. However, generating meshes remains time-consuming and requires expensive computational resources. In this paper, we propose a novel method, 3DMeshNet, for three-dimensional structured mesh generation. The method embeds the meshing-related differential equations into the loss func… ▽ More

    Submitted 7 May, 2024; originally announced July 2024.

  37. arXiv:2407.01552  [pdf

    cs.NI physics.optics

    High Spectral-Efficiency, Ultra-low MIMO SDM Transmission over a Field-Deployed Multi-Core OAM Fiber

    Authors: Junyi Liu, Zengquan Xu, Shuqi Mo, Yuming Huang, Yining Huang, Zhenhua Li, Yuying Guo, Lei Shen, Shuo Xu, Ran Gao, Cheng Du, Qian Feng, Jie Luo, Jie Liu, Siyuan Yu

    Abstract: Few-mode multi-core fiber (FM-MCF) based Space-Division Multiplexing (SDM) systems possess the potential to maximize the number of multiplexed spatial channels per fiber by harnessing both the space (fiber cores) and mode (optical mode per core) dimensions. However, to date, no SDM transmissions over field-deployed FM-MCFs in realistic outdoor settings have been reported, which contrasts with SDM… ▽ More

    Submitted 29 April, 2024; originally announced July 2024.

    Comments: 17 pages, 8 figures

  38. arXiv:2407.01290  [pdf, other

    cs.LG cs.AI

    Hypformer: Exploring Efficient Hyperbolic Transformer Fully in Hyperbolic Space

    Authors: Menglin Yang, Harshit Verma, Delvin Ce Zhang, Jiahong Liu, Irwin King, Rex Ying

    Abstract: Hyperbolic geometry have shown significant potential in modeling complex structured data, particularly those with underlying tree-like and hierarchical structures. Despite the impressive performance of various hyperbolic neural networks across numerous domains, research on adapting the Transformer to hyperbolic space remains limited. Previous attempts have mainly focused on modifying self-attentio… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

    Comments: KDD 2024

  39. arXiv:2407.01017  [pdf, other

    cs.CV

    Coding for Intelligence from the Perspective of Category

    Authors: Wenhan Yang, Zixuan Hu, Lilang Lin, Jiaying Liu, Ling-Yu Duan

    Abstract: Coding, which targets compressing and reconstructing data, and intelligence, often regarded at an abstract computational level as being centered around model learning and prediction, interweave recently to give birth to a series of significant progress. The recent trends demonstrate the potential homogeneity of these two fields, especially when deep-learning models aid these two categories for bet… ▽ More

    Submitted 2 July, 2024; v1 submitted 1 July, 2024; originally announced July 2024.

  40. arXiv:2407.00993  [pdf, other

    cs.AI cs.CL

    Mobile-Bench: An Evaluation Benchmark for LLM-based Mobile Agents

    Authors: Shihan Deng, Weikai Xu, Hongda Sun, Wei Liu, Tao Tan, Jianfeng Liu, Ang Li, Jian Luan, Bin Wang, Rui Yan, Shuo Shang

    Abstract: With the remarkable advancements of large language models (LLMs), LLM-based agents have become a research hotspot in human-computer interaction. However, there is a scarcity of benchmarks available for LLM-based mobile agents. Benchmarking these agents generally faces three main challenges: (1) The inefficiency of UI-only operations imposes limitations to task evaluation. (2) Specific instructions… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

  41. arXiv:2407.00906  [pdf, other

    cs.CV cs.LG

    GSO-YOLO: Global Stability Optimization YOLO for Construction Site Detection

    Authors: Yuming Zhang, Dongzhi Guan, Shouxin Zhang, Junhao Su, Yunzhi Han, Jiabin Liu

    Abstract: Safety issues at construction sites have long plagued the industry, posing risks to worker safety and causing economic damage due to potential hazards. With the advancement of artificial intelligence, particularly in the field of computer vision, the automation of safety monitoring on construction sites has emerged as a solution to this longstanding issue. Despite achieving impressive performance,… ▽ More

    Submitted 30 June, 2024; originally announced July 2024.

  42. arXiv:2407.00743  [pdf, other

    cs.MM cs.AI cs.CL eess.AS

    AIMDiT: Modality Augmentation and Interaction via Multimodal Dimension Transformation for Emotion Recognition in Conversations

    Authors: Sheng Wu, Jiaxing Liu, Longbiao Wang, Dongxiao He, Xiaobao Wang, Jianwu Dang

    Abstract: Emotion Recognition in Conversations (ERC) is a popular task in natural language processing, which aims to recognize the emotional state of the speaker in conversations. While current research primarily emphasizes contextual modeling, there exists a dearth of investigation into effective multimodal fusion methods. We propose a novel framework called AIMDiT to solve the problem of multimodal fusion… ▽ More

    Submitted 12 April, 2024; originally announced July 2024.

  43. arXiv:2407.00506  [pdf, other

    cs.AI cs.GT cs.LG

    ShapG: new feature importance method based on the Shapley value

    Authors: Chi Zhao, **g Liu, Elena Parilina

    Abstract: With wide application of Artificial Intelligence (AI), it has become particularly important to make decisions of AI systems explainable and transparent. In this paper, we proposed a new Explainable Artificial Intelligence (XAI) method called ShapG (Explanations based on Shapley value for Graphs) for measuring feature importance. ShapG is a model-agnostic global explanation method. At the first sta… ▽ More

    Submitted 29 June, 2024; originally announced July 2024.

    MSC Class: 68T01; 68T20

  44. arXiv:2407.00382  [pdf, other

    math.NA cs.AI cs.CE cs.LG

    Towards Universal Mesh Movement Networks

    Authors: Mingrui Zhang, Chunyang Wang, Stephan Kramer, Joseph G. Wallwork, Siyi Li, Jiancheng Liu, Xiang Chen, Matthew D. Piggott

    Abstract: Solving complex Partial Differential Equations (PDEs) accurately and efficiently is an essential and challenging problem in all scientific and engineering disciplines. Mesh movement methods provide the capability to improve the accuracy of the numerical solution without increasing the overall mesh degree of freedom count. Conventional sophisticated mesh movement methods are extremely expensive and… ▽ More

    Submitted 1 July, 2024; v1 submitted 29 June, 2024; originally announced July 2024.

  45. arXiv:2407.00341  [pdf, other

    cs.CL

    Iterative Data Augmentation with Large Language Models for Aspect-based Sentiment Analysis

    Authors: Haiyun Li, Qihuang Zhong, Ke Zhu, Juhua Liu, Bo Du, Dacheng Tao

    Abstract: Aspect-based Sentiment Analysis (ABSA) is an important sentiment analysis task, which aims to determine the sentiment polarity towards an aspect in a sentence. Due to the expensive and limited labeled data, data augmentation (DA) has become the standard for improving the performance of ABSA. However, current DA methods usually have some shortcomings: 1) poor fluency and coherence, 2) lack of diver… ▽ More

    Submitted 29 June, 2024; originally announced July 2024.

    Comments: Work in process

  46. A slightly oblate dark matter halo revealed by a retrograde precessing Galactic disk warp

    Authors: Yang Huang, Qikang Feng, Tigran Khachaturyants, Huawei Zhang, Jifeng Liu, Juntai Shen, Timothy C. Beers, Youjun Lu, Song Wang, Haibo Yuan

    Abstract: The shape of the dark matter (DM) halo is key to understanding the hierarchical formation of the Galaxy. Despite extensive efforts in recent decades, however, its shape remains a matter of debate, with suggestions ranging from strongly oblate to prolate. Here, we present a new constraint on its present shape by directly measuring the evolution of the Galactic disk warp with time, as traced by accu… ▽ More

    Submitted 29 June, 2024; originally announced July 2024.

    Comments: Published in Nature Astronomy on June 27th, 2024. Final published version here: https://www.nature.com/articles/s41550-024-02309-5

  47. arXiv:2407.00297  [pdf

    eess.IV cs.CV

    UADSN: Uncertainty-Aware Dual-Stream Network for Facial Nerve Segmentation

    Authors: Guanghao Zhu, Lin Liu, **g Zhang, Xiaohui Du, Ruqian Hao, Juanxiu Liu

    Abstract: Facial nerve segmentation is crucial for preoperative path planning in cochlear implantation surgery. Recently, researchers have proposed some segmentation methods, such as atlas-based and deep learning-based methods. However, since the facial nerve is a tubular organ with a diameter of only 1.0-1.5mm, it is challenging to locate and segment the facial nerve in CT scans. In this work, we propose a… ▽ More

    Submitted 28 June, 2024; originally announced July 2024.

  48. arXiv:2407.00136  [pdf, other

    hep-ex

    Observation of the Electromagnetic Dalitz Transition $h_c \rightarrow e^+e^-η_c$

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, S. Ahmed, M. Albrecht, R. Aliberti, A. Amoroso, M. R. An, Q. An, X. H. Bai, Y. Bai, O. Bakina, R. Baldini Ferroli, I. Balossino, Y. Ban, K. Begzsuren, N. Berger, M. Bertani, D. Bettoni, F. Bianchi, J. Bloms, A. Bortone, I. Boyko, R. A. Briere , et al. (495 additional authors not shown)

    Abstract: Using $(27.12\pm 0.14)\times10^8$ $ψ(3686)$ decays and data samples of $e^+e^-$ collisions with $\sqrt{s}$ from 4.130 to 4.780~GeV collected with the BESIII detector, we report the first observation of the electromagnetic Dalitz transition $h_c\to e^+e^-η_c$ with a statistical significance of $5.4σ$. We measure the ratio of the branching fractions… ▽ More

    Submitted 2 July, 2024; v1 submitted 28 June, 2024; originally announced July 2024.

  49. arXiv:2407.00115  [pdf, other

    cs.LG cs.AI

    Instance Temperature Knowledge Distillation

    Authors: Zhengbo Zhang, Yuxi Zhou, Jia Gong, Jun Liu, Zhigang Tu

    Abstract: Knowledge distillation (KD) enhances the performance of a student network by allowing it to learn the knowledge transferred from a teacher network incrementally. Existing methods dynamically adjust the temperature to enable the student network to adapt to the varying learning difficulties at different learning stages of KD. KD is a continuous process, but when adjusting the temperature, these meth… ▽ More

    Submitted 7 July, 2024; v1 submitted 27 June, 2024; originally announced July 2024.

    ACM Class: I.4.0

  50. arXiv:2407.00042  [pdf

    q-bio.NC cs.SI eess.SY

    Module control of network analysis in psychopathology

    Authors: Chunyu Pan, Quan Zhang, Yue Zhu, Shengzhou Kong, Juan Liu, Changsheng Zhang, Fei Wang, Xizhe Zhang

    Abstract: The network approach to characterizing psychopathology departs from traditional latent categorical and dimensional approaches. Causal interplay among symptoms contributed to dynamic psychopathology system. Therefore, analyzing the symptom clusters is critical for understanding mental disorders. Furthermore, despite extensive research studying the topological features of symptom networks, the contr… ▽ More

    Submitted 30 May, 2024; originally announced July 2024.