Skip to main content

Showing 1–50 of 370 results for author: Kang, Z

.
  1. arXiv:2406.12501  [pdf, other

    cs.IR

    Improving Multi-modal Recommender Systems by Denoising and Aligning Multi-modal Content and User Feedback

    Authors: Guipeng Xv, Xinyu Li, Ruobing Xie, Chen Lin, Chong Liu, Feng Xia, Zhanhui Kang, Leyu Lin

    Abstract: Multi-modal recommender systems (MRSs) are pivotal in diverse online web platforms and have garnered considerable attention in recent years. However, previous studies overlook the challenges of (1) noisy multi-modal content, (2) noisy user feedback, and (3) aligning multi-modal content with user feedback. In order to tackle these challenges, we propose Denoising and Aligning Multi-modal Recommende… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

  2. arXiv:2406.01684  [pdf, other

    hep-ph nucl-th

    Color Glass Condensate meets High Twist Expansion

    Authors: Yu Fu, Zhong-Bo Kang, Farid Salazar, Xin-Nian Wang, Hongxi Xing

    Abstract: We establish the correspondence between two well-known frameworks for QCD multiple scattering in nuclear media: the Color Glass Condensate (CGC) and the High-Twist (HT) expansion formalism. We argue that a consistent matching between both frameworks, in their common domain of validity, is achieved by incorporating the sub-eikonal longitudinal momentum phase in the CGC formalism, which mediates the… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

    Comments: 34 pages, 12 figures, 1 table

    Report number: INT-PUB-24-024

  3. arXiv:2405.15280  [pdf, other

    cs.IR cs.AI cs.LG

    DFGNN: Dual-frequency Graph Neural Network for Sign-aware Feedback

    Authors: Yiqing Wu, Ruobing Xie, Zhao Zhang, Xu Zhang, Fuzhen Zhuang, Leyu Lin, Zhanhui Kang, Yongjun Xu

    Abstract: The graph-based recommendation has achieved great success in recent years. However, most existing graph-based recommendations focus on capturing user preference based on positive edges/feedback, while ignoring negative edges/feedback (e.g., dislike, low rating) that widely exist in real-world recommender systems. How to utilize negative feedback in graph-based recommendations still remains underex… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

    Comments: Accepted by KDD 2024 Research Track

  4. arXiv:2405.14309  [pdf, other

    hep-ph

    Gamma-ray Signal from $Z_{N\geq 3}$ Dark Matter-Companion Models

    Authors: Jun Guo, Zhaofeng Kang, Ji-Gang Zhao

    Abstract: In Ref.~\cite{Guo:2021rre}, we proposed to replace the final dark matter (DM) particle in the semi-annihilation mode $\rm DM+DM\to antiDM+Higgs~boson$ with its $Z_{N\geq 3}$ companion, thus reducing DM number density without DM-nucleon scattering. In this work, we study the indirect detection signals from DM annihilation, the Higgs boson pair with one of them from the companion decay being on- or… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

    Comments: 14 pages, 3 figures

  5. arXiv:2405.05694  [pdf, other

    hep-ph

    Matter Asymmetries in the $Z_N$ Dark matter -companion Models

    Authors: Shao-Long Chen, Zhaofeng Kang, Ze-Kun Liu, Peng Zhang

    Abstract: A class of $Z_{N\geq 3}$-symmetric WIMP dark matter models that are characterized by the semi-annihilation into the companion of dark matter has been proposed in ref.~\cite{Guo:2021rre}, providing a mechanism to evade the stringent direct detection constraint. In this work, we point out that such models naturally provide the three Sakharov elements necessary for dark matter asymmetry, and moreover… ▽ More

    Submitted 6 June, 2024; v1 submitted 9 May, 2024; originally announced May 2024.

    Comments: 25 pages,14 figures

  6. arXiv:2405.03562  [pdf, other

    cs.IR

    ID-centric Pre-training for Recommendation

    Authors: Yiqing Wu, Ruobing Xie, Zhao Zhang, Fuzhen Zhuang, Xu Zhang, Leyu Lin, Zhanhui Kang, Yongjun Xu

    Abstract: Classical sequential recommendation models generally adopt ID embeddings to store knowledge learned from user historical behaviors and represent items. However, these unique IDs are challenging to be transferred to new domains. With the thriving of pre-trained language model (PLM), some pioneer works adopt PLM for pre-trained recommendation, where modality information (e.g., text) is considered un… ▽ More

    Submitted 7 May, 2024; v1 submitted 6 May, 2024; originally announced May 2024.

  7. arXiv:2404.19502  [pdf, other

    hep-ph

    Interplay between Vector-like Lepton and Seesaw Mechanism:Oblique Corrections

    Authors: Shuyang Han, Zhaofeng Kang, Jiang Zhu

    Abstract: The non-vanishing neutrino mass strongly hints the existence of right-handed neutrinos (RHNs), singlets of the standard model (SM). However, they are highly decoupled from the SM and difficult to probe. In this work, we consider the Majorana RHNs from the type-I seesaw mechanism may well mix with the heavy neutral lepton dwelling in certain vector-like lepton (VLL), thus acquiring a sizable electr… ▽ More

    Submitted 30 April, 2024; originally announced April 2024.

  8. arXiv:2404.16697  [pdf, other

    quant-ph

    High-Coherence Kerr-cat qubit in 2D architecture

    Authors: Ahmed Hajr, Bingcheng Qing, Ke Wang, Gerwin Koolstra, Zahra Pedramrazi, Ziqi Kang, Larry Chen, Long B. Nguyen, Christian Junger, Noah Goss, Irwin Huang, Bibek Bhandari, Nicholas E. Frattini, Shruti Puri, Justin Dressel, Andrew N. Jordan, David Santiago, Irfan Siddiqi

    Abstract: The Kerr-cat qubit is a bosonic qubit in which multi-photon Schrodinger cat states are stabilized by applying a two-photon drive to an oscillator with a Kerr nonlinearity. The suppressed bit-flip rate with increasing cat size makes this qubit a promising candidate to implement quantum error correction codes tailored for noise-biased qubits. However, achieving strong light-matter interactions neces… ▽ More

    Submitted 19 May, 2024; v1 submitted 25 April, 2024; originally announced April 2024.

  9. arXiv:2404.15704  [pdf, other

    cs.LG cs.AI cs.SD eess.AS

    Efficient Multi-Model Fusion with Adversarial Complementary Representation Learning

    Authors: Zuheng Kang, Yayun He, Jianzong Wang, Junqing Peng, **g Xiao

    Abstract: Single-model systems often suffer from deficiencies in tasks such as speaker verification (SV) and image classification, relying heavily on partial prior knowledge during decision-making, resulting in suboptimal performance. Although multi-model fusion (MMF) can mitigate some of these issues, redundancy in learned representations may limits improvements. To this end, we propose an adversarial comp… ▽ More

    Submitted 24 April, 2024; originally announced April 2024.

    Comments: Accepted by the 2024 International Joint Conference on Neural Networks (IJCNN 2024)

  10. arXiv:2404.14721  [pdf, other

    cs.LG

    Dynamically Anchored Prompting for Task-Imbalanced Continual Learning

    Authors: Chenxing Hong, Yan **, Zhiqi Kang, Yizhou Chen, Mengke Li, Yang Lu, Hanzi Wang

    Abstract: Existing continual learning literature relies heavily on a strong assumption that tasks arrive with a balanced data stream, which is often unrealistic in real-world applications. In this work, we explore task-imbalanced continual learning (TICL) scenarios where the distribution of task data is non-uniform across the whole learning process. We find that imbalanced tasks significantly challenge the… ▽ More

    Submitted 22 April, 2024; originally announced April 2024.

    Comments: Accepted by IJCAI 2024

  11. arXiv:2404.13892  [pdf, other

    cs.SD cs.AI eess.AS

    Retrieval-Augmented Audio Deepfake Detection

    Authors: Zuheng Kang, Yayun He, Botao Zhao, Xiaoyang Qu, Junqing Peng, **g Xiao, Jianzong Wang

    Abstract: With recent advances in speech synthesis including text-to-speech (TTS) and voice conversion (VC) systems enabling the generation of ultra-realistic audio deepfakes, there is growing concern about their potential misuse. However, most deepfake (DF) detection methods rely solely on the fuzzy knowledge learned by a single model, resulting in performance bottlenecks and transparency issues. Inspired… ▽ More

    Submitted 23 April, 2024; v1 submitted 22 April, 2024; originally announced April 2024.

    Comments: Accepted by the 2024 International Conference on Multimedia Retrieval (ICMR 2024)

  12. arXiv:2404.11375  [pdf, other

    cs.CV cs.MM

    Text-controlled Motion Mamba: Text-Instructed Temporal Grounding of Human Motion

    Authors: Xinghan Wang, Zixi Kang, Yadong Mu

    Abstract: Human motion understanding is a fundamental task with diverse practical applications, facilitated by the availability of large-scale motion capture datasets. Recent studies focus on text-motion tasks, such as text-based motion generation, editing and question answering. In this study, we introduce the novel task of text-based human motion grounding (THMG), aimed at precisely localizing temporal se… ▽ More

    Submitted 17 April, 2024; originally announced April 2024.

  13. arXiv:2404.08796  [pdf, other

    cs.IR

    The Elephant in the Room: Rethinking the Usage of Pre-trained Language Model in Sequential Recommendation

    Authors: Zekai Qu, Ruobing Xie, Chaojun Xiao, Xingwu Sun, Zhanhui Kang

    Abstract: Sequential recommendation (SR) has seen significant advancements with the help of Pre-trained Language Models (PLMs). Some PLM-based SR models directly use PLM to encode user historical behavior's text sequences to learn user representations, while there is seldom an in-depth exploration of the capability and suitability of PLM in behavior sequence modeling. In this work, we first conduct extensiv… ▽ More

    Submitted 17 April, 2024; v1 submitted 12 April, 2024; originally announced April 2024.

    Comments: 10 pages

  14. arXiv:2404.08793  [pdf, other

    cs.CR cs.CL cs.HC

    JailbreakLens: Visual Analysis of Jailbreak Attacks Against Large Language Models

    Authors: Yingchaojie Feng, Zhizhang Chen, Zhining Kang, Sijia Wang, Minfeng Zhu, Wei Zhang, Wei Chen

    Abstract: The proliferation of large language models (LLMs) has underscored concerns regarding their security vulnerabilities, notably against jailbreak attacks, where adversaries design jailbreak prompts to circumvent safety mechanisms for potential misuse. Addressing these concerns necessitates a comprehensive analysis of jailbreak prompts to evaluate LLMs' defensive capabilities and identify potential we… ▽ More

    Submitted 12 April, 2024; originally announced April 2024.

    Comments: Submitted to VIS 2024

  15. arXiv:2403.11116  [pdf, other

    cs.CV cs.AI

    PhD: A Prompted Visual Hallucination Evaluation Dataset

    Authors: Jiazhen Liu, Yuhan Fu, Ruobing Xie, Runquan Xie, Xingwu Sun, Fengzong Lian, Zhanhui Kang, Xirong Li

    Abstract: The rapid growth of Large Language Models (LLMs) has driven the development of Large Vision-Language Models (LVLMs). The challenge of hallucination, prevalent in LLMs, also emerges in LVLMs. However, most existing efforts mainly focus on object hallucination in LVLM, ignoring diverse types of LVLM hallucinations. In this study, we delve into the Intrinsic Vision-Language Hallucination (IVL-Hallu)… ▽ More

    Submitted 17 March, 2024; originally announced March 2024.

  16. arXiv:2403.03676  [pdf, other

    cs.LG

    Simplified PCNet with Robustness

    Authors: Bingheng Li, Xuanting Xie, Haoxiang Lei, Ruiyi Fang, Zhao Kang

    Abstract: Graph Neural Networks (GNNs) have garnered significant attention for their success in learning the representation of homophilic or heterophilic graphs. However, they cannot generalize well to real-world graphs with different levels of homophily. In response, the Possion-Charlier Network (PCNet) \cite{li2024pc}, the previous work, allows graph representation to be learned from heterophily to homoph… ▽ More

    Submitted 6 March, 2024; originally announced March 2024.

    Comments: 10 pages, 3 figures

  17. arXiv:2403.03670  [pdf, other

    cs.LG

    CDC: A Simple Framework for Complex Data Clustering

    Authors: Zhao Kang, Xuanting Xie, Bingheng Li, Erlin Pan

    Abstract: In today's data-driven digital era, the amount as well as complexity, such as multi-view, non-Euclidean, and multi-relational, of the collected data are growing exponentially or even faster. Clustering, which unsupervisely extracts valid knowledge from data, is extremely useful in practice. However, existing methods are independently developed to handle one particular challenge at the expense of t… ▽ More

    Submitted 6 March, 2024; originally announced March 2024.

    Comments: 10 pages, 5 figures

  18. arXiv:2403.03666  [pdf, other

    cs.LG

    Provable Filter for Real-world Graph Clustering

    Authors: Xuanting Xie, Erlin Pan, Zhao Kang, Wenyu Chen, Bingheng Li

    Abstract: Graph clustering, an important unsupervised problem, has been shown to be more resistant to advances in Graph Neural Networks (GNNs). In addition, almost all clustering methods focus on homophilic graphs and ignore heterophily. This significantly limits their applicability in practice, since real-world graphs exhibit a structural disparity and cannot simply be classified as homophily and heterophi… ▽ More

    Submitted 6 March, 2024; originally announced March 2024.

    Comments: 12 pages, 5 figures

  19. arXiv:2403.03659  [pdf, other

    cs.LG

    Robust Graph Structure Learning under Heterophily

    Authors: Xuanting Xie, Zhao Kang, Wenyu Chen

    Abstract: Graph is a fundamental mathematical structure in characterizing relations between different objects and has been widely used on various learning tasks. Most methods implicitly assume a given graph to be accurate and complete. However, real data is inevitably noisy and sparse, which will lead to inferior results. Despite the remarkable success of recent graph representation learning methods, they i… ▽ More

    Submitted 6 March, 2024; originally announced March 2024.

    Comments: 26 pages, 5 figures

  20. arXiv:2403.02775  [pdf, other

    cs.AI cs.LG

    EasyQuant: An Efficient Data-free Quantization Algorithm for LLMs

    Authors: Hanlin Tang, Yifu Sun, Decheng Wu, Kai Liu, Jianchen Zhu, Zhanhui Kang

    Abstract: Large language models (LLMs) have proven to be very superior to conventional methods in various tasks. However, their expensive computations and high memory requirements are prohibitive for deployment. Model quantization is an effective method for reducing this overhead. The problem is that in most previous works, the quantized model was calibrated using few samples from the training data, which m… ▽ More

    Submitted 5 March, 2024; originally announced March 2024.

  21. arXiv:2403.01886  [pdf, other

    cs.CL cs.AI

    FCDS: Fusing Constituency and Dependency Syntax into Document-Level Relation Extraction

    Authors: Xudong Zhu, Zhao Kang, Bei Hui

    Abstract: Document-level Relation Extraction (DocRE) aims to identify relation labels between entities within a single document. It requires handling several sentences and reasoning over them. State-of-the-art DocRE methods use a graph structure to connect entities across the document to capture dependency syntax information. However, this is insufficient to fully exploit the rich syntax information in the… ▽ More

    Submitted 4 March, 2024; originally announced March 2024.

    Comments: Appear in COLING 2024

  22. arXiv:2402.18581  [pdf, other

    cs.NE cs.AI

    Multi-objective Optimal Roadside Units Deployment in Urban Vehicular Networks

    Authors: Weian Guo, Zecheng Kang, Dongyang Li, Lun Zhang, Li Li

    Abstract: The significance of transportation efficiency, safety, and related services is increasing in urban vehicular networks. Within such networks, roadside units (RSUs) serve as intermediates in facilitating communication. Therefore, the deployment of RSUs is of utmost importance in ensuring the quality of communication services. However, the optimization objectives, such as time delay and deployment co… ▽ More

    Submitted 14 January, 2024; originally announced February 2024.

    Comments: This manuscript has been submitted to the journal of IEEE Transactions on Vehicular Technology

  23. arXiv:2402.13607  [pdf, other

    cs.CV cs.CL

    CODIS: Benchmarking Context-Dependent Visual Comprehension for Multimodal Large Language Models

    Authors: Fuwen Luo, Chi Chen, Zihao Wan, Zhaolu Kang, Qidong Yan, Yingjie Li, Xiaolong Wang, Siyu Wang, Ziyue Wang, Xiaoyue Mi, Peng Li, Ning Ma, Maosong Sun, Yang Liu

    Abstract: Multimodal large language models (MLLMs) have demonstrated promising results in a variety of tasks that combine vision and language. As these models become more integral to research and applications, conducting comprehensive evaluations of their capabilities has grown increasingly important. However, most existing benchmarks fail to consider that, in certain situations, images need to be interpret… ▽ More

    Submitted 4 June, 2024; v1 submitted 21 February, 2024; originally announced February 2024.

  24. arXiv:2402.04883  [pdf, other

    cs.CV

    Toward Accurate Camera-based 3D Object Detection via Cascade Depth Estimation and Calibration

    Authors: Chaoqun Wang, Yiran Qin, Zijian Kang, Ningning Ma, Ruimao Zhang

    Abstract: Recent camera-based 3D object detection is limited by the precision of transforming from image to 3D feature spaces, as well as the accuracy of object localization within the 3D space. This paper aims to address such a fundamental problem of camera-based 3D object detection: How to effectively learn depth information for accurate feature lifting and object localization. Different from previous met… ▽ More

    Submitted 7 February, 2024; originally announced February 2024.

    Comments: Accepted to ICRA2024

  25. arXiv:2402.01516  [pdf, other

    cs.CV

    Cross-view Masked Diffusion Transformers for Person Image Synthesis

    Authors: Trung X. Pham, Zhang Kang, Chang D. Yoo

    Abstract: We present X-MDPT ($\underline{Cross}$-view $\underline{M}$asked $\underline{D}$iffusion $\underline{P}$rediction $\underline{T}$ransformers), a novel diffusion model designed for pose-guided human image generation. X-MDPT distinguishes itself by employing masked diffusion transformers that operate on latent patches, a departure from the commonly-used Unet structures in existing works. The model c… ▽ More

    Submitted 3 June, 2024; v1 submitted 2 February, 2024; originally announced February 2024.

    Comments: ICML 2024

  26. arXiv:2401.03849  [pdf, other

    hep-ph hep-th

    Confinement Bubble Wall Velocity via Quasiparticle Determination

    Authors: Zhaofeng Kang, Jiang Zhu

    Abstract: Lattice simulations reveal that the deconfinement-confinement (D-C) phase transition (PT) of the hot pure $SU(N>2)$ Yang-Mills system is first order. This system can be described by a pool of quasigluons moving in the Polyakov loop background, and in this picture, we establish an effective distribution function for quasigluons, which encodes interactions among quasigluons and in particular the con… ▽ More

    Submitted 30 January, 2024; v1 submitted 8 January, 2024; originally announced January 2024.

    Comments: 7 pages, 3 figures

  27. arXiv:2401.02913  [pdf, other

    cs.IR

    Plug-in Diffusion Model for Sequential Recommendation

    Authors: Haokai Ma, Ruobing Xie, Lei Meng, Xin Chen, Xu Zhang, Leyu Lin, Zhanhui Kang

    Abstract: Pioneering efforts have verified the effectiveness of the diffusion models in exploring the informative uncertainty for recommendation. Considering the difference between recommendation and image synthesis tasks, existing methods have undertaken tailored refinements to the diffusion and reverse process. However, these approaches typically use the highest-score item in corpus for user interest pred… ▽ More

    Submitted 5 January, 2024; originally announced January 2024.

    Comments: Accepted by AAAI 2024

  28. arXiv:2401.01941  [pdf, other

    hep-ph hep-ex nucl-th

    The DIS 1-Jettiness Event Shape at N$^3$LL+${\cal O}(α_s^2)$

    Authors: Haotian Cao, Zhong-Bo Kang, Xiaohui Liu, Sonny Mantry

    Abstract: We present results for the $Ï„_1$ and $Ï„_{1a}$ 1-Jettiness global event shape distributions, for Deep Inelastic Scattering (DIS), at the N$^3$LL + ${\cal O}(α_s^2)$ level of accuracy. These event-shape distributions quantify and characterize the pattern of final state radiation in electron-nucleus collisions. They can be used as a probe of nuclear structure functions, nuclear medium effects in jet… ▽ More

    Submitted 21 June, 2024; v1 submitted 3 January, 2024; originally announced January 2024.

    Comments: 42 pages, 8 figures, references added, new appendix with extended discussion on shape function added, version to appear in Physical Review D

  29. arXiv:2312.17484  [pdf, other

    cs.CL cs.AI

    Truth Forest: Toward Multi-Scale Truthfulness in Large Language Models through Intervention without Tuning

    Authors: Zhongzhi Chen, Xingwu Sun, Xianfeng Jiao, Fengzong Lian, Zhanhui Kang, Di Wang, Cheng-Zhong Xu

    Abstract: Despite the great success of large language models (LLMs) in various tasks, they suffer from generating hallucinations. We introduce Truth Forest, a method that enhances truthfulness in LLMs by uncovering hidden truth representations using multi-dimensional orthogonal probes. Specifically, it creates multiple orthogonal bases for modeling truth by incorporating orthogonal constraints into the prob… ▽ More

    Submitted 14 January, 2024; v1 submitted 29 December, 2023; originally announced December 2023.

    Comments: Accepted as AAAI 2024

  30. arXiv:2312.14438  [pdf, other

    cs.LG cs.AI cs.SI

    PC-Conv: Unifying Homophily and Heterophily with Two-fold Filtering

    Authors: Bingheng Li, Erlin Pan, Zhao Kang

    Abstract: Recently, many carefully crafted graph representation learning methods have achieved impressive performance on either strong heterophilic or homophilic graphs, but not both. Therefore, they are incapable of generalizing well across real-world graphs with different levels of homophily. This is attributed to their neglect of homophily in heterophilic graphs, and vice versa. In this paper, we propose… ▽ More

    Submitted 22 December, 2023; originally announced December 2023.

    Comments: Accepted by AAAI2024

  31. arXiv:2312.14066  [pdf, other

    cs.LG

    Upper Bounding Barlow Twins: A Novel Filter for Multi-Relational Clustering

    Authors: Xiaowei Qian, Bingheng Li, Zhao Kang

    Abstract: Multi-relational clustering is a challenging task due to the fact that diverse semantic information conveyed in multi-layer graphs is difficult to extract and fuse. Recent methods integrate topology structure and node attribute information through graph filtering. However, they often use a low-pass filter without fully considering the correlation among multiple graphs. To overcome this drawback, w… ▽ More

    Submitted 21 December, 2023; originally announced December 2023.

    Comments: Accepted by AAAI 2024

  32. arXiv:2312.09226  [pdf, other

    hep-ph hep-ex nucl-ex nucl-th

    Nuclear modified transverse momentum dependent parton distribution and fragmentation functions

    Authors: Mishary Alrashed, Zhong-Bo Kang, John Terry, Hongxi Xing, Congyue Zhang

    Abstract: In this study, we extend our previous global analysis of nuclear-modified transverse momentum distribution functions (nTMDs) to also consider the nuclear-modified collinear fragmentation function. Our methodology incorporates the global set of experimental data from both Drell-Yan production and Semi-Inclusive Deep Inelastic Scattering. Through a comprehensive global extraction of these distributi… ▽ More

    Submitted 14 December, 2023; originally announced December 2023.

    Comments: 39 pages, 16 figures

    Report number: LA-UR-23-33438

  33. arXiv:2311.17142  [pdf, other

    hep-ph hep-ex nucl-ex nucl-th

    Transverse Energy-Energy Correlators in the Color-Glass Condensate at the Electron-Ion Collider

    Authors: Zhong-Bo Kang, Jani Penttala, Fanyi Zhao, Yiyu Zhou

    Abstract: We investigate the transverse energy-energy correlators (TEEC) in the small-$x$ regime at the upcoming Electron-Ion Collider (EIC). Focusing on the back-to-back production of electron-hadron pairs in both $ep$ and $eA$ collisions, we establish a factorization theorem given in terms of the hard function, quark distributions, soft functions, and TEEC jet functions, where the gluon saturation effect… ▽ More

    Submitted 28 November, 2023; originally announced November 2023.

    Comments: 11 pages, 3 figures

    Report number: MIT-CTP/5649

    Journal ref: Phys. Rev. D 109, 094012 (2024)

  34. arXiv:2311.01033  [pdf, other

    cs.LG cs.AI cs.SI

    Non-Autoregressive Diffusion-based Temporal Point Processes for Continuous-Time Long-Term Event Prediction

    Authors: Wang-Tao Zhou, Zhao Kang, Ling Tian

    Abstract: Continuous-time long-term event prediction plays an important role in many application scenarios. Most existing works rely on autoregressive frameworks to predict event sequences, which suffer from error accumulation, thus compromising prediction quality. Inspired by the success of denoising diffusion probabilistic models, we propose a diffusion-based non-autoregressive temporal point process mode… ▽ More

    Submitted 2 November, 2023; originally announced November 2023.

  35. arXiv:2311.00672  [pdf, other

    hep-ph hep-ex nucl-ex nucl-th

    Polarized fragmenting jet functions in Inclusive and Exclusive Jet Production

    Authors: Zhong-Bo Kang, Hongxi Xing, Fanyi Zhao, Yiyu Zhou

    Abstract: In this work, we present a complete theoretical framework for analyzing the distribution of polarized hadrons within jets, with and without measuring the transverse momentum relative to the standard jet axis. Using soft-collinear effective theory (SCET), we derive the factorization and provide the theoretical calculation of both semi-inclusive and exclusive fragmenting jet functions (FJFs) under l… ▽ More

    Submitted 5 May, 2024; v1 submitted 1 November, 2023; originally announced November 2023.

    Comments: 49 pages, 9 figures

    Report number: MIT-CTP/5633

    Journal ref: JHEP 03, 142 (2024)

  36. arXiv:2310.15929  [pdf, other

    cs.LG cs.AI cs.CL

    E-Sparse: Boosting the Large Language Model Inference through Entropy-based N:M Sparsity

    Authors: Yun Li, Lin Niu, Xipeng Zhang, Kai Liu, Jianchen Zhu, Zhanhui Kang

    Abstract: Traditional pruning methods are known to be challenging to work in Large Language Models (LLMs) for Generative AI because of their unaffordable training process and large computational demands. For the first time, we introduce the information entropy of hidden state features into a pruning metric design, namely E-Sparse, to improve the accuracy of N:M sparsity on LLM. E-Sparse employs the informat… ▽ More

    Submitted 22 March, 2024; v1 submitted 24 October, 2023; originally announced October 2023.

  37. arXiv:2310.15159  [pdf, other

    hep-ph hep-ex nucl-ex nucl-th

    Probing Transverse Momentum Dependent Structures with Azimuthal Dependence of Energy Correlators

    Authors: Zhong-Bo Kang, Kyle Lee, Ding Yu Shao, Fanyi Zhao

    Abstract: We study the azimuthal angle dependence of the energy-energy correlators $\langle \mathcal{E}(\hat{n}_1)\mathcal{E}(\hat{n}_2)\rangle$ in the back-to-back region for $e^+e^-$ annihilation and deep inelastic scattering (DIS) processes with general polarization of the proton beam. We demonstrate that the polarization information of the beam and the underlying partons from the hard scattering is prop… ▽ More

    Submitted 23 October, 2023; originally announced October 2023.

    Comments: 36 pages, 7 figures

    Report number: MIT-CTP/5632

    Journal ref: JHEP 03, 153 (2024)

  38. arXiv:2310.13540  [pdf, other

    cs.IR

    Thoroughly Modeling Multi-domain Pre-trained Recommendation as Language

    Authors: Zekai Qu, Ruobing Xie, Chaojun Xiao, Yuan Yao, Zhiyuan Liu, Fengzong Lian, Zhanhui Kang, Jie Zhou

    Abstract: With the thriving of pre-trained language model (PLM) widely verified in various of NLP tasks, pioneer efforts attempt to explore the possible cooperation of the general textual information in PLM with the personalized behavioral information in user historical behavior sequences to enhance sequential recommendation (SR). However, despite the commonalities of input format and task goal, there are h… ▽ More

    Submitted 27 November, 2023; v1 submitted 20 October, 2023; originally announced October 2023.

  39. arXiv:2310.12847  [pdf, other

    hep-ph hep-ex nucl-ex nucl-th

    Correspondence between Color Glass Condensate and High-Twist Formalism

    Authors: Yu Fu, Zhong-Bo Kang, Farid Salazar, Xin-Nian Wang, Hongxi Xing

    Abstract: The Color Glass Condensate (CGC) effective theory and the collinear factorization at high-twist (HT) are two well-known frameworks describing perturbative QCD multiple scatterings in nuclear media. It has long been recognized that these two formalisms have their own domain of validity in different kinematics regions. Taking direct photon production in proton-nucleus collisions as an example, we cl… ▽ More

    Submitted 19 October, 2023; originally announced October 2023.

    Comments: 7 pages, 3 figures + supplemental material

  40. Direct quarkonium-plus-gluon production in DIS in the Color Glass Condensate

    Authors: Zhong-Bo Kang, Emilie Li, Farid Salazar

    Abstract: We compute the differential cross-section for direct quarkonium production accompanied by a gluon in high-energy deep inelastic scattering (DIS) at small-$x$. We employ the Non-Relativistic QCD factorization framework, focusing on the $S$-wave contribution to the formation of the quarkonium, and including both color singlet and octet contributions. Our short distance coefficients for the productio… ▽ More

    Submitted 18 October, 2023; originally announced October 2023.

    Comments: 52 pages, 7 figures, 1 table

    Journal ref: JHEP 03, 027 (2024)

  41. arXiv:2310.04681  [pdf, other

    cs.SD cs.AI eess.AS

    VoiceExtender: Short-utterance Text-independent Speaker Verification with Guided Diffusion Model

    Authors: Yayun He, Zuheng Kang, Jianzong Wang, Junqing Peng, **g Xiao

    Abstract: Speaker verification (SV) performance deteriorates as utterances become shorter. To this end, we propose a new architecture called VoiceExtender which provides a promising solution for improving SV performance when handling short-duration speech signals. We use two guided diffusion models, the built-in and the external speaker embedding (SE) guided diffusion model, both of which utilize a diffusio… ▽ More

    Submitted 6 October, 2023; originally announced October 2023.

    Comments: Accepted by the 2023 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU 2023)

  42. A Prototype-Based Neural Network for Image Anomaly Detection and Localization

    Authors: Chao Huang, Zhao Kang, Hong Wu

    Abstract: Image anomaly detection and localization perform not only image-level anomaly classification but also locate pixel-level anomaly regions. Recently, it has received much research attention due to its wide application in various fields. This paper proposes ProtoAD, a prototype-based neural network for image anomaly detection and localization. First, the patch features of normal images are extracted… ▽ More

    Submitted 25 May, 2024; v1 submitted 4 October, 2023; originally announced October 2023.

    Comments: Published in Neural Processing Letters 2024

    Journal ref: Neural Process Lett 56, 169 (2024)

  43. arXiv:2309.13629  [pdf, ps, other

    astro-ph.IM astro-ph.SR

    Periodic Variable Star Classification with Deep Learning: Handling Data Imbalance in an Ensemble Augmentation Way

    Authors: Zihan Kang, Yanxia Zhang, **gyi Zhang, Changhua Li, Minzhi Kong, Yongheng Zhao, Xue-Bing Wu

    Abstract: Time-domain astronomy is progressing rapidly with the ongoing and upcoming large-scale photometric sky surveys led by the Vera C. Rubin Observatory project (LSST). Billions of variable sources call for better automatic classification algorithms for light curves. Among them, periodic variable stars are frequently studied. Different categories of periodic variable stars have a high degree of class i… ▽ More

    Submitted 24 September, 2023; originally announced September 2023.

    Comments: 10 pages, 8 figures, accepted

    Journal ref: PASP 135 094501 (2023)

  44. arXiv:2309.07111  [pdf, other

    cond-mat.str-el cond-mat.mes-hall cond-mat.mtrl-sci

    Anomalous excitonic phase diagram in band-gap-tuned Ta2Ni(Se,S)5

    Authors: Cheng Chen, Weichen Tang, Xiang Chen, Zhibo Kang, Shuhan Ding, Kirsty Scott, Siqi Wang, Zhenglu Li, Jacob P. C. Ruff, Makoto Hashimoto, Dong-Hui Lu, Chris Jozwiak, Aaron Bostwick, Eli Rotenberg, Eduardo H. da Silva Neto, Robert J. Birgeneau, Yulin Chen, Steven G. Louie, Yao Wang, Yu He

    Abstract: During a band-gap-tuned semimetal-to-semiconductor transition, Coulomb attraction between electrons and holes can cause spontaneously formed excitons near the zero-band-gap point, or the Lifshitz transition point. This has become an important route to realize bulk excitonic insulators -- an insulating ground state distinct from single-particle band insulators. How this route manifests from weak to… ▽ More

    Submitted 13 September, 2023; originally announced September 2023.

    Comments: 27 pages, 4 + 9 figures

    Journal ref: Nat Commun 14, 7512 (2023)

  45. arXiv:2309.07084  [pdf, other

    cs.CV

    SupFusion: Supervised LiDAR-Camera Fusion for 3D Object Detection

    Authors: Yiran Qin, Chaoqun Wang, Zijian Kang, Ningning Ma, Zhen Li, Ruimao Zhang

    Abstract: In this paper, we propose a novel training strategy called SupFusion, which provides an auxiliary feature level supervision for effective LiDAR-Camera fusion and significantly boosts detection performance. Our strategy involves a data enhancement method named Polar Sampling, which densifies sparse objects and trains an assistant model to generate high-quality features as the supervision. These fea… ▽ More

    Submitted 31 October, 2023; v1 submitted 13 September, 2023; originally announced September 2023.

    Comments: Accepted to ICCV2023

  46. arXiv:2308.14323  [pdf

    physics.geo-ph

    Institutional map** and causal analysis of avalanche vulnerable areas based on multi-source data

    Authors: Zexuan Zhou, Bingqi Ma, Jianwei Zhu, Zhizhong Kang

    Abstract: Avalanche disaster is a major natural disaster that seriously threatens the national infrastructure and personnel's life safety. For a long time, the research of avalanche disaster prediction in the world is insufficient, there are only some basic models and basic conditions of occurrence, and there is no long series and wide range of avalanche disaster prediction products. Based on 7 different ba… ▽ More

    Submitted 28 August, 2023; originally announced August 2023.

    Comments: 19 pages, 13 figures

  47. arXiv:2308.01217  [pdf, other

    cs.CV

    TeachCLIP: Multi-Grained Teaching for Efficient Text-to-Video Retrieval

    Authors: Kaibin Tian, Ruixiang Zhao, Hu Hu, Runquan Xie, Fengzong Lian, Zhanhui Kang, Xirong Li

    Abstract: For text-to-video retrieval (T2VR), which aims to retrieve unlabeled videos by ad-hoc textual queries, CLIP-based methods are dominating. Compared to CLIP4Clip which is efficient and compact, the state-of-the-art models tend to compute video-text similarity by fine-grained cross-modal feature interaction and matching, putting their scalability for large-scale T2VR into doubt. For efficient T2VR, w… ▽ More

    Submitted 2 August, 2023; originally announced August 2023.

  48. Blinkverse: A Database of Fast Radio Bursts

    Authors: Jiaying Xu, Yi Feng, Di Li, Pei Wang, Yongkun Zhang, **tao Xie, Huaxi Chen, Han Wang, Zhixuan Kang, **g**g Hu, Yun Zheng, Chao-Wei Tsai, Xianglei Chen, Dengke Zhou

    Abstract: The volume of research on fast radio bursts (FRBs) observation have been seeing a dramatic growth. To facilitate the systematic analysis of the FRB population, we established a database platform, Blinkverse (https://blinkverse.alkaidos.cn), as a central inventory of FRBs from various observatories and with published properties, particularly dynamic spectra from FAST, CHIME, GBT, Arecibo, etc. Blin… ▽ More

    Submitted 1 August, 2023; originally announced August 2023.

    Comments: 13 pages, 9 figures

    Journal ref: Universe 2023, 9(7), 330

  49. arXiv:2307.12286  [pdf, ps, other

    cs.IT eess.SP

    Double-Active-IRS Aided Wireless Communication: Deployment Optimization and Capacity Scaling

    Authors: Zhenyu Kang, Changsheng You, Rui Zhang

    Abstract: In this letter, we consider a double-active-intelligent reflecting surface (IRS) aided wireless communication system, where two active IRSs are properly deployed to assist the communication from a base station (BS) to multiple users located in a given zone via the double-reflection links. Under the assumption of fixed per-element amplification power for each active-IRS element, we formulate a rate… ▽ More

    Submitted 23 July, 2023; originally announced July 2023.

  50. arXiv:2307.06935  [pdf, other

    hep-ph hep-ex

    Collins-type Energy-Energy Correlators and Nucleon Structure

    Authors: Zhong-Bo Kang, Kyle Lee, Ding Yu Shao, Fanyi Zhao

    Abstract: We generalize the conventional Energy-Energy Correlator (EEC) to include the azimuthal angle dependence, so to define azimuthal angle dependent EEC observables. We study this new EEC observable in $e^+e^-$ and semi-inclusive deep inelastic scattering (SIDIS). In the back-to-back region, we find that the azimuthal angle dependent EEC is sensitive to both the unpolarized EEC jet function and a Colli… ▽ More

    Submitted 13 July, 2023; originally announced July 2023.

    Comments: Presented at DIS2023: XXX International Workshop on Deep-Inelastic Scattering and Related Subjects, Michigan State University, USA, 27-31 March 2023