Skip to main content

Showing 51–100 of 860 results for author: Zhu, M

.
  1. arXiv:2404.13922  [pdf

    hep-ex physics.plasm-ph

    A Platform for All-optical Thomson/ Compton Scattering with Versatile Parameters

    Authors: Siyu Chen, Wenchao Yan, Mingyang Zhu, Yaojun Li, Xichen Hu, Hao Xu, Jie Feng, Xulei Ge, Wenzhao Wang, Guangwei Lu, Mingxuan Wei, Lin Lu, Xiaojun Huang, Boyuan Li, Xiaohui Yuan, Feng Liu, Min Chen, Liming Chen, Jie Zhang

    Abstract: A dual-beam platform for all-optical electron-photon scattering, or Thomson/Compton scattering, with adjustable collision-angle and parameter tuning ability has been developed, which, in principle, can be used for the verification of strong-field quantum electrodynamics effects. Combining this platform with a 200 TW Ti:Sapphire laser system, we demonstrated the generation of inverse Compton scatte… ▽ More

    Submitted 22 April, 2024; originally announced April 2024.

  2. arXiv:2404.12242  [pdf, other

    cs.CL

    CMNEE: A Large-Scale Document-Level Event Extraction Dataset based on Open-Source Chinese Military News

    Authors: Mengna Zhu, Zijie Xu, Kaisheng Zeng, Kaiming Xiao, Mao Wang, Wenjun Ke, Hongbin Huang

    Abstract: Extracting structured event knowledge, including event triggers and corresponding arguments, from military texts is fundamental to many applications, such as intelligence analysis and decision assistance. However, event extraction in the military field faces the data scarcity problem, which impedes the research of event extraction models in this domain. To alleviate this problem, we propose CMNEE,… ▽ More

    Submitted 18 April, 2024; originally announced April 2024.

    Comments: 13 pages, 7 figures, accepted to LREC-COLING 2024

  3. arXiv:2404.11943  [pdf, other

    cs.HC

    AgentCoord: Visually Exploring Coordination Strategy for LLM-based Multi-Agent Collaboration

    Authors: Bo Pan, Jiaying Lu, Ke Wang, Li Zheng, Zhen Wen, Yingchaojie Feng, Minfeng Zhu, Wei Chen

    Abstract: The potential of automatic task-solving through Large Language Model (LLM)-based multi-agent collaboration has recently garnered widespread attention from both the research community and industry. While utilizing natural language to coordinate multiple agents presents a promising avenue for democratizing agent technology for general users, designing coordination strategies remains challenging with… ▽ More

    Submitted 18 April, 2024; originally announced April 2024.

  4. arXiv:2404.10985  [pdf, ps, other

    cs.CV stat.ML

    Pixel-Wise Symbol Spotting via Progressive Points Location for Parsing CAD Images

    Authors: Junbiao Pang, Zailin Dong, Jiaxin Deng, Mengyuan Zhu, Yunwei Zhang

    Abstract: Parsing Computer-Aided Design (CAD) drawings is a fundamental step for CAD revision, semantic-based management, and the generation of 3D prototypes in both the architecture and engineering industries. Labeling symbols from a CAD drawing is a challenging yet notorious task from a practical point of view. In this work, we propose to label and spot symbols from CAD images that are converted from CAD… ▽ More

    Submitted 16 April, 2024; originally announced April 2024.

    Comments: 10 pages, 10 figures,6 tables

  5. arXiv:2404.09322  [pdf

    cs.DC cs.AI

    The intelligent prediction and assessment of financial information risk in the cloud computing model

    Authors: Yufu Wang, Mingwei Zhu, Jiaqiang Yuan, Guanghui Wang, Hong Zhou

    Abstract: Cloud computing (cloud computing) is a kind of distributed computing, referring to the network "cloud" will be a huge data calculation and processing program into countless small programs, and then, through the system composed of multiple servers to process and analyze these small programs to get the results and return to the user. This report explores the intersection of cloud computing and finan… ▽ More

    Submitted 14 April, 2024; originally announced April 2024.

  6. arXiv:2404.08793  [pdf, other

    cs.CR cs.CL cs.HC

    JailbreakLens: Visual Analysis of Jailbreak Attacks Against Large Language Models

    Authors: Yingchaojie Feng, Zhizhang Chen, Zhining Kang, Sijia Wang, Minfeng Zhu, Wei Zhang, Wei Chen

    Abstract: The proliferation of large language models (LLMs) has underscored concerns regarding their security vulnerabilities, notably against jailbreak attacks, where adversaries design jailbreak prompts to circumvent safety mechanisms for potential misuse. Addressing these concerns necessitates a comprehensive analysis of jailbreak prompts to evaluate LLMs' defensive capabilities and identify potential we… ▽ More

    Submitted 12 April, 2024; originally announced April 2024.

    Comments: Submitted to VIS 2024

  7. arXiv:2404.08004  [pdf, other

    cs.LG cs.RO

    GRANP: A Graph Recurrent Attentive Neural Process Model for Vehicle Trajectory Prediction

    Authors: Yuhao Luo, Kehua Chen, Meixin Zhu

    Abstract: As a vital component in autonomous driving, accurate trajectory prediction effectively prevents traffic accidents and improves driving efficiency. To capture complex spatial-temporal dynamics and social interactions, recent studies developed models based on advanced deep-learning methods. On the other hand, recent studies have explored the use of deep generative models to further account for traje… ▽ More

    Submitted 9 April, 2024; originally announced April 2024.

  8. arXiv:2404.06591  [pdf, other

    physics.soc-ph cs.IR

    Milgram's experiment in the knowledge space: Individual navigation strategies

    Authors: Manran Zhu, János Kertész

    Abstract: Data deluge characteristic for our times has led to information overload, posing a significant challenge to effectively finding our way through the digital landscape. Addressing this issue requires an in-depth understanding of how we navigate through the abundance of information. Previous research has discovered multiple patterns in how individuals navigate in the geographic, social, and informati… ▽ More

    Submitted 9 April, 2024; originally announced April 2024.

    Comments: 25 pages, 8 figures

  9. arXiv:2404.05464  [pdf, ps, other

    gr-qc astro-ph.CO hep-th

    Parity violation in primordial tensor non-Gaussianities from matter bounce cosmology

    Authors: Shingo Akama, Mian Zhu

    Abstract: It has been shown that primordial tensor non-Gaussianities from a cubic Weyl action with a non-dynamical coupling are suppressed by the so-called slow-roll parameter in a conventional framework of slow-roll inflation. In this paper, we consider matter bounce cosmology in which the background spacetime is no longer quasi-de Sitter, and hence one might expect that the matter bounce models could pred… ▽ More

    Submitted 8 April, 2024; originally announced April 2024.

  10. arXiv:2404.02937  [pdf, other

    cs.LG cs.AI

    Towards Responsible and Reliable Traffic Flow Prediction with Large Language Models

    Authors: Xusen Guo, Qiming Zhang, Junyue Jiang, Mingxing Peng, Hao, Yang, Meixin Zhu

    Abstract: Traffic forecasting is crucial for intelligent transportation systems. It has experienced significant advancements thanks to the power of deep learning in capturing latent patterns of traffic data. However, recent deep-learning architectures require intricate model designs and lack an intuitive understanding of the map** from input data to predicted results. Achieving both accuracy and responsib… ▽ More

    Submitted 21 April, 2024; v1 submitted 3 April, 2024; originally announced April 2024.

    Comments: 27pages, 8 figures

  11. arXiv:2404.01151  [pdf, other

    cs.CV

    Detect2Interact: Localizing Object Key Field in Visual Question Answering (VQA) with LLMs

    Authors: Jialou Wang, Manli Zhu, Yulei Li, Honglei Li, Longzhi Yang, Wai Lok Woo

    Abstract: Localization plays a crucial role in enhancing the practicality and precision of VQA systems. By enabling fine-grained identification and interaction with specific parts of an object, it significantly improves the system's ability to provide contextually relevant and spatially accurate responses, crucial for applications in dynamic environments like robotics and augmented reality. However, traditi… ▽ More

    Submitted 1 April, 2024; originally announced April 2024.

    Comments: Accepted to IEEE Intelligent Systems

  12. arXiv:2404.01150  [pdf, other

    cs.RO

    Visual-inertial state estimation based on Chebyshev polynomial optimization

    Authors: Hongyu Zhang, Maoran Zhu, Qi Cai, Yuanxin Wu

    Abstract: This paper proposes an innovative state estimation method for visual-inertial fusion based on Chebyshev polynomial optimization. Specifically, the pose is modeled as a Chebyshev polynomial of a certain order, and its time derivatives are used to calculate linear acceleration and angular velocity, which, along with inertial measurements, constitute dynamic constraints. This is coupled with a visual… ▽ More

    Submitted 1 April, 2024; originally announced April 2024.

  13. arXiv:2404.00712  [pdf, other

    cs.LG cs.AI cs.CY cs.IR

    Survey of Computerized Adaptive Testing: A Machine Learning Perspective

    Authors: Qi Liu, Yan Zhuang, Haoyang Bi, Zhenya Huang, Weizhe Huang, Jiatong Li, Junhao Yu, Zirui Liu, Zirui Hu, Yuting Hong, Zachary A. Pardos, Hai** Ma, Mengxiao Zhu, Shi** Wang, Enhong Chen

    Abstract: Computerized Adaptive Testing (CAT) provides an efficient and tailored method for assessing the proficiency of examinees, by dynamically adjusting test questions based on their performance. Widely adopted across diverse fields like education, healthcare, sports, and sociology, CAT has revolutionized testing practices. While traditional methods rely on psychometrics and statistics, the increasing c… ▽ More

    Submitted 4 April, 2024; v1 submitted 31 March, 2024; originally announced April 2024.

  14. arXiv:2403.19345  [pdf

    cs.IR cs.AI

    Intelligent Classification and Personalized Recommendation of E-commerce Products Based on Machine Learning

    Authors: Kangming Xu, Huiming Zhou, Haotian Zheng, Mingwei Zhu, Qi Xin

    Abstract: With the rapid evolution of the Internet and the exponential proliferation of information, users encounter information overload and the conundrum of choice. Personalized recommendation systems play a pivotal role in alleviating this burden by aiding users in filtering and selecting information tailored to their preferences and requirements. Such systems not only enhance user experience and satisfa… ▽ More

    Submitted 28 March, 2024; originally announced March 2024.

  15. arXiv:2403.18660  [pdf, other

    cs.GR cs.CV

    InstructBrush: Learning Attention-based Instruction Optimization for Image Editing

    Authors: Ruoyu Zhao, Qingnan Fan, Fei Kou, Shuai Qin, Hong Gu, Wei Wu, Pengcheng Xu, Mingrui Zhu, Nannan Wang, Xinbo Gao

    Abstract: In recent years, instruction-based image editing methods have garnered significant attention in image editing. However, despite encompassing a wide range of editing priors, these methods are helpless when handling editing tasks that are challenging to accurately describe through language. We propose InstructBrush, an inversion method for instruction-based image editing methods to bridge this gap.… ▽ More

    Submitted 27 March, 2024; originally announced March 2024.

    Comments: Project Page: https://royzhao926.github.io/InstructBrush/

  16. arXiv:2403.18344  [pdf, other

    cs.AI

    LC-LLM: Explainable Lane-Change Intention and Trajectory Predictions with Large Language Models

    Authors: Mingxing Peng, Xusen Guo, Xianda Chen, Meixin Zhu, Kehua Chen, Hao, Yang, Xuesong Wang, Yinhai Wang

    Abstract: To ensure safe driving in dynamic environments, autonomous vehicles should possess the capability to accurately predict the lane change intentions of surrounding vehicles in advance and forecast their future trajectories. Existing motion prediction approaches have ample room for improvement, particularly in terms of long-term prediction accuracy and interpretability. In this paper, we address thes… ▽ More

    Submitted 27 March, 2024; originally announced March 2024.

  17. arXiv:2403.17552  [pdf, other

    cs.CL

    Naive Bayes-based Context Extension for Large Language Models

    Authors: Jianlin Su, Murtadha Ahmed, Wenbo, Luo Ao, Mingren Zhu, Yunfeng Liu

    Abstract: Large Language Models (LLMs) have shown promising in-context learning abilities. However, conventional In-Context Learning (ICL) approaches are often impeded by length limitations of transformer architecture, which pose challenges when attempting to effectively integrate supervision from a substantial number of demonstration examples. In this paper, we introduce a novel framework, called Naive Bay… ▽ More

    Submitted 26 March, 2024; originally announced March 2024.

    Comments: Accepted to main NAACL 2024

  18. Self-Supervised Learning for Medical Image Data with Anatomy-Oriented Imaging Planes

    Authors: Tianwei Zhang, Dong Wei, Mengmeng Zhu, Shi Gu, Yefeng Zheng

    Abstract: Self-supervised learning has emerged as a powerful tool for pretraining deep networks on unlabeled data, prior to transfer learning of target tasks with limited annotation. The relevance between the pretraining pretext and target tasks is crucial to the success of transfer learning. Various pretext tasks have been proposed to utilize properties of medical image data (e.g., three dimensionality), w… ▽ More

    Submitted 7 April, 2024; v1 submitted 25 March, 2024; originally announced March 2024.

    Comments: Medical Image Analysis

  19. arXiv:2403.15943  [pdf, ps, other

    cs.CV

    Advanced Feature Manipulation for Enhanced Change Detection Leveraging Natural Language Models

    Authors: Zhenglin Li, Yangchen Huang, Mengran Zhu, **gyu Zhang, **gHao Chang, Houze Liu

    Abstract: Change detection is a fundamental task in computer vision that processes a bi-temporal image pair to differentiate between semantically altered and unaltered regions. Large language models (LLMs) have been utilized in various domains for their exceptional feature extraction capabilities and have shown promise in numerous downstream applications. In this study, we harness the power of a pre-trained… ▽ More

    Submitted 13 June, 2024; v1 submitted 23 March, 2024; originally announced March 2024.

    Comments: This version is not our full version based on our new progress, related data, and methodology we are dealing with, and based on the rules and the laws, we are adjusting our current version

  20. arXiv:2403.14232  [pdf, other

    cs.LG

    Contrastive Balancing Representation Learning for Heterogeneous Dose-Response Curves Estimation

    Authors: Minqin Zhu, Anpeng Wu, Haoxuan Li, Ruoxuan Xiong, Bo Li, Xiaoqing Yang, Xuan Qin, Peng Zhen, Jiecheng Guo, Fei Wu, Kun Kuang

    Abstract: Estimating the individuals' potential response to varying treatment doses is crucial for decision-making in areas such as precision medicine and management science. Most recent studies predict counterfactual outcomes by learning a covariate representation that is independent of the treatment variable. However, such independence constraints neglect much of the covariate information that is useful f… ▽ More

    Submitted 21 March, 2024; originally announced March 2024.

  21. arXiv:2403.13245  [pdf, other

    eess.SY cs.AI cs.DC cs.LG cs.RO

    Federated reinforcement learning for robot motion planning with zero-shot generalization

    Authors: Zhenyuan Yuan, Siyuan Xu, Minghui Zhu

    Abstract: This paper considers the problem of learning a control policy for robot motion planning with zero-shot generalization, i.e., no data collection and policy adaptation is needed when the learned policy is deployed in new environments. We develop a federated reinforcement learning framework that enables collaborative learning of multiple learners and a central server, i.e., the Cloud, without sharing… ▽ More

    Submitted 7 April, 2024; v1 submitted 19 March, 2024; originally announced March 2024.

  22. arXiv:2403.12130  [pdf, other

    astro-ph.GA

    Almost Optically Dark Galaxies in DECaLS (I): Detection, Optical Properties and Possible Origins

    Authors: Lin Du, Wei Du, Cheng Cheng, Ming Zhu, Haiyang Yu, Hong Wu

    Abstract: We report the discovery of eight optical counterparts of ALFALFA extragalactic objects from DECaLS, five of which are discovered for the first time. These objects were flagged as HI emission sources with no optical counterparts in SDSS before. Multi-band data reveal their unusual physical properties. They are faint and blue ($g-r=-0.35\sim0.55$), with quite low surface brightness (… ▽ More

    Submitted 18 March, 2024; originally announced March 2024.

    Comments: 32 pages, 11 figures, accepted by the Astrophysical Journal

  23. arXiv:2403.10831  [pdf, other

    cs.CV

    DUE: Dynamic Uncertainty-Aware Explanation Supervision via 3D Imputation

    Authors: Qilong Zhao, Yifei Zhang, Mengdan Zhu, Siyi Gu, Yuyang Gao, Xiaofeng Yang, Liang Zhao

    Abstract: Explanation supervision aims to enhance deep learning models by integrating additional signals to guide the generation of model explanations, showcasing notable improvements in both the predictability and explainability of the model. However, the application of explanation supervision to higher-dimensional data, such as 3D medical images, remains an under-explored domain. Challenges associated wit… ▽ More

    Submitted 16 March, 2024; originally announced March 2024.

    Comments: 9 pages,6 figures

  24. arXiv:2403.10720  [pdf, other

    cs.AI

    Development and Application of a Monte Carlo Tree Search Algorithm for Simulating Da Vinci Code Game Strategies

    Authors: Ye Zhang, Mengran Zhu, Kailin Gui, Jiayue Yu, Yong Hao, Haozhan Sun

    Abstract: In this study, we explore the efficiency of the Monte Carlo Tree Search (MCTS), a prominent decision-making algorithm renowned for its effectiveness in complex decision environments, contingent upon the volume of simulations conducted. Notwithstanding its broad applicability, the algorithm's performance can be adversely impacted in certain scenarios, particularly within the domain of game strategy… ▽ More

    Submitted 15 March, 2024; originally announced March 2024.

    Comments: This paper has been accepted by CVIDL2024

  25. arXiv:2403.09121  [pdf, other

    cs.HC

    OutlineSpark: Igniting AI-powered Presentation Slides Creation from Computational Notebooks through Outlines

    Authors: Fengjie Wang, Yanna Lin, Leni Yang, Haotian Li, Mingyang Gu, Min Zhu, Huamin Qu

    Abstract: Computational notebooks are widely utilized for exploration and analysis. However, creating slides to communicate analysis results from these notebooks is quite tedious and time-consuming. Researchers have proposed automatic systems for generating slides from notebooks, which, however, often do not consider the process of users conceiving and organizing their messages from massive code cells. Thos… ▽ More

    Submitted 14 March, 2024; originally announced March 2024.

    Comments: To appear in Proceedings of the CHI Conference on Human Factors in Computing Systems (CHI 2024)

  26. arXiv:2403.06199  [pdf, other

    cs.CV cs.CL

    Mipha: A Comprehensive Overhaul of Multimodal Assistant with Small Language Models

    Authors: Minjie Zhu, Yichen Zhu, Xin Liu, Ning Liu, Zhiyuan Xu, Chaomin Shen, Yaxin Peng, Zhicai Ou, Feifei Feng, Jian Tang

    Abstract: Multimodal Large Language Models (MLLMs) have showcased impressive skills in tasks related to visual understanding and reasoning. Yet, their widespread application faces obstacles due to the high computational demands during both the training and inference phases, restricting their use to a limited audience within the research and user communities. In this paper, we investigate the design aspects… ▽ More

    Submitted 25 March, 2024; v1 submitted 10 March, 2024; originally announced March 2024.

  27. arXiv:2403.04904  [pdf, other

    cond-mat.mtrl-sci

    Antiferroelectric Nanodomains Stabilized by Chemical Disorder at Anti-phase Boundaries

    Authors: Menglin Zhu, Michael Xu, Yu Yun, Liyan Wu, Or Shafir, Colin Gilgenbach, Lane W. Martin, Ilya Grinberg, Jonathan E. Spanier, James M. LeBeau

    Abstract: Antiferroelectric perovskite oxides exhibit exceptional dielectric properties and high structural/chemical tunability, making them promising for a wide range of applications from high energy-density capacitors to solid-state cooling. However, tailoring the antiferroelectric phase stability through alloying is hampered by the complex interplay between chemistry and the alignment of dipole moments.… ▽ More

    Submitted 7 March, 2024; originally announced March 2024.

  28. arXiv:2403.04753  [pdf, other

    cs.GT

    Mechanism for Decision-aware Collaborative Federated Learning: A Pitfall of Shapley Values

    Authors: Meng Qi, Mingxi Zhu

    Abstract: This paper investigates mechanism design for decision-aware collaboration via federated learning (FL) platforms. Our framework consists of a digital platform and multiple decision-aware agents, each endowed with proprietary data sets. The platform offers an infrastructure that enables access to the data, creates incentives for collaborative learning aimed at operational decision-making, and conduc… ▽ More

    Submitted 7 March, 2024; originally announced March 2024.

  29. arXiv:2403.03088  [pdf

    cond-mat.soft cond-mat.mtrl-sci

    Shear-enhanced Liquid Crystal Spinning of Conjugated Polymer Fibers

    Authors: Hao Jiang, Chi-yuan Yang, Deyu Tu, Zhu Chen, Wei Huang, Liang-wen Feng, Hengda Sun, Hongzhi Wang, Simone Fabiano, Meifang Zhu, Gang Wang

    Abstract: Conjugated polymer fibers can be used to manufacture various soft fibrous optoelectronic devices, significantly advancing wearable devices and smart textiles. Recently, conjugated polymer-based fibrous electronic devices have been widely used in energy conversion, electrochemical sensing, and human-machine interaction. However, the insufficient mechanical properties of conjugated polymer fibers, t… ▽ More

    Submitted 6 March, 2024; v1 submitted 5 March, 2024; originally announced March 2024.

  30. arXiv:2402.17979  [pdf, other

    cs.CE cs.AI cs.LG

    Ensemble Methodology:Innovations in Credit Default Prediction Using LightGBM, XGBoost, and LocalEnsemble

    Authors: Mengran Zhu, Ye Zhang, Yulu Gong, Kaijuan Xing, Xu Yan, **tong Song

    Abstract: In the realm of consumer lending, accurate credit default prediction stands as a critical element in risk mitigation and lending decision optimization. Extensive research has sought continuous improvement in existing models to enhance customer experiences and ensure the sound economic functioning of lending institutions. This study responds to the evolving landscape of credit default prediction, c… ▽ More

    Submitted 27 February, 2024; originally announced February 2024.

  31. arXiv:2402.16038  [pdf

    cs.CL cs.AI cs.LG

    Deep Learning Approaches for Improving Question Answering Systems in Hepatocellular Carcinoma Research

    Authors: Shuning Huo, Yafei Xiang, Hanyi Yu, Mengran Zhu, Yulu Gong

    Abstract: In recent years, advancements in natural language processing (NLP) have been fueled by deep learning techniques, particularly through the utilization of powerful computing resources like GPUs and TPUs. Models such as BERT and GPT-3, trained on vast amounts of data, have revolutionized language understanding and generation. These pre-trained models serve as robust bases for various tasks including… ▽ More

    Submitted 25 February, 2024; originally announced February 2024.

  32. arXiv:2402.16036  [pdf

    cs.RO cs.CV cs.LG

    Machine Learning-Based Vehicle Intention Trajectory Recognition and Prediction for Autonomous Driving

    Authors: Hanyi Yu, Shuning Huo, Mengran Zhu, Yulu Gong, Yafei Xiang

    Abstract: In recent years, the expansion of internet technology and advancements in automation have brought significant attention to autonomous driving technology. Major automobile manufacturers, including Volvo, Mercedes-Benz, and Tesla, have progressively introduced products ranging from assisted-driving vehicles to semi-autonomous vehicles. However, this period has also witnessed several traffic safety i… ▽ More

    Submitted 25 February, 2024; originally announced February 2024.

  33. arXiv:2402.16035  [pdf

    cs.CL cs.AI

    Text Understanding and Generation Using Transformer Models for Intelligent E-commerce Recommendations

    Authors: Yafei Xiang, Hanyi Yu, Yulu Gong, Shuning Huo, Mengran Zhu

    Abstract: With the rapid development of artificial intelligence technology, Transformer structural pre-training model has become an important tool for large language model (LLM) tasks. In the field of e-commerce, these models are especially widely used, from text understanding to generating recommendation systems, which provide powerful technical support for improving user experience and optimizing service… ▽ More

    Submitted 25 February, 2024; originally announced February 2024.

  34. CLIPose: Category-Level Object Pose Estimation with Pre-trained Vision-Language Knowledge

    Authors: Xiao Lin, Minghao Zhu, Ronghao Dang, Guangliang Zhou, Shaolong Shu, Feng Lin, Chengju Liu, Qijun Chen

    Abstract: Most of existing category-level object pose estimation methods devote to learning the object category information from point cloud modality. However, the scale of 3D datasets is limited due to the high cost of 3D data collection and annotation. Consequently, the category features extracted from these limited point cloud samples may not be comprehensive. This motivates us to investigate whether we… ▽ More

    Submitted 24 February, 2024; originally announced February 2024.

    Comments: 14 pages, 4 figures, 9 tables

  35. arXiv:2402.13669  [pdf, other

    cs.CL

    Self-Distillation Bridges Distribution Gap in Language Model Fine-Tuning

    Authors: Zhaorui Yang, Tianyu Pang, Haozhe Feng, Han Wang, Wei Chen, Minfeng Zhu, Qian Liu

    Abstract: The surge in Large Language Models (LLMs) has revolutionized natural language processing, but fine-tuning them for specific tasks often encounters challenges in balancing performance and preserving general instruction-following abilities. In this paper, we posit that the distribution gap between task datasets and the LLMs serves as the primary underlying cause. To address the problem, we introduce… ▽ More

    Submitted 28 May, 2024; v1 submitted 21 February, 2024; originally announced February 2024.

    Comments: ACL 2024

  36. arXiv:2402.13469  [pdf, other

    quant-ph cs.PL

    The Quantum Abstract Machine

    Authors: Liyi Li, Le Chang, Rance Cleaveland, Mingwei Zhu, Xiaodi Wu

    Abstract: This paper develops a model of quantum behavior that is intended to support the abstract yet accurate design and functional verification of quantum communication protocols. The work is motivated by the need for conceptual tools for the development of quantum-communication systems that are usable by non-specialists in quantum physics while also correctly capturing at a useful abstraction the underl… ▽ More

    Submitted 20 February, 2024; originally announced February 2024.

  37. arXiv:2402.10686  [pdf, other

    cs.IT cs.CR cs.LG eess.SP

    Uncertainty, Calibration, and Membership Inference Attacks: An Information-Theoretic Perspective

    Authors: Meiyi Zhu, Caili Guo, Chunyan Feng, Osvaldo Simeone

    Abstract: In a membership inference attack (MIA), an attacker exploits the overconfidence exhibited by typical machine learning models to determine whether a specific data point was used to train a target model. In this paper, we analyze the performance of the state-of-the-art likelihood ratio attack (LiRA) within an information-theoretical framework that allows the investigation of the impact of the aleato… ▽ More

    Submitted 16 February, 2024; originally announced February 2024.

    Comments: 27 pages, 13 figures

  38. arXiv:2402.09830  [pdf

    cs.LG cs.AI cs.CE

    Utilizing GANs for Fraud Detection: Model Training with Synthetic Transaction Data

    Authors: Mengran Zhu, Yulu Gong, Yafei Xiang, Hanyi Yu, Shuning Huo

    Abstract: Anomaly detection is a critical challenge across various research domains, aiming to identify instances that deviate from normal data distributions. This paper explores the application of Generative Adversarial Networks (GANs) in fraud detection, comparing their advantages with traditional methods. GANs, a type of Artificial Neural Network (ANN), have shown promise in modeling complex data distrib… ▽ More

    Submitted 15 February, 2024; originally announced February 2024.

  39. arXiv:2402.09820  [pdf

    cs.CR cs.AI cs.LG q-fin.GN

    Utilizing Deep Learning for Enhancing Network Resilience in Finance

    Authors: Yulu Gong, Mengran Zhu, Shuning Huo, Yafei Xiang, Hanyi Yu

    Abstract: In the age of the Internet, people's lives are increasingly dependent on today's network technology. Maintaining network integrity and protecting the legitimate interests of users is at the heart of network construction. Threat detection is an important part of a complete and effective defense system. How to effectively detect unknown threats is one of the concerns of network protection. Currently… ▽ More

    Submitted 18 February, 2024; v1 submitted 15 February, 2024; originally announced February 2024.

  40. arXiv:2402.06023  [pdf, other

    cs.LG cs.AI cs.GT

    Decision Theory-Guided Deep Reinforcement Learning for Fast Learning

    Authors: Zelin Wan, **-Hee Cho, Mu Zhu, Ahmed H. Anwar, Charles Kamhoua, Munindar P. Singh

    Abstract: This paper introduces a novel approach, Decision Theory-guided Deep Reinforcement Learning (DT-guided DRL), to address the inherent cold start problem in DRL. By integrating decision theory principles, DT-guided DRL enhances agents' initial performance and robustness in complex environments, enabling more efficient and reliable convergence during learning. Our investigation encompasses two primary… ▽ More

    Submitted 8 February, 2024; originally announced February 2024.

  41. arXiv:2402.03397  [pdf

    q-bio.QM eess.IV

    A Comprehensive Approach to Diagnosing Temporomandibular Joint Diseases: AI-driven TMD Diagnostic System

    Authors: Y. Gua, C. T. Kong, D. D Zhangc, Y. J Baid, J. K. H. Tsoia, Hua Huangc, Y. Q. Dengc, Y. M Zhue

    Abstract: AI-driven TMD diagnostic system uses AI segmentation method to diagnose Temporomandibular Joint Disorders (TMD). By using segmentation, three important parts: temporal bone, temporomandibular joint (TMJ) disc and the condyle can be identified. The location and the size of each segment are used as the basic information to determine if the patient has a high chance of having Temporomandibular Joint… ▽ More

    Submitted 4 February, 2024; originally announced February 2024.

  42. arXiv:2402.01858  [pdf, other

    cs.LG cs.AI cs.CL cs.CV

    Explaining latent representations of generative models with large multimodal models

    Authors: Mengdan Zhu, Zhenke Liu, Bo Pan, Abhinav Angirekula, Liang Zhao

    Abstract: Learning interpretable representations of data generative latent factors is an important topic for the development of artificial intelligence. With the rise of the large multimodal model, it can align images with text to generate answers. In this work, we propose a framework to comprehensively explain each latent variable in the generative models using a large multimodal model. We further measure… ▽ More

    Submitted 17 April, 2024; v1 submitted 2 February, 2024; originally announced February 2024.

    Comments: ICLR 2024 Workshop on Reliable and Responsible Foundation Models

  43. arXiv:2402.01576  [pdf, other

    cs.RO

    Training Adversarial yet Safe Agent to Characterize Safety Performance of Highly Automated Vehicles

    Authors: Minghao Zhu, Anmol Sidhu, Keith A. Redmill

    Abstract: This paper focuses on safety performance testing and characterization of black-box highly automated vehicles (HAV). Existing testing approaches typically obtain the testing outcomes by deploying the HAV into a specific testing environment. Such a testing environment can involve various passively given testing strategies presented by other traffic participants such as (i) the naturalistic driving p… ▽ More

    Submitted 2 February, 2024; originally announced February 2024.

  44. arXiv:2401.17364  [pdf, other

    astro-ph.GA astro-ph.CO astro-ph.IM

    HiFAST: an HI data calibration and imaging pipeline for FAST

    Authors: Yingjie **g, Jie Wang, Chen Xu, Ziming Liu, Qingze Chen, Tiantian Liang, **long Xu, Yixian Cao, **g Wang, Huijie Hu, Chuan-Peng Zhang, Qi Guo, Liang Gao, Mei Ai, Hengqian Gan, Xuyang Gao, **lin Han, Ligang Hou, Zhipeng Hou, Peng Jiang, Xu Kong, Fujia Li, Zerui Liu, Li Shao, Hengxing Pan , et al. (8 additional authors not shown)

    Abstract: The Five-hundred-meter Aperture Spherical radio Telescope (FAST) has the largest aperture and a 19-beam L-band receiver, making it powerful for investigating the neutral hydrogen atomic gas (HI) in the universe. We present HiFAST (https://hifast.readthedocs.io), a dedicated, modular, and self-contained calibration and imaging pipeline for processing the HI data of FAST. The pipeline consists of fr… ▽ More

    Submitted 30 January, 2024; originally announced January 2024.

    Comments: Accepted by SCPMA. 21 pages, 14 figures. The pipeline is accessible at https://hifast.readthedocs.io

  45. arXiv:2401.16581  [pdf, other

    cond-mat.str-el

    Continuum excitations in a spin-supersolid on a triangular lattice

    Authors: M. Zhu, V. Romerio, N. Steiger, S. D. Nabi, N. Murai, S. Ohira-Kawamura, K. Yu. Povarov, Y. Skourski, R. Sibille, L. Keller, Z. Yan, S. Gvasaliya, A. Zheludev

    Abstract: Magnetic, thermodynamic, neutron diffraction and inelastic neutron scattering are used to study spin correlations in the easy-axis XXZ triangular lattice magnet K2Co(SeO3)2. Despite the presence of quasi-2D "supersolid" magnetic order, the low-energy excitation spectrum contains no sharp modes and is instead a broad and structured multi-particle continuum. Applying a weak magnetic field drives the… ▽ More

    Submitted 26 April, 2024; v1 submitted 29 January, 2024; originally announced January 2024.

  46. arXiv:2401.16459  [pdf, other

    cs.CV

    Bridging Generative and Discriminative Models for Unified Visual Perception with Diffusion Priors

    Authors: Shiyin Dong, Mingrui Zhu, Kun Cheng, Nannan Wang, Xinbo Gao

    Abstract: The remarkable prowess of diffusion models in image generation has spurred efforts to extend their application beyond generative tasks. However, a persistent challenge exists in lacking a unified approach to apply diffusion models to visual perception tasks with diverse semantic granularity requirements. Our purpose is to establish a unified visual perception framework, capitalizing on the potenti… ▽ More

    Submitted 29 January, 2024; originally announced January 2024.

    Comments: 18 pages,11 figures

  47. arXiv:2401.15843  [pdf, other

    cs.SE

    APIGen: Generative API Method Recommendation

    Authors: Yujia Chen, Cuiyun Gao, Muyijie Zhu, Qing Liao, Yong Wang, Guoai Xu

    Abstract: Automatic API method recommendation is an essential task of code intelligence, which aims to suggest suitable APIs for programming queries. Existing approaches can be categorized into two primary groups: retrieval-based and learning-based approaches. Although these approaches have achieved remarkable success, they still come with notable limitations. The retrieval-based approaches rely on the text… ▽ More

    Submitted 28 January, 2024; originally announced January 2024.

    Comments: To appear in the proceedings of the 31st IEEE International Conference on Software Analysis, Evolution, and Reengineering (SANER 2024)

  48. arXiv:2401.15397  [pdf, other

    astro-ph.GA

    FASHI: A search for extragalactic OH megamasers with FAST

    Authors: Chuan-Peng Zhang, Cheng Cheng, Ming Zhu, **-Long Xu, Peng Jiang

    Abstract: The FAST All Sky HI survey (FASHI) is broader in frequency band and sky volume, and deeper in detection sensitivity than the Arecibo Legacy Fast ALFA survey (ALFALFA). To efficiently expand the sample of OH megamasers (OHMs), whose strongest line has a rest frequency of 1667.35903 MHz, we directly matched the IRAS Point Source Catalog Redshift (PSCz) catalog with the corresponding FASHI data cube.… ▽ More

    Submitted 16 June, 2024; v1 submitted 27 January, 2024; originally announced January 2024.

    Comments: 21 pages, 6 figures. Comments are welcome

  49. arXiv:2401.15002  [pdf, other

    cs.CV

    BackdoorBench: A Comprehensive Benchmark and Analysis of Backdoor Learning

    Authors: Baoyuan Wu, Hongrui Chen, Mingda Zhang, Zihao Zhu, Shaokui Wei, Danni Yuan, Mingli Zhu, Ruotong Wang, Li Liu, Chao Shen

    Abstract: As an emerging and vital topic for studying deep neural networks' vulnerability (DNNs), backdoor learning has attracted increasing interest in recent years, and many seminal backdoor attack and defense algorithms are being developed successively or concurrently, in the status of a rapid arms race. However, mainly due to the diverse settings, and the difficulties of implementation and reproducibili… ▽ More

    Submitted 26 January, 2024; originally announced January 2024.

  50. arXiv:2401.14148  [pdf, other

    cs.CV

    LanDA: Language-Guided Multi-Source Domain Adaptation

    Authors: Zhenbin Wang, Lei Zhang, Lituan Wang, Minjuan Zhu

    Abstract: Multi-Source Domain Adaptation (MSDA) aims to mitigate changes in data distribution when transferring knowledge from multiple labeled source domains to an unlabeled target domain. However, existing MSDA techniques assume target domain images are available, yet overlook image-rich semantic information. Consequently, an open question is whether MSDA can be guided solely by textual cues in the absenc… ▽ More

    Submitted 25 January, 2024; originally announced January 2024.

    Comments: 20 pages, 8 figures