Skip to main content

Showing 51–100 of 487 results for author: Wen, Z

.
  1. arXiv:2402.01469  [pdf, other

    cs.CL

    AMOR: A Recipe for Building Adaptable Modular Knowledge Agents Through Process Feedback

    Authors: Jian Guan, Wei Wu, Zujie Wen, Peng Xu, Hongning Wang, Minlie Huang

    Abstract: The notable success of large language models (LLMs) has sparked an upsurge in building language agents to complete various complex tasks. We present AMOR, an agent framework based on open-source LLMs, which reasons with external knowledge bases and adapts to specific domains through human supervision to the reasoning process. AMOR builds reasoning logic over a finite state machine (FSM) that solve… ▽ More

    Submitted 2 February, 2024; originally announced February 2024.

    Comments: Work in progress

  2. arXiv:2402.01440  [pdf, other

    cs.LG cs.AI cs.SI

    Few-Shot Learning on Graphs: from Meta-learning to Pre-training and Prompting

    Authors: Xingtong Yu, Yuan Fang, Zemin Liu, Yuxia Wu, Zhihao Wen, Jianyuan Bo, Xinming Zhang, Steven C. H. Hoi

    Abstract: Graph representation learning, a critical step in graph-centric tasks, has seen significant advancements. Earlier techniques often operate in an end-to-end setting, where performance heavily relies on the availability of ample labeled data. This constraint has spurred the emergence of few-shot learning on graphs, where only a few task-specific labels are available for each task. Given the extensiv… ▽ More

    Submitted 2 March, 2024; v1 submitted 2 February, 2024; originally announced February 2024.

  3. arXiv:2401.16702  [pdf, other

    cs.CV

    Multi-granularity Correspondence Learning from Long-term Noisy Videos

    Authors: Yijie Lin, Jie Zhang, Zhenyu Huang, Jia Liu, Zujie Wen, Xi Peng

    Abstract: Existing video-language studies mainly focus on learning short video clips, leaving long-term temporal dependencies rarely explored due to over-high computational cost of modeling long videos. To address this issue, one feasible solution is learning the correspondence between video clips and captions, which however inevitably encounters the multi-granularity noisy correspondence (MNC) problem. To… ▽ More

    Submitted 29 January, 2024; originally announced January 2024.

    Comments: Accepted by ICLR 2024 (oral)

  4. arXiv:2401.13503  [pdf, other

    cs.CV

    Learning Representations for Clustering via Partial Information Discrimination and Cross-Level Interaction

    Authors: Hai-Xin Zhang, Dong Huang, Hua-Bao Ling, Guang-Yu Zhang, Wei-jun Sun, Zi-hao Wen

    Abstract: In this paper, we present a novel deep image clustering approach termed PICI, which enforces the partial information discrimination and the cross-level interaction in a joint learning framework. In particular, we leverage a Transformer encoder as the backbone, through which the masked image modeling with two paralleled augmented views is formulated. After deriving the class tokens from the masked… ▽ More

    Submitted 24 January, 2024; originally announced January 2024.

  5. arXiv:2401.10296  [pdf, other

    astro-ph.HE astro-ph.SR

    The Study of Mode Switching behavior of PSR J0614+2229 Using the Parkes Ultra-wideband Receiver Observations

    Authors: Yanqing Cai, Shijun Dang, Rai Yuen, Lunhua Shang, Feifei Kou, Jian** Yuan, Lei Zhang, Zurong Zhou, Na Wang, Qingying Li, Zhigang Wen, Wenming Yan, Shuangqiang Wang, Shengnan Sun, Habtamu Menberu Tedila, Shuo Xiao, Xin Xu, Rushuang Zhao, Qijun Zhi, Aijun Dong, Bing Zhang, Wei Li, Yingying Ren, Yujia Liu

    Abstract: In this paper, we presented a detailed single pulse and polarization study of PSR J0614+2229 based on the archived data observed on 2019 August 15 (MJD 58710) and September 12 (MJD 58738) using the Ultra-wideband Low-frequency Receiver on the Parkes radio telescope. The single-pulse sequences show that this pulsar switches between two emission states, in which the emission of state A occurs earlie… ▽ More

    Submitted 17 January, 2024; originally announced January 2024.

  6. arXiv:2401.09085  [pdf

    physics.optics

    3D orientation super-resolution spatial-frequency-shift microscopy

    Authors: Xiaowei Liu, Mingwei Tang, Ning Zhou, Chenlei Pang, Zhong Wen, Xu Liu, Qing Yang

    Abstract: Super-resolution map** of the 3D orientation of fluorophores reveals the alignment of biological structures where the fluorophores are tightly attached, and thus plays a vital role in studying the organization and dynamics of bio-complexes. However, current super-resolution imaging techniques are either limited to 2D orientation map** or suffer from slow speed and the requirement of special la… ▽ More

    Submitted 22 January, 2024; v1 submitted 17 January, 2024; originally announced January 2024.

    Comments: 22 pages, 5 figures

  7. arXiv:2401.05778  [pdf, other

    cs.CL cs.AI

    Risk Taxonomy, Mitigation, and Assessment Benchmarks of Large Language Model Systems

    Authors: Tianyu Cui, Yanling Wang, Chuanpu Fu, Yong Xiao, Sijia Li, Xinhao Deng, Yunpeng Liu, Qinglin Zhang, Ziyi Qiu, Peiyang Li, Zhixing Tan, Junwu Xiong, Xinyu Kong, Zujie Wen, Ke Xu, Qi Li

    Abstract: Large language models (LLMs) have strong capabilities in solving diverse natural language processing tasks. However, the safety and security issues of LLM systems have become the major obstacle to their widespread application. Many studies have extensively investigated risks in LLM systems and developed the corresponding mitigation strategies. Leading-edge enterprises such as OpenAI, Google, Meta,… ▽ More

    Submitted 11 January, 2024; originally announced January 2024.

  8. arXiv:2401.05596  [pdf

    cs.CL cs.AI

    POMP: Probability-driven Meta-graph Prompter for LLMs in Low-resource Unsupervised Neural Machine Translation

    Authors: Shilong Pan, Zhiliang Tian, Liang Ding, Zhen Huang, Zhihua Wen, Dongsheng Li

    Abstract: Low-resource languages (LRLs) face challenges in supervised neural machine translation due to limited parallel data, prompting research into unsupervised methods. Unsupervised neural machine translation (UNMT) methods, including back-translation, transfer learning, and pivot-based translation, offer practical solutions for LRL translation, but they are hindered by issues like synthetic data noise,… ▽ More

    Submitted 16 January, 2024; v1 submitted 10 January, 2024; originally announced January 2024.

  9. arXiv:2401.02682  [pdf, other

    cs.LG cs.SI

    Homophily-Related: Adaptive Hybrid Graph Filter for Multi-View Graph Clustering

    Authors: Zichen Wen, Yawen Ling, Yazhou Ren, Tianyi Wu, Jianpeng Chen, Xiaorong Pu, Zhifeng Hao, Lifang He

    Abstract: Recently there is a growing focus on graph data, and multi-view graph clustering has become a popular area of research interest. Most of the existing methods are only applicable to homophilous graphs, yet the extensive real-world graph data can hardly fulfill the homophily assumption, where the connected nodes tend to belong to the same class. Several studies have pointed out that the poor perform… ▽ More

    Submitted 5 January, 2024; originally announced January 2024.

    Comments: Accepted by AAAI2024

  10. arXiv:2312.16998  [pdf, other

    eess.IV cs.CV

    Deep Unfolding Network with Spatial Alignment for multi-modal MRI reconstruction

    Authors: Hao Zhang, Qi Wang, Jun Shi, Shihui Ying, Zhijie Wen

    Abstract: Multi-modal Magnetic Resonance Imaging (MRI) offers complementary diagnostic information, but some modalities are limited by the long scanning time. To accelerate the whole acquisition process, MRI reconstruction of one modality from highly undersampled k-space data with another fully-sampled reference modality is an efficient solution. However, the misalignment between modalities, which is common… ▽ More

    Submitted 28 December, 2023; originally announced December 2023.

  11. arXiv:2312.12693  [pdf, other

    math.NA

    Anderson Accelerated Gauss-Newton-guided deep learning for nonlinear inverse problems with Application to Electrical Impedance Tomography

    Authors: Qing** Zhou, Guixian Xu, Zhexin Wen, Hongqiao Wang

    Abstract: Physics-guided deep learning is an important prevalent research topic in scientific machine learning, which has tremendous potential in various complex applications including science and engineering. In these applications, data is expensive to acquire and high accuracy is required for making decisions. In this work, we introduce an efficient physics-guided deep learning framework for the variation… ▽ More

    Submitted 19 December, 2023; originally announced December 2023.

    MSC Class: 78A46; 68U10; 68T07

  12. arXiv:2312.07889  [pdf, other

    math.OC cs.CE cs.CG cs.GR

    Adaptive Isogeometric Topology Optimization of Shell Structures based on PHT-splines

    Authors: Zepeng Wen, Qiong Pan, Xiaoya Zhai, Hongmei Kang, Falai Chen

    Abstract: This paper proposes an Adaptive Isogeometric Topology Optimization framework for shell structures based on PHT-splines (PHT-AITO). In this framework, the design domain, displacement, and density are represented by PHT-splines. Leveraging the local refinement capability of PHT-splines, mesh elements defining the density function are adaptively refined to achieve a suitable resolution at the interfa… ▽ More

    Submitted 12 December, 2023; originally announced December 2023.

  13. arXiv:2312.06993  [pdf

    cs.LG

    Dynamically configured physics-informed neural network in topology optimization applications

    Authors: Jichao Yin, Ziming Wen, Shuhao Li, Yaya Zhanga, Hu Wang

    Abstract: Integration of machine learning (ML) into the topology optimization (TO) framework is attracting increasing attention, but data acquisition in data-driven models is prohibitive. Compared with popular ML methods, the physics-informed neural network (PINN) can avoid generating enormous amounts of data when solving forward problems and additionally provide better inference. To this end, a dynamically… ▽ More

    Submitted 12 December, 2023; originally announced December 2023.

    Comments: 31 pages, 22 figures

  14. arXiv:2312.06644  [pdf, other

    cs.CV cs.AI cs.GR

    AnyHome: Open-Vocabulary Generation of Structured and Textured 3D Homes

    Authors: Rao Fu, Zehao Wen, Zichen Liu, Srinath Sridhar

    Abstract: Inspired by cognitive theories, we introduce AnyHome, a framework that translates any text into well-structured and textured indoor scenes at a house-scale. By prompting Large Language Models (LLMs) with designed templates, our approach converts provided textual narratives into amodal structured representations. These representations guarantee consistent and realistic spatial layouts by directing… ▽ More

    Submitted 20 March, 2024; v1 submitted 11 December, 2023; originally announced December 2023.

  15. arXiv:2312.04293  [pdf, other

    cs.CV cs.MM

    GPT-4V with Emotion: A Zero-shot Benchmark for Generalized Emotion Recognition

    Authors: Zheng Lian, Licai Sun, Haiyang Sun, Kang Chen, Zhuofan Wen, Hao Gu, Bin Liu, Jianhua Tao

    Abstract: Recently, GPT-4 with Vision (GPT-4V) has demonstrated remarkable visual capabilities across various tasks, but its performance in emotion recognition has not been fully evaluated. To bridge this gap, we present the quantitative evaluation results of GPT-4V on 21 benchmark datasets covering 6 tasks: visual sentiment analysis, tweet sentiment analysis, micro-expression recognition, facial emotion re… ▽ More

    Submitted 17 March, 2024; v1 submitted 7 December, 2023; originally announced December 2023.

  16. arXiv:2312.01801  [pdf, other

    cs.HC cs.SE

    SPROUT: Authoring Programming Tutorials with Interactive Visualization of Large Language Model Generation Process

    Authors: Yihan Liu, Zhen Wen, Luoxuan Weng, Ollie Woodman, Yi Yang, Wei Chen

    Abstract: The rapid development of large language models (LLMs), such as ChatGPT, has revolutionized the efficiency of creating programming tutorials. LLMs can be instructed with text prompts to generate comprehensive text descriptions of code snippets. However, the lack of transparency in the end-to-end generation process has hindered the understanding of model behavior and limited user control over the ge… ▽ More

    Submitted 4 December, 2023; originally announced December 2023.

  17. arXiv:2312.01273  [pdf, other

    math.OC

    An Augmented Lagrangian Primal-Dual Semismooth Newton Method for Multi-Block Composite Optimization

    Authors: Zhanwang Deng, Kangkang Deng, Jiang Hu, Zaiwen Wen

    Abstract: In this paper, we develop a novel primal-dual semismooth Newton method for solving linearly constrained multi-block convex composite optimization problems. First, a differentiable augmented Lagrangian (AL) function is constructed by utilizing the Moreau envelopes of the nonsmooth functions. It enables us to derive an equivalent saddle point problem and establish the strong AL duality under the Sla… ▽ More

    Submitted 15 May, 2024; v1 submitted 2 December, 2023; originally announced December 2023.

    Comments: 27 pages

  18. arXiv:2312.01057  [pdf, other

    cs.LG cs.AI cs.CL

    RLHF and IIA: Perverse Incentives

    Authors: Wanqiao Xu, Shi Dong, Xiuyuan Lu, Grace Lam, Zheng Wen, Benjamin Van Roy

    Abstract: Existing algorithms for reinforcement learning from human feedback (RLHF) can incentivize responses at odds with preferences because they are based on models that assume independence of irrelevant alternatives (IIA). The perverse incentives induced by IIA hinder innovations on query formats and learning algorithms.

    Submitted 1 February, 2024; v1 submitted 2 December, 2023; originally announced December 2023.

  19. arXiv:2310.18894  [pdf, other

    cs.CV

    Emergence of Shape Bias in Convolutional Neural Networks through Activation Sparsity

    Authors: Tianqin Li, Ziqi Wen, Yangfan Li, Tai Sing Lee

    Abstract: Current deep-learning models for object recognition are known to be heavily biased toward texture. In contrast, human visual systems are known to be biased toward shape and structure. What could be the design principles in human visual systems that led to this difference? How could we introduce more shape bias into the deep learning models? In this paper, we report that sparse coding, a ubiquitous… ▽ More

    Submitted 29 October, 2023; originally announced October 2023.

    Comments: Published as NeurIPS 2023 (Oral)

  20. arXiv:2310.11531  [pdf, ps, other

    cs.LG cs.AI eess.SY stat.ML

    Efficient Online Learning with Offline Datasets for Infinite Horizon MDPs: A Bayesian Approach

    Authors: Dengwang Tang, Rahul Jain, Botao Hao, Zheng Wen

    Abstract: In this paper, we study the problem of efficient online reinforcement learning in the infinite horizon setting when there is an offline dataset to start with. We assume that the offline dataset is generated by an expert but with unknown level of competence, i.e., it is not perfect and not necessarily using the optimal policy. We show that if the learning agent models the behavioral policy (paramet… ▽ More

    Submitted 1 February, 2024; v1 submitted 17 October, 2023; originally announced October 2023.

    Comments: 22 pages

    MSC Class: 93E35

  21. Dual-Branch Knowledge Distillation for Noise-Robust Synthetic Speech Detection

    Authors: Cunhang Fan, Mingming Ding, Jianhua Tao, Ruibo Fu, Jiangyan Yi, Zhengqi Wen, Zhao Lv

    Abstract: Most research in synthetic speech detection (SSD) focuses on improving performance on standard noise-free datasets. However, in actual situations, noise interference is usually present, causing significant performance degradation in SSD systems. To improve noise robustness, this paper proposes a dual-branch knowledge distillation synthetic speech detection (DKDSSD) method. Specifically, a parallel… ▽ More

    Submitted 16 April, 2024; v1 submitted 13 October, 2023; originally announced October 2023.

  22. arXiv:2310.07555  [pdf, other

    cs.CV

    Does resistance to style-transfer equal Global Shape Bias? Measuring network sensitivity to global shape configuration

    Authors: Ziqi Wen, Tianqin Li, Zhi **g, Tai Sing Lee

    Abstract: Deep learning models are known to exhibit a strong texture bias, while human tends to rely heavily on global shape structure for object recognition. The current benchmark for evaluating a model's global shape bias is a set of style-transferred images with the assumption that resistance to the attack of style transfer is related to the development of global structure sensitivity in the model. In th… ▽ More

    Submitted 29 February, 2024; v1 submitted 11 October, 2023; originally announced October 2023.

  23. arXiv:2310.06713  [pdf, other

    cs.LG stat.AP

    Interpretable Traffic Event Analysis with Bayesian Networks

    Authors: Tong Yuan, Jian Yang, Zeyi Wen

    Abstract: Although existing machine learning-based methods for traffic accident analysis can provide good quality results to downstream tasks, they lack interpretability which is crucial for this critical problem. This paper proposes an interpretable framework based on Bayesian Networks for traffic accident prediction. To enable the ease of interpretability, we design a dataset construction pipeline to feed… ▽ More

    Submitted 10 October, 2023; originally announced October 2023.

    Comments: 11 pages, 7 figures

    MSC Class: 62F15 ACM Class: G.3

  24. arXiv:2310.05388  [pdf, other

    cs.CL

    GROVE: A Retrieval-augmented Complex Story Generation Framework with A Forest of Evidence

    Authors: Zhihua Wen, Zhiliang Tian, Wei Wu, Yuxin Yang, Yanqi Shi, Zhen Huang, Dongsheng Li

    Abstract: Conditional story generation is significant in human-machine interaction, particularly in producing stories with complex plots. While Large language models (LLMs) perform well on multiple NLP tasks, including story generation, it is challenging to generate stories with both complex and creative plots. Existing methods often rely on detailed prompts to guide LLMs to meet target conditions, which in… ▽ More

    Submitted 23 October, 2023; v1 submitted 8 October, 2023; originally announced October 2023.

    Comments: Findings of EMNLP 2023

  25. arXiv:2310.01419  [pdf, other

    cs.IR cs.LG

    Design Principles of Robust Multi-Armed Bandit Framework in Video Recommendations

    Authors: Belhassen Bayar, Phanideep Gampa, Ainur Yessenalina, Zhen Wen

    Abstract: Current multi-armed bandit approaches in recommender systems (RS) have focused more on devising effective exploration techniques, while not adequately addressing common exploitation challenges related to distributional changes and item cannibalization. Little work exists to guide the design of robust bandit frameworks that can address these frequent challenges in RS. In this paper, we propose a ne… ▽ More

    Submitted 24 September, 2023; originally announced October 2023.

    Comments: RecSys CARS 2023 Workshop paper

  26. arXiv:2310.00212  [pdf, other

    cs.LG cs.AI cs.CL

    Pairwise Proximal Policy Optimization: Harnessing Relative Feedback for LLM Alignment

    Authors: Tianhao Wu, Banghua Zhu, Ruoyu Zhang, Zhao** Wen, Kannan Ramchandran, Jiantao Jiao

    Abstract: Large Language Models (LLMs) can acquire extensive world knowledge through pre-training on large corpora. However, due to exposure to low-quality data, LLMs may exhibit harmful behavior without aligning with human values. The dominant approach for steering LLMs towards beneficial behavior involves Reinforcement Learning with Human Feedback (RLHF), with Proximal Policy Optimization (PPO) serving as… ▽ More

    Submitted 9 October, 2023; v1 submitted 29 September, 2023; originally announced October 2023.

    Comments: 19 pages, 5 figures

  27. arXiv:2309.17409  [pdf, ps, other

    math.OC

    Sharper Convergence Guarantees for Federated Learning with Partial Model Personalization

    Authors: Yiming Chen, Liyuan Cao, Kun Yuan, Zaiwen Wen

    Abstract: Partial model personalization, which encompasses both shared and personal variables in its formulation, is a critical optimization problem in federated learning. It balances individual client needs with collective knowledge utilization, and serves as a general formulation covering various key scenarios, ranging from fully shared to fully personalized federated learning. This paper introduces two e… ▽ More

    Submitted 29 September, 2023; originally announced September 2023.

  28. arXiv:2308.13295  [pdf

    math.NA

    Resolution-independent generative models based on operator learning for physics-constrained Bayesian inverse problems

    Authors: Xinchao Jiang, Xin Wang, Ziming Wen, Hu Wang

    Abstract: The Bayesian inference approach is widely used to tackle inverse problems due to its versatile and natural ability to handle ill-posedness. However, it often faces challenges when dealing with situations involving continuous fields or large-resolution discrete representations (high-dimensional). Moreover, the prior distribution of unknown parameters is commonly difficult to be determined. In this… ▽ More

    Submitted 25 August, 2023; originally announced August 2023.

  29. Voucher Abuse Detection with Prompt-based Fine-tuning on Graph Neural Networks

    Authors: Zhihao Wen, Yuan Fang, Yihan Liu, Yang Guo, Shuji Hao

    Abstract: Voucher abuse detection is an important anomaly detection problem in E-commerce. While many GNN-based solutions have emerged, the supervised paradigm depends on a large quantity of labeled data. A popular alternative is to adopt self-supervised pre-training using label-free data, and further fine-tune on a downstream task with limited labels. Nevertheless, the "pre-train, fine-tune" paradigm is of… ▽ More

    Submitted 30 August, 2023; v1 submitted 19 August, 2023; originally announced August 2023.

    Comments: 7 pages, Accepted by CIKM23 Applied Research Track

  30. arXiv:2308.06470  [pdf, ps, other

    math.OC

    On the Optimal Lower and Upper Complexity Bounds for a Class of Composite Optimization Problems

    Authors: Zhenyuan Zhu, Fan Chen, Junyu Zhang, Zaiwen Wen

    Abstract: We study the optimal lower and upper complexity bounds for finding approximate solutions to the composite problem $\min_x\ f(x)+h(Ax-b)$, where $f$ is smooth and $h$ is convex. Given access to the proximal operator of $h$, for strongly convex, convex, and nonconvex $f$, we design efficient first order algorithms with complexities $\tilde{O}\left(κ_A\sqrt{κ_f}\log\left(1/ε\right)\right)$,… ▽ More

    Submitted 12 August, 2023; originally announced August 2023.

    MSC Class: 90C25; 90C26; 90C46; 90C60

  31. arXiv:2308.04149  [pdf

    cond-mat.mtrl-sci

    Fully epitaxial fcc(111) magnetic tunnel junctions with a Co90Fe10/MgAlO/Co90Fe10 structure

    Authors: Jieyuan Song, Thomas Scheike, Cong He, Zhenchao Wen, Tadakatsu Ohkubo, Kazuhiro Hono, Hiroaki Sukegawa, Seiji Mitani

    Abstract: Magnetic tunnel junctions (MTJs) with bcc(001)-type structures such as Fe(001)/MgO(001)/Fe(001), have been widely used as the core of various spintronic devices such as magnetoresistive memories; however, the limited material selection of (001)-type MTJs hinders the further development of spintronic devices. Here, as an alternative to the (001)-type MTJs, an fcc(111)-type MTJ using a fully epitaxi… ▽ More

    Submitted 8 August, 2023; originally announced August 2023.

    Comments: 18 pages, 5 figures

  32. arXiv:2307.14024  [pdf, other

    cs.IR

    Multi-view Hypergraph Contrastive Policy Learning for Conversational Recommendation

    Authors: Sen Zhao, Wei Wei, Xian-Ling Mao, Shuai Zhu, Minghui Yang, Zujie Wen, Dangyang Chen, Feida Zhu

    Abstract: Conversational recommendation systems (CRS) aim to interactively acquire user preferences and accordingly recommend items to users. Accurately learning the dynamic user preferences is of crucial importance for CRS. Previous works learn the user preferences with pairwise relations from the interactive conversation and item knowledge, while largely ignoring the fact that factors for a relationship i… ▽ More

    Submitted 26 July, 2023; originally announced July 2023.

  33. arXiv:2307.10230  [pdf, other

    cs.IR

    Prompt Tuning on Graph-augmented Low-resource Text Classification

    Authors: Zhihao Wen, Yuan Fang

    Abstract: Text classification is a fundamental problem in information retrieval with many real-world applications, such as predicting the topics of online articles and the categories of e-commerce product descriptions. However, low-resource text classification, with no or few labeled samples, presents a serious concern for supervised learning. Meanwhile, many text data are inherently grounded on a network s… ▽ More

    Submitted 27 November, 2023; v1 submitted 15 July, 2023; originally announced July 2023.

    Comments: 14 pages, journal under review. arXiv admin note: substantial text overlap with arXiv:2305.03324

  34. Quantivine: A Visualization Approach for Large-scale Quantum Circuit Representation and Analysis

    Authors: Zhen Wen, Yihan Liu, Siwei Tan, Jieyi Chen, Minfeng Zhu, Dongming Han, Jianwei Yin, Mingliang Xu, Wei Chen

    Abstract: Quantum computing is a rapidly evolving field that enables exponential speed-up over classical algorithms. At the heart of this revolutionary technology are quantum circuits, which serve as vital tools for implementing, analyzing, and optimizing quantum algorithms. Recent advancements in quantum computing and the increasing capability of quantum devices have led to the development of more complex… ▽ More

    Submitted 18 July, 2023; originally announced July 2023.

    Comments: Accepted by IEEE VIS 2023

    Journal ref: IEEE Transactions on Visualization and Computer Graphics, 2023

  35. arXiv:2307.08929  [pdf, other

    cond-mat.mtrl-sci cs.LG physics.app-ph physics.comp-ph

    Active learning of effective Hamiltonian for super-large-scale atomic structures

    Authors: Xingyue Ma, Hongying Chen, Ri He, Zhanbo Yu, Sergei Prokhorenko, Zheng Wen, Zhicheng Zhong, Jorge Iñiguez, L. Bellaiche, Di Wu, Yurong Yang

    Abstract: The first-principles-based effective Hamiltonian scheme provides one of the most accurate modeling technique for large-scale structures, especially for ferroelectrics. However, the parameterization of the effective Hamiltonian is complicated and can be difficult for some complex systems such as high-entropy perovskites. Here, we propose a general form of effective Hamiltonian and develop an active… ▽ More

    Submitted 14 May, 2024; v1 submitted 17 July, 2023; originally announced July 2023.

    Comments: 11 pages, 4 figures

  36. arXiv:2307.08699  [pdf, other

    cs.CV cs.AI

    Pair then Relation: Pair-Net for Panoptic Scene Graph Generation

    Authors: **ghao Wang, Zhengyu Wen, Xiangtai Li, Zu** Guo, **gkang Yang, Ziwei Liu

    Abstract: Panoptic Scene Graph (PSG) is a challenging task in Scene Graph Generation (SGG) that aims to create a more comprehensive scene graph representation using panoptic segmentation instead of boxes. Compared to SGG, PSG has several challenging problems: pixel-level segment outputs and full relationship exploration (It also considers thing and stuff relation). Thus, current PSG methods have limited per… ▽ More

    Submitted 1 August, 2023; v1 submitted 17 July, 2023; originally announced July 2023.

    Comments: Project Page: https://github.com/king159/Pair-Net

  37. arXiv:2307.05074  [pdf, other

    cs.IR cs.AI cs.DB

    Retrieval-augmented GPT-3.5-based Text-to-SQL Framework with Sample-aware Prompting and Dynamic Revision Chain

    Authors: Chunxi Guo, Zhiliang Tian, **tao Tang, Shasha Li, Zhihua Wen, Kaixuan Wang, Ting Wang

    Abstract: Text-to-SQL aims at generating SQL queries for the given natural language questions and thus hel** users to query databases. Prompt learning with large language models (LLMs) has emerged as a recent approach, which designs prompts to lead LLMs to understand the input question and generate the corresponding SQL. However, it faces challenges with strict SQL syntax requirements. Existing work promp… ▽ More

    Submitted 4 September, 2023; v1 submitted 11 July, 2023; originally announced July 2023.

  38. arXiv:2307.02046  [pdf, other

    cs.IR cs.AI cs.CL

    Recommender Systems in the Era of Large Language Models (LLMs)

    Authors: Zihuai Zhao, Wenqi Fan, Jiatong Li, Yunqing Liu, Xiaowei Mei, Yiqi Wang, Zhen Wen, Fei Wang, Xiangyu Zhao, Jiliang Tang, Qing Li

    Abstract: With the prosperity of e-commerce and web applications, Recommender Systems (RecSys) have become an important component of our daily life, providing personalized suggestions that cater to user preferences. While Deep Neural Networks (DNNs) have made significant advancements in enhancing recommender systems by modeling user-item interactions and incorporating textual side information, DNN-based met… ▽ More

    Submitted 29 April, 2024; v1 submitted 5 July, 2023; originally announced July 2023.

    Comments: Accepted by IEEE TKDE

  39. arXiv:2307.00783  [pdf, other

    math.OC cs.AI cs.LG

    Monte Carlo Policy Gradient Method for Binary Optimization

    Authors: Cheng Chen, Ruitao Chen, Tianyou Li, Ruichen Ao, Zaiwen Wen

    Abstract: Binary optimization has a wide range of applications in combinatorial optimization problems such as MaxCut, MIMO detection, and MaxSAT. However, these problems are typically NP-hard due to the binary constraints. We develop a novel probabilistic model to sample the binary solution according to a parameterized policy distribution. Specifically, minimizing the KL divergence between the parameterized… ▽ More

    Submitted 3 July, 2023; originally announced July 2023.

    MSC Class: 90C09; 90C27; 90C59; 60J45; 60J20

  40. Reciprocating Magnetic Fields in the Pulsar Wind Observed from the Black Widow Pulsar J1720-0534

    Authors: Chen-Chen Miao, Victoria Blackmon, Wei-Wei Zhu, Dong-Zi Li, Mingyu Ge, Xiao-Peng You, Maura McLaughlin, Di Li, Na Wang, Pei Wang, Jia-Rui Niu, M. Cruces, Jian-** Yuan, Jun-Tao Bai, D. J. Champion, Yu-Tong Chen, Ming-Min Chi, P. C. C. Freire, Yi Feng, Zhen-Ye Gan, M. Kramer, Fei-Fei Kou, Yu-Xi Li, Xue-Li Miao, Ling-Qi Meng , et al. (19 additional authors not shown)

    Abstract: We report the radio observations of the eclipsing black widow pulsar J1720-0534, a 3.26 ms pulsar in orbit with a low mass companion of mass 0.029 to 0.034 M$_{\odot}$. We obtain the phase-connected timing ephemeris and polarization profile of this millisecond pulsar (MSP) using the Five-hundred-meter Aperture Spherical Radio Telescope (FAST), the Green Bank Telescope (GBT), and the Parkes Telesco… ▽ More

    Submitted 28 August, 2023; v1 submitted 2 July, 2023; originally announced July 2023.

    Comments: 15 pages, 8 figures, 1 table, accepted by RAA

  41. arXiv:2307.00358  [pdf, ps, other

    math.OC

    The Error in Multivariate Linear Extrapolation with Applications to Derivative-Free Optimization

    Authors: Liyuan Cao, Zaiwen Wen, Ya-xiang Yuan

    Abstract: We study in this paper the function approximation error of multivariate linear extrapolation. The sharp error bound of linear interpolation already exists in the literature. However, linear extrapolation is used far more often in applications such as derivative-free optimization, while its error is not well-studied. We introduce in this paper a method to numerically compute the sharp bound on the… ▽ More

    Submitted 1 July, 2023; originally announced July 2023.

    Comments: arXiv admin note: text overlap with arXiv:2209.12606

  42. arXiv:2306.15401  [pdf, other

    cs.MM cs.HC

    Explainable Multimodal Emotion Recognition

    Authors: Zheng Lian, Haiyang Sun, Licai Sun, Hao Gu, Zhuofan Wen, Siyuan Zhang, Shun Chen, Mingyu Xu, Ke Xu, Kang Chen, Lan Chen, Shan Liang, Ya Li, Jiangyan Yi, Bin Liu, Jianhua Tao

    Abstract: Multimodal emotion recognition is an important research topic in artificial intelligence, whose main goal is to integrate multimodal clues to identify human emotional states. Current works generally assume accurate labels for benchmark datasets and focus on develo** more effective architectures. However, emotion annotation relies on subjective judgment. To obtain more reliable labels, existing d… ▽ More

    Submitted 23 May, 2024; v1 submitted 27 June, 2023; originally announced June 2023.

  43. arXiv:2306.14112  [pdf, other

    cs.IR

    Enhancing Dynamic Image Advertising with Vision-Language Pre-training

    Authors: Zhoufutu Wen, Xinyu Zhao, Zhipeng **, Yi Yang, Wei Jia, Xiaodong Chen, Shuanglong Li, Lin Liu

    Abstract: In the multimedia era, image is an effective medium in search advertising. Dynamic Image Advertising (DIA), a system that matches queries with ad images and generates multimodal ads, is introduced to improve user experience and ad revenue. The core of DIA is a query-image matching module performing ad image retrieval and relevance modeling. Current query-image matching suffers from limited and inc… ▽ More

    Submitted 24 June, 2023; originally announced June 2023.

    Comments: 6 pages, 3 figures, accepted to SIRIP 2023

  44. arXiv:2306.10508  [pdf, other

    cs.CV cs.RO

    QCNeXt: A Next-Generation Framework For Joint Multi-Agent Trajectory Prediction

    Authors: Zikang Zhou, Zihao Wen, Jian** Wang, Yung-Hui Li, Yu-Kai Huang

    Abstract: Estimating the joint distribution of on-road agents' future trajectories is essential for autonomous driving. In this technical report, we propose a next-generation framework for joint multi-agent trajectory prediction called QCNeXt. First, we adopt the query-centric encoding paradigm for the task of joint multi-agent trajectory prediction. Powered by this encoding scheme, our scene encoder is equ… ▽ More

    Submitted 18 June, 2023; originally announced June 2023.

    Comments: Technical report for the 1st place solution of the Argoverse 2 Multi-Agent Motion Forecasting Competition at the CVPR 2023 Workshop on Autonomous Driving

  45. Controllable Multi-Objective Re-ranking with Policy Hypernetworks

    Authors: Sirui Chen, Yuan Wang, Zi**g Wen, Zhiyu Li, Changshuo Zhang, Xiao Zhang, Quan Lin, Cheng Zhu, Jun Xu

    Abstract: Multi-stage ranking pipelines have become widely used strategies in modern recommender systems, where the final stage aims to return a ranked list of items that balances a number of requirements such as user preference, diversity, novelty etc. Linear scalarization is arguably the most widely used technique to merge multiple requirements into one optimization objective, by summing up the requiremen… ▽ More

    Submitted 17 July, 2023; v1 submitted 8 June, 2023; originally announced June 2023.

  46. Knowing-how & Knowing-that: A New Task for Machine Comprehension of User Manuals

    Authors: Hongru Liang, Jia Liu, Weihong Du, Dingnan **, Wenqiang Lei, Zujie Wen, Jiancheng Lv

    Abstract: The machine reading comprehension (MRC) of user manuals has huge potential in customer service. However, current methods have trouble answering complex questions. Therefore, we introduce the Knowing-how & Knowing-that task that requires the model to answer factoid-style, procedure-style, and inconsistent questions about user manuals. We resolve this task by jointly representing the steps and facts… ▽ More

    Submitted 8 August, 2023; v1 submitted 7 June, 2023; originally announced June 2023.

    Journal ref: Findings of the Association for Computational Linguistics: ACL 2023. (2023)

  47. arXiv:2306.04099  [pdf, other

    cs.LG

    NTKCPL: Active Learning on Top of Self-Supervised Model by Estimating True Coverage

    Authors: Ziting Wen, Oscar Pizarro, Stefan Williams

    Abstract: High annotation cost for training machine learning classifiers has driven extensive research in active learning and self-supervised learning. Recent research has shown that in the context of supervised learning different active learning strategies need to be applied at various stages of the training process to ensure improved performance over the random baseline. We refer to the point where the nu… ▽ More

    Submitted 6 June, 2023; originally announced June 2023.

  48. arXiv:2305.20068  [pdf, other

    cs.RO cs.LG

    TOFG: A Unified and Fine-Grained Environment Representation in Autonomous Driving

    Authors: Zihao Wen, Yifan Zhang, Xinhong Chen, Jian** Wang

    Abstract: In autonomous driving, an accurate understanding of environment, e.g., the vehicle-to-vehicle and vehicle-to-lane interactions, plays a critical role in many driving tasks such as trajectory prediction and motion planning. Environment information comes from high-definition (HD) map and historical trajectories of vehicles. Due to the heterogeneity of the map data and trajectory data, many data-driv… ▽ More

    Submitted 31 May, 2023; originally announced May 2023.

    Comments: Accepted by ICRA 2023

  49. arXiv:2305.13774  [pdf, other

    cs.SD eess.AS

    ADD 2023: the Second Audio Deepfake Detection Challenge

    Authors: Jiangyan Yi, Jianhua Tao, Ruibo Fu, Xinrui Yan, Chenglong Wang, Tao Wang, Chu Yuan Zhang, Xiaohui Zhang, Yan Zhao, Yong Ren, Le Xu, Junzuo Zhou, Hao Gu, Zhengqi Wen, Shan Liang, Zheng Lian, Shuai Nie, Haizhou Li

    Abstract: Audio deepfake detection is an emerging topic in the artificial intelligence community. The second Audio Deepfake Detection Challenge (ADD 2023) aims to spur researchers around the world to build new innovative technologies that can further accelerate and foster research on detecting and analyzing deepfake speech utterances. Different from previous challenges (e.g. ADD 2022), ADD 2023 focuses on s… ▽ More

    Submitted 23 May, 2023; originally announced May 2023.

  50. arXiv:2305.10011  [pdf

    physics.optics

    Super-Resolution Imaging via Angular Magnification

    Authors: Yi Zhou, Dingpeng Liao, Kun Zhang, Zijie Ma, Shikai Wu, Jun Ma, Xuemei Dai, Zhengguo Shang, Zhongquan Wen, Gang Chen

    Abstract: The far-field resolution of optical imaging systems is restricted by the Abbe diffraction limit, a direct result of the wave nature of light. One successful technological approach to circumventing this limit is to reduce the effective size of a point-spread-function. In the past decades, great endeavors have been made to engineer an effective point-spread-function by exploiting different mechanism… ▽ More

    Submitted 17 May, 2023; originally announced May 2023.