Skip to main content

Showing 1–50 of 268 results for author: Deng, X

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.00918  [pdf, other

    cs.CR cs.AI cs.LG

    Robust and Reliable Early-Stage Website Fingerprinting Attacks via Spatial-Temporal Distribution Analysis

    Authors: Xinhao Deng, Qi Li, Ke Xu

    Abstract: Website Fingerprinting (WF) attacks identify the websites visited by users by performing traffic analysis, compromising user privacy. Particularly, DL-based WF attacks demonstrate impressive attack performance. However, the effectiveness of DL-based WF attacks relies on the collected complete and pure traffic during the page loading, which impacts the practicality of these attacks. The WF performa… ▽ More

    Submitted 30 June, 2024; originally announced July 2024.

    Comments: To appear in the Proceedings of The ACM Conference on Computer and Communications Security (CCS), 2024

  2. arXiv:2406.18540  [pdf, other

    cs.CV cs.CR

    Fully Exploiting Every Real Sample: SuperPixel Sample Gradient Model Stealing

    Authors: Yunlong Zhao, Xiaoheng Deng, Yi**g Liu, Xinjun Pei, Jiazhi Xia, Wei Chen

    Abstract: Model stealing (MS) involves querying and observing the output of a machine learning model to steal its capabilities. The quality of queried data is crucial, yet obtaining a large amount of real data for MS is often challenging. Recent works have reduced reliance on real data by using generative models. However, when high-dimensional query data is required, these methods are impractical due to the… ▽ More

    Submitted 18 May, 2024; originally announced June 2024.

    Comments: Accepted to CVPR 2024

  3. arXiv:2406.15859  [pdf, other

    cs.IR cs.AI

    LLM-Powered Explanations: Unraveling Recommendations Through Subgraph Reasoning

    Authors: Guangsi Shi, Xiaofeng Deng, Linhao Luo, Lijuan Xia, Lei Bao, Bei Ye, Fei Du, Shirui Pan, Yuxiao Li

    Abstract: Recommender systems are pivotal in enhancing user experiences across various web applications by analyzing the complicated relationships between users and items. Knowledge graphs(KGs) have been widely used to enhance the performance of recommender systems. However, KGs are known to be noisy and incomplete, which are hard to provide reliable explanations for recommendation results. An explainable r… ▽ More

    Submitted 29 June, 2024; v1 submitted 22 June, 2024; originally announced June 2024.

  4. arXiv:2406.15459  [pdf, other

    cs.GT cs.CE cs.LG

    Large-Scale Contextual Market Equilibrium Computation through Deep Learning

    Authors: Yunxuan Ma, Yide Bian, Hao Xu, Weitao Yang, **gshu Zhao, Zhijian Duan, Feng Wang, Xiaotie Deng

    Abstract: Market equilibrium is one of the most fundamental solution concepts in economics and social optimization analysis. Existing works on market equilibrium computation primarily focus on settings with a relatively small number of buyers. Motivated by this, our paper investigates the computation of market equilibrium in scenarios with a large-scale buyer population, where buyers and goods are represent… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

    Comments: 22 pages

  5. arXiv:2406.14232  [pdf, other

    cs.LG cs.AI

    Enhancing robustness of data-driven SHM models: adversarial training with circle loss

    Authors: Xiangli Yang, Xijie Deng, Hanwei Zhang, Yang Zou, Jianxi Yang

    Abstract: Structural health monitoring (SHM) is critical to safeguarding the safety and reliability of aerospace, civil, and mechanical infrastructure. Machine learning-based data-driven approaches have gained popularity in SHM due to advancements in sensors and computational power. However, machine learning models used in SHM are vulnerable to adversarial examples -- even small changes in input can lead to… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

    Comments: 12 pages, 9 figures

  6. arXiv:2406.10534  [pdf, other

    cs.LG cs.AI physics.flu-dyn

    A Finite Difference Informed Graph Network for Solving Steady-State Incompressible Flows on Block-Structured Grids

    Authors: Yiye Zou, Tianyu Li, Shufan Zou, **gyu Wang, Lai** Zhang, Xiaogang Deng

    Abstract: Recently, advancements in deep learning have enabled physics-informed neural networks (PINNs) to solve partial differential equations (PDEs). Numerical differentiation (ND) using the finite difference (FD) method is efficient in physics-constrained designs, even in parameterized settings, often employing body-fitted block-structured grids for complex flow cases. However, convolution operators in C… ▽ More

    Submitted 15 June, 2024; originally announced June 2024.

  7. arXiv:2406.08098  [pdf, other

    cs.SE

    Scalable Defect Detection via Traversal on Code Graph

    Authors: Zhengyao Liu, Xitong Zhong, Xing**g Deng, Shuo Hong, Xiang Gao, Hailong Sun

    Abstract: Detecting defects and vulnerabilities in the early stage has long been a challenge in software engineering. Static analysis, a technique that inspects code without execution, has emerged as a key strategy to address this challenge. Among recent advancements, the use of graph-based representations, particularly Code Property Graph (CPG), has gained traction due to its comprehensive depiction of cod… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

  8. arXiv:2406.07550  [pdf, other

    cs.CV

    An Image is Worth 32 Tokens for Reconstruction and Generation

    Authors: Qihang Yu, Mark Weber, Xueqing Deng, Xiaohui Shen, Daniel Cremers, Liang-Chieh Chen

    Abstract: Recent advancements in generative models have highlighted the crucial role of image tokenization in the efficient synthesis of high-resolution images. Tokenization, which transforms images into latent representations, reduces computational demands compared to directly processing pixels and enhances the effectiveness and efficiency of the generation process. Prior methods, such as VQGAN, typically… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

    Comments: A compact 1D Image Tokenization method, leading to SOTA generation performance while being substantially faster. Project page at https://yucornetto.github.io/projects/titok.html

  9. arXiv:2406.05642  [pdf, other

    nlin.CG cs.DS

    The Invertibility of Cellular Automata with Menory: Correcting Errors and New Conclusions

    Authors: Chen Wang, Xiang Deng, Chao Wang

    Abstract: Cellular automata with memory (CAM) are widely used in fields such as image processing, pattern recognition, simulation, and cryptography. The invertibility of CAM is generally considered to be chaotic. Paper [Invertible behavior in elementary cellular automata with memory, Juan C. Seck-Tuoh-Mora et al., Information Sciences, 2012] presented necessary and sufficient conditions for the invertibilit… ▽ More

    Submitted 9 June, 2024; originally announced June 2024.

  10. arXiv:2406.05427  [pdf, other

    cs.LG

    Decision Mamba: A Multi-Grained State Space Model with Self-Evolution Regularization for Offline RL

    Authors: Qi Lv, Xiang Deng, Gongwei Chen, Michael Yu Wang, Liqiang Nie

    Abstract: While the conditional sequence modeling with the transformer architecture has demonstrated its effectiveness in dealing with offline reinforcement learning (RL) tasks, it is struggle to handle out-of-distribution states and actions. Existing work attempts to address this issue by data augmentation with the learned policy or adding extra constraints with the value-based RL algorithm. However, these… ▽ More

    Submitted 8 June, 2024; originally announced June 2024.

  11. arXiv:2405.17932  [pdf, ps, other

    cs.LG cs.DC

    Towards Communication-efficient Federated Learning via Sparse and Aligned Adaptive Optimization

    Authors: Xiumei Deng, Jun Li, Kang Wei, Long Shi, Zeihui Xiong, Ming Ding, Wen Chen, Shi **, H. Vincent Poor

    Abstract: Adaptive moment estimation (Adam), as a Stochastic Gradient Descent (SGD) variant, has gained widespread popularity in federated learning (FL) due to its fast convergence. However, federated Adam (FedAdam) algorithms suffer from a threefold increase in uplink communication overhead compared to federated SGD (FedSGD) algorithms, which arises from the necessity to transmit both local model updates a… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

  12. arXiv:2405.17914  [pdf, other

    cs.LG

    Trustworthy DNN Partition for Blockchain-enabled Digital Twin in Wireless IIoT Networks

    Authors: Xiumei Deng, Jun Li, Long Shi, Kang Wei, Ming Ding, Yumeng Shao, Wen Chen, Shi **

    Abstract: Digital twin (DT) has emerged as a promising solution to enhance manufacturing efficiency in industrial Internet of Things (IIoT) networks. To promote the efficiency and trustworthiness of DT for wireless IIoT networks, we propose a blockchain-enabled DT (B-DT) framework that employs deep neural network (DNN) partitioning technique and reputation-based consensus mechanism, wherein the DTs maintain… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

  13. arXiv:2405.16276  [pdf, ps, other

    cs.GT

    Mechanism Design for LLM Fine-tuning with Multiple Reward Models

    Authors: Haoran Sun, Yurong Chen, Siwei Wang, Wei Chen, Xiaotie Deng

    Abstract: Recent research on fine-tuning large language models (LLMs) through the aggregation of multiple preferences has attracted considerable attention. However, the existing literature predominantly focuses on the empirical performance of aggregation algorithms, while neglecting the underlying motivation for agents to misreport their preferences. In this paper, we formalize this as a multi-parameter mec… ▽ More

    Submitted 25 May, 2024; originally announced May 2024.

  14. arXiv:2405.13726  [pdf, other

    cs.LG

    Score-based Generative Models with Adaptive Momentum

    Authors: Ziqing Wen, Xiaoge Deng, ** Luo, Tao Sun, Dongsheng Li

    Abstract: Score-based generative models have demonstrated significant practical success in data-generating tasks. The models establish a diffusion process that perturbs the ground truth data to Gaussian noise and then learn the reverse process to transform noise into data. However, existing denoising methods such as Langevin dynamic and numerical stochastic differential equation solvers enjoy randomness but… ▽ More

    Submitted 22 May, 2024; originally announced May 2024.

  15. arXiv:2405.11647  [pdf, other

    cs.AI cs.LG

    Hummer: Towards Limited Competitive Preference Dataset

    Authors: Li Jiang, Yusen Wu, Junwu Xiong, **gqing Ruan, Yichuan Ding, Qingpei Guo, Zujie Wen, Jun Zhou, Xiaotie Deng

    Abstract: Preference datasets are essential for incorporating human preferences into pre-trained language models, playing a key role in the success of Reinforcement Learning from Human Feedback. However, these datasets often demonstrate conflicting alignment objectives, leading to increased vulnerability to jailbreak attacks and challenges in adapting downstream tasks to prioritize specific alignment object… ▽ More

    Submitted 20 May, 2024; v1 submitted 19 May, 2024; originally announced May 2024.

    Comments: 9 pages, 5 figures

  16. arXiv:2405.09839  [pdf, other

    cs.LG

    Advances in Robust Federated Learning: Heterogeneity Considerations

    Authors: Chuan Chen, Tianchi Liao, Xiaojun Deng, Zihou Wu, Sheng Huang, Zibin Zheng

    Abstract: In the field of heterogeneous federated learning (FL), the key challenge is to efficiently and collaboratively train models across multiple clients with different data distributions, model structures, task objectives, computational capabilities, and communication resources. This diversity leads to significant heterogeneity, which increases the complexity of model training. In this paper, we first… ▽ More

    Submitted 16 May, 2024; originally announced May 2024.

  17. arXiv:2405.08748  [pdf, other

    cs.CV

    Hunyuan-DiT: A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding

    Authors: Zhimin Li, Jianwei Zhang, Qin Lin, Jiangfeng Xiong, Yanxin Long, Xinchi Deng, Yingfang Zhang, Xingchao Liu, Minbin Huang, Zedong Xiao, Dayou Chen, Jiajun He, Jiahao Li, Wenyue Li, Chen Zhang, Rongwei Quan, Jianxiang Lu, Jiabin Huang, Xiaoyan Yuan, Xiaoxiao Zheng, Yixuan Li, Jihong Zhang, Chao Zhang, Meng Chen, Jie Liu , et al. (20 additional authors not shown)

    Abstract: We present Hunyuan-DiT, a text-to-image diffusion transformer with fine-grained understanding of both English and Chinese. To construct Hunyuan-DiT, we carefully design the transformer structure, text encoder, and positional encoding. We also build from scratch a whole data pipeline to update and evaluate data for iterative model optimization. For fine-grained language understanding, we train a Mu… ▽ More

    Submitted 14 May, 2024; originally announced May 2024.

    Comments: Project Page: https://dit.hunyuan.tencent.com/

  18. arXiv:2405.08733  [pdf, other

    cs.GR cs.CV

    A Simple Approach to Differentiable Rendering of SDFs

    Authors: Zichen Wang, Xi Deng, Ziyi Zhang, Wenzel Jakob, Steve Marschner

    Abstract: We present a simple algorithm for differentiable rendering of surfaces represented by Signed Distance Fields (SDF), which makes it easy to integrate rendering into gradient-based optimization pipelines. To tackle visibility-related derivatives that make rendering non-differentiable, existing physically based differentiable rendering methods often rely on elaborate guiding data structures or repara… ▽ More

    Submitted 7 June, 2024; v1 submitted 14 May, 2024; originally announced May 2024.

  19. arXiv:2405.00263  [pdf, other

    cs.CL cs.AI cs.LG

    Clover: Regressive Lightweight Speculative Decoding with Sequential Knowledge

    Authors: Bin Xiao, Chunan Shi, Xiaonan Nie, Fan Yang, Xiangwei Deng, Lei Su, Weipeng Chen, Bin Cui

    Abstract: Large language models (LLMs) suffer from low efficiency as the mismatch between the requirement of auto-regressive decoding and the design of most contemporary GPUs. Specifically, billions to trillions of parameters must be loaded to the GPU cache through its limited memory bandwidth for computation, but only a small batch of tokens is actually computed. Consequently, the GPU spends most of its ti… ▽ More

    Submitted 30 April, 2024; originally announced May 2024.

  20. arXiv:2404.15700  [pdf, other

    cs.CV cs.RO

    MAS-SAM: Segment Any Marine Animal with Aggregated Features

    Authors: Tianyu Yan, Zifu Wan, Xinhao Deng, **** Zhang, Yang Liu, Huchuan Lu

    Abstract: Recently, Segment Anything Model (SAM) shows exceptional performance in generating high-quality object masks and achieving zero-shot image segmentation. However, as a versatile vision model, SAM is primarily trained with large-scale natural light images. In underwater scenes, it exhibits substantial performance degradation due to the light scattering and absorption. Meanwhile, the simplicity of th… ▽ More

    Submitted 9 May, 2024; v1 submitted 24 April, 2024; originally announced April 2024.

    Comments: Accepted by IJCAI2024 as Poster

  21. arXiv:2404.11894  [pdf, other

    cs.GR

    Rendering Participating Media Using Path Graphs

    Authors: Becky Hu, Xi Deng, Fujun Luan, Miloš Hašan, Steve Marschner

    Abstract: Rendering volumetric scattering media, including clouds, fog, smoke, and other complex materials, is crucial for realism in computer graphics. Traditional path tracing, while unbiased, requires many long path samples to converge in scenes with scattering media, and a lot of work is wasted by paths that make a negligible contribution to the image. Methods to make better use of the information learn… ▽ More

    Submitted 18 April, 2024; originally announced April 2024.

  22. arXiv:2404.10603  [pdf, other

    cs.CV

    Enhancing 3D Fidelity of Text-to-3D using Cross-View Correspondences

    Authors: Seungwook Kim, Kejie Li, Xueqing Deng, Yichun Shi, Minsu Cho, Peng Wang

    Abstract: Leveraging multi-view diffusion models as priors for 3D optimization have alleviated the problem of 3D consistency, e.g., the Janus face problem or the content drift problem, in zero-shot text-to-3D models. However, the 3D geometric fidelity of the output remains an unresolved issue; albeit the rendered 2D views are realistic, the underlying geometry may contain errors such as unreasonable concavi… ▽ More

    Submitted 16 April, 2024; originally announced April 2024.

    Comments: 25 pages, 22 figures, accepted to CVPR 2024

  23. arXiv:2404.10561  [pdf, other

    cs.LG q-bio.QM stat.ML

    HiGraphDTI: Hierarchical Graph Representation Learning for Drug-Target Interaction Prediction

    Authors: Bin Liu, Siqi Wu, ** Wang, Xin Deng, Ao Zhou

    Abstract: The discovery of drug-target interactions (DTIs) plays a crucial role in pharmaceutical development. The deep learning model achieves more accurate results in DTI prediction due to its ability to extract robust and expressive features from drug and target chemical structures. However, existing deep learning methods typically generate drug features via aggregating molecular atom representations, ig… ▽ More

    Submitted 16 April, 2024; originally announced April 2024.

  24. arXiv:2404.08639  [pdf, other

    cs.CV

    COCONut: Modernizing COCO Segmentation

    Authors: Xueqing Deng, Qihang Yu, Peng Wang, Xiaohui Shen, Liang-Chieh Chen

    Abstract: In recent decades, the vision community has witnessed remarkable progress in visual recognition, partially owing to advancements in dataset benchmarks. Notably, the established COCO benchmark has propelled the development of modern detection and segmentation systems. However, the COCO segmentation benchmark has seen comparatively slow improvement over the last decade. Originally equipped with coar… ▽ More

    Submitted 12 April, 2024; originally announced April 2024.

    Comments: Accepted at CVPR2024, data available at https://xdeng7.github.io/coconut.github.io/

  25. arXiv:2404.06078  [pdf, other

    cs.IR

    End-to-end training of Multimodal Model and ranking Model

    Authors: Xiuqi Deng, Lu Xu, Xiyao Li, **kai Yu, Erpeng Xue, Zhongyuan Wang, Di Zhang, Zhaojie Liu, Guorui Zhou, Yang Song, Na Mou, Shen Jiang, Han Li

    Abstract: Traditional recommender systems heavily rely on ID features, which often encounter challenges related to cold-start and generalization. Modeling pre-extracted content features can mitigate these issues, but is still a suboptimal solution due to the discrepancies between training tasks and model parameters. End-to-end training presents a promising solution for these problems, yet most of the existi… ▽ More

    Submitted 9 April, 2024; originally announced April 2024.

    Comments: 9 pages, 8 figures

  26. arXiv:2404.04929  [pdf, other

    cs.RO

    RoboMP$^2$: A Robotic Multimodal Perception-Planning Framework with Multimodal Large Language Models

    Authors: Qi Lv, Hao Li, Xiang Deng, Rui Shao, Michael Yu Wang, Liqiang Nie

    Abstract: Multimodal Large Language Models (MLLMs) have shown impressive reasoning abilities and general intelligence in various domains. It inspires researchers to train end-to-end MLLMs or utilize large models to generate policies with human-selected prompts for embodied agents. However, these methods exhibit limited generalization capabilities on unseen tasks or scenarios, and overlook the multimodal env… ▽ More

    Submitted 8 June, 2024; v1 submitted 7 April, 2024; originally announced April 2024.

    Comments: Accepted by ICML 2024; Project page: https://aopolin-lv.github.io/RoboMP2.github.io/

    Journal ref: ICML 2024

  27. arXiv:2404.01606  [pdf

    cs.CR cs.DC

    Haina Storage: A Decentralized Secure Storage Framework Based on Improved Blockchain Structure

    Authors: Zijian Zhou, Caimei Wang, Xiaoheng Deng, Jianhao Lu, Qilue Wen, Chen Zhang, Hong Li

    Abstract: Although the decentralized storage technology based on the blockchain can effectively realize secure data storage on cloud services. However, there are still some problems in the existing schemes, such as low storage capacity and low efficiency. To address related issues, we propose a novel decentralized storage framework, which mainly includes four aspects: (1) we proposed a Bi-direction Circular… ▽ More

    Submitted 1 April, 2024; originally announced April 2024.

    Comments: 24 pages, 21 figures

  28. arXiv:2403.16557  [pdf, ps, other

    cs.LG cs.DC

    Accelerating Federated Learning by Selecting Beneficial Herd of Local Gradients

    Authors: ** Luo, Xiaoge Deng, Ziqing Wen, Tao Sun, Dongsheng Li

    Abstract: Federated Learning (FL) is a distributed machine learning framework in communication network systems. However, the systems' Non-Independent and Identically Distributed (Non-IID) data negatively affect the convergence efficiency of the global model, since only a subset of these data samples are beneficial for model convergence. In pursuit of this subset, a reliable approach involves determining a m… ▽ More

    Submitted 25 March, 2024; originally announced March 2024.

  29. arXiv:2403.11863  [pdf, other

    eess.SY cs.RO

    Context-aware LLM-based Safe Control Against Latent Risks

    Authors: Quan Khanh Luu, Xiyu Deng, Anh Van Ho, Yorie Nakahira

    Abstract: It is challenging for autonomous control systems to perform complex tasks in the presence of latent risks. Motivated by this challenge, this paper proposes an integrated framework that involves Large Language Models (LLMs), stochastic gradient descent (SGD), and optimization-based control. In the first phrase, the proposed framework breaks down complex tasks into a sequence of smaller subtasks, wh… ▽ More

    Submitted 18 March, 2024; originally announced March 2024.

  30. arXiv:2403.10188  [pdf, other

    cs.CR cs.AR

    Taiyi: A high-performance CKKS accelerator for Practical Fully Homomorphic Encryption

    Authors: Shengyu Fan, Xianglong Deng, Zhuoyu Tian, Zhicheng Hu, Liang Chang, Rui Hou, Dan Meng, Mingzhe Zhang

    Abstract: Fully Homomorphic Encryption (FHE), a novel cryptographic theory enabling computation directly on ciphertext data, offers significant security benefits but is hampered by substantial performance overhead. In recent years, a series of accelerator designs have significantly enhanced the performance of FHE applications, bringing them closer to real-world applicability. However, these accelerators fac… ▽ More

    Submitted 15 March, 2024; originally announced March 2024.

    Comments: 14 pages, 15 figures

  31. arXiv:2403.08857  [pdf, other

    cs.CV

    DialogGen: Multi-modal Interactive Dialogue System for Multi-turn Text-to-Image Generation

    Authors: Minbin Huang, Yanxin Long, Xinchi Deng, Ruihang Chu, Jiangfeng Xiong, Xiaodan Liang, Hong Cheng, Qinglin Lu, Wei Liu

    Abstract: Text-to-image (T2I) generation models have significantly advanced in recent years. However, effective interaction with these models is challenging for average users due to the need for specialized prompt engineering knowledge and the inability to perform multi-turn image generation, hindering a dynamic and iterative creation process. Recent attempts have tried to equip Multi-modal Large Language M… ▽ More

    Submitted 13 March, 2024; originally announced March 2024.

    Comments: Project page: https://hunyuan-dialoggen.github.io/

  32. arXiv:2403.07346  [pdf, other

    cs.CV

    Complementing Event Streams and RGB Frames for Hand Mesh Reconstruction

    Authors: Jian** Jiang, Xinyu Zhou, Bingxuan Wang, Xiaoming Deng, Chao Xu, Boxin Shi

    Abstract: Reliable hand mesh reconstruction (HMR) from commonly-used color and depth sensors is challenging especially under scenarios with varied illuminations and fast motions. Event camera is a highly promising alternative for its high dynamic range and dense temporal resolution properties, but it lacks key texture appearance for hand mesh reconstruction. In this paper, we propose EvRGBHand -- the first… ▽ More

    Submitted 12 March, 2024; originally announced March 2024.

  33. arXiv:2403.05818  [pdf

    cs.LG q-bio.QM

    PR-NET: Leveraging Pathway Refined Network Structures for Prostate Cancer Patient Condition Prediction

    Authors: R. Li, J. Liu, X. L. Deng, X. Liu, J. C. Guo, W. Y. Wu, L. Yang

    Abstract: The diagnosis and monitoring of Castrate Resistant Prostate Cancer (CRPC) are crucial for cancer patients, but the current models (such as P-NET) have limitations in terms of parameter count, generalization, and cost. To address the issue, we develop a more accurate and efficient Prostate Cancer patient condition prediction model, named PR-NET. By compressing and optimizing the network structure o… ▽ More

    Submitted 12 March, 2024; v1 submitted 9 March, 2024; originally announced March 2024.

  34. arXiv:2402.17200  [pdf, other

    cs.CV eess.IV

    Enhancing Quality of Compressed Images by Mitigating Enhancement Bias Towards Compression Domain

    Authors: Qunliang Xing, Mai Xu, Shengxi Li, Xin Deng, Meisong Zheng, Huaida Liu, Ying Chen

    Abstract: Existing quality enhancement methods for compressed images focus on aligning the enhancement domain with the raw domain to yield realistic images. However, these methods exhibit a pervasive enhancement bias towards the compression domain, inadvertently regarding it as more realistic than the raw domain. This bias makes enhanced images closely resemble their compressed counterparts, thus degrading… ▽ More

    Submitted 19 March, 2024; v1 submitted 26 February, 2024; originally announced February 2024.

    Comments: Accepted to CVPR 2024

  35. arXiv:2402.14486  [pdf, ps, other

    cs.GT cs.AI cs.LG econ.TH

    Are Bounded Contracts Learnable and Approximately Optimal?

    Authors: Yurong Chen, Zhaohua Chen, Xiaotie Deng, Zhiyi Huang

    Abstract: This paper considers the hidden-action model of the principal-agent problem, in which a principal incentivizes an agent to work on a project using a contract. We investigate whether contracts with bounded payments are learnable and approximately optimal. Our main results are two learning algorithms that can find a nearly optimal bounded contract using a polynomial number of queries, under two stan… ▽ More

    Submitted 22 February, 2024; originally announced February 2024.

  36. arXiv:2402.12957  [pdf, other

    cs.DC

    Energy-Efficient Wireless Federated Learning via Doubly Adaptive Quantization

    Authors: Xuefeng Han, Wen Chen, Jun Li, Ming Ding, Qingqing Wu, Kang Wei, Xiumei Deng, Zhen Mei

    Abstract: Federated learning (FL) has been recognized as a viable distributed learning paradigm for training a machine learning model across distributed clients without uploading raw data. However, FL in wireless networks still faces two major challenges, i.e., large communication overhead and high energy consumption, which are exacerbated by client heterogeneity in dataset sizes and wireless channels. Whil… ▽ More

    Submitted 20 February, 2024; originally announced February 2024.

  37. arXiv:2402.11904  [pdf, other

    cs.GT cs.LG

    Scalable Virtual Valuations Combinatorial Auction Design by Combining Zeroth-Order and First-Order Optimization Method

    Authors: Zhijian Duan, Haoran Sun, Yichong Xia, Siqiang Wang, Zhilin Zhang, Chuan Yu, Jian Xu, Bo Zheng, Xiaotie Deng

    Abstract: Automated auction design seeks to discover empirically high-revenue and incentive-compatible mechanisms using machine learning. Ensuring dominant strategy incentive compatibility (DSIC) is crucial, and the most effective approach is to confine the mechanism to Affine Maximizer Auctions (AMAs). Nevertheless, existing AMA-based approaches encounter challenges such as scalability issues (arising from… ▽ More

    Submitted 19 February, 2024; originally announced February 2024.

  38. arXiv:2402.08303   

    cs.CL cs.AI cs.CE cs.HC cs.LG

    ChatCell: Facilitating Single-Cell Analysis with Natural Language

    Authors: Yin Fang, Kangwei Liu, Ningyu Zhang, Xinle Deng, Penghui Yang, Zhuo Chen, Xiangru Tang, Mark Gerstein, Xiaohui Fan, Huajun Chen

    Abstract: As Large Language Models (LLMs) rapidly evolve, their influence in science is becoming increasingly prominent. The emerging capabilities of LLMs in task generalization and free-form dialogue can significantly advance fields like chemistry and biology. However, the field of single-cell biology, which forms the foundational building blocks of living organisms, still faces several challenges. High kn… ▽ More

    Submitted 19 February, 2024; v1 submitted 13 February, 2024; originally announced February 2024.

    Comments: I have decided to temporarily withdraw this draft as I am in the process of making further revisions to improve its content. Code: https://github.com/zjunlp/ChatCell Dataset: https://huggingface.co/datasets/zjunlp/ChatCell-Instructions Demo: https://chat.openai.com/g/g-vUwj222gQ-chatcell

  39. arXiv:2402.04476  [pdf, other

    cs.CV cs.AI cs.CL

    Dual-View Visual Contextualization for Web Navigation

    Authors: Jihyung Kil, Chan Hee Song, Boyuan Zheng, Xiang Deng, Yu Su, Wei-Lun Chao

    Abstract: Automatic web navigation aims to build a web agent that can follow language instructions to execute complex and diverse tasks on real-world websites. Existing work primarily takes HTML documents as input, which define the contents and action spaces (i.e., actionable elements and operations) of webpages. Nevertheless, HTML documents may not provide a clear task-related context for each element, mak… ▽ More

    Submitted 30 March, 2024; v1 submitted 6 February, 2024; originally announced February 2024.

    Comments: Accepted to CVPR 2024

  40. arXiv:2401.17268  [pdf, other

    cs.CL cs.AI cs.LG

    Weaver: Foundation Models for Creative Writing

    Authors: Tiannan Wang, Jiamin Chen, Qingrui Jia, Shuai Wang, Ruoyu Fang, Huilin Wang, Zhaowei Gao, Chunzhao Xie, Chuou Xu, Jihong Dai, Yibin Liu, Jialong Wu, Shengwei Ding, Long Li, Zhiwei Huang, Xinle Deng, Teng Yu, Gangan Ma, Han Xiao, Zixin Chen, Danjun Xiang, Yunxia Wang, Yuanyuan Zhu, Yi Xiao, **g Wang , et al. (21 additional authors not shown)

    Abstract: This work introduces Weaver, our first family of large language models (LLMs) dedicated to content creation. Weaver is pre-trained on a carefully selected corpus that focuses on improving the writing capabilities of large language models. We then fine-tune Weaver for creative and professional writing purposes and align it to the preference of professional writers using a suit of novel methods for… ▽ More

    Submitted 30 January, 2024; originally announced January 2024.

  41. arXiv:2401.12489  [pdf

    cs.LG cs.AI

    Unsupervised Learning Method for the Wave Equation Based on Finite Difference Residual Constraints Loss

    Authors: Xin Feng, Yi Jiang, Jia-Xian Qin, Lai-** Zhang, Xiao-Gang Deng

    Abstract: The wave equation is an important physical partial differential equation, and in recent years, deep learning has shown promise in accelerating or replacing traditional numerical methods for solving it. However, existing deep learning methods suffer from high data acquisition costs, low training efficiency, and insufficient generalization capability for boundary conditions. To address these issues,… ▽ More

    Submitted 23 January, 2024; originally announced January 2024.

    Comments: in Chinese language

  42. arXiv:2401.06044  [pdf, other

    cs.SE

    Safeguarding DeFi Smart Contracts against Oracle Deviations

    Authors: Xun Deng, Sidi Mohamed Beillahi, Cyrus Minwalla, Han Du, Andreas Veneris, Fan Long

    Abstract: This paper presents OVer, a framework designed to automatically analyze the behavior of decentralized finance (DeFi) protocols when subjected to a "skewed" oracle input. OVer firstly performs symbolic analysis on the given contract and constructs a model of constraints. Then, the framework leverages an SMT solver to identify parameters that allow its secure operation. Furthermore, guard statements… ▽ More

    Submitted 11 January, 2024; originally announced January 2024.

    Comments: 13 pages; extended version of paper accepted in ICSE'24

    ACM Class: D.2.4

  43. arXiv:2401.05778  [pdf, other

    cs.CL cs.AI

    Risk Taxonomy, Mitigation, and Assessment Benchmarks of Large Language Model Systems

    Authors: Tianyu Cui, Yanling Wang, Chuanpu Fu, Yong Xiao, Sijia Li, Xinhao Deng, Yunpeng Liu, Qinglin Zhang, Ziyi Qiu, Peiyang Li, Zhixing Tan, Junwu Xiong, Xinyu Kong, Zujie Wen, Ke Xu, Qi Li

    Abstract: Large language models (LLMs) have strong capabilities in solving diverse natural language processing tasks. However, the safety and security issues of LLM systems have become the major obstacle to their widespread application. Many studies have extensively investigated risks in LLM systems and developed the corresponding mitigation strategies. Leading-edge enterprises such as OpenAI, Google, Meta,… ▽ More

    Submitted 11 January, 2024; originally announced January 2024.

  44. arXiv:2312.11953  [pdf, ps, other

    cs.GT

    Competition among Pairwise Lottery Contests

    Authors: Xiaotie Deng, Hangxin Gan, Ningyuan Li, Weian Li, Qi Qi

    Abstract: We investigate a two-stage competitive model involving multiple contests. In this model, each contest designer chooses two participants from a pool of candidate contestants and determines the biases. Contestants strategically distribute their efforts across various contests within their budget. We first show the existence of a pure strategy Nash equilibrium (PNE) for the contestants, and propose a… ▽ More

    Submitted 20 December, 2023; v1 submitted 19 December, 2023; originally announced December 2023.

    Comments: Accepted by the 38th Annual AAAI Conference on Artificial Intelligence (AAAI 2024)

  45. arXiv:2312.11063  [pdf, ps, other

    cs.GT cs.AI cs.DS cs.LG econ.TH

    A survey on algorithms for Nash equilibria in finite normal-form games

    Authors: Hanyu Li, Wenhan Huang, Zhijian Duan, David Henry Mguni, Kun Shao, Jun Wang, Xiaotie Deng

    Abstract: Nash equilibrium is one of the most influential solution concepts in game theory. With the development of computer science and artificial intelligence, there is an increasing demand on Nash equilibrium computation, especially for Internet economics and multi-agent learning. This paper reviews various algorithms computing the Nash equilibrium and its approximation solutions in finite normal-form ga… ▽ More

    Submitted 18 December, 2023; originally announced December 2023.

    Comments: The published version is in Computer Science Review

  46. arXiv:2312.07669  [pdf, other

    cs.CV

    GMTalker: Gaussian Mixture-based Audio-Driven Emotional talking video Portraits

    Authors: Yibo Xia, Lizhen Wang, Xiang Deng, Xiaoyan Luo, Yebin Liu

    Abstract: Synthesizing high-fidelity and emotion-controllable talking video portraits, with audio-lip sync, vivid expressions, realistic head poses, and eye blinks, has been an important and challenging task in recent years. Most existing methods suffer in achieving personalized and precise emotion control, smooth transitions between different emotion states, and the generation of diverse motions. To tackle… ▽ More

    Submitted 28 May, 2024; v1 submitted 12 December, 2023; originally announced December 2023.

    Comments: Project page: https://bob35buaa.github.io/GMTalker. This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible

  47. arXiv:2312.04653  [pdf, other

    cs.LG cs.GT

    Learning Thresholds with Latent Values and Censored Feedback

    Authors: Jiahao Zhang, Tao Lin, Weiqiang Zheng, Zhe Feng, Yifeng Teng, Xiaotie Deng

    Abstract: In this paper, we investigate a problem of actively learning threshold in latent space, where the unknown reward $g(γ, v)$ depends on the proposed threshold $γ$ and latent value $v$ and it can be $only$ achieved if the threshold is lower than or equal to the unknown latent value. This problem has broad applications in practical scenarios, e.g., reserve price optimization in online auctions, online… ▽ More

    Submitted 7 December, 2023; originally announced December 2023.

    Comments: 18 pages

  48. arXiv:2311.18537  [pdf, other

    cs.CV

    A Simple Video Segmenter by Tracking Objects Along Axial Trajectories

    Authors: Ju He, Qihang Yu, Inkyu Shin, Xueqing Deng, Alan Yuille, Xiaohui Shen, Liang-Chieh Chen

    Abstract: Video segmentation requires consistently segmenting and tracking objects over time. Due to the quadratic dependency on input size, directly applying self-attention to video segmentation with high-resolution input features poses significant challenges, often leading to insufficient GPU memory capacity. Consequently, modern video segmenters either extend an image segmenter without incorporating any… ▽ More

    Submitted 12 June, 2024; v1 submitted 30 November, 2023; originally announced November 2023.

    Comments: The paper and model names have been updated to better reflect the methodological contributions

  49. arXiv:2311.15203  [pdf, ps, other

    cs.GT

    Learning against Non-credible Auctions

    Authors: Qian Wang, Xuanzhi Xia, Zongjun Yang, Xiaotie Deng, Yuqing Kong, Zhilin Zhang, Liang Wang, Chuan Yu, Jian Xu, Bo Zheng

    Abstract: The standard framework of online bidding algorithm design assumes that the seller commits himself to faithfully implementing the rules of the adopted auction. However, the seller may attempt to cheat in execution to increase his revenue if the auction belongs to the class of non-credible auctions. For example, in a second-price auction, the seller could create a fake bid between the highest bid an… ▽ More

    Submitted 26 November, 2023; originally announced November 2023.

  50. arXiv:2311.11860  [pdf, other

    cs.CV

    LION : Empowering Multimodal Large Language Model with Dual-Level Visual Knowledge

    Authors: Gongwei Chen, Leyang Shen, Rui Shao, Xiang Deng, Liqiang Nie

    Abstract: Multimodal Large Language Models (MLLMs) have endowed LLMs with the ability to perceive and understand multi-modal signals. However, most of the existing MLLMs mainly adopt vision encoders pretrained on coarsely aligned image-text pairs, leading to insufficient extraction and reasoning of visual knowledge. To address this issue, we devise a dual-Level vIsual knOwledge eNhanced Multimodal Large Lan… ▽ More

    Submitted 26 November, 2023; v1 submitted 20 November, 2023; originally announced November 2023.

    Comments: Technical Report. Project page: https://rshaojimmy.github.io/Projects/JiuTian-LION Code: https://github.com/rshaojimmy/JiuTian