Skip to main content

Showing 1–50 of 310 results for author: Jiang, T

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.00456  [pdf, other

    cs.SE cs.AI

    Beyond Functional Correctness: Investigating Coding Style Inconsistencies in Large Language Models

    Authors: Yanlin Wang, Tianyue Jiang, Mingwei Liu, Jiachi Chen, Zibin Zheng

    Abstract: Large language models (LLMs) have brought a paradigm shift to the field of code generation, offering the potential to enhance the software development process. However, previous research mainly focuses on the accuracy of code generation, while coding style differences between LLMs and human developers remain under-explored. In this paper, we empirically analyze the differences in coding style betw… ▽ More

    Submitted 29 June, 2024; originally announced July 2024.

    Comments: 13pages, 14 figures

  2. arXiv:2406.17286  [pdf

    cs.RO eess.SY

    Prioritized experience replay-based DDQN for Unmanned Vehicle Path Planning

    Authors: Liu Lipeng, Letian Xu, Jiabei Liu, Haopeng Zhao, Tongzhou Jiang, Tianyao Zheng

    Abstract: Path planning module is a key module for autonomous vehicle navigation, which directly affects its operating efficiency and safety. In complex environments with many obstacles, traditional planning algorithms often cannot meet the needs of intelligence, which may lead to problems such as dead zones in unmanned vehicles. This paper proposes a path planning algorithm based on DDQN and combines it wi… ▽ More

    Submitted 25 June, 2024; originally announced June 2024.

    Comments: 4 pages, 6 figures, 2024 5th International Conference on Information Science, Parallel and Distributed Systems

  3. arXiv:2406.03695  [pdf, other

    cs.CR

    FACOS: Enabling Privacy Protection Through Fine-Grained Access Control with On-chain and Off-chain System

    Authors: Chao Liu, Cankun Hou, Tianyu Jiang, Jianting Ning, Hui Qiao, Yusen Wu

    Abstract: Data-driven landscape across finance, government, and healthcare, the continuous generation of information demands robust solutions for secure storage, efficient dissemination, and fine-grained access control. Blockchain technology emerges as a significant tool, offering decentralized storage while upholding the tenets of data security and accessibility. However, on-chain and off-chain strategies… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

  4. arXiv:2406.01595  [pdf, other

    cs.CV

    MultiPly: Reconstruction of Multiple People from Monocular Video in the Wild

    Authors: Zeren Jiang, Chen Guo, Manuel Kaufmann, Tianjian Jiang, Julien Valentin, Otmar Hilliges, Jie Song

    Abstract: We present MultiPly, a novel framework to reconstruct multiple people in 3D from monocular in-the-wild videos. Reconstructing multiple individuals moving and interacting naturally from monocular in-the-wild videos poses a challenging task. Addressing it necessitates precise pixel-level disentanglement of individuals without any prior knowledge about the subjects. Moreover, it requires recovering i… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

    Comments: Project page: https://eth-ait.github.io/MultiPly/

  5. arXiv:2405.16094  [pdf, other

    cs.CV

    PLUG: Revisiting Amodal Segmentation with Foundation Model and Hierarchical Focus

    Authors: Zhaochen Liu, Limeng Qiao, Xiangxiang Chu, Tingting Jiang

    Abstract: Aiming to predict the complete shapes of partially occluded objects, amodal segmentation is an important step towards visual intelligence. With crucial significance, practical prior knowledge derives from sufficient training, while limited amodal annotations pose challenges to achieve better performance. To tackle this problem, utilizing the mighty priors accumulated in the foundation model, we pr… ▽ More

    Submitted 3 June, 2024; v1 submitted 25 May, 2024; originally announced May 2024.

  6. arXiv:2405.15544  [pdf, other

    q-bio.QM cs.AI cs.LG

    Knowledge-enhanced Relation Graph and Task Sampling for Few-shot Molecular Property Prediction

    Authors: Zeyu Wang, Tianyi Jiang, Yao Lu, Xiaoze Bao, Shanqing Yu, Bin Wei, Qi Xuan

    Abstract: Recently, few-shot molecular property prediction (FSMPP) has garnered increasing attention. Despite impressive breakthroughs achieved by existing methods, they often overlook the inherent many-to-many relationships between molecules and properties, which limits their performance. For instance, similar substructures of molecules can inspire the exploration of new compounds. Additionally, the relati… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

  7. arXiv:2405.12130  [pdf, other

    cs.CL cs.LG

    MoRA: High-Rank Updating for Parameter-Efficient Fine-Tuning

    Authors: Ting Jiang, Shaohan Huang, Shengyue Luo, Zihan Zhang, Haizhen Huang, Furu Wei, Weiwei Deng, Feng Sun, Qi Zhang, Deqing Wang, Fuzhen Zhuang

    Abstract: Low-rank adaptation is a popular parameter-efficient fine-tuning method for large language models. In this paper, we analyze the impact of low-rank updating, as implemented in LoRA. Our findings suggest that the low-rank updating mechanism may limit the ability of LLMs to effectively learn and memorize new knowledge. Inspired by this observation, we propose a new method called MoRA, which employs… ▽ More

    Submitted 20 May, 2024; originally announced May 2024.

    Comments: Work in Progress

  8. arXiv:2405.11344  [pdf

    cs.LG cs.AI

    Improved Content Understanding With Effective Use of Multi-task Contrastive Learning

    Authors: Akanksha Bindal, Sudarshan Ramanujam, Dave Golland, TJ Hazen, Tina Jiang, Fengyu Zhang, Peng Yan

    Abstract: In enhancing LinkedIn core content recommendation models, a significant challenge lies in improving their semantic understanding capabilities. This paper addresses the problem by leveraging multi-task learning, a method that has shown promise in various domains. We fine-tune a pre-trained, transformer-based LLM using multi-task contrastive learning with data from a diverse set of semantic labeling… ▽ More

    Submitted 21 May, 2024; v1 submitted 18 May, 2024; originally announced May 2024.

  9. Vehicles Swarm Intelligence: Cooperation in both Longitudinal and Lateral Dimensions

    Authors: Jia Hu, Nuoheng Zhang, Haoran Wang, Tenglong Jiang, Junnian Zheng, Feilong Liu

    Abstract: Longitudinal-only platooning methods are facing great challenges on running mobility, since they may be impeded by slow-moving vehicles from time to time. To address this issue, this paper proposes a vehicles swarming method coupled both longitudinal and lateral cooperation. The proposed method bears the following contributions: i) enhancing driving mobility by swarming like a bee colony; ii) ensu… ▽ More

    Submitted 13 May, 2024; originally announced May 2024.

  10. arXiv:2405.03129  [pdf, other

    eess.SP cs.IT

    Active Sensing for Multiuser Beam Tracking with Reconfigurable Intelligent Surface

    Authors: Han Han, Tao Jiang, Wei Yu

    Abstract: This paper studies a beam tracking problem in which an access point (AP), in collaboration with a reconfigurable intelligent surface (RIS), dynamically adjusts its downlink beamformers and the reflection pattern at the RIS in order to maintain reliable communications with multiple mobile user equipments (UEs). Specifically, the mobile UEs send uplink pilots to the AP periodically during the channe… ▽ More

    Submitted 31 May, 2024; v1 submitted 5 May, 2024; originally announced May 2024.

  11. arXiv:2404.15454  [pdf, ps, other

    math.ST cs.IT

    Prediction from compression for models with infinite memory, with applications to hidden Markov and renewal processes

    Authors: Yanjun Han, Tianze Jiang, Yihong Wu

    Abstract: Consider the problem of predicting the next symbol given a sample path of length n, whose joint distribution belongs to a distribution class that may have long-term memory. The goal is to compete with the conditional predictor that knows the true model. For both hidden Markov models (HMMs) and renewal processes, we determine the optimal prediction risk in Kullback- Leibler divergence up to univers… ▽ More

    Submitted 23 April, 2024; originally announced April 2024.

    Comments: 37 Pages

  12. arXiv:2404.14162  [pdf, other

    cs.CV

    FLDM-VTON: Faithful Latent Diffusion Model for Virtual Try-on

    Authors: Chenhui Wang, Tao Chen, Zhihao Chen, Zhizhong Huang, Taoran Jiang, Qi Wang, Hongming Shan

    Abstract: Despite their impressive generative performance, latent diffusion model-based virtual try-on (VTON) methods lack faithfulness to crucial details of the clothes, such as style, pattern, and text. To alleviate these issues caused by the diffusion stochastic nature and latent supervision, we propose a novel Faithful Latent Diffusion Model for VTON, termed FLDM-VTON. FLDM-VTON improves the conventiona… ▽ More

    Submitted 19 May, 2024; v1 submitted 22 April, 2024; originally announced April 2024.

    Comments: Accepted by IJCAI 2024

  13. arXiv:2404.13277  [pdf, other

    eess.IV cs.CV

    Beyond Score Changes: Adversarial Attack on No-Reference Image Quality Assessment from Two Perspectives

    Authors: Chenxi Yang, Yujia Liu, Dingquan Li, Yan Zhong, Tingting Jiang

    Abstract: Deep neural networks have demonstrated impressive success in No-Reference Image Quality Assessment (NR-IQA). However, recent researches highlight the vulnerability of NR-IQA models to subtle adversarial perturbations, leading to inconsistencies between model predictions and subjective ratings. Current adversarial attacks, however, focus on perturbing predicted scores of individual images, neglecti… ▽ More

    Submitted 24 April, 2024; v1 submitted 20 April, 2024; originally announced April 2024.

    Comments: Submitted to a conference

  14. arXiv:2404.11996  [pdf, other

    cs.AI

    DST-GTN: Dynamic Spatio-Temporal Graph Transformer Network for Traffic Forecasting

    Authors: Songtao Huang, Hong** Song, Tianqi Jiang, Akbar Telikani, Jun Shen, Qingguo Zhou, Binbin Yong, Qiang Wu

    Abstract: Accurate traffic forecasting is essential for effective urban planning and congestion management. Deep learning (DL) approaches have gained colossal success in traffic forecasting but still face challenges in capturing the intricacies of traffic dynamics. In this paper, we identify and address this challenges by emphasizing that spatial features are inherently dynamic and change over time. A novel… ▽ More

    Submitted 18 April, 2024; originally announced April 2024.

  15. arXiv:2404.10358  [pdf, other

    cs.CV

    Improving Bracket Image Restoration and Enhancement with Flow-guided Alignment and Enhanced Feature Aggregation

    Authors: Wenjie Lin, Zhen Liu, Chengzhi Jiang, Mingyan Han, Ting Jiang, Shuaicheng Liu

    Abstract: In this paper, we address the Bracket Image Restoration and Enhancement (BracketIRE) task using a novel framework, which requires restoring a high-quality high dynamic range (HDR) image from a sequence of noisy, blurred, and low dynamic range (LDR) multi-exposure RAW inputs. To overcome this challenge, we present the IREANet, which improves the multiple exposure alignment and aggregation with a Fl… ▽ More

    Submitted 16 April, 2024; originally announced April 2024.

  16. arXiv:2404.06365  [pdf, other

    cs.CV cs.MM

    Dynamic Resolution Guidance for Facial Expression Recognition

    Authors: Jie Ou, Xu Li, Tianxiang Jiang, Yuanlun Xie

    Abstract: Facial expression recognition (FER) is vital for human-computer interaction and emotion analysis, yet recognizing expressions in low-resolution images remains challenging. This paper introduces a practical method called Dynamic Resolution Guidance for Facial Expression Recognition (DRGFER) to effectively recognize facial expressions in images with varying resolutions without compromising FER model… ▽ More

    Submitted 9 April, 2024; originally announced April 2024.

  17. arXiv:2404.04895  [pdf, other

    cs.NE

    Tensorized Ant Colony Optimization for GPU Acceleration

    Authors: Luming Yang, Tao Jiang, Ran Cheng

    Abstract: Ant Colony Optimization (ACO) is renowned for its effectiveness in solving Traveling Salesman Problems, yet it faces computational challenges in CPU-based environments, particularly with large-scale instances. In response, we introduce a Tensorized Ant Colony Optimization (TensorACO) to utilize the advancements of GPU acceleration. As the core, TensorACO fully transforms ant system and ant path in… ▽ More

    Submitted 12 April, 2024; v1 submitted 7 April, 2024; originally announced April 2024.

    Comments: Genetic and Evolutionary Computation Conference (GECCO '24)

  18. GPU-accelerated Evolutionary Multiobjective Optimization Using Tensorized RVEA

    Authors: Zhenyu Liang, Tao Jiang, Kebin Sun, Ran Cheng

    Abstract: Evolutionary multiobjective optimization has witnessed remarkable progress during the past decades. However, existing algorithms often encounter computational challenges in large-scale scenarios, primarily attributed to the absence of hardware acceleration. In response, we introduce a Tensorized Reference Vector Guided Evolutionary Algorithm (TensorRVEA) for harnessing the advancements of GPU acce… ▽ More

    Submitted 11 April, 2024; v1 submitted 1 April, 2024; originally announced April 2024.

    Comments: Genetic and Evolutionary Computation Conference (GECCO '24)

  19. arXiv:2403.20045  [pdf

    cs.NI

    Blockchain for Energy Market: A Comprehensive Survey

    Authors: Tianqi Jiang, Haoxiang Luo, Kun Yang, Gang Sun, Hongfang Yu, Qi Huang, Athanasios V. Vasilakos

    Abstract: The energy market encompasses the behavior of energy supply and trading within a platform system. By utilizing centralized or distributed trading, energy can be effectively managed and distributed across different regions, thereby achieving market equilibrium and satisfying both producers and consumers. However, recent years have presented unprecedented challenges and difficulties for the developm… ▽ More

    Submitted 5 April, 2024; v1 submitted 29 March, 2024; originally announced March 2024.

  20. arXiv:2403.17297  [pdf, other

    cs.CL cs.AI

    InternLM2 Technical Report

    Authors: Zheng Cai, Maosong Cao, Haojiong Chen, Kai Chen, Keyu Chen, Xin Chen, Xun Chen, Zehui Chen, Zhi Chen, Pei Chu, Xiaoyi Dong, Haodong Duan, Qi Fan, Zhaoye Fei, Yang Gao, Jiaye Ge, Chenya Gu, Yuzhe Gu, Tao Gui, Aijia Guo, Qipeng Guo, Conghui He, Yingfan Hu, Ting Huang, Tao Jiang , et al. (75 additional authors not shown)

    Abstract: The evolution of Large Language Models (LLMs) like ChatGPT and GPT-4 has sparked discussions on the advent of Artificial General Intelligence (AGI). However, replicating such advancements in open-source models has been challenging. This paper introduces InternLM2, an open-source LLM that outperforms its predecessors in comprehensive evaluations across 6 dimensions and 30 benchmarks, long-context m… ▽ More

    Submitted 25 March, 2024; originally announced March 2024.

  21. arXiv:2403.15377  [pdf, other

    cs.CV

    InternVideo2: Scaling Video Foundation Models for Multimodal Video Understanding

    Authors: Yi Wang, Kunchang Li, Xinhao Li, Jiashuo Yu, Yinan He, Guo Chen, Baoqi Pei, Rongkun Zheng, Jilan Xu, Zun Wang, Yansong Shi, Tianxiang Jiang, Songze Li, Hongjie Zhang, Yifei Huang, Yu Qiao, Yali Wang, Limin Wang

    Abstract: We introduce InternVideo2, a new video foundation model (ViFM) that achieves the state-of-the-art performance in action recognition, video-text tasks, and video-centric dialogue. Our approach employs a progressive training paradigm that unifies the different self- or weakly-supervised learning frameworks of masked video token reconstruction, cross-modal contrastive learning, and next token predict… ▽ More

    Submitted 22 March, 2024; originally announced March 2024.

    Comments: a technical report about video understanding

  22. arXiv:2403.11397  [pdf, other

    cs.CV eess.IV

    Defense Against Adversarial Attacks on No-Reference Image Quality Models with Gradient Norm Regularization

    Authors: Yujia Liu, Chenxi Yang, Dingquan Li, Jianhao Ding, Tingting Jiang

    Abstract: The task of No-Reference Image Quality Assessment (NR-IQA) is to estimate the quality score of an input image without additional information. NR-IQA models play a crucial role in the media industry, aiding in performance evaluation and optimization guidance. However, these models are found to be vulnerable to adversarial attacks, which introduce imperceptible perturbations to input images, resulti… ▽ More

    Submitted 17 March, 2024; originally announced March 2024.

    Comments: accepted by CVPR 2024

  23. arXiv:2403.09004  [pdf, ps, other

    cs.IT eess.SP

    Meta-Learning-Based Fronthaul Compression for Cloud Radio Access Networks

    Authors: Ruihua Qiao, Tao Jiang, Wei Yu

    Abstract: This paper investigates the fronthaul compression problem in a user-centric cloud radio access network, in which single-antenna users are served by a central processor (CP) cooperatively via a cluster of remote radio heads (RRHs). To satisfy the fronthaul capacity constraint, this paper proposes a transform-compress-forward scheme, which consists of well-designed transformation matrices and unifor… ▽ More

    Submitted 13 March, 2024; originally announced March 2024.

    Comments: 15 Pages, 13 Figures; accepted in IEEE Transactions on Wireless Communications

  24. arXiv:2403.01413  [pdf, other

    cs.HC cs.AI

    Exploring the Design of Generative AI in Supporting Music-based Reminiscence for Older Adults

    Authors: Yucheng **, Wanling Cai, Li Chen, Yizhe Zhang, Gavin Doherty, Tonglin Jiang

    Abstract: Music-based reminiscence has the potential to positively impact the psychological well-being of older adults. However, the aging process and physiological changes, such as memory decline and limited verbal communication, may impede the ability of older adults to recall their memories and life experiences. Given the advanced capabilities of generative artificial intelligence (AI) systems, such as g… ▽ More

    Submitted 3 March, 2024; originally announced March 2024.

  25. arXiv:2403.00134  [pdf, other

    cs.IT eess.SP

    Active Sensing for Reciprocal MIMO Channels

    Authors: Tao Jiang, Wei Yu

    Abstract: This paper addresses the design of transmit precoder and receive combiner matrices to support $N_{\rm s}$ independent data streams over a time-division duplex (TDD) point-to-point massive multiple-input multiple-output (MIMO) channel with either a fully digital or a hybrid structure. The optimal precoder and combiner design amounts to finding the top-$N_{\rm s}$ singular vectors of the channel mat… ▽ More

    Submitted 6 June, 2024; v1 submitted 29 February, 2024; originally announced March 2024.

    Comments: This paper is accepted in IEEE Transactions on Signal Processing

  26. arXiv:2402.17363  [pdf, other

    cs.RO cs.LG

    CGGM: A conditional graph generation model with adaptive sparsity for node anomaly detection in IoT networks

    Authors: Xianshi Su, Munan Li, Tongbang Jiang, Hao Long

    Abstract: Dynamic graphs are extensively employed for detecting anomalous behavior in nodes within the Internet of Things (IoT). Generative models are often used to address the issue of imbalanced node categories in dynamic graphs. Nevertheless, the constraints it faces include the monotonicity of adjacency relationships, the difficulty in constructing multi-dimensional features for nodes, and the lack of a… ▽ More

    Submitted 27 February, 2024; originally announced February 2024.

    Comments: 13 pages, 19 figures

  27. arXiv:2402.16153  [pdf, other

    cs.SD cs.AI cs.CL cs.LG cs.MM eess.AS

    ChatMusician: Understanding and Generating Music Intrinsically with LLM

    Authors: Ruibin Yuan, Hanfeng Lin, Yi Wang, Zeyue Tian, Shangda Wu, Tianhao Shen, Ge Zhang, Yuhang Wu, Cong Liu, Ziya Zhou, Ziyang Ma, Liumeng Xue, Ziyu Wang, Qin Liu, Tianyu Zheng, Yizhi Li, Yinghao Ma, Yiming Liang, Xiaowei Chi, Ruibo Liu, Zili Wang, Pengfei Li, **gcheng Wu, Chenghua Lin, Qifeng Liu , et al. (10 additional authors not shown)

    Abstract: While Large Language Models (LLMs) demonstrate impressive capabilities in text generation, we find that their ability has yet to be generalized to music, humanity's creative language. We introduce ChatMusician, an open-source LLM that integrates intrinsic musical abilities. It is based on continual pre-training and finetuning LLaMA2 on a text-compatible music representation, ABC notation, and the… ▽ More

    Submitted 25 February, 2024; originally announced February 2024.

    Comments: GitHub: https://shanghaicannon.github.io/ChatMusician/

  28. arXiv:2402.14299  [pdf, other

    cs.RO cs.AI

    We Choose to Go to Space: Agent-driven Human and Multi-Robot Collaboration in Microgravity

    Authors: Miao Xin, Zhongrui You, Zihan Zhang, Taoran Jiang, Tingjia Xu, Haotian Liang, Guo**g Ge, Yuchen Ji, Shentong Mo, Jian Cheng

    Abstract: We present SpaceAgents-1, a system for learning human and multi-robot collaboration (HMRC) strategies under microgravity conditions. Future space exploration requires humans to work together with robots. However, acquiring proficient robot skills and adept collaboration under microgravity conditions poses significant challenges within ground laboratories. To address this issue, we develop a microg… ▽ More

    Submitted 22 February, 2024; originally announced February 2024.

  29. Localization of Dummy Data Injection Attacks in Power Systems Considering Incomplete Topological Information: A Spatio-Temporal Graph Wavelet Convolutional Neural Network Approach

    Authors: Zhaoyang Qu, Yunchang Dong, Yang Li, Siqi Song, Tao Jiang, Min Li, Qiming Wang, Lei Wang, Xiaoyong Bo, Jiye Zang, Qi Xu

    Abstract: The emergence of novel the dummy data injection attack (DDIA) poses a severe threat to the secure and stable operation of power systems. These attacks are particularly perilous due to the minimal Euclidean spatial separation between the injected malicious data and legitimate data, rendering their precise detection challenging using conventional distance-based methods. Furthermore, existing researc… ▽ More

    Submitted 27 January, 2024; originally announced January 2024.

    Comments: Accepted by Applied Energy

    Journal ref: Applied Energy 360 (2024) 122736

  30. arXiv:2401.12578  [pdf, other

    cs.CR

    ToDA: Target-oriented Diffusion Attacker against Recommendation System

    Authors: Xiaohao Liu, Zhulin Tao, Ting Jiang, He Chang, Yunshan Ma, Xianglin Huang, Xiang Wang

    Abstract: Recommendation systems (RS) have become indispensable tools for web services to address information overload, thus enhancing user experiences and bolstering platforms' revenues. However, with their increasing ubiquity, security concerns have also emerged. As the public accessibility of RS, they are susceptible to specific malicious attacks where adversaries can manipulate user profiles, leading to… ▽ More

    Submitted 16 April, 2024; v1 submitted 23 January, 2024; originally announced January 2024.

  31. arXiv:2401.12480  [pdf, other

    cs.CV

    Explore Synergistic Interaction Across Frames for Interactive Video Object Segmentation

    Authors: Kexin Li, Tao Jiang, Zongxin Yang, Yi Yang, Yueting Zhuang, Jun Xiao

    Abstract: Interactive Video Object Segmentation (iVOS) is a challenging task that requires real-time human-computer interaction. To improve the user experience, it is important to consider the user's input habits, segmentation quality, running time and memory consumption.However, existing methods compromise user experience with single input mode and slow running speed. Specifically, these methods only allow… ▽ More

    Submitted 4 February, 2024; v1 submitted 22 January, 2024; originally announced January 2024.

  32. arXiv:2401.07284  [pdf, other

    cs.CL

    Improving Domain Adaptation through Extended-Text Reading Comprehension

    Authors: Ting Jiang, Shaohan Huang, Shengyue Luo, Zihan Zhang, Haizhen Huang, Furu Wei, Weiwei Deng, Feng Sun, Qi Zhang, Deqing Wang, Fuzhen Zhuang

    Abstract: To enhance the domain-specific capabilities of large language models, continued pre-training on a domain-specific corpus is a prevalent method. Recent work demonstrates that adapting models using reading comprehension data formatted by regex-based patterns can significantly improve performance on domain-specific tasks. However, regex-based patterns are incapable of parsing raw corpora using domain… ▽ More

    Submitted 18 January, 2024; v1 submitted 14 January, 2024; originally announced January 2024.

    Comments: Work in Progress

  33. arXiv:2401.06992  [pdf, other

    cs.CV cs.AI

    Progressive Feature Fusion Network for Enhancing Image Quality Assessment

    Authors: Kaiqun Wu, Xiaoling Jiang, Rui Yu, Yonggang Luo, Tian Jiang, Xi Wu, Peng Wei

    Abstract: Image compression has been applied in the fields of image storage and video broadcasting. However, it's formidably tough to distinguish the subtle quality differences between those distorted images generated by different algorithms. In this paper, we propose a new image quality assessment framework to decide which image is better in an image group. To capture the subtle differences, a fine-grained… ▽ More

    Submitted 13 January, 2024; originally announced January 2024.

    Comments: Data Compression Conference

  34. arXiv:2401.05217  [pdf, other

    cs.CV eess.IV

    Exploring Vulnerabilities of No-Reference Image Quality Assessment Models: A Query-Based Black-Box Method

    Authors: Chenxi Yang, Yujia Liu, Dingquan Li, Tingting Jiang

    Abstract: No-Reference Image Quality Assessment (NR-IQA) aims to predict image quality scores consistent with human perception without relying on pristine reference images, serving as a crucial component in various visual tasks. Ensuring the robustness of NR-IQA methods is vital for reliable comparisons of different image processing techniques and consistent user experiences in recommendations. The attack m… ▽ More

    Submitted 25 April, 2024; v1 submitted 10 January, 2024; originally announced January 2024.

  35. arXiv:2401.03369  [pdf, other

    q-bio.MN cs.LG q-bio.BM

    Multi-Modal Representation Learning for Molecular Property Prediction: Sequence, Graph, Geometry

    Authors: Zeyu Wang, Tianyi Jiang, **huan Wang, Qi Xuan

    Abstract: Molecular property prediction refers to the task of labeling molecules with some biochemical properties, playing a pivotal role in the drug discovery and design process. Recently, with the advancement of machine learning, deep learning-based molecular property prediction has emerged as a solution to the resource-intensive nature of traditional methods, garnering significant attention. Among them,… ▽ More

    Submitted 8 January, 2024; v1 submitted 6 January, 2024; originally announced January 2024.

    Comments: 8 pages, 3 figures

  36. BLADE: Box-Level Supervised Amodal Segmentation through Directed Expansion

    Authors: Zhaochen Liu, Zhixuan Li, Tingting Jiang

    Abstract: Perceiving the complete shape of occluded objects is essential for human and machine intelligence. While the amodal segmentation task is to predict the complete mask of partially occluded objects, it is time-consuming and labor-intensive to annotate the pixel-level ground truth amodal masks. Box-level supervised amodal segmentation addresses this challenge by relying solely on ground truth boundin… ▽ More

    Submitted 25 February, 2024; v1 submitted 3 January, 2024; originally announced January 2024.

    Comments: Accepted to AAAI 2024

    Journal ref: Proceedings of the AAAI Conference on Artificial Intelligence. 38, 4 (Mar. 2024), 3846-3854

  37. arXiv:2401.01629  [pdf, ps, other

    cs.LG cs.AI cs.CY

    Synthetic Data in AI: Challenges, Applications, and Ethical Implications

    Authors: Shuang Hao, Wenfeng Han, Tao Jiang, Yi** Li, Haonan Wu, Chunlin Zhong, Zhangjun Zhou, He Tang

    Abstract: In the rapidly evolving field of artificial intelligence, the creation and utilization of synthetic datasets have become increasingly significant. This report delves into the multifaceted aspects of synthetic data, particularly emphasizing the challenges and potential biases these datasets may harbor. It explores the methodologies behind synthetic data generation, spanning traditional statistical… ▽ More

    Submitted 3 January, 2024; originally announced January 2024.

  38. arXiv:2312.09295  [pdf, other

    cs.NI cs.SI

    Networking for the Metaverse: The Standardization Landscape

    Authors: Cedric Westphal, Jungha Hong, Shin-Gak Kang, Leonardo Chiariglione, Tianji Jiang

    Abstract: New applications are being supported by current and future networks. In particular, it is expected that Metaverse applications will be deployed in the near future, as 5G and 6G network provide sufficient bandwidth and sufficiently low latency to provide a satisfying end-user experience. However, networks still need to evolve to better support this type of application. We present here a basic taxon… ▽ More

    Submitted 14 December, 2023; originally announced December 2023.

    Comments: To appear in ITU Journal on Future and Evolving Technologies J-FET December 2023

  39. arXiv:2312.09002  [pdf, other

    cs.IT cs.LG eess.SP

    Localization with Reconfigurable Intelligent Surface: An Active Sensing Approach

    Authors: Zhongze Zhang, Tao Jiang, Wei Yu

    Abstract: This paper addresses an uplink localization problem in which a base station (BS) aims to locate a remote user with the help of reconfigurable intelligent surfaces (RISs). We propose a strategy in which the user transmits pilots sequentially and the BS adaptively adjusts the sensing vectors, including the BS beamforming vector and multiple RIS reflection coefficients based on the observations alrea… ▽ More

    Submitted 15 December, 2023; v1 submitted 14 December, 2023; originally announced December 2023.

    Comments: Accepted in IEEE Transactions on Wireless Communications. This is an extended version of the previous arXiv paper arXiv:2310.13160

  40. arXiv:2312.07526  [pdf, other

    cs.CV

    RTMO: Towards High-Performance One-Stage Real-Time Multi-Person Pose Estimation

    Authors: Peng Lu, Tao Jiang, Yining Li, Xiangtai Li, Kai Chen, Wenming Yang

    Abstract: Real-time multi-person pose estimation presents significant challenges in balancing speed and precision. While two-stage top-down methods slow down as the number of people in the image increases, existing one-stage methods often fail to simultaneously deliver high accuracy and real-time performance. This paper introduces RTMO, a one-stage pose estimation framework that seamlessly integrates coordi… ▽ More

    Submitted 8 April, 2024; v1 submitted 12 December, 2023; originally announced December 2023.

    Comments: Accepted at CVPR 2024. Project page: https://github.com/open-mmlab/mmpose/tree/main/projects/rtmo

  41. arXiv:2311.12273  [pdf, other

    cs.NI eess.SY

    How AI-driven Digital Twins Can Empower Mobile Networks

    Authors: Tong Li, Fenyu Jiang, Qiaohong Yu, Wenzhen Huang, Tao Jiang, Depeng **

    Abstract: The growing complexity of next-generation networks exacerbates the modeling and algorithmic flaws of conventional network optimization methodology. In this paper, we propose a mobile network digital twin (MNDT) architecture for 6G networks. To address the modeling and algorithmic shortcomings, the MNDT uses a simulation-optimization structure. The feedback from the network simulation engine, which… ▽ More

    Submitted 20 November, 2023; originally announced November 2023.

  42. arXiv:2311.12079  [pdf, other

    cs.CV

    FreeKD: Knowledge Distillation via Semantic Frequency Prompt

    Authors: Yuan Zhang, Tao Huang, Jiaming Liu, Tao Jiang, Kuan Cheng, Shanghang Zhang

    Abstract: Knowledge distillation (KD) has been applied to various tasks successfully, and mainstream methods typically boost the student model via spatial imitation losses. However, the consecutive downsamplings induced in the spatial domain of teacher model is a type of corruption, hindering the student from analyzing what specific information needs to be imitated, which results in accuracy degradation. To… ▽ More

    Submitted 22 May, 2024; v1 submitted 20 November, 2023; originally announced November 2023.

    Comments: Accepted by CVPR 2024

  43. arXiv:2311.07582  [pdf

    cs.CL cs.AI

    Evaluating the Potential of Leading Large Language Models in Reasoning Biology Questions

    Authors: Xinyu Gong, Jason Holmes, Yiwei Li, Zhengliang Liu, Qi Gan, Zihao Wu, Jianli Zhang, Yusong Zou, Yuxi Teng, Tian Jiang, Hongtu Zhu, Wei Liu, Tianming Liu, Yajun Yan

    Abstract: Recent advances in Large Language Models (LLMs) have presented new opportunities for integrating Artificial General Intelligence (AGI) into biological research and education. This study evaluated the capabilities of leading LLMs, including GPT-4, GPT-3.5, PaLM2, Claude2, and SenseNova, in answering conceptual biology questions. The models were tested on a 108-question multiple-choice exam covering… ▽ More

    Submitted 4 November, 2023; originally announced November 2023.

  44. arXiv:2311.06062  [pdf, other

    cs.CL cs.CR cs.LG

    Practical Membership Inference Attacks against Fine-tuned Large Language Models via Self-prompt Calibration

    Authors: Wenjie Fu, Huandong Wang, Chen Gao, Guanghua Liu, Yong Li, Tao Jiang

    Abstract: Membership Inference Attacks (MIA) aim to infer whether a target data record has been utilized for model training or not. Prior attempts have quantified the privacy risks of language models (LMs) via MIAs, but there is still no consensus on whether existing MIA algorithms can cause remarkable privacy leakage on practical Large Language Models (LLMs). Existing MIAs designed for LMs can be classifie… ▽ More

    Submitted 25 June, 2024; v1 submitted 10 November, 2023; originally announced November 2023.

    Comments: Repo: https://github.com/wjfu99/MIA-LLMs

  45. arXiv:2311.06049  [pdf, other

    cs.SI cs.CY

    Privacy-Preserving Individual-Level COVID-19 Infection Prediction via Federated Graph Learning

    Authors: Wenjie Fu, Huandong Wang, Chen Gao, Guanghua Liu, Yong Li, Tao Jiang

    Abstract: Accurately predicting individual-level infection state is of great value since its essential role in reducing the damage of the epidemic. However, there exists an inescapable risk of privacy leakage in the fine-grained user mobility trajectories required by individual-level infection prediction. In this paper, we focus on develo** a framework of privacy-preserving individual-level infection pred… ▽ More

    Submitted 10 November, 2023; originally announced November 2023.

    Comments: accepted by TOIS

  46. arXiv:2311.01696  [pdf, other

    cs.CR cs.CV

    Universal Perturbation-based Secret Key-Controlled Data Hiding

    Authors: Donghua Wang, Wen Yao, Tingsong Jiang, Xiaoqian Chen

    Abstract: Deep neural networks (DNNs) are demonstrated to be vulnerable to universal perturbation, a single quasi-perceptible perturbation that can deceive the DNN on most images. However, the previous works are focused on using universal perturbation to perform adversarial attacks, while the potential usability of universal perturbation as data carriers in data hiding is less explored, especially for the k… ▽ More

    Submitted 2 November, 2023; originally announced November 2023.

    Comments: 18 pages, 8 tables, 10 figures

  47. arXiv:2311.01473  [pdf, other

    cs.CV cs.AI cs.LG

    Adversarial Examples in the Physical World: A Survey

    Authors: Jiakai Wang, Donghua Wang, ** Hu, Siyang Wu, Tingsong Jiang, Wen Yao, Aishan Liu, Xianglong Liu

    Abstract: Deep neural networks (DNNs) have demonstrated high vulnerability to adversarial examples. Besides the attacks in the digital world, the practical implications of adversarial examples in the physical world present significant challenges and safety concerns. However, current research on physical adversarial examples (PAEs) lacks a comprehensive understanding of their unique characteristics, leading… ▽ More

    Submitted 1 November, 2023; originally announced November 2023.

    Comments: Adversarial examples, physical-world scenarios, attacks and defenses

  48. arXiv:2310.17649  [pdf, other

    cs.RO cs.CV

    6-DoF Stability Field via Diffusion Models

    Authors: Takuma Yoneda, Tianchong Jiang, Gregory Shakhnarovich, Matthew R. Walter

    Abstract: A core capability for robot manipulation is reasoning over where and how to stably place objects in cluttered environments. Traditionally, robots have relied on object-specific, hand-crafted heuristics in order to perform such reasoning, with limited generalizability beyond a small number of object instances and object interaction patterns. Recent approaches instead learn notions of physical inter… ▽ More

    Submitted 26 October, 2023; originally announced October 2023.

    Comments: In submission

  49. arXiv:2310.13160  [pdf, other

    eess.SP cs.IT

    Active Sensing for Localization with Reconfigurable Intelligent Surface

    Authors: Zhongze Zhang, Tao Jiang, Wei Yu

    Abstract: This paper addresses an uplink localization problem in which the base station (BS) aims to locate a remote user with the aid of reconfigurable intelligent surface (RIS). This paper proposes a strategy in which the user transmits pilots over multiple time frames, and the BS adaptively adjusts the RIS reflection coefficients based on the observations already received so far in order to produce an ac… ▽ More

    Submitted 19 October, 2023; originally announced October 2023.

    Comments: Accepted in IEEE International Conference on Communications (ICC) 2023

  50. arXiv:2310.11044  [pdf, ps, other

    cs.IT eess.SP

    A Tutorial on Near-Field XL-MIMO Communications Towards 6G

    Authors: Haiquan Lu, Yong Zeng, Changsheng You, Yu Han, Jiayi Zhang, Zhe Wang, Zhenjun Dong, Shi **, Cheng-Xiang Wang, Tao Jiang, Xiaohu You, Rui Zhang

    Abstract: Extremely large-scale multiple-input multiple-output (XL-MIMO) is a promising technology for the sixth-generation (6G) mobile communication networks. By significantly boosting the antenna number or size to at least an order of magnitude beyond current massive MIMO systems, XL-MIMO is expected to unprecedentedly enhance the spectral efficiency and spatial resolution for wireless communication. The… ▽ More

    Submitted 3 April, 2024; v1 submitted 17 October, 2023; originally announced October 2023.

    Comments: 42 pages