Skip to main content

Showing 1–50 of 396 results for author: Fan, J

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.01111  [pdf, other

    cs.LG cs.AI stat.ML

    Proximity Matters: Local Proximity Preserved Balancing for Treatment Effect Estimation

    Authors: Hao Wang, Zhichao Chen, Yuan Shen, Jiajun Fan, Zhaoran Liu, Degui Yang, Xinggao Liu, Haoxuan Li

    Abstract: Heterogeneous treatment effect (HTE) estimation from observational data poses significant challenges due to treatment selection bias. Existing methods address this bias by minimizing distribution discrepancies between treatment groups in latent space, focusing on global alignment. However, the fruitful aspect of local proximity, where similar units exhibit similar outcomes, is often overlooked. In… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

    Comments: Code is available at https://anonymous.4open.science/status/ncr-B697

  2. Performative Debias with Fair-exposure Optimization Driven by Strategic Agents in Recommender Systems

    Authors: Zhichen Xiang, Hongke Zhao, Chuang Zhao, Ming He, Jian** Fan

    Abstract: Data bias, e.g., popularity impairs the dynamics of two-sided markets within recommender systems. This overshadows the less visible but potentially intriguing long-tail items that could capture user interest. Despite the abundance of research surrounding this issue, it still poses challenges and remains a hot topic in academic circles. Along this line, in this paper, we developed a re-ranking appr… ▽ More

    Submitted 25 June, 2024; originally announced June 2024.

    Comments: SIGKDD 2024 accepted paper

  3. arXiv:2406.16494  [pdf, other

    cs.IR cs.AI

    Cross-domain Transfer of Valence Preferences via a Meta-optimization Approach

    Authors: Chuang Zhao, Hongke Zhao, Ming He, Xiaomeng Li, Jian** Fan

    Abstract: Cross-domain recommendation offers a potential avenue for alleviating data sparsity and cold-start problems. Embedding and map**, as a classic cross-domain research genre, aims to identify a common map** function to perform representation transformation between two domains. Nevertheless, previous coarse-grained preference representations, non-personalized map** functions, and excessive relia… ▽ More

    Submitted 24 June, 2024; originally announced June 2024.

  4. arXiv:2406.13250  [pdf, other

    cs.AI cs.CL cs.LG

    LangTopo: Aligning Language Descriptions of Graphs with Tokenized Topological Modeling

    Authors: Zhong Guan, Hongke Zhao, Likang Wu, Ming He, Jianpin Fan

    Abstract: Recently, large language models (LLMs) have been widely researched in the field of graph machine learning due to their outstanding abilities in language comprehension and learning. However, the significant gap between natural language tasks and topological structure modeling poses a nonnegligible challenge. Specifically, since natural language descriptions are not sufficient for LLMs to understand… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

  5. arXiv:2406.13235  [pdf, other

    cs.IR cs.AI

    Enhancing Collaborative Semantics of Language Model-Driven Recommendations via Graph-Aware Learning

    Authors: Zhong Guan, Likang Wu, Hongke Zhao, Ming He, Jianpin Fan

    Abstract: Large Language Models (LLMs) are increasingly prominent in the recommendation systems domain. Existing studies usually utilize in-context learning or supervised fine-tuning on task-specific data to align LLMs into recommendations. However, the substantial bias in semantic spaces between language processing tasks and recommendation tasks poses a nonnegligible challenge. Specifically, without the ad… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

    Comments: 10pages

  6. arXiv:2406.11689  [pdf, other

    cs.CV

    Lightweight Model Pre-training via Language Guided Knowledge Distillation

    Authors: Mingsheng Li, Lin Zhang, Mingzhen Zhu, Zilong Huang, Gang Yu, Jiayuan Fan, Tao Chen

    Abstract: This paper studies the problem of pre-training for small models, which is essential for many mobile devices. Current state-of-the-art methods on this problem transfer the representational knowledge of a large network (as a Teacher) into a smaller model (as a Student) using self-supervised distillation, improving the performance of the small model on downstream tasks. However, existing approaches a… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

  7. arXiv:2406.10849  [pdf, other

    math.OC cs.CC

    A parallel framework for graphical optimal transport

    Authors: Jiaojiao Fan, Isabel Haasler, Qinsheng Zhang, Johan Karlsson, Yongxin Chen

    Abstract: We study multi-marginal optimal transport (MOT) problems where the underlying cost has a graphical structure. These graphical multi-marginal optimal transport problems have found applications in several domains including traffic flow control and regression problems in the Wasserstein space. MOT problem can be approached through two aspects: a single big MOT problem, or coupled minor OT problems. I… ▽ More

    Submitted 16 June, 2024; originally announced June 2024.

  8. arXiv:2406.09931  [pdf, other

    eess.IV cs.CV cs.LG

    SCKansformer: Fine-Grained Classification of Bone Marrow Cells via Kansformer Backbone and Hierarchical Attention Mechanisms

    Authors: Yifei Chen, Zhu Zhu, Shenghao Zhu, Linwei Qiu, Binfeng Zou, Fan Jia, Yunpeng Zhu, Chenyan Zhang, Zhaojie Fang, Feiwei Qin, ** Fan, Changmiao Wang, Yu Gao, Gang Yu

    Abstract: The incidence and mortality rates of malignant tumors, such as acute leukemia, have risen significantly. Clinically, hospitals rely on cytological examination of peripheral blood and bone marrow smears to diagnose malignant tumors, with accurate blood cell counting being crucial. Existing automated methods face challenges such as low feature expression capability, poor interpretability, and redund… ▽ More

    Submitted 14 June, 2024; originally announced June 2024.

    Comments: 15 pages, 6 figures

  9. arXiv:2406.07050  [pdf, other

    cs.CV

    DualMamba: A Lightweight Spectral-Spatial Mamba-Convolution Network for Hyperspectral Image Classification

    Authors: Jiamu Sheng, **gyi Zhou, Jiong Wang, Peng Ye, Jiayuan Fan

    Abstract: The effectiveness and efficiency of modeling complex spectral-spatial relations are both crucial for Hyperspectral image (HSI) classification. Most existing methods based on CNNs and transformers still suffer from heavy computational burdens and have room for improvement in capturing the global-local spectral-spatial feature representation. To this end, we propose a novel lightweight parallel desi… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

  10. arXiv:2406.04299  [pdf, other

    cs.LG cs.SI

    NoisyGL: A Comprehensive Benchmark for Graph Neural Networks under Label Noise

    Authors: Zhonghao Wang, Danyu Sun, Sheng Zhou, Haobo Wang, Jiapei Fan, Longtao Huang, Jiajun Bu

    Abstract: Graph Neural Networks (GNNs) exhibit strong potential in node classification task through a message-passing mechanism. However, their performance often hinges on high-quality node labels, which are challenging to obtain in real-world scenarios due to unreliable sources or adversarial attacks. Consequently, label noise is common in real-world graph data, negatively impacting GNNs by propagating inc… ▽ More

    Submitted 6 June, 2024; v1 submitted 6 June, 2024; originally announced June 2024.

    Comments: 28 pages, 15 figures

  11. arXiv:2406.03051  [pdf, other

    cs.CV

    Adapter-X: A Novel General Parameter-Efficient Fine-Tuning Framework for Vision

    Authors: Minglei Li, Peng Ye, Yongqi Huang, Lin Zhang, Tao Chen, Tong He, Jiayuan Fan, Wanli Ouyang

    Abstract: Parameter-efficient fine-tuning (PEFT) has become increasingly important as foundation models continue to grow in both popularity and size. Adapter has been particularly well-received due to their potential for parameter reduction and adaptability across diverse tasks. However, striking a balance between high efficiency and robust generalization across tasks remains a challenge for adapter-based m… ▽ More

    Submitted 5 June, 2024; v1 submitted 5 June, 2024; originally announced June 2024.

  12. arXiv:2406.02976  [pdf, other

    cs.CV cs.AI

    DA-Flow: Dual Attention Normalizing Flow for Skeleton-based Video Anomaly Detection

    Authors: Ruituo Wu, Yang Chen, Jian Xiao, Bing Li, Jicong Fan, Frédéric Dufaux, Ce Zhu, Yipeng Liu

    Abstract: Cooperation between temporal convolutional networks (TCN) and graph convolutional networks (GCN) as a processing module has shown promising results in skeleton-based video anomaly detection (SVAD). However, to maintain a lightweight model with low computational and storage complexity, shallow GCN and TCN blocks are constrained by small receptive fields and a lack of cross-dimension interaction cap… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

  13. arXiv:2406.02425  [pdf, other

    cs.CV cs.RO

    CoNav: A Benchmark for Human-Centered Collaborative Navigation

    Authors: Changhao Li, Xinyu Sun, Peihao Chen, Jugang Fan, Zixu Wang, Yanxia Liu, **hui Zhu, Chuang Gan, Mingkui Tan

    Abstract: Human-robot collaboration, in which the robot intelligently assists the human with the upcoming task, is an appealing objective. To achieve this goal, the agent needs to be equipped with a fundamental collaborative navigation ability, where the agent should reason human intention by observing human activities and then navigate to the human's intended destination in advance of the human. However, t… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

  14. arXiv:2406.00015  [pdf

    cs.CL

    Use of natural language processing to extract and classify papillary thyroid cancer features from surgical pathology reports

    Authors: Ricardo Loor-Torres, Yuqi Wu, Esteban Cabezas, Mariana Borras, David Toro-Tobon, Mayra Duran, Misk Al Zahidy, Maria Mateo Chavez, Cristian Soto Jacome, Jungwei W. Fan, Naykky M. Singh Ospina, Yonghui Wu, Juan P. Brito

    Abstract: Background We aim to use Natural Language Processing (NLP) to automate the extraction and classification of thyroid cancer risk factors from pathology reports. Methods We analyzed 1,410 surgical pathology reports from adult papillary thyroid cancer patients at Mayo Clinic, Rochester, MN, from 2010 to 2019. Structured and non-structured reports were used to create a consensus-based ground truth dic… ▽ More

    Submitted 22 May, 2024; originally announced June 2024.

    Comments: 21 pages, 6 figures, 7 tables

  15. arXiv:2405.17661  [pdf, other

    cs.CV

    RefDrop: Controllable Consistency in Image or Video Generation via Reference Feature Guidance

    Authors: Jiaojiao Fan, Haotian Xue, Qinsheng Zhang, Yongxin Chen

    Abstract: There is a rapidly growing interest in controlling consistency across multiple generated images using diffusion models. Among various methods, recent works have found that simply manipulating attention modules by concatenating features from multiple reference images provides an efficient approach to enhancing consistency without fine-tuning. Despite its popularity and success, few studies have elu… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

  16. arXiv:2405.10302  [pdf, other

    stat.ME cs.LG math.ST stat.ML

    Optimal Aggregation of Prediction Intervals under Unsupervised Domain Shift

    Authors: Jiawei Ge, Debarghya Mukherjee, Jianqing Fan

    Abstract: As machine learning models are increasingly deployed in dynamic environments, it becomes paramount to assess and quantify uncertainties associated with distribution shifts. A distribution shift occurs when the underlying data-generating process changes, leading to a deviation in the model's performance. The prediction interval, which captures the range of likely outcomes for a given prediction, se… ▽ More

    Submitted 16 May, 2024; originally announced May 2024.

  17. arXiv:2405.09996  [pdf, other

    cs.CV

    Driving-Video Dehazing with Non-Aligned Regularization for Safety Assistance

    Authors: Junkai Fan, Jiangwei Weng, Kun Wang, Yijun Yang, Jianjun Qian, Jun Li, Jian Yang

    Abstract: Real driving-video dehazing poses a significant challenge due to the inherent difficulty in acquiring precisely aligned hazy/clear video pairs for effective model training, especially in dynamic driving scenarios with unpredictable weather conditions. In this paper, we propose a pioneering approach that addresses this challenge through a nonaligned regularization strategy. Our core concept involve… ▽ More

    Submitted 16 May, 2024; originally announced May 2024.

    Comments: Accepted by CVPR 2024

  18. arXiv:2405.06814  [pdf, other

    cs.CV

    Dual-Task Vision Transformer for Rapid and Accurate Intracerebral Hemorrhage Classification on CT Images

    Authors: Jialiang Fan, Guoyu Lu, Xinhui Fan, Lucan Li

    Abstract: Intracerebral hemorrhage (ICH) is a severe and sudden medical condition caused by the rupture of blood vessels in the brain, leading to permanent damage to brain tissue and often resulting in functional disabilities or death in patients. Diagnosis and analysis of ICH typically rely on brain CT imaging. Given the urgency of ICH conditions, early treatment is crucial, necessitating rapid analysis of… ▽ More

    Submitted 18 June, 2024; v1 submitted 10 May, 2024; originally announced May 2024.

    Comments: 9 pages, 4 figure3

  19. arXiv:2405.04715  [pdf, other

    math.ST cs.LG stat.ME stat.ML

    Causality Pursuit from Heterogeneous Environments via Neural Adversarial Invariance Learning

    Authors: Yihong Gu, Cong Fang, Peter Bühlmann, Jianqing Fan

    Abstract: Pursuing causality from data is a fundamental problem in scientific discovery, treatment intervention, and transfer learning. This paper introduces a novel algorithmic method for addressing nonparametric invariance and causality learning in regression models across multiple environments, where the joint distribution of response variables and covariates varies, but the conditional expectations of o… ▽ More

    Submitted 30 June, 2024; v1 submitted 7 May, 2024; originally announced May 2024.

    Comments: 48 pages, 7 figures with appendix

    MSC Class: 62G08

  20. arXiv:2405.04046  [pdf

    cs.CR

    MBCT: A Monero-Based Covert Transmission Approach with On-chain Dynamic Session Key Negotiation

    Authors: Zhenshuai Yue, Haoran Zhu, Xiaolin Chang, Jelena Mišić, Vojislav B. Mišić, Junchao Fan

    Abstract: Traditional covert transmission (CT) approaches have been hindering CT application while blockchain technology offers new avenue. Current blockchain-based CT approaches require off-chain negotiation of critical information and often overlook the dynamic session keys updating, which increases the risk of message and key leakage. Additionally, in some approaches the covert transactions exhibit obvio… ▽ More

    Submitted 7 May, 2024; originally announced May 2024.

  21. arXiv:2405.02351  [pdf, other

    cs.LG cs.AI cs.DC physics.optics

    Towards General Neural Surrogate Solvers with Specialized Neural Accelerators

    Authors: Chenkai Mao, Robert Lupoiu, Tianxiang Dai, Mingkun Chen, Jonathan A. Fan

    Abstract: Surrogate neural network-based partial differential equation (PDE) solvers have the potential to solve PDEs in an accelerated manner, but they are largely limited to systems featuring fixed domain sizes, geometric layouts, and boundary conditions. We propose Specialized Neural Accelerator-Powered Domain Decomposition Methods (SNAP-DDM), a DDM-based approach to PDE solving in which subdomain proble… ▽ More

    Submitted 14 June, 2024; v1 submitted 2 May, 2024; originally announced May 2024.

    Comments: 9 pages, 7 Figures, to be published in ICML 2024

  22. arXiv:2405.01030  [pdf

    cs.CR cs.SE

    Towards Trust Proof for Secure Confidential Virtual Machines

    Authors: **gkai Mao, Haoran Zhu, Junchao Fan, Lin Li, Xiaolin Chang

    Abstract: The Virtual Machine (VM)-based Trusted-Execution-Environment (TEE) technology, like AMD Secure-Encrypted-Virtualization (SEV), enables the establishment of Confidential VMs (CVMs) to protect data privacy. But CVM lacks ways to provide the trust proof of its running state, degrading the user confidence of using CVM. The technology of virtual Trusted Platform Module (vTPM) can be used to generate tr… ▽ More

    Submitted 2 May, 2024; originally announced May 2024.

  23. arXiv:2404.18392  [pdf, other

    cs.DC

    Dflow, a Python framework for constructing cloud-native AI-for-Science workflows

    Authors: Xinzijian Liu, Yanbo Han, Zhuoyuan Li, Jiahao Fan, Chengqian Zhang, **zhe Zeng, Yifan Shan, Yannan Yuan, Wei-Hong Xu, Yun-Pei Liu, Yuzhi Zhang, Tongqi Wen, Darrin M. York, Zhicheng Zhong, Hang Zheng, Jun Cheng, Linfeng Zhang, Han Wang

    Abstract: In the AI-for-science era, scientific computing scenarios such as concurrent learning and high-throughput computing demand a new generation of infrastructure that supports scalable computing resources and automated workflow management on both cloud and high-performance supercomputers. Here we introduce Dflow, an open-source Python toolkit designed for scientists to construct workflows with simple… ▽ More

    Submitted 28 April, 2024; originally announced April 2024.

  24. arXiv:2404.17400  [pdf, other

    cs.CV cs.AI eess.IV

    Spatial-frequency Dual-Domain Feature Fusion Network for Low-Light Remote Sensing Image Enhancement

    Authors: Zishu Yao, Guodong Fan, **fu Fan, Min Gan, C. L. Philip Chen

    Abstract: Low-light remote sensing images generally feature high resolution and high spatial complexity, with continuously distributed surface features in space. This continuity in scenes leads to extensive long-range correlations in spatial domains within remote sensing images. Convolutional Neural Networks, which rely on local correlations for long-distance modeling, struggle to establish long-range corre… ▽ More

    Submitted 26 April, 2024; originally announced April 2024.

    Comments: 14 page

  25. arXiv:2404.12608  [pdf, other

    cs.DB cs.CL cs.PL

    Auto-Formula: Recommend Formulas in Spreadsheets using Contrastive Learning for Table Representations

    Authors: Sibei Chen, Yeye He, Weiwei Cui, Ju Fan, Song Ge, Haidong Zhang, Dongmei Zhang, Surajit Chaudhuri

    Abstract: Spreadsheets are widely recognized as the most popular end-user programming tools, which blend the power of formula-based computation, with an intuitive table-based interface. Today, spreadsheets are used by billions of users to manipulate tables, most of whom are neither database experts nor professional programmers. Despite the success of spreadsheets, authoring complex formulas remains challe… ▽ More

    Submitted 18 April, 2024; originally announced April 2024.

    Comments: full version of a paper to appear in SIGMOD 2024

  26. arXiv:2404.10318  [pdf, other

    cs.CV

    SRGS: Super-Resolution 3D Gaussian Splatting

    Authors: Xiang Feng, Yongbo He, Yubo Wang, Yan Yang, Wen Li, Yifei Chen, Zhenzhong Kuang, Jiajun ding, Jian** Fan, Yu Jun

    Abstract: Recently, 3D Gaussian Splatting (3DGS) has gained popularity as a novel explicit 3D representation. This approach relies on the representation power of Gaussian primitives to provide a high-quality rendering. However, primitives optimized at low resolution inevitably exhibit sparsity and texture deficiency, posing a challenge for achieving high-resolution novel view synthesis (HRNVS). To address t… ▽ More

    Submitted 18 June, 2024; v1 submitted 16 April, 2024; originally announced April 2024.

    Comments: The first to focus on the HRNVS of 3DGS

  27. arXiv:2404.08970  [pdf, other

    cs.LG

    Fast Gradient Computation for Gromov-Wasserstein Distance

    Authors: Wei Zhang, Zihao Wang, Jie Fan, Hao Wu, Yong Zhang

    Abstract: The Gromov-Wasserstein distance is a notable extension of optimal transport. In contrast to the classic Wasserstein distance, it solves a quadratic assignment problem that minimizes the pair-wise distance distortion under the transportation of distributions and thus could apply to distributions in different spaces. These properties make Gromov-Wasserstein widely applicable to many fields, such as… ▽ More

    Submitted 13 April, 2024; originally announced April 2024.

    Comments: Work in progress

  28. arXiv:2404.07771  [pdf, other

    cs.LG math.ST stat.ML

    An Overview of Diffusion Models: Applications, Guided Generation, Statistical Rates and Optimization

    Authors: Minshuo Chen, Song Mei, Jianqing Fan, Mengdi Wang

    Abstract: Diffusion models, a powerful and universal generative AI technology, have achieved tremendous success in computer vision, audio, reinforcement learning, and computational biology. In these applications, diffusion models provide flexible high-dimensional data modeling, and act as a sampler for generating new samples under active guidance towards task-desired properties. Despite the significant empi… ▽ More

    Submitted 11 April, 2024; originally announced April 2024.

  29. arXiv:2404.07442  [pdf, ps, other

    math.LO cs.LO

    Logics of False Belief and Radical Ignorance

    Authors: Jie Fan

    Abstract: In the literature, the question about how to axiomatize the transitive logic of false belief is thought of as hard and left as an open problem. In this paper, among other contributions, we deal with this problem. In more details, although the standard doxastic operator is undefinable with the operator of false belief, the former is {\em almost definable} with the latter. On one hand, the involved… ▽ More

    Submitted 10 April, 2024; originally announced April 2024.

    Comments: 21 pages

    MSC Class: 03B45; 03B42

  30. arXiv:2404.01700  [pdf, other

    cs.CV

    MotionChain: Conversational Motion Controllers via Multimodal Prompts

    Authors: Biao Jiang, Xin Chen, Chi Zhang, Fukun Yin, Zhuoyuan Li, Gang YU, Jiayuan Fan

    Abstract: Recent advancements in language models have demonstrated their adeptness in conducting multi-turn dialogues and retaining conversational context. However, this proficiency remains largely unexplored in other multimodal generative models, particularly in human motion models. By integrating multi-turn conversations in controlling continuous virtual human movements, generative human motion models can… ▽ More

    Submitted 3 April, 2024; v1 submitted 2 April, 2024; originally announced April 2024.

    Comments: 14 pages, 4 figures

  31. Decentralized Multimedia Data Sharing in IoV: A Learning-based Equilibrium of Supply and Demand

    Authors: Jiani Fan, Minrui Xu, Jiale Guo, Lwin Khin Shar, Jiawen Kang, Dusit Niyato, Kwok-Yan Lam

    Abstract: The Internet of Vehicles (IoV) has great potential to transform transportation systems by enhancing road safety, reducing traffic congestion, and improving user experience through onboard infotainment applications. Decentralized data sharing can improve security, privacy, reliability, and facilitate infotainment data sharing in IoVs. However, decentralized data sharing may not achieve the expected… ▽ More

    Submitted 29 March, 2024; originally announced March 2024.

    Journal ref: IEEE Transactions on Vehicular Technology (Volume: 73, Issue: 3, March 2024)

  32. A Learning-based Incentive Mechanism for Mobile AIGC Service in Decentralized Internet of Vehicles

    Authors: Jiani Fan, Minrui Xu, Ziyao Liu, Huanyi Ye, Chaojie Gu, Dusit Niyato, Kwok-Yan Lam

    Abstract: Artificial Intelligence-Generated Content (AIGC) refers to the paradigm of automated content generation utilizing AI models. Mobile AIGC services in the Internet of Vehicles (IoV) network have numerous advantages over traditional cloud-based AIGC services, including enhanced network efficiency, better reconfigurability, and stronger data security and privacy. Nonetheless, AIGC service provisioning… ▽ More

    Submitted 9 May, 2024; v1 submitted 29 March, 2024; originally announced March 2024.

    Comments: 2023 IEEE 98th Vehicular Technology Conference (VTC2023-Fall)

  33. Differentiated Security Architecture for Secure and Efficient Infotainment Data Communication in IoV Networks

    Authors: Jiani Fan, Lwin Khin Shar, Jiale Guo, Wenzhuo Yang, Dusit Niyato, Kwok-Yan Lam

    Abstract: This paper aims to provide differentiated security protection for infotainment data communication in Internet-of-Vehicle (IoV) networks. The IoV is a network of vehicles that uses various sensors, software, built-in hardware, and communication technologies to enable information exchange between pedestrians, cars, and urban infrastructure. Negligence on the security of infotainment data communicati… ▽ More

    Submitted 29 March, 2024; originally announced March 2024.

    Comments: 16th International Conference on Network and System Security

  34. arXiv:2403.17421  [pdf, other

    cs.IR cs.AI

    MA4DIV: Multi-Agent Reinforcement Learning for Search Result Diversification

    Authors: Yiqun Chen, Jiaxin Mao, Yi Zhang, Dehong Ma, Long Xia, Jun Fan, Daiting Shi, Zhicong Cheng, Simiu Gu, Dawei Yin

    Abstract: The objective of search result diversification (SRD) is to ensure that selected documents cover as many different subtopics as possible. Existing methods primarily utilize a paradigm of "greedy selection", i.e., selecting one document with the highest diversity score at a time. These approaches tend to be inefficient and are easily trapped in a suboptimal state. In addition, some other methods aim… ▽ More

    Submitted 27 March, 2024; v1 submitted 26 March, 2024; originally announced March 2024.

  35. arXiv:2403.16198  [pdf, other

    cs.CV

    Diffusion Model is a Good Pose Estimator from 3D RF-Vision

    Authors: Junqiao Fan, Jianfei Yang, Yuecong Xu, Lihua Xie

    Abstract: Human pose estimation (HPE) from Radio Frequency vision (RF-vision) performs human sensing using RF signals that penetrate obstacles without revealing privacy (e.g., facial information). Recently, mmWave radar has emerged as a promising RF-vision sensor, providing radar point clouds by processing RF signals. However, the mmWave radar has a limited resolution with severe noise, leading to inaccurat… ▽ More

    Submitted 24 March, 2024; originally announced March 2024.

  36. arXiv:2403.13236  [pdf, other

    eess.SY cs.AI math.OC

    Safety-Aware Reinforcement Learning for Electric Vehicle Charging Station Management in Distribution Network

    Authors: Jiarong Fan, Ariel Liebman, Hao Wang

    Abstract: The increasing integration of electric vehicles (EVs) into the grid can pose a significant risk to the distribution system operation in the absence of coordination. In response to the need for effective coordination of EVs within the distribution network, this paper presents a safety-aware reinforcement learning (RL) algorithm designed to manage EV charging stations while ensuring the satisfaction… ▽ More

    Submitted 19 March, 2024; originally announced March 2024.

    Comments: 2024 IEEE Power & Energy Society General Meeting (PESGM)

  37. arXiv:2403.11544  [pdf, ps, other

    cs.LG

    RL in Markov Games with Independent Function Approximation: Improved Sample Complexity Bound under the Local Access Model

    Authors: Junyi Fan, Yuxuan Han, Jialin Zeng, Jian-Feng Cai, Yang Wang, Yang Xiang, Jiheng Zhang

    Abstract: Efficiently learning equilibria with large state and action spaces in general-sum Markov games while overcoming the curse of multi-agency is a challenging problem. Recent works have attempted to solve this problem by employing independent linear function classes to approximate the marginal $Q$-value for each agent. However, existing sample complexity bounds under such a framework have a suboptimal… ▽ More

    Submitted 19 March, 2024; v1 submitted 18 March, 2024; originally announced March 2024.

    Comments: Accepted at the 27th International Conference on Artificial Intelligence and Statistics (AISTATS 2024)

  38. arXiv:2403.11530  [pdf, other

    cs.CV

    Continual Forgetting for Pre-trained Vision Models

    Authors: Hongbo Zhao, Bolin Ni, Haochen Wang, Junsong Fan, Fei Zhu, Yuxi Wang, Yuntao Chen, Gaofeng Meng, Zhaoxiang Zhang

    Abstract: For privacy and security concerns, the need to erase unwanted information from pre-trained vision models is becoming evident nowadays. In real-world scenarios, erasure requests originate at any time from both users and model owners. These requests usually form a sequence. Therefore, under such a setting, selective information is expected to be continuously removed from a pre-trained model while ma… ▽ More

    Submitted 18 March, 2024; originally announced March 2024.

    Comments: Accepted by CVPR 2024

  39. arXiv:2403.10925  [pdf, other

    cs.CV

    Learning Dual-Level Deformable Implicit Representation for Real-World Scale Arbitrary Super-Resolution

    Authors: Zhiheng Li, Muheng Li, Jixuan Fan, Lei Chen, Yansong Tang, Jie Zhou, Jiwen Lu

    Abstract: Scale arbitrary super-resolution based on implicit image function gains increasing popularity since it can better represent the visual world in a continuous manner. However, existing scale arbitrary works are trained and evaluated on simulated datasets, where low-resolution images are generated from their ground truths by the simplest bicubic downsampling. These models exhibit limited generalizati… ▽ More

    Submitted 16 March, 2024; originally announced March 2024.

  40. arXiv:2403.05352  [pdf, other

    cs.CV

    Enhancing Plausibility Evaluation for Generated Designs with Denoising Autoencoder

    Authors: Jiajie Fan, Amal Trigui, Thomas Bäck, Hao Wang

    Abstract: A great interest has arisen in using Deep Generative Models (DGM) for generative design. When assessing the quality of the generated designs, human designers focus more on structural plausibility, e.g., no missing component, rather than visual artifacts, e.g., noises in the images. Meanwhile, commonly used metrics such as Fréchet Inception Distance (FID) may not evaluate accurately as they tend to… ▽ More

    Submitted 8 March, 2024; originally announced March 2024.

  41. arXiv:2403.02714  [pdf, other

    cs.CV

    DomainVerse: A Benchmark Towards Real-World Distribution Shifts For Tuning-Free Adaptive Domain Generalization

    Authors: Feng Hou, ** Yuan, Ying Yang, Yang Liu, Yang Zhang, Cheng Zhong, Zhongchao Shi, Jian** Fan, Yong Rui, Zhiqiang He

    Abstract: Traditional cross-domain tasks, including domain adaptation and domain generalization, rely heavily on training model by source domain data. With the recent advance of vision-language models (VLMs), viewed as natural source models, the cross-domain task changes to directly adapt the pre-trained source model to arbitrary target domains equipped with prior domain knowledge, and we name this task Ada… ▽ More

    Submitted 5 March, 2024; originally announced March 2024.

    Comments: Currently in review for ICML 2024

  42. arXiv:2403.01422  [pdf, other

    cs.CV

    MovieLLM: Enhancing Long Video Understanding with AI-Generated Movies

    Authors: Zhende Song, Chenchen Wang, Jiamu Sheng, Chi Zhang, Gang Yu, Jiayuan Fan, Tao Chen

    Abstract: Development of multimodal models has marked a significant step forward in how machines understand videos. These models have shown promise in analyzing short video clips. However, when it comes to longer formats like movies, they often fall short. The main hurdles are the lack of high-quality, diverse video data and the intensive work required to collect or annotate such data. In face of these chal… ▽ More

    Submitted 24 June, 2024; v1 submitted 3 March, 2024; originally announced March 2024.

  43. arXiv:2403.01053  [pdf, other

    cs.LG cs.AI cs.CV

    Seeing Unseen: Discover Novel Biomedical Concepts via Geometry-Constrained Probabilistic Modeling

    Authors: Jianan Fan, Dongnan Liu, Hang Chang, Heng Huang, Mei Chen, Weidong Cai

    Abstract: Machine learning holds tremendous promise for transforming the fundamental practice of scientific discovery by virtue of its data-driven nature. With the ever-increasing stream of research data collection, it would be appealing to autonomously explore patterns and insights from observational data for discovering novel classes of phenotypes and concepts. However, in the biomedical domain, there are… ▽ More

    Submitted 5 March, 2024; v1 submitted 1 March, 2024; originally announced March 2024.

    Comments: CVPR 2024

  44. arXiv:2402.19011  [pdf, other

    cs.CR

    Ruledger: Ensuring Execution Integrity in Trigger-Action IoT Platforms

    Authors: **gwen Fan, Yi He, Bo Tang, Qi Li, Ravi Sandhu

    Abstract: Smart home IoT systems utilize trigger-action platforms, e.g., IFTTT, to manage devices from various vendors. However, they may be abused by triggering malicious rule execution with forged IoT devices or events violating the execution integrity and the intentions of the users. To address this issue, we propose a ledger based IoT platform called Ruledger, which ensures the correct execution of rule… ▽ More

    Submitted 29 February, 2024; originally announced February 2024.

    Journal ref: 10.1109/INFOCOM42981.2021.9488687

  45. Asphalt Concrete Characterization Using Digital Image Correlation: A Systematic Review of Best Practices, Applications, and Future Vision

    Authors: Siqi Wang, Zehui Zhu, Tao Ma, Jianwei Fan

    Abstract: Digital Image Correlation (DIC) is an optical technique that measures displacement and strain by tracking pattern movement in a sequence of captured images during testing. DIC has gained recognition in asphalt pavement engineering since the early 2000s. However, users often perceive the DIC technique as an out-of-box tool and lack a thorough understanding of its operational and measurement princip… ▽ More

    Submitted 26 February, 2024; originally announced February 2024.

    Comments: Journal of Testing and Evaluation

  46. arXiv:2402.16389  [pdf, other

    cs.CL cs.AI

    MoZIP: A Multilingual Benchmark to Evaluate Large Language Models in Intellectual Property

    Authors: Shiwen Ni, Minghuan Tan, Yuelin Bai, Fuqiang Niu, Min Yang, Bowen Zhang, Ruifeng Xu, Xiaojun Chen, Chengming Li, Xi** Hu, Ye Li, Jian** Fan

    Abstract: Large language models (LLMs) have demonstrated impressive performance in various natural language processing (NLP) tasks. However, there is limited understanding of how well LLMs perform in specific domains (e.g, the intellectual property (IP) domain). In this paper, we contribute a new benchmark, the first Multilingual-oriented quiZ on Intellectual Property (MoZIP), for the evaluation of LLMs in… ▽ More

    Submitted 26 February, 2024; originally announced February 2024.

    Journal ref: LREC-COLING 2024

  47. arXiv:2402.16347  [pdf, other

    cs.CL cs.DB

    CodeS: Towards Building Open-source Language Models for Text-to-SQL

    Authors: Haoyang Li, **g Zhang, Hanbing Liu, Ju Fan, Xiaokang Zhang, Jun Zhu, Renjie Wei, Hongyan Pan, Cui** Li, Hong Chen

    Abstract: Language models have shown promising performance on the task of translating natural language questions into SQL queries (Text-to-SQL). However, most of the state-of-the-art (SOTA) approaches rely on powerful yet closed-source large language models (LLMs), such as ChatGPT and GPT-4, which may have the limitations of unclear model architectures, data privacy risks, and expensive inference overheads.… ▽ More

    Submitted 26 February, 2024; originally announced February 2024.

    Comments: Accepted to SIGMOD 2024

  48. arXiv:2402.16033  [pdf, other

    cs.CV

    Diving Deep into Regions: Exploiting Regional Information Transformer for Single Image Deraining

    Authors: Baiang Li, Zhao Zhang, Huan Zheng, Xiaogang Xu, Yanyan Wei, **gyi Zhang, Jicong Fan, Meng Wang

    Abstract: Transformer-based Single Image Deraining (SID) methods have achieved remarkable success, primarily attributed to their robust capability in capturing long-range interactions. However, we've noticed that current methods handle rain-affected and unaffected regions concurrently, overlooking the disparities between these areas, resulting in confusion between rain streaks and background parts, and inab… ▽ More

    Submitted 25 February, 2024; originally announced February 2024.

  49. arXiv:2402.11224  [pdf, other

    cs.LG cs.CR

    Neural Networks with (Low-Precision) Polynomial Approximations: New Insights and Techniques for Accuracy Improvement

    Authors: Chi Zhang, **g**g Fan, Man Ho Au, Siu Ming Yiu

    Abstract: Replacing non-polynomial functions (e.g., non-linear activation functions such as ReLU) in a neural network with their polynomial approximations is a standard practice in privacy-preserving machine learning. The resulting neural network, called polynomial approximation of neural network (PANN) in this paper, is compatible with advanced cryptosystems to enable privacy-preserving model inference. Us… ▽ More

    Submitted 7 June, 2024; v1 submitted 17 February, 2024; originally announced February 2024.

  50. arXiv:2402.06213  [pdf, other

    cs.CV

    Multi-source-free Domain Adaptation via Uncertainty-aware Adaptive Distillation

    Authors: Yaxuan Song, Jianan Fan, Dongnan Liu, Weidong Cai

    Abstract: Source-free domain adaptation (SFDA) alleviates the domain discrepancy among data obtained from domains without accessing the data for the awareness of data privacy. However, existing conventional SFDA methods face inherent limitations in medical contexts, where medical data are typically collected from multiple institutions using various equipment. To address this problem, we propose a simple yet… ▽ More

    Submitted 9 February, 2024; originally announced February 2024.

    Comments: Accepted by ISBI 2024