Skip to main content

Showing 1–50 of 104 results for author: Wan, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.17681  [pdf, other

    cs.CL

    VarBench: Robust Language Model Benchmarking Through Dynamic Variable Perturbation

    Authors: Kun Qian, Shunji Wan, Claudia Tang, Youzhi Wang, Xuanming Zhang, Maximillian Chen, Zhou Yu

    Abstract: As large language models achieve impressive scores on traditional benchmarks, an increasing number of researchers are becoming concerned about benchmark data leakage during pre-training, commonly known as the data contamination problem. To ensure fair evaluation, recent benchmarks release only the training and validation sets, kee** the test set labels closed-source. They require anyone wishing… ▽ More

    Submitted 26 June, 2024; v1 submitted 25 June, 2024; originally announced June 2024.

  2. Crowd-Sourced NeRF: Collecting Data from Production Vehicles for 3D Street View Reconstruction

    Authors: Tong Qin, Changze Li, Haoyang Ye, Shaowei Wan, Minzhen Li, Hongwei Liu, Ming Yang

    Abstract: Recently, Neural Radiance Fields (NeRF) achieved impressive results in novel view synthesis. Block-NeRF showed the capability of leveraging NeRF to build large city-scale models. For large-scale modeling, a mass of image data is necessary. Collecting images from specially designed data-collection vehicles can not support large-scale applications. How to acquire massive high-quality data remains an… ▽ More

    Submitted 23 June, 2024; originally announced June 2024.

  3. arXiv:2406.09486  [pdf, other

    cs.CV cs.AI

    SeMOPO: Learning High-quality Model and Policy from Low-quality Offline Visual Datasets

    Authors: Shenghua Wan, Ziyuan Chen, Le Gan, Shuai Feng, De-Chuan Zhan

    Abstract: Model-based offline reinforcement Learning (RL) is a promising approach that leverages existing data effectively in many real-world applications, especially those involving high-dimensional inputs like images and videos. To alleviate the distribution shift issue in offline RL, existing model-based methods heavily rely on the uncertainty of learned dynamics. However, the model uncertainty estimatio… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

    Comments: 23 pages, 10 figures

  4. arXiv:2405.02435  [pdf, other

    cs.CR cs.SE

    Bridging the Gap: A Study of AI-based Vulnerability Management between Industry and Academia

    Authors: Shengye Wan, Joshua Saxe, Craig Gomes, Sahana Chennabasappa, Avilash Rath, Kun Sun, Xinda Wang

    Abstract: Recent research advances in Artificial Intelligence (AI) have yielded promising results for automated software vulnerability management. AI-based models are reported to greatly outperform traditional static analysis tools, indicating a substantial workload relief for security engineers. However, the industry remains very cautious and selective about integrating AI-based techniques into their secur… ▽ More

    Submitted 3 May, 2024; originally announced May 2024.

    Comments: Accepted by IEEE/IFIP International Conference on Dependable Systems and Networks, Industry Track, 2024

  5. arXiv:2404.13161  [pdf, other

    cs.CR cs.LG

    CyberSecEval 2: A Wide-Ranging Cybersecurity Evaluation Suite for Large Language Models

    Authors: Manish Bhatt, Sahana Chennabasappa, Yue Li, Cyrus Nikolaidis, Daniel Song, Shengye Wan, Faizan Ahmad, Cornelius Aschermann, Yaohui Chen, Dhaval Kapil, David Molnar, Spencer Whitman, Joshua Saxe

    Abstract: Large language models (LLMs) introduce new security risks, but there are few comprehensive evaluation suites to measure and reduce these risks. We present BenchmarkName, a novel benchmark to quantify LLM security risks and capabilities. We introduce two new areas for testing: prompt injection and code interpreter abuse. We evaluated multiple state-of-the-art (SOTA) LLMs, including GPT-4, Mistral,… ▽ More

    Submitted 19 April, 2024; originally announced April 2024.

  6. arXiv:2404.08916  [pdf, other

    cs.CV cs.LG

    Meply: A Large-scale Dataset and Baseline Evaluations for Metastatic Perirectal Lymph Node Detection and Segmentation

    Authors: Weidong Guo, Hantao Zhang, Shouhong Wan, Bingbing Zou, Wanqin Wang, Chenyang Qiu, Jun Li, Peiquan **

    Abstract: Accurate segmentation of metastatic lymph nodes in rectal cancer is crucial for the staging and treatment of rectal cancer. However, existing segmentation approaches face challenges due to the absence of pixel-level annotated datasets tailored for lymph nodes around the rectum. Additionally, metastatic lymph nodes are characterized by their relatively small size, irregular shapes, and lower contra… ▽ More

    Submitted 13 April, 2024; originally announced April 2024.

    Comments: 13 pages

  7. arXiv:2404.03386  [pdf, other

    cs.RO cs.AI cs.LG

    SENSOR: Imitate Third-Person Expert's Behaviors via Active Sensoring

    Authors: Kaichen Huang, Minghao Shao, Shenghua Wan, Hai-Hang Sun, Shuai Feng, Le Gan, De-Chuan Zhan

    Abstract: In many real-world visual Imitation Learning (IL) scenarios, there is a misalignment between the agent's and the expert's perspectives, which might lead to the failure of imitation. Previous methods have generally solved this problem by domain alignment, which incurs extra computation and storage costs, and these methods fail to handle the \textit{hard cases} where the viewpoint gap is too large.… ▽ More

    Submitted 4 April, 2024; originally announced April 2024.

  8. arXiv:2404.03382  [pdf, other

    cs.LG cs.AI

    DIDA: Denoised Imitation Learning based on Domain Adaptation

    Authors: Kaichen Huang, Hai-Hang Sun, Shenghua Wan, Minghao Shao, Shuai Feng, Le Gan, De-Chuan Zhan

    Abstract: Imitating skills from low-quality datasets, such as sub-optimal demonstrations and observations with distractors, is common in real-world applications. In this work, we focus on the problem of Learning from Noisy Demonstrations (LND), where the imitator is required to learn from data with noise that often occurs during the processes of data collection or transmission. Previous IL methods improve t… ▽ More

    Submitted 4 April, 2024; originally announced April 2024.

  9. arXiv:2404.00999  [pdf, other

    cs.CL

    What Causes the Failure of Explicit to Implicit Discourse Relation Recognition?

    Authors: Wei Liu, Stephen Wan, Michael Strube

    Abstract: We consider an unanswered question in the discourse processing community: why do relation classifiers trained on explicit examples (with connectives removed) perform poorly in real implicit scenarios? Prior work claimed this is due to linguistic dissimilarity between explicit and implicit examples but provided no empirical evidence. In this study, we show that one cause for such failure is a label… ▽ More

    Submitted 1 April, 2024; originally announced April 2024.

    Comments: Accepted by NAACL2024 (Long Paper)

  10. arXiv:2403.14066  [pdf, other

    eess.IV cs.CV

    LeFusion: Synthesizing Myocardial Pathology on Cardiac MRI via Lesion-Focus Diffusion Models

    Authors: Hantao Zhang, Jiancheng Yang, Shouhong Wan, Pascal Fua

    Abstract: Data generated in clinical practice often exhibits biases, such as long-tail imbalance and algorithmic unfairness. This study aims to mitigate these challenges through data synthesis. Previous efforts in medical imaging synthesis have struggled with separating lesion information from background context, leading to difficulties in generating high-quality backgrounds and limited control over the syn… ▽ More

    Submitted 20 March, 2024; originally announced March 2024.

    Comments: 13 pages

  11. arXiv:2403.09976  [pdf, other

    cs.LG cs.CV

    AD3: Implicit Action is the Key for World Models to Distinguish the Diverse Visual Distractors

    Authors: Yucen Wang, Shenghua Wan, Le Gan, Shuai Feng, De-Chuan Zhan

    Abstract: Model-based methods have significantly contributed to distinguishing task-irrelevant distractors for visual control. However, prior research has primarily focused on heterogeneous distractors like noisy background videos, leaving homogeneous distractors that closely resemble controllable agents largely unexplored, which poses significant challenges to existing methods. To tackle this problem, we p… ▽ More

    Submitted 5 June, 2024; v1 submitted 14 March, 2024; originally announced March 2024.

  12. arXiv:2401.03673  [pdf, other

    cs.SI physics.data-an

    Comparing discriminating abilities of evaluation metrics in link prediction

    Authors: Xinshan Jiao, Shuyan Wan, Qian Liu, Yilin Bi, Yan-Li Lee, En Xu, Dong Hao, Tao Zhou

    Abstract: Link prediction aims to predict the potential existence of links between two unconnected nodes within a network based on the known topological characteristics. Evaluation metrics are used to assess the effectiveness of algorithms in link prediction. The discriminating ability of these evaluation metrics is vitally important for accurately evaluating link prediction algorithms. In this study, we pr… ▽ More

    Submitted 8 January, 2024; originally announced January 2024.

  13. arXiv:2312.04724  [pdf, other

    cs.CR cs.LG

    Purple Llama CyberSecEval: A Secure Coding Benchmark for Language Models

    Authors: Manish Bhatt, Sahana Chennabasappa, Cyrus Nikolaidis, Shengye Wan, Ivan Evtimov, Dominik Gabi, Daniel Song, Faizan Ahmad, Cornelius Aschermann, Lorenzo Fontana, Sasha Frolov, Ravi Prakash Giri, Dhaval Kapil, Yiannis Kozyrakis, David LeBlanc, James Milazzo, Aleksandar Straumann, Gabriel Synnaeve, Varun Vontimitta, Spencer Whitman, Joshua Saxe

    Abstract: This paper presents CyberSecEval, a comprehensive benchmark developed to help bolster the cybersecurity of Large Language Models (LLMs) employed as coding assistants. As what we believe to be the most extensive unified cybersecurity safety benchmark to date, CyberSecEval provides a thorough evaluation of LLMs in two crucial security domains: their propensity to generate insecure code and their lev… ▽ More

    Submitted 7 December, 2023; originally announced December 2023.

  14. arXiv:2311.14281  [pdf, ps, other

    cs.CV

    Multi-modal Instance Refinement for Cross-domain Action Recognition

    Authors: Yuan Qing, Naixing Wu, Shaohua Wan, Lixin Duan

    Abstract: Unsupervised cross-domain action recognition aims at adapting the model trained on an existing labeled source domain to a new unlabeled target domain. Most existing methods solve the task by directly aligning the feature distributions of source and target domains. However, this would cause negative transfer during domain adaptation due to some negative training samples in both domains. In the sour… ▽ More

    Submitted 24 November, 2023; originally announced November 2023.

    Comments: Accepted by PRCV 2023

  15. arXiv:2311.13165  [pdf, other

    cs.AI

    Multimodal Large Language Models: A Survey

    Authors: Jiayang Wu, Wensheng Gan, Zefeng Chen, Shicheng Wan, Philip S. Yu

    Abstract: The exploration of multimodal language models integrates multiple data types, such as images, text, language, audio, and other heterogeneity. While the latest large language models excel in text-based tasks, they often struggle to understand and process other data types. Multimodal models address this limitation by combining various modalities, enabling a more comprehensive understanding of divers… ▽ More

    Submitted 22 November, 2023; originally announced November 2023.

    Comments: IEEE BigData 2023. 10 pages

  16. arXiv:2311.05804  [pdf, other

    cs.AI

    Model-as-a-Service (MaaS): A Survey

    Authors: Wensheng Gan, Shicheng Wan, Philip S. Yu

    Abstract: Due to the increased number of parameters and data in the pre-trained model exceeding a certain level, a foundation model (e.g., a large language model) can significantly improve downstream task performance and emerge with some novel special abilities (e.g., deep learning, complex reasoning, and human alignment) that were not present before. Foundation models are a form of generative artificial in… ▽ More

    Submitted 9 November, 2023; originally announced November 2023.

    Comments: Preprint. 3 figures, 1 tables

  17. arXiv:2310.09705  [pdf, other

    cs.LG cs.SI

    SGA: A Graph Augmentation Method for Signed Graph Neural Networks

    Authors: Zeyu Zhang, Shuyan Wan, Sijie Wang, Xianda Zheng, Xinrui Zhang, Kaiqi Zhao, Jiamou Liu, Dong Hao

    Abstract: Signed Graph Neural Networks (SGNNs) are vital for analyzing complex patterns in real-world signed graphs containing positive and negative links. However, three key challenges hinder current SGNN-based signed graph representation learning: sparsity in signed graphs leaves latent structures undiscovered, unbalanced triangles pose representation difficulties for SGNN models, and real-world signed gr… ▽ More

    Submitted 14 October, 2023; originally announced October 2023.

  18. arXiv:2308.08283  [pdf, other

    eess.IV cs.CV cs.LG

    CARE: A Large Scale CT Image Dataset and Clinical Applicable Benchmark Model for Rectal Cancer Segmentation

    Authors: Hantao Zhang, Weidong Guo, Chenyang Qiu, Shouhong Wan, Bingbing Zou, Wanqin Wang, Peiquan **

    Abstract: Rectal cancer segmentation of CT image plays a crucial role in timely clinical diagnosis, radiotherapy treatment, and follow-up. Although current segmentation methods have shown promise in delineating cancerous tissues, they still encounter challenges in achieving high segmentation precision. These obstacles arise from the intricate anatomical structures of the rectum and the difficulties in perfo… ▽ More

    Submitted 16 August, 2023; originally announced August 2023.

    Comments: 8 pages

  19. arXiv:2308.00009  [pdf

    eess.IV cs.LG

    A 3D deep learning classifier and its explainability when assessing coronary artery disease

    Authors: Wing Keung Cheung, Jeremy Kalindjian, Robert Bell, Arjun Nair, Leon J. Menezes, Riyaz Patel, Simon Wan, Kacy Chou, Jiahang Chen, Ryo Torii, Rhodri H. Davies, James C. Moon, Daniel C. Alexander, Joseph Jacob

    Abstract: Early detection and diagnosis of coronary artery disease (CAD) could save lives and reduce healthcare costs. In this study, we propose a 3D Resnet-50 deep learning model to directly classify normal subjects and CAD patients on computed tomography coronary angiography images. Our proposed method outperforms a 2D Resnet-50 model by 23.65%. Explainability is also provided by using a Grad-GAM. Further… ▽ More

    Submitted 29 July, 2023; originally announced August 2023.

  20. arXiv:2306.10695  [pdf, other

    cs.LG cs.AI cs.CV

    SeMAIL: Eliminating Distractors in Visual Imitation via Separated Models

    Authors: Shenghua Wan, Yucen Wang, Minghao Shao, Ruying Chen, De-Chuan Zhan

    Abstract: Model-based imitation learning (MBIL) is a popular reinforcement learning method that improves sample efficiency on high-dimension input sources, such as images and videos. Following the convention of MBIL research, existing algorithms are highly deceptive by task-irrelevant information, especially moving distractors in videos. To tackle this problem, we propose a new algorithm - named Separated M… ▽ More

    Submitted 19 June, 2023; originally announced June 2023.

    Comments: 18 pages, 7 figures

  21. arXiv:2306.10528  [pdf, other

    cs.DC

    Boosting the Performance of Degraded Reads in RS-coded Distributed Storage Systems

    Authors: Tian Xie, Juntao Fang, Shenggang wan, Changsheng Xie, Xubin He

    Abstract: Reed-Solomon (RS) codes have been increasingly adopted by distributed storage systems in place of replication,because they provide the same level of availability with much lower storage overhead. However, a key drawback of those RS-coded distributed storage systems is the poor latency of degraded reads, which can be incurred by data failures or hot spots,and are not rare in production environments… ▽ More

    Submitted 18 June, 2023; originally announced June 2023.

  22. arXiv:2305.06272  [pdf, other

    cs.IR cs.CR cs.DC cs.LG

    FedPDD: A Privacy-preserving Double Distillation Framework for Cross-silo Federated Recommendation

    Authors: Sheng Wan, Dashan Gao, Hanlin Gu, Daning Hu

    Abstract: Cross-platform recommendation aims to improve recommendation accuracy by gathering heterogeneous features from different platforms. However, such cross-silo collaborations between platforms are restricted by increasingly stringent privacy protection regulations, thus data cannot be aggregated for training. Federated learning (FL) is a practical solution to deal with the data silo problem in recomm… ▽ More

    Submitted 30 January, 2024; v1 submitted 9 May, 2023; originally announced May 2023.

    Comments: Accepted by IJCNN2023

  23. arXiv:2305.00044  [pdf, other

    econ.GN cs.LG

    Hedonic Prices and Quality Adjusted Price Indices Powered by AI

    Authors: Patrick Bajari, Zhihao Cen, Victor Chernozhukov, Manoj Manukonda, Suhas Vijaykumar, ** Wang, Ramon Huerta, Junbo Li, Ling Leng, George Monokroussos, Shan Wan

    Abstract: Accurate, real-time measurements of price index changes using electronic records are essential for tracking inflation and productivity in today's economic environment. We develop empirical hedonic models that can process large amounts of unstructured product data (text, images, prices, quantities) and output accurate hedonic price estimates and derived indices. To accomplish this, we generate abst… ▽ More

    Submitted 28 April, 2023; originally announced May 2023.

    Comments: Revised CEMMAP Working Paper (CWP08/23)

  24. arXiv:2304.10770  [pdf, other

    cs.LG cs.AI cs.IT

    DEIR: Efficient and Robust Exploration through Discriminative-Model-Based Episodic Intrinsic Rewards

    Authors: Shanchuan Wan, Yu** Tang, Yingtao Tian, Tomoyuki Kaneko

    Abstract: Exploration is a fundamental aspect of reinforcement learning (RL), and its effectiveness is a deciding factor in the performance of RL algorithms, especially when facing sparse extrinsic rewards. Recent studies have shown the effectiveness of encouraging exploration with intrinsic rewards estimated from novelties in observations. However, there is a gap between the novelty of an observation and a… ▽ More

    Submitted 18 May, 2023; v1 submitted 21 April, 2023; originally announced April 2023.

    Comments: Accepted as a conference paper to the 32nd International Joint Conference on Artificial Intelligence (IJCAI-23)

  25. arXiv:2304.06632  [pdf, other

    cs.AI cs.CY cs.HC

    AI-Generated Content (AIGC): A Survey

    Authors: Jiayang Wu, Wensheng Gan, Zefeng Chen, Shicheng Wan, Hong Lin

    Abstract: To address the challenges of digital intelligence in the digital economy, artificial intelligence-generated content (AIGC) has emerged. AIGC uses artificial intelligence to assist or replace manual content generation by generating content based on user-inputted keywords or requirements. The development of large model algorithms has significantly strengthened the capabilities of AIGC, which makes A… ▽ More

    Submitted 25 March, 2023; originally announced April 2023.

    Comments: Preprint. 14 figures, 4 tables

  26. arXiv:2304.06111  [pdf, other

    cs.CY cs.NI

    Web3: The Next Internet Revolution

    Authors: Shicheng Wan, Hong Lin, Wensheng Gan, Jiahui Chen, Philip S. Yu

    Abstract: Since the first appearance of the World Wide Web, people more rely on the Web for their cyber social activities. The second phase of World Wide Web, named Web 2.0, has been extensively attracting worldwide people that participate in building and enjoying the virtual world. Nowadays, the next internet revolution: Web3 is going to open new opportunities for traditional social models. The decentraliz… ▽ More

    Submitted 22 March, 2023; originally announced April 2023.

    Comments: Preprint. 5 figures, 2 tables

  27. arXiv:2304.06032  [pdf, other

    cs.CY

    Web 3.0: The Future of Internet

    Authors: Wensheng Gan, Zhenqiang Ye, Shicheng Wan, Philip S. Yu

    Abstract: With the rapid growth of the Internet, human daily life has become deeply bound to the Internet. To take advantage of massive amounts of data and information on the internet, the Web architecture is continuously being reinvented and upgraded. From the static informative characteristics of Web 1.0 to the dynamic interactive features of Web 2.0, scholars and engineers have worked hard to make the in… ▽ More

    Submitted 23 March, 2023; originally announced April 2023.

    Comments: ACM Web Conference 2023

  28. arXiv:2303.04627  [pdf, other

    cs.DB

    Fairness-driven Skilled Task Assignment with Extra Budget in Spatial Crowdsourcing

    Authors: Yunjun Zhou, Shuhan Wan, Detian Zhang, Shiting Wen

    Abstract: With the prevalence of mobile devices and ubiquitous wireless networks, spatial crowdsourcing has attracted much attention from both academic and industry communities. On spatial crowdsourcing platforms, task requesters can publish spatial tasks and workers need to move to destinations to perform them. In this paper, we formally define the Skilled Task Assignment with Extra Budget (STAEB), which a… ▽ More

    Submitted 8 March, 2023; originally announced March 2023.

  29. arXiv:2301.10167  [pdf, other

    eess.SP cs.LG physics.optics

    EEG Opto-processor: epileptic seizure detection using diffractive photonic computing units

    Authors: Tao Yan, Maoqi Zhang, Sen Wan, Kaifeng Shang, Haiou Zhang, Xun Cao, Xing Lin, Qionghai Dai

    Abstract: Electroencephalography (EEG) analysis extracts critical information from brain signals, which has provided fundamental support for various applications, including brain-disease diagnosis and brain-computer interface. However, the real-time processing of large-scale EEG signals at high energy efficiency has placed great challenges for electronic processors on edge computing devices. Here, we propos… ▽ More

    Submitted 9 December, 2022; originally announced January 2023.

    Comments: 22 pages, 5 figures

  30. arXiv:2212.10252  [pdf, other

    cs.AI

    MDL-based Compressing Sequential Rules

    Authors: Xinhong Chen, Wensheng Gan, Shicheng Wan, Tianlong Gu

    Abstract: Nowadays, with the rapid development of the Internet, the era of big data has come. The Internet generates huge amounts of data every day. However, extracting meaningful information from massive data is like looking for a needle in a haystack. Data mining techniques can provide various feasible methods to solve this problem. At present, many sequential rule mining (SRM) algorithms are presented to… ▽ More

    Submitted 20 December, 2022; originally announced December 2022.

    Comments: Preprint. 6 figures, 8 tables

  31. arXiv:2211.14951  [pdf, other

    cs.CY cs.DB

    Metaverse in Education: Vision, Opportunities, and Challenges

    Authors: Hong Lin, Shicheng Wan, Wensheng Gan, Jiahui Chen, Han-Chieh Chao

    Abstract: Traditional education has been updated with the development of information technology in human history. Within big data and cyber-physical systems, the Metaverse has generated strong interest in various applications (e.g., entertainment, business, and cultural travel) over the last decade. As a novel social work idea, the Metaverse consists of many kinds of technologies, e.g., big data, interactio… ▽ More

    Submitted 27 November, 2022; originally announced November 2022.

    Comments: IEEE BigData 2022. 10 pages, 5 figures, 3 tables

  32. arXiv:2208.14230  [pdf, other

    cs.DB cs.AI

    A Generic Algorithm for Top-K On-Shelf Utility Mining

    Authors: Jiahui Chen, Xu Guo, Wensheng Gan, Shichen Wan, Philip S. Yu

    Abstract: On-shelf utility mining (OSUM) is an emerging research direction in data mining. It aims to discover itemsets that have high relative utility in their selling time period. Compared with traditional utility mining, OSUM can find more practical and meaningful patterns in real-life applications. However, there is a major drawback to traditional OSUM. For normal users, it is hard to define a minimum t… ▽ More

    Submitted 26 August, 2022; originally announced August 2022.

    Comments: Preprint. 4 figures, 4 table

  33. arXiv:2208.12551  [pdf, other

    cs.AI

    Itemset Utility Maximization with Correlation Measure

    Authors: Jiahui Chen, Yixin Xu, Shicheng Wan, Wensheng Gan, Jerry Chun-Wei Lin

    Abstract: As an important data mining technology, high utility itemset mining (HUIM) is used to find out interesting but hidden information (e.g., profit and risk). HUIM has been widely applied in many application scenarios, such as market analysis, medical detection, and web click stream analysis. However, most previous HUIM approaches often ignore the relationship between items in an itemset. Therefore, m… ▽ More

    Submitted 26 August, 2022; originally announced August 2022.

    Comments: Preprint. 5 figures, 7 tables

  34. arXiv:2208.12439  [pdf, other

    cs.DB cs.AI

    Temporal Fuzzy Utility Maximization with Remaining Measure

    Authors: Shicheng Wan, Zhenqiang Ye, Wensheng Gan, Jiahui Chen

    Abstract: High utility itemset mining approaches discover hidden patterns from large amounts of temporal data. However, an inescapable problem of high utility itemset mining is that its discovered results hide the quantities of patterns, which causes poor interpretability. The results only reflect the shop** trends of customers, which cannot help decision makers quantify collected information. In linguist… ▽ More

    Submitted 26 August, 2022; originally announced August 2022.

    Comments: Preprint. 9 figures, 11 tables

  35. arXiv:2208.04589  [pdf, other

    cs.LG cs.AI

    Long-term Causal Effects Estimation via Latent Surrogates Representation Learning

    Authors: Ruichu Cai, Weilin Chen, Zeqin Yang, Shu Wan, Chen Zheng, Xiaoqing Yang, Jiecheng Guo

    Abstract: Estimating long-term causal effects based on short-term surrogates is a significant but challenging problem in many real-world applications, e.g., marketing and medicine. Despite its success in certain domains, most existing methods estimate causal effects in an idealistic and simplistic way - ignoring the causal structure among short-term outcomes and treating all of them as surrogates. However,… ▽ More

    Submitted 21 November, 2023; v1 submitted 9 August, 2022; originally announced August 2022.

  36. Incremental Few-Shot Semantic Segmentation via Embedding Adaptive-Update and Hyper-class Representation

    Authors: Guangchen Shi, Yirui Wu, Jun Liu, Shaohua Wan, Wenhai Wang, Tong Lu

    Abstract: Incremental few-shot semantic segmentation (IFSS) targets at incrementally expanding model's capacity to segment new class of images supervised by only a few samples. However, features learned on old classes could significantly drift, causing catastrophic forgetting. Moreover, few samples for pixel-level segmentation on new classes lead to notorious overfitting issues in each learning session. In… ▽ More

    Submitted 26 July, 2022; originally announced July 2022.

    Journal ref: Proceedings of the 30th ACM International Conference on Multimedia 2022

  37. arXiv:2206.07893  [pdf, other

    cs.CV cs.MM eess.IV

    PeQuENet: Perceptual Quality Enhancement of Compressed Video with Adaptation- and Attention-based Network

    Authors: Sai** Zhang, Luis Herranz, Marta Mrak, Marc Gorriz Blanch, Shuai Wan, Fuzheng Yang

    Abstract: In this paper we propose a generative adversarial network (GAN) framework to enhance the perceptual quality of compressed videos. Our framework includes attention and adaptation to different quantization parameters (QPs) in a single model. The attention module exploits global receptive fields that can capture and align long-range correlations between consecutive frames, which can be beneficial for… ▽ More

    Submitted 15 June, 2022; originally announced June 2022.

  38. arXiv:2206.06157  [pdf, other

    cs.DB cs.AI

    Towards Target High-Utility Itemsets

    Authors: **bao Miao, Wensheng Gan, Shicheng Wan, Yongdong Wu, Philippe Fournier-Viger

    Abstract: For applied intelligence, utility-driven pattern discovery algorithms can identify insightful and useful patterns in databases. However, in these techniques for pattern discovery, the number of patterns can be huge, and the user is often only interested in a few of those patterns. Hence, targeted high-utility itemset mining has emerged as a key research topic, where the aim is to find a subset of… ▽ More

    Submitted 9 June, 2022; originally announced June 2022.

    Comments: Preprint. 6 figures, 5 tables

  39. arXiv:2205.13384  [pdf, other

    cs.CV

    Continual Learning for Visual Search with Backward Consistent Feature Embedding

    Authors: Timmy S. T. Wan, Jun-Cheng Chen, Tzer-Yi Wu, Chu-Song Chen

    Abstract: In visual search, the gallery set could be incrementally growing and added to the database in practice. However, existing methods rely on the model trained on the entire dataset, ignoring the continual updating of the model. Besides, as the model updates, the new model must re-extract features for the entire gallery set to maintain compatible feature space, imposing a high computational cost for a… ▽ More

    Submitted 26 May, 2022; originally announced May 2022.

    Comments: 15 pages with supplementary material; accepted to CVPR 2022

  40. Hyperspectral Image Classification With Contrastive Graph Convolutional Network

    Authors: Wentao Yu, Sheng Wan, Guangyu Li, Jian Yang, Chen Gong

    Abstract: Recently, Graph Convolutional Network (GCN) has been widely used in Hyperspectral Image (HSI) classification due to its satisfactory performance. However, the number of labeled pixels is very limited in HSI, and thus the available supervision information is usually insufficient, which will inevitably degrade the representation ability of most existing GCN-based methods. To enhance the feature repr… ▽ More

    Submitted 11 May, 2022; originally announced May 2022.

  41. arXiv:2205.06754  [pdf, other

    eess.IV cs.CV

    Slimmable Video Codec

    Authors: Zhaocheng Liu, Luis Herranz, Fei Yang, Sai** Zhang, Shuai Wan, Marta Mrak, Marc Górriz Blanch

    Abstract: Neural video compression has emerged as a novel paradigm combining trainable multilayer neural networks and machine learning, achieving competitive rate-distortion (RD) performances, but still remaining impractical due to heavy neural architectures, with large memory and computational demands. In addition, models are usually optimized for a single RD tradeoff. Recent slimmable image codecs can dyn… ▽ More

    Submitted 13 May, 2022; originally announced May 2022.

    Comments: Computer Vision and Pattern Recognition Workshop(CLIC2022)

  42. arXiv:2205.03380  [pdf, ps, other

    eess.IV cs.CV math.OC

    Multi-mode Tensor Train Factorization with Spatial-spectral Regularization for Remote Sensing Images Recovery

    Authors: Gaohang Yu, Shaochun Wan, Liqun Qi, Yanwei Xu

    Abstract: Tensor train (TT) factorization and corresponding TT rank, which can well express the low-rankness and mode correlations of higher-order tensors, have attracted much attention in recent years. However, TT factorization based methods are generally not sufficient to characterize low-rankness along each mode of third-order tensor. Inspired by this, we generalize the tensor train factorization to the… ▽ More

    Submitted 5 May, 2022; originally announced May 2022.

    Comments: 21 pages

  43. arXiv:2204.01690  [pdf, other

    cs.CR cs.LG

    Deep Image: A precious image based deep learning method for online malware detection in IoT Environment

    Authors: Meysam Ghahramani, Rahim Taheri, Mohammad Shojafar, Reza Javidan, Shaohua Wan

    Abstract: The volume of malware and the number of attacks in IoT devices are rising everyday, which encourages security professionals to continually enhance their malware analysis tools. Researchers in the field of cyber security have extensively explored the usage of sophisticated analytics and the efficiency of malware detection. With the introduction of new malware kinds and attack routes, security exper… ▽ More

    Submitted 4 April, 2022; originally announced April 2022.

    Comments: 10 pages, 17 figures, SUBMITTED TO IEEE INTERNET OF THINGS JOURNAL, MARCH 2022

  44. arXiv:2203.10975  [pdf, other

    stat.ML cs.LG stat.AP stat.ME

    GCF: Generalized Causal Forest for Heterogeneous Treatment Effect Estimation in Online Marketplace

    Authors: Shu Wan, Chen Zheng, Zhonggen Sun, Mengfan Xu, Xiaoqing Yang, Hongtu Zhu, Jiecheng Guo

    Abstract: Uplift modeling is a rapidly growing approach that utilizes causal inference and machine learning methods to directly estimate the heterogeneous treatment effects, which has been widely applied to various online marketplaces to assist large-scale decision-making in recent years. The existing popular models, like causal forest (CF), are limited to either discrete treatments or posing parametric ass… ▽ More

    Submitted 23 September, 2022; v1 submitted 21 March, 2022; originally announced March 2022.

  45. arXiv:2201.08944  [pdf, other

    eess.IV cs.CV cs.MM

    DCNGAN: A Deformable Convolutional-Based GAN with QP Adaptation for Perceptual Quality Enhancement of Compressed Video

    Authors: Sai** Zhang, Luis Herranz, Marta Mrak, Marc Gorriz Blanch, Shuai Wan, Fuzheng Yang

    Abstract: In this paper, we propose a deformable convolution-based generative adversarial network (DCNGAN) for perceptual quality enhancement of compressed videos. DCNGAN is also adaptive to the quantization parameters (QPs). Compared with optical flows, deformable convolutions are more effective and efficient to align frames. Deformable convolutions can operate on multiple frames, thus leveraging more temp… ▽ More

    Submitted 27 January, 2022; v1 submitted 21 January, 2022; originally announced January 2022.

    Comments: 5 pages, 4 figures

  46. arXiv:2112.01039  [pdf, ps, other

    cs.LG cs.AI cs.DC cs.MA cs.NI

    How global observation works in Federated Learning: Integrating vertical training into Horizontal Federated Learning

    Authors: Shuo Wan, Jiaxun Lu, **yi Fan, Yunfeng Shao, Chenghui Peng, Khaled B. Letaief

    Abstract: Federated learning (FL) has recently emerged as a transformative paradigm that jointly train a model with distributed data sets in IoT while avoiding the need for central data collection. Due to the limited observation range, such data sets can only reflect local information, which limits the quality of trained models. In practice, the global information and local observations would require a join… ▽ More

    Submitted 10 December, 2021; v1 submitted 2 December, 2021; originally announced December 2021.

  47. arXiv:2111.15026  [pdf, other

    cs.DB cs.AI

    Anomaly Rule Detection in Sequence Data

    Authors: Wensheng Gan, Lili Chen, Shicheng Wan, Jiahui Chen, Chien-Ming Chen

    Abstract: Analyzing sequence data usually leads to the discovery of interesting patterns and then anomaly detection. In recent years, numerous frameworks and methods have been proposed to discover interesting patterns in sequence data as well as detect anomalous behavior. However, existing algorithms mainly focus on frequency-driven analytic, and they are challenging to be applied in real-world settings. In… ▽ More

    Submitted 29 November, 2021; originally announced November 2021.

    Comments: Preprint. 6 figures, 7 tables

  48. arXiv:2111.00309  [pdf, other

    cs.DB cs.AI

    TargetUM: Targeted High-Utility Itemset Querying

    Authors: **bao Miao, Shicheng Wan, Wensheng Gan, Jiayi Sun, Jiahui Chen

    Abstract: Traditional high-utility itemset mining (HUIM) aims to determine all high-utility itemsets (HUIs) that satisfy the minimum utility threshold (\textit{minUtil}) in transaction databases. However, in most applications, not all HUIs are interesting because only specific parts are required. Thus, targeted mining based on user preferences is more important than traditional mining tasks. This paper is t… ▽ More

    Submitted 30 October, 2021; originally announced November 2021.

    Comments: Preprint. 7 figures, 9 tables

  49. arXiv:2111.00307  [pdf, other

    cs.DB

    FUIM: Fuzzy Utility Itemset Mining

    Authors: Shicheng Wan, Wensheng Gan, Xu Guo, Jiahui Chen, Unil Yun

    Abstract: Because of usefulness and comprehensibility, fuzzy data mining has been extensively studied and is an emerging topic in recent years. Compared with utility-driven itemset mining technologies, fuzzy utility mining not only takes utilities (e.g., profits) into account, but also considers quantities of items in each transaction for discovering high fuzzy utility itemsets (HFUIs). Thus, fuzziness can… ▽ More

    Submitted 30 October, 2021; originally announced November 2021.

    Comments: Preprint. 10 figures, 6 tables

  50. arXiv:2109.11913  [pdf

    cs.MM

    Spatial Information Refinement for Chroma Intra Prediction in Video Coding

    Authors: Chengyi Zou, Shuai Wan, Tiannan Ji, Marta Mrak, Marc Gorriz Blanch, Luis Herranz

    Abstract: Video compression benefits from advanced chroma intra prediction methods, such as the Cross-Component Linear Model (CCLM) which uses linear models to approximate the relationship between the luma and chroma components. Recently it has been proven that advanced cross-component prediction methods based on Neural Networks (NN) can bring additional coding gains. In this paper, spatial information refi… ▽ More

    Submitted 24 September, 2021; originally announced September 2021.