Skip to main content

Showing 1–50 of 163 results for author: Gan, W

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.05070  [pdf, other

    cs.DB

    Targeted Mining Precise-positioning Episode Rules

    Authors: Jian Zhu, Xiaoye Chen, Wensheng Gan, Zefeng Chen, Philip S. Yu

    Abstract: The era characterized by an exponential increase in data has led to the widespread adoption of data intelligence as a crucial task. Within the field of data mining, frequent episode mining has emerged as an effective tool for extracting valuable and essential information from event sequences. Various algorithms have been developed to discover frequent episodes and subsequently derive episode rules… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

    Comments: IEEE TETCI, 14 pages

  2. arXiv:2405.13055  [pdf, other

    cs.CL cs.AI cs.CY

    Large Language Models for Medicine: A Survey

    Authors: Yanxin Zheng, Wensheng Gan, Zefeng Chen, Zhenlian Qi, Qian Liang, Philip S. Yu

    Abstract: To address challenges in the digital economy's landscape of digital intelligence, large language models (LLMs) have been developed. Improvements in computational power and available resources have significantly advanced LLMs, allowing their integration into diverse domains for human life. Medical LLMs are essential application tools with potential across various medical scenarios. In this paper, w… ▽ More

    Submitted 19 May, 2024; originally announced May 2024.

    Comments: Preprint. 5 figures,5 tables

  3. arXiv:2405.13001  [pdf, other

    cs.CL cs.AI cs.CY

    Large Language Models for Education: A Survey

    Authors: Hanyi Xu, Wensheng Gan, Zhenlian Qi, Jiayang Wu, Philip S. Yu

    Abstract: Artificial intelligence (AI) has a profound impact on traditional education. In recent years, large language models (LLMs) have been increasingly used in various applications such as natural language processing, computer vision, speech recognition, and autonomous driving. LLMs have also been applied in many fields, including recommendation, finance, government, education, legal affairs, and financ… ▽ More

    Submitted 11 May, 2024; originally announced May 2024.

    Comments: Journal of Machine Learning and Cybernetics. 4 tables, 6 figures

  4. arXiv:2405.12496  [pdf, other

    eess.AS cs.NI cs.SD eess.SP

    A Survey of Integrating Wireless Technology into Active Noise Control

    Authors: Xiaoyi Shen, Dongyuan Shi, Zhengding Luo, Junwei Ji, Woon-Seng Gan

    Abstract: Active Noise Control (ANC) is a widely adopted technology for reducing environmental noise across various scenarios. This paper focuses on enhancing noise reduction performance, particularly through the refinement of signal quality fed into ANC systems. We discuss the main wireless technique integrated into the ANC system, equipped with some innovative algorithms, in diverse environments. Instead… ▽ More

    Submitted 21 May, 2024; originally announced May 2024.

  5. arXiv:2405.07536  [pdf, other

    cs.RO eess.SY

    Multi-AUV Kinematic Task Assignment based on Self-organizing Map Neural Network and Dubins Path Generator

    Authors: Xin Li, Wenyang Gan, Pang Wen, Daqi Zhu

    Abstract: To deal with the task assignment problem of multi-AUV systems under kinematic constraints, which means steering capability constraints for underactuated AUVs or other vehicles likely, an improved task assignment algorithm is proposed combining the Dubins Path algorithm with improved SOM neural network algorithm. At first, the aimed tasks are assigned to the AUVs by improved SOM neural network meth… ▽ More

    Submitted 24 June, 2024; v1 submitted 13 May, 2024; originally announced May 2024.

  6. arXiv:2404.18428  [pdf, other

    cs.DB

    Geospatial Big Data: Survey and Challenges

    Authors: Jiayang Wu, Wensheng Gan, Han-Chieh Chao, Philip S. Yu

    Abstract: In recent years, geospatial big data (GBD) has obtained attention across various disciplines, categorized into big earth observation data and big human behavior data. Identifying geospatial patterns from GBD has been a vital research focus in the fields of urban management and environmental sustainability. This paper reviews the evolution of GBD mining and its integration with advanced artificial… ▽ More

    Submitted 29 April, 2024; originally announced April 2024.

    Comments: IEEE JSTARS. 14 pages, 5 figures

  7. arXiv:2403.18139  [pdf, other

    eess.IV cs.CV

    Pseudo-MRI-Guided PET Image Reconstruction Method Based on a Diffusion Probabilistic Model

    Authors: Weijie Gan, Huidong Xie, Carl von Gall, Günther Platsch, Michael T. Jurkiewicz, Andrea Andrade, Udunna C. Anazodo, Ulugbek S. Kamilov, Hongyu An, Jorge Cabello

    Abstract: Anatomically guided PET reconstruction using MRI information has been shown to have the potential to improve PET image quality. However, these improvements are limited to PET scans with paired MRI information. In this work we employed a diffusion probabilistic model (DPM) to infer T1-weighted-MRI (deep-MRI) images from FDG-PET brain images. We then use the DPM-generated T1w-MRI to guide the PET re… ▽ More

    Submitted 26 March, 2024; originally announced March 2024.

  8. Unsupervised learning based end-to-end delayless generative fixed-filter active noise control

    Authors: Zhengding Luo, Dongyuan Shi, Xiaoyi Shen, Woon-Seng Gan

    Abstract: Delayless noise control is achieved by our earlier generative fixed-filter active noise control (GFANC) framework through efficient coordination between the co-processor and real-time controller. However, the one-dimensional convolutional neural network (1D CNN) in the co-processor requires initial training using labelled noise datasets. Labelling noise data can be resource-intensive and may intro… ▽ More

    Submitted 8 February, 2024; originally announced February 2024.

    Comments: 2024 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2024)

  9. arXiv:2402.02694  [pdf, other

    eess.AS cs.LG cs.SD

    Description on IEEE ICME 2024 Grand Challenge: Semi-supervised Acoustic Scene Classification under Domain Shift

    Authors: Jisheng Bai, Mou Wang, Haohe Liu, Han Yin, Yafei Jia, Siwei Huang, Yutong Du, Dongzhe Zhang, Dongyuan Shi, Woon-Seng Gan, Mark D. Plumbley, Susanto Rahardja, Bin Xiang, Jianfeng Chen

    Abstract: Acoustic scene classification (ASC) is a crucial research problem in computational auditory scene analysis, and it aims to recognize the unique acoustic characteristics of an environment. One of the challenges of the ASC task is the domain shift between training and testing data. Since 2018, ASC challenges have focused on the generalization of ASC models across different recording devices. Althoug… ▽ More

    Submitted 28 February, 2024; v1 submitted 4 February, 2024; originally announced February 2024.

  10. arXiv:2401.13998  [pdf, other

    eess.IV cs.CV

    WAL-Net: Weakly supervised auxiliary task learning network for carotid plaques classification

    Authors: Haitao Gan, Lingchao Fu, Ran Zhou, Weiyan Gan, Furong Wang, Xiaoyan Wu, Zhi Yang, Zhongwei Huang

    Abstract: The classification of carotid artery ultrasound images is a crucial means for diagnosing carotid plaques, holding significant clinical relevance for predicting the risk of stroke. Recent research suggests that utilizing plaque segmentation as an auxiliary task for classification can enhance performance by leveraging the correlation between segmentation and classification tasks. However, this appro… ▽ More

    Submitted 27 January, 2024; v1 submitted 25 January, 2024; originally announced January 2024.

  11. arXiv:2401.08678  [pdf, other

    eess.AS cs.SD

    Sub-band and Full-band Interactive U-Net with DPRNN for Demixing Cross-talk Stereo Music

    Authors: Han Yin, Mou Wang, Jisheng Bai, Dongyuan Shi, Woon-Seng Gan, Jianfeng Chen

    Abstract: This paper presents a detailed description of our proposed methods for the ICASSP 2024 Cadenza Challenge. Experimental results show that the proposed system can achieve better performance than official baselines.

    Submitted 10 January, 2024; originally announced January 2024.

    Comments: Submitted to ICASSP 2024

  12. arXiv:2401.01599  [pdf, other

    cs.LG math.ST

    Generalization Error Curves for Analytic Spectral Algorithms under Power-law Decay

    Authors: Yicheng Li, Weiye Gan, Zuoqiang Shi, Qian Lin

    Abstract: The generalization error curve of certain kernel regression method aims at determining the exact order of generalization error with various source condition, noise level and choice of the regularization parameter rather than the minimax rate. In this work, under mild assumptions, we rigorously provide a full characterization of the generalization error curves of the kernel gradient descent method… ▽ More

    Submitted 3 January, 2024; originally announced January 2024.

  13. arXiv:2312.10073  [pdf, other

    cs.IR cs.AI

    Data Scarcity in Recommendation Systems: A Survey

    Authors: Zefeng Chen, Wensheng Gan, Jiayang Wu, Kaixia Hu, Hong Lin

    Abstract: The prevalence of online content has led to the widespread adoption of recommendation systems (RSs), which serve diverse purposes such as news, advertisements, and e-commerce recommendations. Despite their significance, data scarcity issues have significantly impaired the effectiveness of existing RS models and hindered their progress. To address this challenge, the concept of knowledge transfer,… ▽ More

    Submitted 7 December, 2023; originally announced December 2023.

    Comments: ACM Transactions on Recommender Systems, 32 pages

  14. arXiv:2312.03718  [pdf, other

    cs.CL cs.AI

    Large Language Models in Law: A Survey

    Authors: **qi Lai, Wensheng Gan, Jiayang Wu, Zhenlian Qi, Philip S. Yu

    Abstract: The advent of artificial intelligence (AI) has significantly impacted the traditional judicial industry. Moreover, recently, with the development of AI-generated content (AIGC), AI and law have found applications in various domains, including image recognition, automatic text generation, and interactive chat. With the rapid emergence and growing popularity of large models, it is evident that AI wi… ▽ More

    Submitted 25 November, 2023; originally announced December 2023.

    Comments: Preprint

  15. arXiv:2311.18810  [pdf, other

    cs.CV

    Convergence of Nonconvex PnP-ADMM with MMSE Denoisers

    Authors: Chicago Park, Shirin Shoushtari, Weijie Gan, Ulugbek S. Kamilov

    Abstract: Plug-and-Play Alternating Direction Method of Multipliers (PnP-ADMM) is a widely-used algorithm for solving inverse problems by integrating physical measurement models and convolutional neural network (CNN) priors. PnP-ADMM has been theoretically proven to converge for convex data-fidelity terms and nonexpansive CNNs. It has however been observed that PnP-ADMM often empirically converges even for… ▽ More

    Submitted 30 November, 2023; originally announced November 2023.

  16. arXiv:2311.15445  [pdf, other

    cs.CV eess.IV

    FLAIR: A Conditional Diffusion Framework with Applications to Face Video Restoration

    Authors: Zihao Zou, Jiaming Liu, Shirin Shoushtari, Yubo Wang, Weijie Gan, Ulugbek S. Kamilov

    Abstract: Face video restoration (FVR) is a challenging but important problem where one seeks to recover a perceptually realistic face videos from a low-quality input. While diffusion probabilistic models (DPMs) have been shown to achieve remarkable performance for face image restoration, they often fail to preserve temporally coherent, high-quality videos, compromising the fidelity of reconstructed faces.… ▽ More

    Submitted 26 November, 2023; originally announced November 2023.

    Comments: 32 pages, 27 figures

  17. arXiv:2311.13165  [pdf, other

    cs.AI

    Multimodal Large Language Models: A Survey

    Authors: Jiayang Wu, Wensheng Gan, Zefeng Chen, Shicheng Wan, Philip S. Yu

    Abstract: The exploration of multimodal language models integrates multiple data types, such as images, text, language, audio, and other heterogeneity. While the latest large language models excel in text-based tasks, they often struggle to understand and process other data types. Multimodal models address this limitation by combining various modalities, enabling a more comprehensive understanding of divers… ▽ More

    Submitted 22 November, 2023; originally announced November 2023.

    Comments: IEEE BigData 2023. 10 pages

  18. arXiv:2311.13160  [pdf, other

    cs.AI

    Large Language Models in Education: Vision and Opportunities

    Authors: Wensheng Gan, Zhenlian Qi, Jiayang Wu, Jerry Chun-Wei Lin

    Abstract: With the rapid development of artificial intelligence technology, large language models (LLMs) have become a hot research topic. Education plays an important role in human social development and progress. Traditional education faces challenges such as individual student differences, insufficient allocation of teaching resources, and assessment of teaching effectiveness. Therefore, the applications… ▽ More

    Submitted 22 November, 2023; originally announced November 2023.

    Comments: IEEE BigData 2023. 10 pages

  19. arXiv:2311.10945  [pdf, other

    cs.CL cs.AI

    An Empirical Bayes Framework for Open-Domain Dialogue Generation

    Authors: **g Yang Lee, Kong Aik Lee, Woon-Seng Gan

    Abstract: To engage human users in meaningful conversation, open-domain dialogue agents are required to generate diverse and contextually coherent dialogue. Despite recent advancements, which can be attributed to the usage of pretrained language models, the generation of diverse and coherent dialogue remains an open research problem. A popular approach to address this issue involves the adaptation of variat… ▽ More

    Submitted 17 November, 2023; originally announced November 2023.

  20. arXiv:2311.10943  [pdf, other

    cs.CL

    Partially Randomizing Transformer Weights for Dialogue Response Diversity

    Authors: **g Yang Lee, Kong Aik Lee, Woon-Seng Gan

    Abstract: Despite recent progress in generative open-domain dialogue, the issue of low response diversity persists. Prior works have addressed this issue via either novel objective functions, alternative learning approaches such as variational frameworks, or architectural extensions such as the Randomized Link (RL) Transformer. However, these approaches typically entail either additional difficulties during… ▽ More

    Submitted 17 November, 2023; originally announced November 2023.

  21. arXiv:2311.07226  [pdf, other

    cs.RO cs.AI

    Large Language Models for Robotics: A Survey

    Authors: Fanlong Zeng, Wensheng Gan, Yongheng Wang, Ning Liu, Philip S. Yu

    Abstract: The human ability to learn, generalize, and control complex manipulation tasks through multi-modality feedback suggests a unique capability, which we refer to as dexterity intelligence. Understanding and assessing this intelligence is a complex task. Amidst the swift progress and extensive proliferation of large language models (LLMs), their applications in the field of robotics have garnered incr… ▽ More

    Submitted 13 November, 2023; originally announced November 2023.

    Comments: Preprint. 4 figures, 3 tables

  22. arXiv:2311.05804  [pdf, other

    cs.AI

    Model-as-a-Service (MaaS): A Survey

    Authors: Wensheng Gan, Shicheng Wan, Philip S. Yu

    Abstract: Due to the increased number of parameters and data in the pre-trained model exceeding a certain level, a foundation model (e.g., a large language model) can significantly improve downstream task performance and emerge with some novel special abilities (e.g., deep learning, complex reasoning, and human alignment) that were not present before. Foundation models are a form of generative artificial in… ▽ More

    Submitted 9 November, 2023; originally announced November 2023.

    Comments: Preprint. 3 figures, 1 tables

  23. arXiv:2311.02121  [pdf, other

    cs.CV

    Enhancing Monocular Height Estimation from Aerial Images with Street-view Images

    Authors: Xiaomou Hou, Wanshui Gan, Naoto Yokoya

    Abstract: Accurate height estimation from monocular aerial imagery presents a significant challenge due to its inherently ill-posed nature. This limitation is rooted in the absence of adequate geometric constraints available to the model when training with monocular imagery. Without additional geometric information to supplement the monocular image data, the model's ability to provide reliable estimations i… ▽ More

    Submitted 3 November, 2023; originally announced November 2023.

  24. arXiv:2311.02003  [pdf, other

    eess.IV cs.CV

    A Structured Pruning Algorithm for Model-based Deep Learning

    Authors: Chicago Park, Weijie Gan, Zihao Zou, Yuyang Hu, Zhixin Sun, Ulugbek S. Kamilov

    Abstract: There is a growing interest in model-based deep learning (MBDL) for solving imaging inverse problems. MBDL networks can be seen as iterative algorithms that estimate the desired image using a physical measurement model and a learned image prior specified using a convolutional neural net (CNNs). The iterative nature of MBDL networks increases the test-time computational complexity, which limits the… ▽ More

    Submitted 3 November, 2023; originally announced November 2023.

  25. arXiv:2311.00230  [pdf, other

    cs.CV

    DINO-Mix: Enhancing Visual Place Recognition with Foundational Vision Model and Feature Mixing

    Authors: Gaoshuang Huang, Yang Zhou, Xiaofei Hu, Chenglong Zhang, Luying Zhao, Wenjian Gan, Mingbo Hou

    Abstract: Utilizing visual place recognition (VPR) technology to ascertain the geographical location of publicly available images is a pressing issue for real-world VPR applications. Although most current VPR methods achieve favorable results under ideal conditions, their performance in complex environments, characterized by lighting variations, seasonal changes, and occlusions caused by moving objects, is… ▽ More

    Submitted 5 December, 2023; v1 submitted 31 October, 2023; originally announced November 2023.

    Comments: Under review / Open source code

  26. arXiv:2310.13699  [pdf, other

    cs.HC cs.ET

    Interaction in Metaverse: A Survey

    Authors: Hong Lin, Zirun Gan, Wensheng Gan, Zhenlian Qi, Yuehua Wang, Philip S. Yu

    Abstract: Human-computer interaction (HCI) emerged with the birth of the computer and has been upgraded through decades of development. Metaverse has attracted a lot of interest with its immersive experience, and HCI is the entrance to the Metaverse for people. It is predictable that HCI will determine the immersion of the Metaverse. However, the technologies of HCI in Metaverse are not mature enough. There… ▽ More

    Submitted 27 September, 2023; originally announced October 2023.

    Comments: Preprint. 3 figures, 3 tables

  27. arXiv:2310.07504  [pdf, other

    eess.IV cs.CV

    PtychoDV: Vision Transformer-Based Deep Unrolling Network for Ptychographic Image Reconstruction

    Authors: Weijie Gan, Qiuchen Zhai, Michael Thompson McCann, Cristina Garcia Cardona, Ulugbek S. Kamilov, Brendt Wohlberg

    Abstract: Ptychography is an imaging technique that captures multiple overlap** snapshots of a sample, illuminated coherently by a moving localized probe. The image recovery from ptychographic data is generally achieved via an iterative algorithm that solves a nonlinear phase retrieval problem derived from measured diffraction patterns. However, these iterative approaches have high computational cost. In… ▽ More

    Submitted 6 March, 2024; v1 submitted 11 October, 2023; originally announced October 2023.

  28. arXiv:2309.16102  [pdf, other

    cs.AI cs.DB

    Discovering Utility-driven Interval Rules

    Authors: Chunkai Zhang, Maohua Lyu, Huai** Hao, Wensheng Gan, Philip S. Yu

    Abstract: For artificial intelligence, high-utility sequential rule mining (HUSRM) is a knowledge discovery method that can reveal the associations between events in the sequences. Recently, abundant methods have been proposed to discover high-utility sequence rules. However, the existing methods are all related to point-based sequences. Interval events that persist for some time are common. Traditional int… ▽ More

    Submitted 27 September, 2023; originally announced September 2023.

    Comments: Preprint. 11 figures, 5 tables

  29. arXiv:2308.07767  [pdf, other

    eess.AS cs.SD

    Preliminary investigation of the short-term in situ performance of an automatic masker selection system

    Authors: Bhan Lam, Zhen-Ting Ong, Kenneth Ooi, Wen-Hui Ong, Trevor Wong, Karn N. Watcharasupat, Woon-Seng Gan

    Abstract: Soundscape augmentation or "masking" introduces wanted sounds into the acoustic environment to improve acoustic comfort. Usually, the masker selection and playback strategies are either arbitrary or based on simple rules (e.g. -3 dBA), which may lead to sub-optimal increment or even reduction in acoustic comfort for dynamic acoustic environments. To reduce ambiguity in the selection of maskers, an… ▽ More

    Submitted 15 August, 2023; originally announced August 2023.

    Comments: paper submitted to the 52nd International Congress and Exposition on Noise Control Engineering held in Chiba, Greater Tokyo, Japan, on 20-23 August 2023 (Inter-Noise 2023)

    ACM Class: J.2; J.4

  30. arXiv:2308.03684  [pdf, other

    eess.AS cs.SD

    Active Noise Control based on the Momentum Multichannel Normalized Filtered-x Least Mean Square Algorithm

    Authors: Dongyuan Shi, Woon-Seng Gan, Bhan Lam, Shulin Wen, Xiaoyi Shen

    Abstract: Multichannel active noise control (MCANC) is widely utilized to achieve significant noise cancellation area in the complicated acoustic field. Meanwhile, the filter-x least mean square (FxLMS) algorithm gradually becomes the benchmark solution for the implementation of MCANC due to its low computational complexity. However, its slow convergence speed more or less undermines the performance of deal… ▽ More

    Submitted 7 August, 2023; originally announced August 2023.

    Comments: Conference: INTER-NOISE and NOISE-CON Congress and Conference Proceedings 2020 At Korea Volume: 261

  31. Anti-noise window: Subjective perception of active noise reduction and effect of informational masking

    Authors: Bhan Lam, Kelvin Chee Quan Lim, Kenneth Ooi, Zhen-Ting Ong, Dongyuan Shi, Woon-Seng Gan

    Abstract: Reviving natural ventilation (NV) for urban sustainability presents challenges for indoor acoustic comfort. Active control and interference-based noise mitigation strategies, such as the use of loudspeakers, offer potential solutions to achieve acoustic comfort while maintaining NV. However, these approaches are not commonly integrated or evaluated from a perceptual standpoint. This study examines… ▽ More

    Submitted 8 July, 2023; originally announced July 2023.

    Comments: Accepted manuscript submitted to Sustainable Cities and Society

    Journal ref: Sustain. Cities Soc., 104763, 2023

  32. arXiv:2306.06470  [pdf, other

    cs.DB

    TALENT: Targeted Mining of Non-overlap** Sequential Patterns

    Authors: Zefeng Chen, Wensheng Gan, Gengsen Huang, Zhenlian Qi, Yan Li, Philip S. Yu

    Abstract: With the widespread application of efficient pattern mining algorithms, sequential patterns that allow gap constraints have become a valuable tool to discover knowledge from biological data such as DNA and protein sequences. Among all kinds of gap-constrained mining, non-overlap** sequence mining can mine interesting patterns and satisfy the anti-monotonic property (the Apriori property). Howeve… ▽ More

    Submitted 10 June, 2023; originally announced June 2023.

    Comments: Preprint. 9 figures, 5 tables

  33. arXiv:2305.12672  [pdf, other

    eess.IV cs.CV cs.LG

    Block Coordinate Plug-and-Play Methods for Blind Inverse Problems

    Authors: Weijie Gan, Shirin Shoushtari, Yuyang Hu, Jiaming Liu, Hongyu An, Ulugbek S. Kamilov

    Abstract: Plug-and-play (PnP) prior is a well-known class of methods for solving imaging inverse problems by computing fixed-points of operators combining physical measurement models and learned image denoisers. While PnP methods have been extensively used for image recovery with known measurement operators, there is little work on PnP for solving blind inverse problems. We address this gap by presenting a… ▽ More

    Submitted 26 October, 2023; v1 submitted 21 May, 2023; originally announced May 2023.

  34. arXiv:2304.13931  [pdf, other

    cs.DB

    Open Metaverse: Issues, Evolution, and Future

    Authors: Zefeng Chen, Wensheng Gan, Jiayi Sun, Jiayang Wu, Philip S. Yu

    Abstract: With the evolution of content on the web and the Internet, there is a need for cyberspace that can be used to work, live, and play in digital worlds regardless of geography. The Metaverse provides the possibility of future Internet and represents a future trend. In the future, the Metaverse will be a space where the real and the virtual are combined. In this article, we have a comprehensive survey… ▽ More

    Submitted 26 April, 2023; originally announced April 2023.

    Comments: Preprint. 6 figures, 2 tables

  35. arXiv:2304.11947  [pdf, other

    cs.DB

    Towards Top-$K$ Non-Overlap** Sequential Patterns

    Authors: Zefeng Chen, Wensheng Gan, Gengsen Huang, Yan Li, Zhenlian Qi

    Abstract: Sequential pattern mining (SPM) has excellent prospects and application spaces and has been widely used in different fields. The non-overlap** SPM, as one of the data mining techniques, has been used to discover patterns that have requirements for gap constraints in some specific mining tasks, such as bio-data mining. And for the non-overlap** sequential patterns with gap constraints, the Nett… ▽ More

    Submitted 24 April, 2023; originally announced April 2023.

    Comments: Preprint. 5 figures, 5 tables

  36. arXiv:2304.06632  [pdf, other

    cs.AI cs.CY cs.HC

    AI-Generated Content (AIGC): A Survey

    Authors: Jiayang Wu, Wensheng Gan, Zefeng Chen, Shicheng Wan, Hong Lin

    Abstract: To address the challenges of digital intelligence in the digital economy, artificial intelligence-generated content (AIGC) has emerged. AIGC uses artificial intelligence to assist or replace manual content generation by generating content based on user-inputted keywords or requirements. The development of large model algorithms has significantly strengthened the capabilities of AIGC, which makes A… ▽ More

    Submitted 25 March, 2023; originally announced April 2023.

    Comments: Preprint. 14 figures, 4 tables

  37. arXiv:2304.06111  [pdf, other

    cs.CY cs.NI

    Web3: The Next Internet Revolution

    Authors: Shicheng Wan, Hong Lin, Wensheng Gan, Jiahui Chen, Philip S. Yu

    Abstract: Since the first appearance of the World Wide Web, people more rely on the Web for their cyber social activities. The second phase of World Wide Web, named Web 2.0, has been extensively attracting worldwide people that participate in building and enjoying the virtual world. Nowadays, the next internet revolution: Web3 is going to open new opportunities for traditional social models. The decentraliz… ▽ More

    Submitted 22 March, 2023; originally announced April 2023.

    Comments: Preprint. 5 figures, 2 tables

  38. arXiv:2304.06032  [pdf, other

    cs.CY

    Web 3.0: The Future of Internet

    Authors: Wensheng Gan, Zhenqiang Ye, Shicheng Wan, Philip S. Yu

    Abstract: With the rapid growth of the Internet, human daily life has become deeply bound to the Internet. To take advantage of massive amounts of data and information on the internet, the Web architecture is continuously being reinvented and upgraded. From the static informative characteristics of Web 1.0 to the dynamic interactive features of Web 2.0, scholars and engineers have worked hard to make the in… ▽ More

    Submitted 23 March, 2023; originally announced April 2023.

    Comments: ACM Web Conference 2023

  39. arXiv:2304.03679  [pdf, other

    cs.IR

    T2Ranking: A large-scale Chinese Benchmark for Passage Ranking

    Authors: Xiaohui Xie, Qian Dong, Bingning Wang, Feiyang Lv, Ting Yao, Weinan Gan, Zhi**g Wu, Xiangsheng Li, Haitao Li, Yiqun Liu, ** Ma

    Abstract: Passage ranking involves two stages: passage retrieval and passage re-ranking, which are important and challenging topics for both academics and industries in the area of Information Retrieval (IR). However, the commonly-used datasets for passage ranking usually focus on the English language. For non-English scenarios, such as Chinese, the existing datasets are limited in terms of data scale, fine… ▽ More

    Submitted 7 April, 2023; originally announced April 2023.

    Comments: This Resource paper has been accepted by the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2023)

  40. arXiv:2304.01689  [pdf, other

    cs.CR

    Privacy-Preserving Federated Discovery of DNA Motifs with Differential Privacy

    Authors: Yao Chen, Wensheng Gan, Gengsen Huang, Yongdong Wu, Philip S. Yu

    Abstract: DNA motif discovery is an important issue in gene research, which aims to identify transcription factor binding sites (i.e., motifs) in DNA sequences to reveal the mechanisms that regulate gene expression. However, the phenomenon of data silos and the problem of privacy leakage have seriously hindered the development of DNA motif discovery. On the one hand, the phenomenon of data silos makes data… ▽ More

    Submitted 4 April, 2023; originally announced April 2023.

    Comments: Preprint. 7 figures, 1 table

  41. arXiv:2303.17987  [pdf, other

    cs.CR cs.AI cs.LG

    Federated Learning for Metaverse: A Survey

    Authors: Yao Chen, Shan Huang, Wensheng Gan, Gengsen Huang, Yongdong Wu

    Abstract: The metaverse, which is at the stage of innovation and exploration, faces the dilemma of data collection and the problem of private data leakage in the process of development. This can seriously hinder the widespread deployment of the metaverse. Fortunately, federated learning (FL) is a solution to the above problems. FL is a distributed machine learning paradigm with privacy-preserving features d… ▽ More

    Submitted 23 March, 2023; originally announced March 2023.

    Comments: ACM Web Conference 2023

  42. arXiv:2303.14510  [pdf, other

    cs.DB

    Targeted Mining of Top-k High Utility Itemsets

    Authors: Shan Huang, Wensheng Gan, **bao Miao, Xuming Han, Philippe Fournier-Viger

    Abstract: Finding high-importance patterns in data is an emerging data mining task known as High-utility itemset mining (HUIM). Given a minimum utility threshold, a HUIM algorithm extracts all the high-utility itemsets (HUIs) whose utility values are not less than the threshold. This can reveal a wealth of useful information, but the precise needs of users are not well taken into account. In particular, use… ▽ More

    Submitted 25 March, 2023; originally announced March 2023.

    Comments: Preprint. 5 figures, 5 tables

  43. arXiv:2303.11123  [pdf, other

    cs.HC

    The Human-Centric Metaverse: A Survey

    Authors: Riyan Yang, Lin Li, Wensheng Gan, Zefeng Chen, Zhenlian Qi

    Abstract: In the era of the Web of Things, the Metaverse is expected to be the landing site for the next generation of the Internet, resulting in the increased popularity of related technologies and applications in recent years and gradually becoming the focus of Internet research. The Metaverse, as a link between the real and virtual worlds, can provide users with immersive experiences. As the concept of t… ▽ More

    Submitted 20 March, 2023; originally announced March 2023.

    Comments: ACM Web Conference 2023

  44. arXiv:2303.10076  [pdf, other

    cs.CV

    A Simple Framework for 3D Occupancy Estimation in Autonomous Driving

    Authors: Wanshui Gan, Ningkai Mo, Hongbin Xu, Naoto Yokoya

    Abstract: The task of estimating 3D occupancy from surrounding-view images is an exciting development in the field of autonomous driving, following the success of Bird's Eye View (BEV) perception. This task provides crucial 3D attributes of the driving environment, enhancing the overall understanding and perception of the surrounding space. In this work, we present a simple framework for 3D occupancy estima… ▽ More

    Submitted 16 November, 2023; v1 submitted 17 March, 2023; originally announced March 2023.

    Comments: 15 pages, 8 figures

  45. Implementing Continuous HRTF Measurement in Near-Field

    Authors: Ee-Leng Tan, Santi Peksi, Woon-Seng Gan

    Abstract: Head-related transfer function (HRTF) is an essential component to create an immersive listening experience over headphones for virtual reality (VR) and augmented reality (AR) applications. Metaverse combines VR and AR to create immersive digital experiences, and users are very likely to interact with virtual objects in the near-field (NF). The HRTFs of such objects are highly individualized and d… ▽ More

    Submitted 15 June, 2023; v1 submitted 15 March, 2023; originally announced March 2023.

    Comments: 5 pages, 9 figures, Submitted to 2023 IEEE International Conference on Acoustics, Speech and Signal Processing

  46. Autonomous Soundscape Augmentation with Multimodal Fusion of Visual and Participant-linked Inputs

    Authors: Kenneth Ooi, Karn N. Watcharasupat, Bhan Lam, Zhen-Ting Ong, Woon-Seng Gan

    Abstract: Autonomous soundscape augmentation systems typically use trained models to pick optimal maskers to effect a desired perceptual change. While acoustic information is paramount to such systems, contextual information, including participant demographics and the visual environment, also influences acoustic perception. Hence, we propose modular modifications to an existing attention-based deep neural n… ▽ More

    Submitted 14 March, 2023; originally announced March 2023.

    Comments: 5 pages, 2 figures. Submitted to 2023 IEEE International Conference on Acoustics, Speech and Signal Processing

    Journal ref: ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Jun. 2023, pp. 1-5

  47. arXiv:2303.07820  [pdf, other

    cs.CV

    Adaptive Rotated Convolution for Rotated Object Detection

    Authors: Yifan Pu, Yiru Wang, Zhuofan Xia, Yizeng Han, Yulin Wang, Weihao Gan, Zidong Wang, Shiji Song, Gao Huang

    Abstract: Rotated object detection aims to identify and locate objects in images with arbitrary orientation. In this scenario, the oriented directions of objects vary considerably across different images, while multiple orientations of objects exist within an image. This intrinsic characteristic makes it challenging for standard backbone networks to extract high-quality features of these arbitrarily orienta… ▽ More

    Submitted 21 September, 2023; v1 submitted 14 March, 2023; originally announced March 2023.

    Comments: ICCV 2023

  48. Deep Generative Fixed-filter Active Noise Control

    Authors: Zhengding Luo, Dongyuan Shi, Xiaoyi Shen, Junwei Ji, Woon-Seng Gan

    Abstract: Due to the slow convergence and poor tracking ability, conventional LMS-based adaptive algorithms are less capable of handling dynamic noises. Selective fixed-filter active noise control (SFANC) can significantly reduce response time by selecting appropriate pre-trained control filters for different noises. Nonetheless, the limited number of pre-trained control filters may affect noise reduction p… ▽ More

    Submitted 10 March, 2023; originally announced March 2023.

    Comments: Accepted by ICASSP 2023. Code will be available after publication

    Journal ref: ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

  49. arXiv:2212.14255  [pdf, other

    cs.DB cs.AI

    HUSP-SP: Faster Utility Mining on Sequence Data

    Authors: Chunkai Zhang, Yuting Yang, Zilin Du, Wensheng Gan, Philip S. Yu

    Abstract: High-utility sequential pattern mining (HUSPM) has emerged as an important topic due to its wide application and considerable popularity. However, due to the combinatorial explosion of the search space when the HUSPM problem encounters a low utility threshold or large-scale data, it may be time-consuming and memory-costly to address the HUSPM problem. Several algorithms have been proposed for addr… ▽ More

    Submitted 29 December, 2022; originally announced December 2022.

    Comments: ACM TKDD, 7 figures, 2 tables

  50. arXiv:2212.10452  [pdf, other

    cs.DB cs.AI

    Towards Sequence Utility Maximization under Utility Occupancy Measure

    Authors: Gengsen Huang, Wensheng Gan, Philip S. Yu

    Abstract: The discovery of utility-driven patterns is a useful and difficult research topic. It can extract significant and interesting information from specific and varied databases, increasing the value of the services provided. In practice, the measure of utility is often used to demonstrate the importance, profit, or risk of an object or a pattern. In the database, although utility is a flexible criteri… ▽ More

    Submitted 20 December, 2022; originally announced December 2022.

    Comments: Preprint. 7 figures, 8 tables