Skip to main content

Showing 1–14 of 14 results for author: Tang, F

Searching in archive eess. Search in all archives.
.
  1. arXiv:2406.08305  [pdf, other

    cs.NI eess.SP

    Large Language Model(LLM) assisted End-to-End Network Health Management based on Multi-Scale Semanticization

    Authors: Fengxiao Tang, Xiaonan Wang, Xun Yuan, Linfeng Luo, Ming Zhao, Nei Kato

    Abstract: Network device and system health management is the foundation of modern network operations and maintenance. Traditional health management methods, relying on expert identification or simple rule-based algorithms, struggle to cope with the dynamic heterogeneous networks (DHNs) environment. Moreover, current state-of-the-art distributed anomaly detection methods, which utilize specific machine learn… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

  2. arXiv:2405.11289  [pdf, other

    eess.IV cs.CV

    Diffusion Model Driven Test-Time Image Adaptation for Robust Skin Lesion Classification

    Authors: Ming Hu, Siyuan Yan, Peng Xia, Feilong Tang, Wenxue Li, Peibo Duan, Lin Zhang, Zongyuan Ge

    Abstract: Deep learning-based diagnostic systems have demonstrated potential in skin disease diagnosis. However, their performance can easily degrade on test domains due to distribution shifts caused by input-level corruptions, such as imaging equipment variability, brightness changes, and image blur. This will reduce the reliability of model deployment in real-world scenarios. Most existing solutions focus… ▽ More

    Submitted 18 May, 2024; originally announced May 2024.

  3. arXiv:2402.13763  [pdf, other

    cs.SD eess.AS

    Music Style Transfer with Time-Varying Inversion of Diffusion Models

    Authors: Sifei Li, Yuxin Zhang, Fan Tang, Chongyang Ma, Weiming dong, Changsheng Xu

    Abstract: With the development of diffusion models, text-guided image style transfer has demonstrated high-quality controllable synthesis results. However, the utilization of text for diverse music style transfer poses significant challenges, primarily due to the limited availability of matched audio-text datasets. Music, being an abstract and complex art form, exhibits variations and intricacies even withi… ▽ More

    Submitted 21 February, 2024; originally announced February 2024.

    Comments: 7 pages, 4 figures, AAAI 2024

  4. arXiv:2401.17800  [pdf, other

    cs.SD cs.MM eess.AS

    Dance-to-Music Generation with Encoder-based Textual Inversion of Diffusion Models

    Authors: Sifei Li, Weiming Dong, Yuxin Zhang, Fan Tang, Chongyang Ma, Oliver Deussen, Tong-Yee Lee, Changsheng Xu

    Abstract: The harmonious integration of music with dance movements is pivotal in vividly conveying the artistic essence of dance. This alignment also significantly elevates the immersive quality of gaming experiences and animation productions. While there has been remarkable advancement in creating high-fidelity music from textual descriptions, current methodologies mainly concentrate on modulating overarch… ▽ More

    Submitted 31 January, 2024; originally announced January 2024.

    Comments: 9 pages, 3 figures

  5. arXiv:2312.01740  [pdf, other

    eess.IV cs.CV

    MobileUtr: Revisiting the relationship between light-weight CNN and Transformer for efficient medical image segmentation

    Authors: Fenghe Tang, Bingkun Nian, Jianrui Ding, Quan Quan, Jie Yang, Wei Liu, S. Kevin Zhou

    Abstract: Due to the scarcity and specific imaging characteristics in medical images, light-weighting Vision Transformers (ViTs) for efficient medical image segmentation is a significant challenge, and current studies have not yet paid attention to this issue. This work revisits the relationship between CNNs and Transformers in lightweight universal networks for medical image segmentation, aiming to integra… ▽ More

    Submitted 4 December, 2023; originally announced December 2023.

    Comments: 13 pages

    ACM Class: I.4.6

  6. arXiv:2309.13227  [pdf, other

    cs.LG cs.SD eess.AS

    Importance of negative sampling in weak label learning

    Authors: Ankit Shah, Fuyu Tang, Zelin Ye, Rita Singh, Bhiksha Raj

    Abstract: Weak-label learning is a challenging task that requires learning from data "bags" containing positive and negative instances, but only the bag labels are known. The pool of negative instances is usually larger than positive instances, thus making selecting the most informative negative instance critical for performance. Such a selection strategy for negative instances from each bag is an open prob… ▽ More

    Submitted 22 September, 2023; originally announced September 2023.

  7. arXiv:2308.01239  [pdf, other

    eess.IV cs.CV

    CMUNeXt: An Efficient Medical Image Segmentation Network based on Large Kernel and Skip Fusion

    Authors: Fenghe Tang, Jianrui Ding, Lingtao Wang, Chun** Ning, S. Kevin Zhou

    Abstract: The U-shaped architecture has emerged as a crucial paradigm in the design of medical image segmentation networks. However, due to the inherent local limitations of convolution, a fully convolutional segmentation network with U-shaped architecture struggles to effectively extract global context information, which is vital for the precise localization of lesions. While hybrid architectures combining… ▽ More

    Submitted 2 August, 2023; v1 submitted 2 August, 2023; originally announced August 2023.

    Comments: 8 pages, 3 figures

    ACM Class: I.4.6

  8. arXiv:2303.06877  [pdf, other

    cs.CV eess.IV

    Progressive Open Space Expansion for Open-Set Model Attribution

    Authors: Tianyun Yang, Danding Wang, Fan Tang, Xinying Zhao, Juan Cao, Sheng Tang

    Abstract: Despite the remarkable progress in generative technology, the Janus-faced issues of intellectual property protection and malicious content supervision have arisen. Efforts have been paid to manage synthetic images by attributing them to a set of potential source models. However, the closed-set classification setting limits the application in real-world scenarios for handling contents generated by… ▽ More

    Submitted 13 March, 2023; originally announced March 2023.

    Comments: accepted to CVPR2023

  9. CMU-Net: A Strong ConvMixer-based Medical Ultrasound Image Segmentation Network

    Authors: Fenghe Tang, Lingtao Wang, Chun** Ning, Min Xian, Jianrui Ding

    Abstract: U-Net and its extensions have achieved great success in medical image segmentation. However, due to the inherent local characteristics of ordinary convolution operations, U-Net encoder cannot effectively extract global context information. In addition, simple skip connections cannot capture salient features. In this work, we propose a fully convolutional segmentation network (CMU-Net) which incorp… ▽ More

    Submitted 10 November, 2022; v1 submitted 24 October, 2022; originally announced October 2022.

    Comments: This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible

  10. arXiv:2203.13504  [pdf, other

    cs.CL cs.SD eess.AS

    EmoCaps: Emotion Capsule based Model for Conversational Emotion Recognition

    Authors: Zai**g Li, Fengxiao Tang, Ming Zhao, Yusen Zhu

    Abstract: Emotion recognition in conversation (ERC) aims to analyze the speaker's state and identify their emotion in the conversation. Recent works in ERC focus on context modeling but ignore the representation of contextual emotional tendency. In order to extract multi-modal information and the emotional tendency of the utterance effectively, we propose a new structure named Emoformer to extract multi-mod… ▽ More

    Submitted 25 March, 2022; originally announced March 2022.

    Comments: 9 pages, 5 figures, accepted by Finding of ACL 2022

  11. arXiv:2203.03635  [pdf, ps, other

    eess.IV cs.CV

    Stepwise Feature Fusion: Local Guides Global

    Authors: **feng Wang, Qiming Huang, Feilong Tang, Jia Meng, Jionglong Su, Sifan Song

    Abstract: Colonoscopy, currently the most efficient and recognized colon polyp detection technology, is necessary for early screening and prevention of colorectal cancer. However, due to the varying size and complex morphological features of colonic polyps as well as the indistinct boundary between polyps and mucosa, accurate segmentation of polyps is still challenging. Deep learning has become popular for… ▽ More

    Submitted 27 June, 2022; v1 submitted 7 March, 2022; originally announced March 2022.

    Comments: 10 pages, 5 figures

  12. arXiv:2105.14576  [pdf, other

    cs.CV eess.IV

    StyTr$^2$: Image Style Transfer with Transformers

    Authors: Yingying Deng, Fan Tang, Weiming Dong, Chongyang Ma, Xingjia Pan, Lei Wang, Changsheng Xu

    Abstract: The goal of image style transfer is to render an image with artistic features guided by a style reference while maintaining the original content. Owing to the locality in convolutional neural networks (CNNs), extracting and maintaining the global information of input images is difficult. Therefore, traditional neural style transfer methods face biased content representation. To address this critic… ▽ More

    Submitted 1 April, 2022; v1 submitted 30 May, 2021; originally announced May 2021.

    Comments: Accepted by CVPR 2022

  13. arXiv:2103.16744  [pdf, other

    cs.CV eess.IV

    Deep Simultaneous Optimisation of Sampling and Reconstruction for Multi-contrast MRI

    Authors: Xinwen Liu, **g Wang, Fangfang Tang, Shekhar S. Chandra, Feng Liu, Stuart Crozier

    Abstract: MRI images of the same subject in different contrasts contain shared information, such as the anatomical structure. Utilizing the redundant information amongst the contrasts to sub-sample and faithfully reconstruct multi-contrast images could greatly accelerate the imaging speed, improve image quality and shorten scanning protocols. We propose an algorithm that generates the optimised sampling pat… ▽ More

    Submitted 30 March, 2021; originally announced March 2021.

    Comments: Presented at ISMRM 28th Annual Meeting & Exhibition (Poster #3619)

  14. arXiv:2009.08003  [pdf, other

    cs.CV eess.IV

    Arbitrary Video Style Transfer via Multi-Channel Correlation

    Authors: Yingying Deng, Fan Tang, Weiming Dong, Haibin Huang, Chongyang Ma, Changsheng Xu

    Abstract: Video style transfer is getting more attention in AI community for its numerous applications such as augmented reality and animation productions. Compared with traditional image style transfer, performing this task on video presents new challenges: how to effectively generate satisfactory stylized results for any specified style, and maintain temporal coherence across frames at the same time. Towa… ▽ More

    Submitted 19 January, 2021; v1 submitted 16 September, 2020; originally announced September 2020.