Skip to main content

Showing 1–8 of 8 results for author: Xiong, S

Searching in archive eess. Search in all archives.
.
  1. arXiv:2401.11675  [pdf, other

    eess.IV

    Rethinking Cross-Attention for Infrared and Visible Image Fusion

    Authors: Lihua Jian, Songlei Xiong, Han Yan, Xiaoguang Niu, Shaowu Wu, Di Zhang

    Abstract: The salient information of an infrared image and the abundant texture of a visible image can be fused to obtain a comprehensive image. As can be known, the current fusion methods based on Transformer techniques for infrared and visible (IV) images have exhibited promising performance. However, the attention mechanism of the previous Transformer-based methods was prone to extract common information… ▽ More

    Submitted 21 January, 2024; originally announced January 2024.

  2. arXiv:2310.04992  [pdf, other

    eess.IV cs.CV

    VisionFM: a Multi-Modal Multi-Task Vision Foundation Model for Generalist Ophthalmic Artificial Intelligence

    Authors: Jianing Qiu, Jian Wu, Hao Wei, Peilun Shi, Minqing Zhang, Yunyun Sun, Lin Li, Hanruo Liu, Hongyi Liu, Simeng Hou, Yuyang Zhao, Xuehui Shi, Junfang Xian, Xiaoxia Qu, Sirui Zhu, Lijie Pan, Xiaoniao Chen, Xiaojia Zhang, Shuai Jiang, Kebing Wang, Chenlong Yang, Mingqiang Chen, Sujie Fan, Jianhua Hu, Aiguo Lv , et al. (17 additional authors not shown)

    Abstract: We present VisionFM, a foundation model pre-trained with 3.4 million ophthalmic images from 560,457 individuals, covering a broad range of ophthalmic diseases, modalities, imaging devices, and demography. After pre-training, VisionFM provides a foundation to foster multiple ophthalmic artificial intelligence (AI) applications, such as disease screening and diagnosis, disease prognosis, subclassifi… ▽ More

    Submitted 7 October, 2023; originally announced October 2023.

  3. arXiv:2202.08509  [pdf, other

    cs.SD cs.AI cs.CV cs.LG eess.AS

    A Study of Designing Compact Audio-Visual Wake Word Spotting System Based on Iterative Fine-Tuning in Neural Network Pruning

    Authors: Hengshun Zhou, Jun Du, Chao-Han Huck Yang, Shifu Xiong, Chin-Hui Lee

    Abstract: Audio-only-based wake word spotting (WWS) is challenging under noisy conditions due to environmental interference in signal transmission. In this paper, we investigate on designing a compact audio-visual WWS system by utilizing visual information to alleviate the degradation. Specifically, in order to use visual information, we first encode the detected lips to fixed-size vectors with MobileNet an… ▽ More

    Submitted 17 February, 2022; originally announced February 2022.

    Comments: Accepted to ICASSP 2022. H. Zhou et al

  4. arXiv:2111.14992  [pdf, other

    eess.SP cs.CR

    Network Traffic Sha** for Enhancing Privacy in IoT Systems

    Authors: Sijie Xiong, Anand D. Sarwate, Narayan B. Mandayam

    Abstract: Motivated by privacy issues caused by inference attacks on user activities in the packet sizes and timing information of Internet of Things (IoT) network traffic, we establish a rigorous event-level differential privacy (DP) model on infinite packet streams. We propose a memoryless traffic sha** mechanism satisfying a first-come-first-served queuing discipline that outputs traffic dependent on t… ▽ More

    Submitted 29 November, 2021; originally announced November 2021.

    Comments: 18 pages, 10 figures, submitted to IEEE Transactions on Networking

  5. arXiv:2107.02412  [pdf, ps, other

    cs.IT eess.SP

    GBLinks: GNN-Based Beam Selection and Link Activation for Ultra-dense D2D mmWave Networks

    Authors: S. He, S. Xiong, W. Zhang, Y. Yang, J. Ren, Y. Huang

    Abstract: In this paper, we consider the problem of joint beam selection and link activation across a set of communication pairs to effectively control the interference between communication pairs via inactivating part communication pairs in ultra-dense device-to-device (D2D) mmWave communication networks. The resulting optimization problem is formulated as an integer programming problem that is nonconvex a… ▽ More

    Submitted 29 December, 2021; v1 submitted 6 July, 2021; originally announced July 2021.

    Comments: 31 pages, 9 figures, submitted to IEEE Trans. on Commun., July 2021, major revised in Dec. 2021

  6. arXiv:2008.08523  [pdf

    cs.CV cs.LG eess.IV

    Scene Text Detection with Selected Anchor

    Authors: Anna Zhu, Hang Du, Shengwu Xiong

    Abstract: Object proposal technique with dense anchoring scheme for scene text detection were applied frequently to achieve high recall. It results in the significant improvement in accuracy but waste of computational searching, regression and classification. In this paper, we propose an anchor selection-based region proposal network (AS-RPN) using effective selected anchors instead of dense anchors to extr… ▽ More

    Submitted 19 August, 2020; originally announced August 2020.

    Comments: 8 pages

  7. arXiv:1910.04919  [pdf

    cs.CV cs.LG eess.IV

    From Species to Cultivar: Soybean Cultivar Recognition using Multiscale Sliding Chord Matching of Leaf Images

    Authors: Bin Wang, Yongsheng Gao, Xiaohan Yu, Xiaohui Yuan, Shengwu Xiong, Xianzhong Feng

    Abstract: Leaf image recognition techniques have been actively researched for plant species identification. However it remains unclear whether leaf patterns can provide sufficient information for cultivar recognition. This paper reports the first attempt on soybean cultivar recognition from plant leaves which is not only a challenging research problem but also important for soybean cultivar evaluation, sele… ▽ More

    Submitted 10 October, 2019; originally announced October 2019.

    Comments: 33 pages, 8 figures

  8. arXiv:1812.02455  [pdf, ps, other

    cs.CL cs.SD eess.AS

    The USTC-NEL Speech Translation system at IWSLT 2018

    Authors: Dan Liu, Junhua Liu, Wu Guo, Shifu Xiong, Zhiqiang Ma, Rui Song, Chongliang Wu, Quan Liu

    Abstract: This paper describes the USTC-NEL system to the speech translation task of the IWSLT Evaluation 2018. The system is a conventional pipeline system which contains 3 modules: speech recognition, post-processing and machine translation. We train a group of hybrid-HMM models for our speech recognition, and for machine translation we train transformer based neural machine translation models with speech… ▽ More

    Submitted 6 December, 2018; originally announced December 2018.

    Comments: 5 pages, 8 tabels