Skip to main content

Showing 1–10 of 10 results for author: Ling, N

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.09125  [pdf, other

    cs.CV cs.AI

    HAAP: Vision-context Hierarchical Attention Autoregressive with Adaptive Permutation for Scene Text Recognition

    Authors: Honghui Chen, Yuhang Qiu, Jiabao Wang, **** Chen, Nam Ling

    Abstract: Internal Language Model (LM)-based methods use permutation language modeling (PLM) to solve the error correction caused by conditional independence in external LM-based methods. However, random permutations of human interference cause fit oscillations in the model training, and Iterative Refinement (IR) operation to improve multimodal information decoupling also introduces additional overhead. To… ▽ More

    Submitted 15 May, 2024; originally announced May 2024.

    Comments: 12 pages, 10 figures

    MSC Class: 68T01 ACM Class: I.2.10

  2. arXiv:2404.13786  [pdf, other

    eess.SY cs.AI cs.DC cs.LG

    Soar: Design and Deployment of A Smart Roadside Infrastructure System for Autonomous Driving

    Authors: Shuyao Shi, Neiwen Ling, Zhehao Jiang, Xuan Huang, Yuze He, Xiaoguang Zhao, Bufang Yang, Chen Bian, **gfei Xia, Zhenyu Yan, Raymond Yeung, Guoliang Xing

    Abstract: Recently,smart roadside infrastructure (SRI) has demonstrated the potential of achieving fully autonomous driving systems. To explore the potential of infrastructure-assisted autonomous driving, this paper presents the design and deployment of Soar, the first end-to-end SRI system specifically designed to support autonomous driving systems. Soar consists of both software and hardware components ca… ▽ More

    Submitted 21 April, 2024; originally announced April 2024.

  3. arXiv:2311.10986  [pdf, other

    cs.LG

    EdgeFM: Leveraging Foundation Model for Open-set Learning on the Edge

    Authors: Bufang Yang, Lixing He, Neiwen Ling, Zhenyu Yan, Guoliang Xing, Xian Shuai, Xiaozhe Ren, Xin Jiang

    Abstract: Deep Learning (DL) models have been widely deployed on IoT devices with the help of advancements in DL algorithms and chips. However, the limited resources of edge devices make these on-device DL models hard to be generalizable to diverse environments and tasks. Although the recently emerged foundation models (FMs) show impressive generalization power, how to effectively leverage the rich knowledg… ▽ More

    Submitted 22 November, 2023; v1 submitted 18 November, 2023; originally announced November 2023.

    Comments: Accepted to the 21th ACM Conference on Embedded Networked Sensor Systems (SenSys 2023)

  4. arXiv:2309.04806  [pdf, other

    cs.CV cs.AI

    Timely Fusion of Surround Radar/Lidar for Object Detection in Autonomous Driving Systems

    Authors: Wen**g Xie, Tao Hu, Neiwen Ling, Guoliang Xing, Chun Jason Xue, Nan Guan

    Abstract: Fusing Radar and Lidar sensor data can fully utilize their complementary advantages and provide more accurate reconstruction of the surrounding for autonomous driving systems. Surround Radar/Lidar can provide 360-degree view sampling with the minimal cost, which are promising sensing hardware solutions for autonomous driving systems. However, due to the intrinsic physical constraints, the rotating… ▽ More

    Submitted 27 May, 2024; v1 submitted 9 September, 2023; originally announced September 2023.

  5. arXiv:2307.04339  [pdf, other

    cs.DC cs.AI

    Miriam: Exploiting Elastic Kernels for Real-time Multi-DNN Inference on Edge GPU

    Authors: Zhihe Zhao, Neiwen Ling, Nan Guan, Guoliang Xing

    Abstract: Many applications such as autonomous driving and augmented reality, require the concurrent running of multiple deep neural networks (DNN) that poses different levels of real-time performance requirements. However, coordinating multiple DNN tasks with varying levels of criticality on edge GPUs remains an area of limited study. Unlike server-level GPUs, edge GPUs are resource-limited and lack hardwa… ▽ More

    Submitted 10 July, 2023; originally announced July 2023.

  6. arXiv:2201.05752  [pdf, other

    cs.LG cs.PL

    Moses: Efficient Exploitation of Cross-device Transferable Features for Tensor Program Optimization

    Authors: Zhihe Zhao, Xian Shuai, Yang Bai, Neiwen Ling, Nan Guan, Zhenyu Yan, Guoliang Xing

    Abstract: Achieving efficient execution of machine learning models has attracted significant attention recently. To generate tensor programs efficiently, a key component of DNN compilers is the cost model that can predict the performance of each configuration on specific devices. However, due to the rapid emergence of hardware platforms, it is increasingly labor-intensive to train domain-specific predictors… ▽ More

    Submitted 14 January, 2022; originally announced January 2022.

  7. arXiv:2110.15569  [pdf, other

    cs.CV

    Novel View Synthesis from a Single Image via Unsupervised learning

    Authors: Bingzheng Liu, Jianjun Lei, Bo Peng, Chuanbo Yu, Wanqing Li, Nam Ling

    Abstract: View synthesis aims to generate novel views from one or more given source views. Although existing methods have achieved promising performance, they usually require paired views of different poses to learn a pixel transformation. This paper proposes an unsupervised network to learn such a pixel transformation from a single source viewpoint. In particular, the network consists of a token transforma… ▽ More

    Submitted 29 October, 2021; originally announced October 2021.

    Comments: 9 pages, submitted to TCSVT

  8. arXiv:2008.06940  [pdf, other

    cs.LG cs.SI

    TempNodeEmb:Temporal Node Embedding considering temporal edge influence matrix

    Authors: Khushnood Abbas, Alireza Abbasi, Dong Shi, Niu Ling, Mingsheng Shang, Chen Liong, Bolun Chen

    Abstract: Understanding the evolutionary patterns of real-world evolving complex systems such as human interactions, transport networks, biological interactions, and computer networks has important implications in our daily lives. Predicting future links among the nodes in such networks reveals an important aspect of the evolution of temporal networks. To analyse networks, they are mapped to adjacency matri… ▽ More

    Submitted 16 August, 2020; originally announced August 2020.

    Comments: IEEE double column 6 pages

  9. arXiv:2002.12521  [pdf, other

    eess.IV cs.LG cs.MM

    Improved Image Coding Autoencoder With Deep Learning

    Authors: Licheng Xiao, Hairong Wang, Nam Ling

    Abstract: In this paper, we build autoencoder based pipelines for extreme end-to-end image compression based on Ballé's approach, which is the state-of-the-art open source implementation in image compression using deep learning. We deepened the network by adding one more hidden layer before each strided convolutional layer with exactly the same number of down-samplings and up-samplings. Our approach outperf… ▽ More

    Submitted 27 February, 2020; originally announced February 2020.

  10. arXiv:1811.06679  [pdf, other

    cs.CV

    HSCS: Hierarchical Sparsity Based Co-saliency Detection for RGBD Images

    Authors: Runmin Cong, Jianjun Lei, Huazhu Fu, Qingming Huang, Xiaochun Cao, Nam Ling

    Abstract: Co-saliency detection aims to discover common and salient objects in an image group containing more than two relevant images. Moreover, depth information has been demonstrated to be effective for many computer vision tasks. In this paper, we propose a novel co-saliency detection method for RGBD images based on hierarchical sparsity reconstruction and energy function refinement. With the assistance… ▽ More

    Submitted 16 November, 2018; originally announced November 2018.

    Comments: 11 pages, 5 figures, Accepted by IEEE Transactions on Multimedia, https://rmcong.github.io/