Skip to main content

Showing 1–13 of 13 results for author: Ho, C M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2206.01777  [pdf, other

    cs.CV eess.IV

    Real-Time Super-Resolution for Real-World Images on Mobile Devices

    Authors: Jie Cai, Zibo Meng, Jiaming Ding, Chiu Man Ho

    Abstract: Image Super-Resolution (ISR), which aims at recovering High-Resolution (HR) images from the corresponding Low-Resolution (LR) counterparts. Although recent progress in ISR has been remarkable. However, they are way too computationally intensive to be deployed on edge devices, since most of the recent approaches are deep learning-based. Besides, these methods always fail in real-world scenes, since… ▽ More

    Submitted 3 June, 2022; originally announced June 2022.

    Comments: arXiv admin note: text overlap with arXiv:2004.13674

  2. arXiv:2111.12317  [pdf, other

    cs.CL

    Handling tree-structured text: parsing directory pages

    Authors: Sarang Shrivastava, Afreen Shaikh, Shivani Shrivastava, Chung Ming Ho, Pradeep Reddy, Vijay Saraswat

    Abstract: The determination of the reading sequence of text is fundamental to document understanding. This problem is easily solved in pages where the text is organized into a sequence of lines and vertical alignment runs the height of the page (producing multiple columns which can be read from left to right). We present a situation -- the directory page parsing problem -- where information is presented on… ▽ More

    Submitted 24 November, 2021; originally announced November 2021.

  3. arXiv:2108.09322  [pdf, other

    cs.CV

    MM-ViT: Multi-Modal Video Transformer for Compressed Video Action Recognition

    Authors: Jiawei Chen, Chiu Man Ho

    Abstract: This paper presents a pure transformer-based approach, dubbed the Multi-Modal Video Transformer (MM-ViT), for video action recognition. Different from other schemes which solely utilize the decoded RGB frames, MM-ViT operates exclusively in the compressed video domain and exploits all readily available modalities, i.e., I-frames, motion vectors, residuals and audio waveform. In order to handle the… ▽ More

    Submitted 12 November, 2021; v1 submitted 20 August, 2021; originally announced August 2021.

    Comments: Winter Conference on Applications of Computer Vision (WACV) 2022

  4. arXiv:2105.12789  [pdf, other

    cs.CV

    RSCA: Real-time Segmentation-based Context-Aware Scene Text Detection

    Authors: Jiachen Li, Yuan Lin, Rongrong Liu, Chiu Man Ho, Humphrey Shi

    Abstract: Segmentation-based scene text detection methods have been widely adopted for arbitrary-shaped text detection recently, since they make accurate pixel-level predictions on curved text instances and can facilitate real-time inference without time-consuming processing on anchors. However, current segmentation-based models are unable to learn the shapes of curved texts and often require complex label… ▽ More

    Submitted 26 May, 2021; originally announced May 2021.

    Comments: CVPR 2021 Workshop

  5. arXiv:2105.08826  [pdf, other

    eess.IV cs.CV cs.LG

    Real-Time Video Super-Resolution on Smartphones with Deep Learning, Mobile AI 2021 Challenge: Report

    Authors: Andrey Ignatov, Andres Romero, Heewon Kim, Radu Timofte, Chiu Man Ho, Zibo Meng, Kyoung Mu Lee, Yuxiang Chen, Yutong Wang, Zeyu Long, Chenhao Wang, Yifei Chen, Boshen Xu, Shuhang Gu, Lixin Duan, Wen Li, Wang Bofei, Zhang Diankai, Zheng Chengjian, Liu Shaoli, Gao Si, Zhang Xiaofeng, Lu Kaidi, Xu Tianyu, Zheng Hui , et al. (6 additional authors not shown)

    Abstract: Video super-resolution has recently become one of the most important mobile-related problems due to the rise of video communication and streaming services. While many solutions have been proposed for this task, the majority of them are too computationally expensive to run on portable devices with limited hardware resources. To address this problem, we introduce the first Mobile AI challenge, where… ▽ More

    Submitted 17 May, 2021; originally announced May 2021.

    Comments: Mobile AI 2021 Workshop and Challenges: https://ai-benchmark.com/workshops/mai/2021/. arXiv admin note: substantial text overlap with arXiv:2105.07825. substantial text overlap with arXiv:2105.08629, arXiv:2105.07809, arXiv:2105.08630

  6. arXiv:2009.06943  [pdf, other

    eess.IV cs.CV

    AIM 2020 Challenge on Efficient Super-Resolution: Methods and Results

    Authors: Kai Zhang, Martin Danelljan, Yawei Li, Radu Timofte, Jie Liu, Jie Tang, Gangshan Wu, Yu Zhu, Xiangyu He, Wenjie Xu, Chenghua Li, Cong Leng, Jian Cheng, Guangyang Wu, Wenyi Wang, Xiaohong Liu, Hengyuan Zhao, Xiangtao Kong, **gwen He, Yu Qiao, Chao Dong, Xiaotong Luo, Liang Chen, Jiangtao Zhang, Maitreya Suin , et al. (60 additional authors not shown)

    Abstract: This paper reviews the AIM 2020 challenge on efficient single image super-resolution with focus on the proposed solutions and results. The challenge task was to super-resolve an input image with a magnification factor x4 based on a set of prior examples of low and corresponding high resolution images. The goal is to devise a network that reduces one or several aspects such as runtime, parameter co… ▽ More

    Submitted 15 September, 2020; originally announced September 2020.

  7. GIA-Net: Global Information Aware Network for Low-light Imaging

    Authors: Zibo Meng, Runsheng Xu, Chiu Man Ho

    Abstract: It is extremely challenging to acquire perceptually plausible images under low-light conditions due to low SNR. Most recently, U-Nets have shown promising results for low-light imaging. However, vanilla U-Nets generate images with artifacts such as color inconsistency due to the lack of global color information. In this paper, we propose a global information aware (GIA) module, which is capable of… ▽ More

    Submitted 14 September, 2020; originally announced September 2020.

    Comments: 16 pages 6 figures; accepted to AIM at ECCV 2020

    Journal ref: Computer Vision -- ECCV 2020 Workshops, 2020, 327--342

  8. arXiv:2008.01057  [pdf, other

    cs.CV

    Residual Frames with Efficient Pseudo-3D CNN for Human Action Recognition

    Authors: Jiawei Chen, Jenson Hsiao, Chiu Man Ho

    Abstract: Human action recognition is regarded as a key cornerstone in domains such as surveillance or video understanding. Despite recent progress in the development of end-to-end solutions for video-based action recognition, achieving state-of-the-art performance still requires using auxiliary hand-crafted motion representations, e.g., optical flow, which are usually computationally demanding. In this wor… ▽ More

    Submitted 3 August, 2020; originally announced August 2020.

  9. arXiv:2005.01996  [pdf, other

    eess.IV cs.CV

    NTIRE 2020 Challenge on Real-World Image Super-Resolution: Methods and Results

    Authors: Andreas Lugmayr, Martin Danelljan, Radu Timofte, Namhyuk Ahn, Dongwoon Bai, Jie Cai, Yun Cao, Junyang Chen, Kaihua Cheng, SeYoung Chun, Wei Deng, Mostafa El-Khamy, Chiu Man Ho, Xiaozhong Ji, Amin Kheradmand, Gwantae Kim, Hanseok Ko, Kanghyu Lee, Jungwon Lee, Hao Li, Ziluan Liu, Zhi-Song Liu, Shuai Liu, Yunhua Lu, Zibo Meng , et al. (21 additional authors not shown)

    Abstract: This paper reviews the NTIRE 2020 challenge on real world super-resolution. It focuses on the participating methods and final results. The challenge addresses the real world setting, where paired true high and low-resolution images are unavailable. For training, only one set of source input images is therefore provided along with a set of unpaired high-quality target images. In Track 1: Image Proc… ▽ More

    Submitted 5 May, 2020; originally announced May 2020.

  10. arXiv:2004.13674  [pdf, other

    eess.IV cs.CV

    Residual Channel Attention Generative Adversarial Network for Image Super-Resolution and Noise Reduction

    Authors: Jie Cai, Zibo Meng, Chiu Man Ho

    Abstract: Image super-resolution is one of the important computer vision techniques aiming to reconstruct high-resolution images from corresponding low-resolution ones. Most recently, deep learning-based approaches have been demonstrated for image super-resolution. However, as the deep networks go deeper, they become more difficult to train and more difficult to restore the finer texture details, especially… ▽ More

    Submitted 28 April, 2020; originally announced April 2020.

  11. arXiv:1811.08056  [pdf, ps, other

    cs.LG cs.CV stat.ML

    Gradient-Coherent Strong Regularization for Deep Neural Networks

    Authors: Dae Hoon Park, Chiu Man Ho, Yi Chang, Huaqing Zhang

    Abstract: Regularization plays an important role in generalization of deep neural networks, which are often prone to overfitting with their numerous parameters. L1 and L2 regularizers are common regularization tools in machine learning with their simplicity and effectiveness. However, we observe that imposing strong L1 or L2 regularization with stochastic gradient descent on deep neural networks easily fail… ▽ More

    Submitted 17 October, 2019; v1 submitted 19 November, 2018; originally announced November 2018.

  12. arXiv:1810.08322  [pdf, ps, other

    cs.LG cs.CV stat.ML

    Sequenced-Replacement Sampling for Deep Learning

    Authors: Chiu Man Ho, Dae Hoon Park, Wei Yang, Yi Chang

    Abstract: We propose sequenced-replacement sampling (SRS) for training deep neural networks. The basic idea is to assign a fixed sequence index to each sample in the dataset. Once a mini-batch is randomly drawn in each training iteration, we refill the original dataset by successively adding samples according to their sequence index. Thus we carry out replacement sampling but in a batched and sequenced way.… ▽ More

    Submitted 18 October, 2018; originally announced October 2018.

  13. arXiv:1412.7774  [pdf

    cs.NE

    Improved Parameter Identification Method Based on Moving Rate

    Authors: Chol Man Ho, Son Il Gwak, Song Ho Pak, Jong Won Ha

    Abstract: To improve the problem that the parameter identification for fuzzy neural network has many time complexities in calculating, an improved T-S fuzzy inference method and an parameter identification method for fuzzy neural network are proposed. It mainly includes three parts. First, improved fuzzy inference method based on production term for T-S Fuzzy model is explained. Then, compared with existing… ▽ More

    Submitted 24 December, 2014; originally announced December 2014.