Skip to main content

Showing 1–8 of 8 results for author: Shih, H

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.01033  [pdf

    cs.CV cs.LG cs.MM

    Generalized Jersey Number Recognition Using Multi-task Learning With Orientation-guided Weight Refinement

    Authors: Yung-Hui Lin, Yu-Wen Chang, Huang-Chia Shih, Takahiro Ogawa

    Abstract: Jersey number recognition (JNR) has always been an important task in sports analytics. Improving recognition accuracy remains an ongoing challenge because images are subject to blurring, occlusion, deformity, and low resolution. Recent research has addressed these problems using number localization and optical character recognition. Some approaches apply player identification schemes to image sequ… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

    Comments: 10 pages, 6 figures, 5 tables

  2. arXiv:2402.06982  [pdf, other

    cs.CV cs.AI physics.med-ph

    Treatment-wise Glioblastoma Survival Inference with Multi-parametric Preoperative MRI

    Authors: Xiaofeng Liu, Nadya Shusharina, Helen A Shih, C. -C. Jay Kuo, Georges El Fakhri, Jonghye Woo

    Abstract: In this work, we aim to predict the survival time (ST) of glioblastoma (GBM) patients undergoing different treatments based on preoperative magnetic resonance (MR) scans. The personalized and precise treatment planning can be achieved by comparing the ST of different treatments. It is well established that both the current status of the patient (as represented by the MR scans) and the choice of tr… ▽ More

    Submitted 10 February, 2024; originally announced February 2024.

    Comments: SPIE Medical Imaging 2024: Computer-Aided Diagnosis

  3. arXiv:2305.19404  [pdf, other

    cs.CV cs.AI cs.LG physics.med-ph

    Incremental Learning for Heterogeneous Structure Segmentation in Brain Tumor MRI

    Authors: Xiaofeng Liu, Helen A. Shih, Fangxu Xing, Emiliano Santarnecchi, Georges El Fakhri, Jonghye Woo

    Abstract: Deep learning (DL) models for segmenting various anatomical structures have achieved great success via a static DL model that is trained in a single source domain. Yet, the static DL model is likely to perform poorly in a continually evolving environment, requiring appropriate model updates. In an incremental learning setting, we would expect that well-trained static models are updated, following… ▽ More

    Submitted 30 May, 2023; originally announced May 2023.

    Comments: Early Accept to MICCAI 2023

  4. A Real Time 1280x720 Object Detection Chip With 585MB/s Memory Traffic

    Authors: Kuo-Wei Chang, Hsu-Tung Shih, Tian-Sheuan Chang, Shang-Hong Tsai, Chih-Chyau Yang, Chien-Ming Wu, Chun-Ming Huang

    Abstract: Memory bandwidth has become the real-time bottleneck of current deep learning accelerators (DLA), particularly for high definition (HD) object detection. Under resource constraints, this paper proposes a low memory traffic DLA chip with joint hardware and software optimization. To maximize hardware utilization under memory bandwidth, we morph and fuse the object detection model into a group fusion… ▽ More

    Submitted 2 May, 2022; originally announced May 2022.

    Comments: 11 pages, 14 figures, to be published IEEE Transactions on Very Large Scale Integration (VLSI) Systems

  5. arXiv:2205.00779  [pdf, other

    cs.AR cs.CV cs.LG

    Zebra: Memory Bandwidth Reduction for CNN Accelerators With Zero Block Regularization of Activation Maps

    Authors: Hsu-Tung Shih, Tian-Sheuan Chang

    Abstract: The large amount of memory bandwidth between local buffer and external DRAM has become the speedup bottleneck of CNN hardware accelerators, especially for activation maps. To reduce memory bandwidth, we propose to learn pruning unimportant blocks dynamically with zero block regularization of activation maps (Zebra). This strategy has low computational overhead and could easily integrate with other… ▽ More

    Submitted 2 May, 2022; originally announced May 2022.

    Comments: 5 pages, 5 figures, published in IEEE ISCAS 2021

  6. arXiv:2202.10277  [pdf, other

    cs.CV cs.LG

    End-to-End High Accuracy License Plate Recognition Based on Depthwise Separable Convolution Networks

    Authors: Song-Ren Wang, Hong-Yang Shih, Zheng-Yi Shen, Wen-Kai Tai

    Abstract: Automatic license plate recognition plays a crucial role in modern transportation systems such as for traffic monitoring and vehicle violation detection. In real-world scenarios, license plate recognition still faces many challenges and is impaired by unpredictable interference such as weather or lighting conditions. Many machine learning based ALPR solutions have been proposed to solve such chall… ▽ More

    Submitted 21 February, 2022; originally announced February 2022.

  7. arXiv:1706.03038  [pdf, other

    cs.CV

    Okutama-Action: An Aerial View Video Dataset for Concurrent Human Action Detection

    Authors: Mohammadamin Barekatain, Miquel Martí, Hsueh-Fu Shih, Samuel Murray, Kotaro Nakayama, Yutaka Matsuo, Helmut Prendinger

    Abstract: Despite significant progress in the development of human action detection datasets and algorithms, no current dataset is representative of real-world aerial view scenarios. We present Okutama-Action, a new video dataset for aerial view concurrent human action detection. It consists of 43 minute-long fully-annotated sequences with 12 action classes. Okutama-Action features many challenges missing i… ▽ More

    Submitted 15 June, 2017; v1 submitted 9 June, 2017; originally announced June 2017.

    Comments: Computer Vision and Pattern Recognition Workshops (CVPRW), Hawaii, USA, 2017

  8. A Survey on Content-Aware Video Analysis for Sports

    Authors: Huang-Chia Shih

    Abstract: Sports data analysis is becoming increasingly large-scale, diversified, and shared, but difficulty persists in rapidly accessing the most crucial information. Previous surveys have focused on the methodologies of sports video analysis from the spatiotemporal viewpoint instead of a content-based viewpoint, and few of these studies have considered semantics. This study develops a deeper interpretati… ▽ More

    Submitted 3 March, 2017; originally announced March 2017.

    Comments: Accepted for publication in IEEE Transactions on Circuits and Systems for Video Technology (TCSVT)