Skip to main content

Showing 1–40 of 40 results for author: Chou, Y

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.00556  [pdf, other

    cs.MM

    Revisiting Vision-Language Features Adaptation and Inconsistency for Social Media Popularity Prediction

    Authors: Chih-Chung Hsu, Chia-Ming Lee, Yu-Fan Lin, Yi-Shiuan Chou, Chih-Yu Jian, Chi-Han Tsai

    Abstract: Social media popularity (SMP) prediction is a complex task involving multi-modal data integration. While pre-trained vision-language models (VLMs) like CLIP have been widely adopted for this task, their effectiveness in capturing the unique characteristics of social media content remains unexplored. This paper critically examines the applicability of CLIP-based features in SMP prediction, focusing… ▽ More

    Submitted 29 June, 2024; originally announced July 2024.

    Comments: Submission of the 7th Social Media Prediction Challenge

  2. arXiv:2406.19941  [pdf, other

    cs.CV

    GRACE: Graph-Regularized Attentive Convolutional Entanglement with Laplacian Smoothing for Robust DeepFake Video Detection

    Authors: Chih-Chung Hsu, Shao-Ning Chen, Mei-Hsuan Wu, Yi-Fang Wang, Chia-Ming Lee, Yi-Shiuan Chou

    Abstract: As DeepFake video manipulation techniques escalate, posing profound threats, the urgent need to develop efficient detection strategies is underscored. However, one particular issue lies with facial images being mis-detected, often originating from degraded videos or adversarial attacks, leading to unexpected temporal artifacts that can undermine the efficacy of DeepFake video detection techniques.… ▽ More

    Submitted 30 June, 2024; v1 submitted 28 June, 2024; originally announced June 2024.

    Comments: Submitted to TPAMI 2024

  3. arXiv:2406.01356  [pdf, other

    cs.CV

    MP-PolarMask: A Faster and Finer Instance Segmentation for Concave Images

    Authors: Ke-Lei Wang, Pin-Hsuan Chou, Young-Ching Chou, Chia-Jen Liu, Cheng-Kuan Lin, Yu-Chee Tseng

    Abstract: While there are a lot of models for instance segmentation, PolarMask stands out as a unique one that represents an object by a Polar coordinate system. With an anchor-box-free design and a single-stage framework that conducts detection and segmentation at one time, PolarMask is proved to be able to balance efficiency and accuracy. Hence, it can be easily connected with other downstream real-time a… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

  4. arXiv:2405.16466  [pdf, other

    cs.NE

    High-Performance Temporal Reversible Spiking Neural Networks with $O(L)$ Training Memory and $O(1)$ Inference Cost

    Authors: JiaKui Hu, Man Yao, Xuerui Qiu, Yuhong Chou, Yuxuan Cai, Ning Qiao, Yonghong Tian, Bo XU, Guoqi Li

    Abstract: Multi-timestep simulation of brain-inspired Spiking Neural Networks (SNNs) boost memory requirements during training and increase inference energy cost. Current training methods cannot simultaneously solve both training and inference dilemmas. This work proposes a novel Temporal Reversible architecture for SNNs (T-RevSNN) to jointly address the training and inference challenges by altering the for… ▽ More

    Submitted 26 May, 2024; originally announced May 2024.

    Comments: Accepted by ICML2024

  5. LEO Satellite Network Access in the Wild: Potentials, Experiences, and Challenges

    Authors: Sami Ma, Yi Ching Chou, Miao Zhang, Hao Fang, Haoyuan Zhao, Jiangchuan Liu, William I. Atlas

    Abstract: In the past three years, working with the Pacific Salmon Foundation and various First Nations groups, we have established Starlink-empowered wild salmon monitoring sites in remote Northern British Columbia, Canada. We report our experiences with the network services in these challenging environments, including deep woods and deep valleys, that lack infrastructural support with some close to Starli… ▽ More

    Submitted 10 May, 2024; originally announced May 2024.

    Comments: 8 pages, 6 figures

    ACM Class: C.2.1

  6. arXiv:2404.10343  [pdf, other

    cs.CV eess.IV

    The Ninth NTIRE 2024 Efficient Super-Resolution Challenge Report

    Authors: Bin Ren, Yawei Li, Nancy Mehta, Radu Timofte, Hongyuan Yu, Cheng Wan, Yuxin Hong, Bingnan Han, Zhuoyuan Wu, Yajun Zou, Yuqing Liu, Jizhe Li, Keji He, Chao Fan, Heng Zhang, Xiaolin Zhang, Xuanwu Yin, Kunlong Zuo, Bohao Liao, Peizhe Xia, Long Peng, Zhibo Du, Xin Di, Wangkai Li, Yang Wang , et al. (109 additional authors not shown)

    Abstract: This paper provides a comprehensive review of the NTIRE 2024 challenge, focusing on efficient single-image super-resolution (ESR) solutions and their outcomes. The task of this challenge is to super-resolve an input image with a magnification factor of x4 based on pairs of low and corresponding high-resolution images. The primary objective is to develop networks that optimize various aspects such… ▽ More

    Submitted 25 June, 2024; v1 submitted 16 April, 2024; originally announced April 2024.

    Comments: The report paper of NTIRE2024 Efficient Super-resolution, accepted by CVPRW2024

  7. arXiv:2404.09790  [pdf, other

    cs.CV

    NTIRE 2024 Challenge on Image Super-Resolution ($\times$4): Methods and Results

    Authors: Zheng Chen, Zongwei Wu, Eduard Zamfir, Kai Zhang, Yulun Zhang, Radu Timofte, Xiaokang Yang, Hongyuan Yu, Cheng Wan, Yuxin Hong, Zhijuan Huang, Yajun Zou, Yuan Huang, Jiamin Lin, Bingnan Han, Xianyu Guan, Yongsheng Yu, Daoan Zhang, Xuanwu Yin, Kunlong Zuo, **hua Hao, Kai Zhao, Kun Yuan, Ming Sun, Chao Zhou , et al. (63 additional authors not shown)

    Abstract: This paper reviews the NTIRE 2024 challenge on image super-resolution ($\times$4), highlighting the solutions proposed and the outcomes obtained. The challenge involves generating corresponding high-resolution (HR) images, magnified by a factor of four, from low-resolution (LR) inputs using prior information. The LR images originate from bicubic downsampling degradation. The aim of the challenge i… ▽ More

    Submitted 15 April, 2024; originally announced April 2024.

    Comments: NTIRE 2024 webpage: https://cvlai.net/ntire/2024. Code: https://github.com/zhengchen1999/NTIRE2024_ImageSR_x4

  8. arXiv:2404.01643  [pdf, other

    eess.IV cs.CV cs.LG

    A Closer Look at Spatial-Slice Features Learning for COVID-19 Detection

    Authors: Chih-Chung Hsu, Chia-Ming Lee, Yang Fan Chiang, Yi-Shiuan Chou, Chih-Yu Jiang, Shen-Chieh Tai, Chi-Han Tsai

    Abstract: Conventional Computed Tomography (CT) imaging recognition faces two significant challenges: (1) There is often considerable variability in the resolution and size of each CT scan, necessitating strict requirements for the input size and adaptability of models. (2) CT-scan contains large number of out-of-distribution (OOD) slices. The crucial features may only be present in specific spatial regions… ▽ More

    Submitted 20 April, 2024; v1 submitted 2 April, 2024; originally announced April 2024.

    Comments: Camera-ready version, accepted by DEF-AI-MIA workshop, in conjunted with CVPR2024

  9. arXiv:2404.00722  [pdf, other

    cs.CV cs.AI

    DRCT: Saving Image Super-resolution away from Information Bottleneck

    Authors: Chih-Chung Hsu, Chia-Ming Lee, Yi-Shiuan Chou

    Abstract: In recent years, Vision Transformer-based approaches for low-level vision tasks have achieved widespread success. Unlike CNN-based models, Transformers are more adept at capturing long-range dependencies, enabling the reconstruction of images utilizing non-local information. In the domain of super-resolution, Swin-transformer-based models have become mainstream due to their capability of global sp… ▽ More

    Submitted 15 April, 2024; v1 submitted 31 March, 2024; originally announced April 2024.

    Comments: Camera-ready version, NTIRE 2024 Image Super-resolution (x4)

  10. arXiv:2403.11230  [pdf, other

    eess.IV cs.CV cs.LG

    Simple 2D Convolutional Neural Network-based Approach for COVID-19 Detection

    Authors: Chih-Chung Hsu, Chia-Ming Lee, Yang Fan Chiang, Yi-Shiuan Chou, Chih-Yu Jiang, Shen-Chieh Tai, Chi-Han Tsai

    Abstract: This study explores the use of deep learning techniques for analyzing lung Computed Tomography (CT) images. Classic deep learning approaches face challenges with varying slice counts and resolutions in CT images, a diversity arising from the utilization of assorted scanning equipment. Typically, predictions are made on single slices which are then combined for a comprehensive outcome. Yet, this me… ▽ More

    Submitted 17 March, 2024; originally announced March 2024.

  11. arXiv:2312.06668  [pdf

    cs.CL cs.SD eess.AS

    Evaluating Self-supervised Speech Models on a Taiwanese Hokkien Corpus

    Authors: Yi-Hui Chou, Kalvin Chang, Meng-Ju Wu, Winston Ou, Alice Wen-Hsin Bi, Carol Yang, Bryan Y. Chen, Rong-Wei Pai, Po-Yen Yeh, Jo-Peng Chiang, Iu-Tshian Phoann, Winnie Chang, Chenxuan Cui, Noel Chen, Jiatong Shi

    Abstract: Taiwanese Hokkien is declining in use and status due to a language shift towards Mandarin in Taiwan. This is partly why it is a low resource language in NLP and speech research today. To ensure that the state of the art in speech processing does not leave Taiwanese Hokkien behind, we contribute a 1.5-hour dataset of Taiwanese Hokkien to ML-SUPERB's hidden set. Evaluating ML-SUPERB's suite of self-… ▽ More

    Submitted 5 December, 2023; originally announced December 2023.

    Comments: Accepted to ASRU 2023

  12. Acquiring Weak Annotations for Tumor Localization in Temporal and Volumetric Data

    Authors: Yu-Cheng Chou, Bowen Li, Deng-** Fan, Alan Yuille, Zongwei Zhou

    Abstract: Creating large-scale and well-annotated datasets to train AI algorithms is crucial for automated tumor detection and localization. However, with limited resources, it is challenging to determine the best type of annotations when annotating massive amounts of unlabeled data. To address this issue, we focus on polyps in colonoscopy videos and pancreatic tumors in abdominal CT scans; both application… ▽ More

    Submitted 20 February, 2024; v1 submitted 23 October, 2023; originally announced October 2023.

    Comments: Published in Machine Intelligence Research

    Journal ref: Mach. Intell. Res. (2024)

  13. arXiv:2308.06582  [pdf, other

    cs.NE

    Gated Attention Coding for Training High-performance and Efficient Spiking Neural Networks

    Authors: Xuerui Qiu, Rui-Jie Zhu, Yuhong Chou, Zhaorui Wang, Liang-jian Deng, Guoqi Li

    Abstract: Spiking neural networks (SNNs) are emerging as an energy-efficient alternative to traditional artificial neural networks (ANNs) due to their unique spike-based event-driven nature. Coding is crucial in SNNs as it converts external input stimuli into spatio-temporal feature sequences. However, most existing deep SNNs rely on direct coding that generates powerless spike representation and lacks the… ▽ More

    Submitted 4 June, 2024; v1 submitted 12 August, 2023; originally announced August 2023.

    Comments: Accepted by Proceedings of the AAAI Conference on Artificial Intelligence 38 (AAAI 24)

  14. arXiv:2308.04872  [pdf, other

    cs.CV

    Tracking Players in a Badminton Court by Two Cameras

    Authors: Young-Ching Chou, Shen-Ru Zhang, Bo-Wei Chen, Hong-Qi Chen, Cheng-Kuan Lin, Yu-Chee Tseng

    Abstract: This study proposes a simple method for multi-object tracking (MOT) of players in a badminton court. We leverage two off-the-shelf cameras, one on the top of the court and the other on the side of the court. The one on the top is to track players' trajectories, while the one on the side is to analyze the pixel features of players. By computing the correlations between adjacent frames and engaging… ▽ More

    Submitted 9 August, 2023; originally announced August 2023.

  15. arXiv:2308.03008  [pdf, other

    eess.IV cs.CV cs.LG

    Early Detection and Localization of Pancreatic Cancer by Label-Free Tumor Synthesis

    Authors: Bowen Li, Yu-Cheng Chou, Shuwen Sun, Hualin Qiao, Alan Yuille, Zongwei Zhou

    Abstract: Early detection and localization of pancreatic cancer can increase the 5-year survival rate for patients from 8.5% to 20%. Artificial intelligence (AI) can potentially assist radiologists in detecting pancreatic tumors at an early stage. Training AI models require a vast number of annotated examples, but the availability of CT scans obtaining early-stage tumors is constrained. This is because earl… ▽ More

    Submitted 5 August, 2023; originally announced August 2023.

    Comments: Big Task Small Data, 1001-AI, MICCAI Workshop, 2023

  16. arXiv:2307.11411  [pdf, other

    cs.CV cs.AI

    Deep Directly-Trained Spiking Neural Networks for Object Detection

    Authors: Qiaoyi Su, Yuhong Chou, Yifan Hu, Jianing Li, Shijie Mei, Ziyang Zhang, Guoqi Li

    Abstract: Spiking neural networks (SNNs) are brain-inspired energy-efficient models that encode information in spatiotemporal dynamics. Recently, deep SNNs trained directly have shown great success in achieving high performance on classification tasks with very few time steps. However, how to design a directly-trained SNN for the regression task of object detection still remains a challenging problem. To ad… ▽ More

    Submitted 26 July, 2023; v1 submitted 21 July, 2023; originally announced July 2023.

    Comments: Accepted by ICCV2023

  17. arXiv:2306.09607  [pdf, other

    cs.CL

    Listener Model for the PhotoBook Referential Game with CLIPScores as Implicit Reference Chain

    Authors: Shih-Lun Wu, Yi-Hui Chou, Liangze Li

    Abstract: PhotoBook is a collaborative dialogue game where two players receive private, partially-overlap** sets of images and resolve which images they have in common. It presents machines with a great challenge to learn how people build common ground around multimodal context to communicate effectively. Methods developed in the literature, however, cannot be deployed to real gameplay since they only tac… ▽ More

    Submitted 15 June, 2023; originally announced June 2023.

    Comments: Accepted to ACL 2023 main conference (short paper)

  18. arXiv:2305.12148  [pdf, other

    cs.LG

    Probabilistic Modeling: Proving the Lottery Ticket Hypothesis in Spiking Neural Network

    Authors: Man Yao, Yuhong Chou, Guangshe Zhao, Xiawu Zheng, Yonghong Tian, Bo Xu, Guoqi Li

    Abstract: The Lottery Ticket Hypothesis (LTH) states that a randomly-initialized large neural network contains a small sub-network (i.e., winning tickets) which, when trained in isolation, can achieve comparable performance to the large network. LTH opens up a new path for network pruning. Existing proofs of LTH in Artificial Neural Networks (ANNs) are based on continuous activation functions, such as ReLU,… ▽ More

    Submitted 20 May, 2023; originally announced May 2023.

    Comments: 22pages, 5 figures

  19. arXiv:2303.14334  [pdf, other

    cs.HC cs.AI cs.CL

    The Semantic Reader Project: Augmenting Scholarly Documents through AI-Powered Interactive Reading Interfaces

    Authors: Kyle Lo, Joseph Chee Chang, Andrew Head, Jonathan Bragg, Amy X. Zhang, Cassidy Trier, Chloe Anastasiades, Tal August, Russell Authur, Danielle Bragg, Erin Bransom, Isabel Cachola, Stefan Candra, Yoganand Chandrasekhar, Yen-Sung Chen, Evie Yu-Yen Cheng, Yvonne Chou, Doug Downey, Rob Evans, Raymond Fok, Fangzhou Hu, Regan Huff, Dongyeop Kang, Tae Soo Kim, Rodney Kinney , et al. (30 additional authors not shown)

    Abstract: Scholarly publications are key to the transfer of knowledge from scholars to others. However, research papers are information-dense, and as the volume of the scientific literature grows, the need for new technology to support the reading process grows. In contrast to the process of finding papers, which has been transformed by Internet technology, the experience of reading research papers has chan… ▽ More

    Submitted 23 April, 2023; v1 submitted 24 March, 2023; originally announced March 2023.

  20. arXiv:2303.13631  [pdf, other

    cs.SD cs.AI cs.CL eess.AS

    In-depth analysis of music structure as a text network

    Authors: **-Rui Tsai, Yen-Ting Chou, Nathan-Christopher Wang, Hui-Ling Chen, Hong-Yue Huang, Zih-Jia Luo, Tzay-Ming Hong

    Abstract: Music, enchanting and poetic, permeates every corner of human civilization. Although music is not unfamiliar to people, our understanding of its essence remains limited, and there is still no universally accepted scientific description. This is primarily due to music being regarded as a product of both reason and emotion, making it difficult to define. In this article, we focus on the fundamental… ▽ More

    Submitted 2 January, 2024; v1 submitted 21 March, 2023; originally announced March 2023.

    Comments: 7 pages, 8 figures

  21. arXiv:2212.13697  [pdf, other

    cs.NI

    Network Characteristics of LEO Satellite Constellations: A Starlink-Based Measurement from End Users

    Authors: Sami Ma, Yi Ching Chou, Haoyuan Zhao, Long Chen, Xiaoqiang Ma, Jiangchuan Liu

    Abstract: Low Earth orbit Satellite Networks (LSNs) have been advocated as a key infrastructure for truly global coverage in the forthcoming 6G. This paper presents our initial measurement results and observations on the end-to-end network characteristics of Starlink, arguably the largest LSN constellation to date. Our findings confirm that LSNs are a promising solution towards ubiquitous Internet coverage… ▽ More

    Submitted 27 December, 2022; originally announced December 2022.

    Comments: 12 pages, 20 figures, to be published in IEEE INFOCOM 2023

  22. Co-designing for a Hybrid Workplace Experience in Software Development

    Authors: Zhendong Wang, Yi-Hung Chou, Kayla Fathi, Tobias Schimmer, Peter Colligan, David Redmiles, Rafael Prikladnicki

    Abstract: With increasing demands for flexible work models, many IT organizations have adapted to hybrid work that promises enhanced team productivity as well as work satisfaction. To achieve productive engineering practice, collaborative product innovation, and effective mentorship in the ensuing hybrid work, we introduce a workshop approach on co-designing for a hybrid workplace experience and provide imp… ▽ More

    Submitted 19 December, 2022; originally announced December 2022.

    Comments: Accepted at IEEE Software

  23. Deep Gradient Learning for Efficient Camouflaged Object Detection

    Authors: Ge-Peng Ji, Deng-** Fan, Yu-Cheng Chou, Dengxin Dai, Alexander Liniger, Luc Van Gool

    Abstract: This paper introduces DGNet, a novel deep framework that exploits object gradient supervision for camouflaged object detection (COD). It decouples the task into two connected branches, i.e., a context and a texture encoder. The essential connection is the gradient-induced transition, representing a soft grou** between context and texture features. Benefiting from the simple but efficient framewo… ▽ More

    Submitted 8 August, 2022; v1 submitted 25 May, 2022; originally announced May 2022.

    Comments: Accepted by Machine Intelligence Research

    Journal ref: Machine Intelligence Research. 20, 92-108 (2023)

  24. arXiv:2204.04090  [pdf, other

    cs.LG

    Single-level Adversarial Data Synthesis based on Neural Tangent Kernels

    Authors: Yu-Rong Zhang, Ruei-Yang Su, Sheng Yen Chou, Shan-Hung Wu

    Abstract: Abstract Generative adversarial networks (GANs) have achieved impressive performance in data synthesis and have driven the development of many applications. However, GANs are known to be hard to train due to their bilevel objective, which leads to the problems of convergence, mode collapse, and gradient vanishing. In this paper, we propose a new generative model called the generative adversarial N… ▽ More

    Submitted 20 November, 2022; v1 submitted 8 April, 2022; originally announced April 2022.

  25. Video Polyp Segmentation: A Deep Learning Perspective

    Authors: Ge-Peng Ji, Guobao Xiao, Yu-Cheng Chou, Deng-** Fan, Kai Zhao, Geng Chen, Luc Van Gool

    Abstract: We present the first comprehensive video polyp segmentation (VPS) study in the deep learning era. Over the years, developments in VPS are not moving forward with ease due to the lack of large-scale fine-grained segmentation annotations. To address this issue, we first introduce a high-quality frame-by-frame annotated VPS dataset, named SUN-SEG, which contains 158,690 colonoscopy frames from the we… ▽ More

    Submitted 31 August, 2022; v1 submitted 27 March, 2022; originally announced March 2022.

    Comments: Accepted by Machine Intelligence Research 2022 (Project Page: https://github.com/GewelsJI/VPS)

    Journal ref: Machine Intelligence Research, vol. 19, no. 6, pp.531-549, 2022

  26. arXiv:2203.02399  [pdf, other

    cs.LG cs.AI

    Benchmarking Instance-Centric Counterfactual Algorithms for XAI: From White Box to Black Box

    Authors: Catarina Moreira, Yu-Liang Chou, Chihcheng Hsieh, Chun Ouyang, Joaquim Jorge, João Madeiras Pereira

    Abstract: This study investigates the impact of machine learning models on the generation of counterfactual explanations by conducting a benchmark evaluation over three different types of models: a decision tree (fully transparent, interpretable, white-box model), a random forest (semi-interpretable, grey-box model), and a neural network (fully opaque, black-box model). We tested the counterfactual generati… ▽ More

    Submitted 11 June, 2024; v1 submitted 4 March, 2022; originally announced March 2022.

  27. arXiv:2110.07957  [pdf, other

    eess.AS cs.CL cs.SD

    Don't speak too fast: The impact of data bias on self-supervised speech models

    Authors: Yen Meng, Yi-Hui Chou, Andy T. Liu, Hung-yi Lee

    Abstract: Self-supervised Speech Models (S3Ms) have been proven successful in many speech downstream tasks, like ASR. However, how pre-training data affects S3Ms' downstream behavior remains an unexplored issue. In this paper, we study how pre-training data affects S3Ms by pre-training models on biased datasets targeting different factors of speech, including gender, content, and prosody, and evaluate these… ▽ More

    Submitted 26 April, 2022; v1 submitted 15 October, 2021; originally announced October 2021.

    Comments: Accepted by ICASSP 2022

  28. arXiv:2108.13858  [pdf, other

    cs.LG cs.AI

    GRP-FED: Addressing Client Imbalance in Federated Learning via Global-Regularized Personalization

    Authors: Yen-Hsiu Chou, Shenda Hong, Chenxi Sun, Derun Cai, Moxian Song, Hongyan Li

    Abstract: Since data is presented long-tailed in reality, it is challenging for Federated Learning (FL) to train across decentralized clients as practical applications. We present Global-Regularized Personalization (GRP-FED) to tackle the data imbalanced issue by considering a single global model and multiple local models for each client. With adaptive aggregation, the global model treats multiple clients f… ▽ More

    Submitted 31 August, 2021; originally announced August 2021.

    Comments: (FL-ICML'21) International Workshop on Federated Learning for User Privacy and Data Confidentiality in Conjunction with ICML 2021

  29. arXiv:2107.05223  [pdf, other

    cs.SD cs.LG cs.MM eess.AS

    BERT-like Pre-training for Symbolic Piano Music Classification Tasks

    Authors: Yi-Hui Chou, I-Chun Chen, Chin-Jui Chang, Joann Ching, Yi-Hsuan Yang

    Abstract: This article presents a benchmark study of symbolic piano music classification using the masked language modelling approach of the Bidirectional Encoder Representations from Transformers (BERT). Specifically, we consider two types of MIDI data: MIDI scores, which are musical scores rendered directly into MIDI with no dynamics and precisely aligned with the metrical grid notated by its composer and… ▽ More

    Submitted 13 April, 2024; v1 submitted 12 July, 2021; originally announced July 2021.

    Comments: Accepted to Journal of Creative Music Systems

  30. arXiv:2105.10110  [pdf, other

    cs.CV

    Guidance and Teaching Network for Video Salient Object Detection

    Authors: Yingxia Jiao, Xiao Wang, Yu-Cheng Chou, Shouyuan Yang, Ge-Peng Ji, Rong Zhu, Ge Gao

    Abstract: Owing to the difficulties of mining spatial-temporal cues, the existing approaches for video salient object detection (VSOD) are limited in understanding complex and noisy scenarios, and often fail in inferring prominent objects. To alleviate such shortcomings, we propose a simple yet efficient architecture, termed Guidance and Teaching Network (GTNet), to independently distil effective spatial an… ▽ More

    Submitted 6 June, 2021; v1 submitted 20 May, 2021; originally announced May 2021.

    Comments: Accepted at IEEE ICIP 2021

  31. Progressively Normalized Self-Attention Network for Video Polyp Segmentation

    Authors: Ge-Peng Ji, Yu-Cheng Chou, Deng-** Fan, Geng Chen, Huazhu Fu, Debesh Jha, Ling Shao

    Abstract: Existing video polyp segmentation (VPS) models typically employ convolutional neural networks (CNNs) to extract features. However, due to their limited receptive fields, CNNs can not fully exploit the global temporal and spatial information in successive video frames, resulting in false-positive segmentation results. In this paper, we propose the novel PNS-Net (Progressively Normalized Self-attent… ▽ More

    Submitted 24 May, 2021; v1 submitted 18 May, 2021; originally announced May 2021.

    Comments: MICCAI 2021 (Provisional accept); Code: https://github.com/GewelsJI/PNS-Net

  32. Joint QoS-Aware Scheduling and Precoding for Massive MIMO Systems via Deep Reinforcement Learning

    Authors: Chih-Wei Huang, Yen-Cheng Chou, Hong-Yunn Chen, Cheng-Fu Chou

    Abstract: The rapid development of mobile networks proliferates the demands of high data rate, low latency, and high-reliability applications for the fifth-generation (5G) and beyond (B5G) mobile networks. Concurrently, the massive multiple-input-multiple-output (MIMO) technology is essential to realize the vision and requires coordination with resource management functions for high user experiences. Though… ▽ More

    Submitted 9 April, 2021; originally announced April 2021.

    Journal ref: IEEE Access, vol. 11, pp. 13243-13256, 2023

  33. arXiv:2103.04244  [pdf, other

    cs.AI cs.LG

    Counterfactuals and Causability in Explainable Artificial Intelligence: Theory, Algorithms, and Applications

    Authors: Yu-Liang Chou, Catarina Moreira, Peter Bruza, Chun Ouyang, Joaquim Jorge

    Abstract: There has been a growing interest in model-agnostic methods that can make deep learning models more transparent and explainable to a user. Some researchers recently argued that for a machine to achieve a certain degree of human-level explainability, this machine needs to provide human causally understandable explanations, also known as causability. A specific class of algorithms that have the pote… ▽ More

    Submitted 8 June, 2021; v1 submitted 6 March, 2021; originally announced March 2021.

  34. Contrast Adaptive Tissue Classification by Alternating Segmentation and Synthesis

    Authors: Dzung L. Pham, Yi-Yu Chou, Blake E. Dewey, Daniel S. Reich, John A. Butman, Snehashis Roy

    Abstract: Deep learning approaches to the segmentation of magnetic resonance images have shown significant promise in automating the quantitative analysis of brain images. However, a continuing challenge has been its sensitivity to the variability of acquisition protocols. Attempting to segment images that have different contrast properties from those within the training data generally leads to significantl… ▽ More

    Submitted 3 March, 2021; originally announced March 2021.

    Comments: 10 pages. MICCAI SASHIMI Workshop 2021

  35. arXiv:2007.10668  [pdf, other

    cs.AI cs.LG

    An Interpretable Probabilistic Approach for Demystifying Black-box Predictive Models

    Authors: Catarina Moreira, Yu-Liang Chou, Mythreyi Velmurugan, Chun Ouyang, Renuka Sindhgatta, Peter Bruza

    Abstract: The use of sophisticated machine learning models for critical decision making is faced with a challenge that these models are often applied as a "black-box". This has led to an increased interest in interpretable machine learning, where post hoc interpretation presents a useful mechanism for generating interpretations of complex learning models. In this paper, we propose a novel approach underpinn… ▽ More

    Submitted 21 July, 2020; originally announced July 2020.

  36. arXiv:2007.02235  [pdf, other

    cs.LG stat.ML

    Unbiased Risk Estimators Can Mislead: A Case Study of Learning with Complementary Labels

    Authors: Yu-Ting Chou, Gang Niu, Hsuan-Tien Lin, Masashi Sugiyama

    Abstract: In weakly supervised learning, unbiased risk estimator(URE) is a powerful tool for training classifiers when training and test data are drawn from different distributions. Nevertheless, UREs lead to overfitting in many problem settings when the models are complex like deep networks. In this paper, we investigate reasons for such overfitting by studying a weakly supervised problem called learning w… ▽ More

    Submitted 21 August, 2020; v1 submitted 5 July, 2020; originally announced July 2020.

    Comments: Accepted at ICML 2020

  37. arXiv:1805.04980  [pdf, other

    cs.CV

    Unifying and Merging Well-trained Deep Neural Networks for Inference Stage

    Authors: Yi-Min Chou, Yi-Ming Chan, Jia-Hong Lee, Chih-Yi Chiu, Chu-Song Chen

    Abstract: We propose a novel method to merge convolutional neural-nets for the inference stage. Given two well-trained networks that may have different architectures that handle different tasks, our method aligns the layers of the original networks and merges them into a unified model by sharing the representative codes of weights. The shared weights are further re-trained to fine-tune the performance of th… ▽ More

    Submitted 13 May, 2018; originally announced May 2018.

    Comments: To appear in the 27th International Joint Conference on Artificial Intelligence and the 23rd European Conference on Artificial Intelligence, 2018. (IJCAI-ECAI 2018)

  38. arXiv:1805.04262  [pdf, other

    cs.CV

    Stingray Detection of Aerial Images Using Augmented Training Images Generated by A Conditional Generative Model

    Authors: Yi-Min Chou, Chien-Hung Chen, Keng-Hao Liu, Chu-Song Chen

    Abstract: In this paper, we present an object detection method that tackles the stingray detection problem based on aerial images. In this problem, the images are aerially captured on a sea-surface area by using an Unmanned Aerial Vehicle (UAV), and the stingrays swimming under (but close to) the sea surface are the target we want to detect and locate. To this end, we use a deep object detection method, fas… ▽ More

    Submitted 25 June, 2018; v1 submitted 11 May, 2018; originally announced May 2018.

    Comments: to appear in CVPR 2018 Workshop (CVPR 2018 Workshop and Challenge: Automated Analysis of Marine Video for Environmental Monitoring)

  39. arXiv:1601.07021  [pdf

    cs.CV

    Polyhedron Volume-Ratio-based Classification for Image Recognition

    Authors: Qingxiang Feng, Jeng-Shyang Pan, Jar-Ferr Yang, Yang-Ting Chou

    Abstract: In this paper, a novel method, called polyhedron volume ratio classification (PVRC) is proposed for image recognition

    Submitted 26 January, 2016; originally announced January 2016.

  40. arXiv:1506.06366  [pdf

    cs.CE cs.AI cs.NE

    A Novel Method for Stock Forecasting based on Fuzzy Time Series Combined with the Longest Common/Repeated Sub-sequence

    Authors: He-Wen Chen, Zih-Ci Wang, Shu-Yu Kuo, Yao-Hsin Chou

    Abstract: Stock price forecasting is an important issue for investors since extreme accuracy in forecasting can bring about high profits. Fuzzy Time Series (FTS) and Longest Common/Repeated Sub-sequence (LCS/LRS) are two important issues for forecasting prices. However, to the best of our knowledge, there are no significant studies using LCS/LRS to predict stock prices. It is impossible that prices stay exa… ▽ More

    Submitted 21 June, 2015; originally announced June 2015.