Skip to main content

Showing 1–50 of 54 results for author: Hung, C

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.15193  [pdf, other

    cs.CL

    Reward Steering with Evolutionary Heuristics for Decoding-time Alignment

    Authors: Chia-Yu Hung, Navonil Majumder, Ambuj Mehrish, Soujanya Poria

    Abstract: The widespread applicability and increasing omnipresence of LLMs have instigated a need to align LLM responses to user and stakeholder preferences. Many preference optimization approaches have been proposed that fine-tune LLM parameters to achieve good alignment. However, such parameter tuning is known to interfere with model performance on many tasks. Moreover, kee** up with shifting user prefe… ▽ More

    Submitted 25 June, 2024; v1 submitted 21 June, 2024; originally announced June 2024.

  2. arXiv:2405.16450  [pdf, other

    cs.LG cs.AI cs.PL

    Synthesizing Programmatic Reinforcement Learning Policies with Large Language Model Guided Search

    Authors: Max Liu, Chan-Hung Yu, Wei-Hsu Lee, Cheng-Wei Hung, Yen-Chun Chen, Shao-Hua Sun

    Abstract: Programmatic reinforcement learning (PRL) has been explored for representing policies through programs as a means to achieve interpretability and generalization. Despite promising outcomes, current state-of-the-art PRL methods are hindered by sample inefficiency, necessitating tens of millions of program-environment interactions. To tackle this challenge, we introduce a novel LLM-guided search fra… ▽ More

    Submitted 26 May, 2024; originally announced May 2024.

  3. arXiv:2404.09956  [pdf, other

    cs.SD cs.AI cs.CL eess.AS

    Tango 2: Aligning Diffusion-based Text-to-Audio Generations through Direct Preference Optimization

    Authors: Navonil Majumder, Chia-Yu Hung, Deepanway Ghosal, Wei-Ning Hsu, Rada Mihalcea, Soujanya Poria

    Abstract: Generative multimodal content is increasingly prevalent in much of the content creation arena, as it has the potential to allow artists and media personnel to create pre-production mockups by quickly bringing their ideas to life. The generation of audio from text prompts is an important aspect of such processes in the music and film industry. Many of the recent diffusion-based text-to-audio models… ▽ More

    Submitted 16 April, 2024; v1 submitted 15 April, 2024; originally announced April 2024.

    Comments: https://github.com/declare-lab/tango

  4. arXiv:2404.08820  [pdf

    cs.CV cs.LG

    Single-image driven 3d viewpoint training data augmentation for effective wine label recognition

    Authors: Yueh-Cheng Huang, Hsin-Yi Chen, Cheng-Jui Hung, Jen-Hui Chuang, Jenq-Neng Hwang

    Abstract: Confronting the critical challenge of insufficient training data in the field of complex image recognition, this paper introduces a novel 3D viewpoint augmentation technique specifically tailored for wine label recognition. This method enhances deep learning model performance by generating visually realistic training samples from a single real-world wine label image, overcoming the challenges pose… ▽ More

    Submitted 12 April, 2024; originally announced April 2024.

  5. arXiv:2403.15759  [pdf

    cs.CY

    Deep Learning Approach to Forecasting COVID-19 Cases in Residential Buildings of Hong Kong Public Housing Estates: The Role of Environment and Sociodemographics

    Authors: E. Leung, J. Guan, KO. Kwok, CT. Hung, CC. Ching, KC. Chong, CHK. Yam, T. Sun, WH. Tsang, EK. Yeoh, A. Lee

    Abstract: Introduction: The current study investigates the complex association between COVID-19 and the studied districts' socioecology (e.g. internal and external built environment, sociodemographic profiles, etc.) to quantify their contributions to the early outbreaks and epidemic resurgence of COVID-19. Methods: We aligned the analytic model's architecture with the hierarchical structure of the resident'… ▽ More

    Submitted 23 March, 2024; originally announced March 2024.

  6. arXiv:2403.13842  [pdf

    cs.LG physics.soc-ph

    Analyzing the Variations in Emergency Department Boarding and Testing the Transferability of Forecasting Models across COVID-19 Pandemic Waves in Hong Kong: Hybrid CNN-LSTM approach to quantifying building-level socioecological risk

    Authors: Eman Leung, **g**g Guan, Kin On Kwok, CT Hung, CC. Ching, CK. Chung, Hector Tsang, EK Yeoh, Albert Lee

    Abstract: Emergency department's (ED) boarding (defined as ED waiting time greater than four hours) has been linked to poor patient outcomes and health system performance. Yet, effective forecasting models is rare before COVID-19, lacking during the peri-COVID era. Here, a hybrid convolutional neural network (CNN)-Long short-term memory (LSTM) model was applied to public-domain data sourced from Hong Kong's… ▽ More

    Submitted 17 March, 2024; originally announced March 2024.

  7. arXiv:2402.18883  [pdf, other

    cs.DS

    Efficient Processing of Subsequent Densest Subgraph Query

    Authors: Chia-Yang Hung, Chih-Ya Shen

    Abstract: Dense subgraph extraction is a fundamental problem in graph analysis and data mining, aimed at identifying cohesive and densely connected substructures within a given graph. It plays a crucial role in various domains, including social network analysis, biological network analysis, recommendation systems, and community detection. However, extracting a subgraph with the highest node similarity is a… ▽ More

    Submitted 29 February, 2024; originally announced February 2024.

    Comments: 11 pages

    MSC Class: 68W27

  8. arXiv:2401.15554  [pdf

    cs.CV

    Pericoronary adipose tissue feature analysis in CT calcium score images with comparison to coronary CTA

    Authors: Yingnan Song, Hao Wu, Juhwan Lee, Justin Kim, Ammar Hoori, Tao Hu, Vladislav Zimin, Mohamed Makhlouf, Sadeer Al-Kindi, Sanjay Rajagopalan, Chun-Ho Yun, Chung-Lieh Hung, David L. Wilson

    Abstract: We investigated the feasibility and advantages of using non-contrast CT calcium score (CTCS) images to assess pericoronary adipose tissue (PCAT) and its association with major adverse cardiovascular events (MACE). PCAT features from coronary CTA (CCTA) have been shown to be associated with cardiovascular risk but are potentially confounded by iodine. If PCAT in CTCS images can be similarly analyze… ▽ More

    Submitted 27 January, 2024; originally announced January 2024.

    Comments: 24 pages,10 figures

  9. arXiv:2401.11095  [pdf, other

    cs.HC cs.SD eess.AS

    SoundShift: Exploring Sound Manipulations for Accessible Mixed-Reality Awareness

    Authors: Ruei-Che Chang, Chia-Sheng Hung, Bing-Yu Chen, Dhruv Jain, Anhong Guo

    Abstract: Mixed-reality (MR) soundscapes blend real-world sound with virtual audio from hearing devices, presenting intricate auditory information that is hard to discern and differentiate. This is particularly challenging for blind or visually impaired individuals, who rely on sounds and descriptions in their everyday lives. To understand how complex audio information is consumed, we analyzed online forum… ▽ More

    Submitted 26 May, 2024; v1 submitted 19 January, 2024; originally announced January 2024.

    Comments: DIS 2024

  10. arXiv:2311.14966  [pdf, other

    cs.CL

    Walking a Tightrope -- Evaluating Large Language Models in High-Risk Domains

    Authors: Chia-Chien Hung, Wiem Ben Rim, Lindsay Frost, Lars Bruckner, Carolin Lawrence

    Abstract: High-risk domains pose unique challenges that require language models to provide accurate and safe responses. Despite the great success of large language models (LLMs), such as ChatGPT and its variants, their performance in high-risk domains remains unclear. Our study delves into an in-depth analysis of the performance of instruction-tuned LLMs, focusing on factual accuracy and safety adherence. T… ▽ More

    Submitted 25 November, 2023; originally announced November 2023.

    Comments: EMNLP 2023 Workshop on Benchmarking Generalisation in NLP (GenBench)

  11. arXiv:2311.07993  [pdf, other

    cs.CV

    Explicit Change Relation Learning for Change Detection in VHR Remote Sensing Images

    Authors: Dalong Zheng, Zebin Wu, Jia Liu, Chih-Cheng Hung, Zhihui Wei

    Abstract: Change detection has always been a concerned task in the interpretation of remote sensing images. It is essentially a unique binary classification task with two inputs, and there is a change relationship between these two inputs. At present, the mining of change relationship features is usually implicit in the network architectures that contain single-branch or two-branch encoders. However, due to… ▽ More

    Submitted 14 November, 2023; originally announced November 2023.

  12. arXiv:2310.14909  [pdf, other

    cs.CL cs.AI cs.LG

    Linking Surface Facts to Large-Scale Knowledge Graphs

    Authors: Gorjan Radevski, Kiril Gashteovski, Chia-Chien Hung, Carolin Lawrence, Goran Glavaš

    Abstract: Open Information Extraction (OIE) methods extract facts from natural language text in the form of ("subject"; "relation"; "object") triples. These facts are, however, merely surface forms, the ambiguity of which impedes their downstream usage; e.g., the surface phrase "Michael Jordan" may refer to either the former basketball player or the university professor. Knowledge Graphs (KGs), on the other… ▽ More

    Submitted 23 October, 2023; originally announced October 2023.

  13. arXiv:2310.08123  [pdf, other

    cs.CL

    Who Wrote it and Why? Prompting Large-Language Models for Authorship Verification

    Authors: Chia-Yu Hung, Zhiqiang Hu, Yujia Hu, Roy Ka-Wei Lee

    Abstract: Authorship verification (AV) is a fundamental task in natural language processing (NLP) and computational linguistics, with applications in forensic analysis, plagiarism detection, and identification of deceptive content. Existing AV techniques, including traditional stylometric and deep learning approaches, face limitations in terms of data requirements and lack of explainability. To address thes… ▽ More

    Submitted 12 October, 2023; originally announced October 2023.

    Comments: 7 pages,1 figure

  14. arXiv:2309.09658  [pdf

    cs.CL

    A Novel Method of Fuzzy Topic Modeling based on Transformer Processing

    Authors: Ching-Hsun Tseng, Shin-Jye Lee, Po-Wei Cheng, Chien Lee, Chih-Chieh Hung

    Abstract: Topic modeling is admittedly a convenient way to monitor markets trend. Conventionally, Latent Dirichlet Allocation, LDA, is considered a must-do model to gain this type of information. By given the merit of deducing keyword with token conditional probability in LDA, we can know the most possible or essential topic. However, the results are not intuitive because the given topics cannot wholly fit… ▽ More

    Submitted 18 September, 2023; originally announced September 2023.

    Comments: Asian Journal of Information and Communications, Vol.12, No. 1, 125-140

  15. arXiv:2306.15593  [pdf

    cs.CV

    Cardiac CT perfusion imaging of pericoronary adipose tissue (PCAT) highlights potential confounds in coronary CTA

    Authors: Hao Wu, Yingnan Song, Ammar Hoori, Ananya Subramaniam, Juhwan Lee, Justin Kim, Tao Hu, Sadeer Al-Kindi, Wei-Ming Huang, Chun-Ho Yun, Chung-Lieh Hung, Sanjay Rajagopalan, David L. Wilson

    Abstract: Features of pericoronary adipose tissue (PCAT) assessed from coronary computed tomography angiography (CCTA) are associated with inflammation and cardiovascular risk. As PCAT is vascularly connected with coronary vasculature, the presence of iodine is a potential confounding factor on PCAT HU and textures that has not been adequately investigated. Use dynamic cardiac CT perfusion (CCTP) to inform… ▽ More

    Submitted 27 June, 2023; originally announced June 2023.

    Comments: 13 pages, 8 figures

  16. arXiv:2305.12717  [pdf, other

    cs.CL cs.LG

    TADA: Efficient Task-Agnostic Domain Adaptation for Transformers

    Authors: Chia-Chien Hung, Lukas Lange, Jannik Strötgen

    Abstract: Intermediate training of pre-trained transformer-based language models on domain-specific data leads to substantial gains for downstream tasks. To increase efficiency and prevent catastrophic forgetting alleviated from full domain-adaptive pre-training, approaches such as adapters have been developed. However, these require additional parameters for each layer, and are criticized for their limited… ▽ More

    Submitted 22 May, 2023; originally announced May 2023.

    Comments: ACL-Findings 2023

  17. arXiv:2305.05139   

    cs.SD cs.MM eess.AS

    Temporal Convolution Network Based Onset Detection and Query by Humming System Design

    Authors: Yu Cheng Hung, Jian-Jiun Ding

    Abstract: Onsets are a key factor to split audio into several notes. In this paper, we ensemble multiple temporal convolution network (TCN) based model and utilize a restricted frequency range spectrogram to achieve more robust onset detection. Different from the present onset detection of QBH system which is only available in a clean scenario, our proposal of onset detection and speech enhancement can prev… ▽ More

    Submitted 7 June, 2023; v1 submitted 8 May, 2023; originally announced May 2023.

    Comments: This paper has been withdrawn by the author due to a crucial definition of probability threshold and several grammer and vocabulary mistakes

  18. arXiv:2305.03982  [pdf

    cs.SD cs.MM eess.AS

    Pitch Estimation by Denoising Preprocessor and Hybrid Estimation Model

    Authors: Yu Cheng Hung, ** Hung Chen, Jian Jiun Ding

    Abstract: Pitch estimation is to estimate the fundamental frequency and the midi number and plays a critical role in music signal analysis and vocal signal processing. In this work, we proposed a new architecture based on a learning-based enhancement preprocessor and a combination of several traditional and deep learning pitch estimation methods to achieve better pitch estimation performance in both noisy a… ▽ More

    Submitted 6 May, 2023; originally announced May 2023.

    Comments: From ICCE-Taiwan

  19. arXiv:2303.17388  [pdf, other

    cs.SE

    BPCE: A Prototype for Co-Evolution between Business Process Variants through Configurable Process Model

    Authors: Linyue Liu, Xi Guo, Chun Ouyang, Patrick C. K. Hung, Hong-Yu Zhang, Keqing He, Chen Mo, Zaiwen Feng

    Abstract: With the continuous development of business process management technology, the increasing business process models are usually owned by large enterprises. In large enterprises, different stakeholders may modify the same business process model. In order to better manage the changeability of processes, they adopt configurable business process models to manage process variants. However, the process va… ▽ More

    Submitted 30 March, 2023; originally announced March 2023.

    Comments: 18 pages , 11 figures

    MSC Class: 68N99 ACM Class: D.2.2

  20. arXiv:2303.03364  [pdf, other

    cs.RO cs.CV cs.LG

    Leveraging Scene Embeddings for Gradient-Based Motion Planning in Latent Space

    Authors: Jun Yamada, Chia-Man Hung, Jack Collins, Ioannis Havoutis, Ingmar Posner

    Abstract: Motion planning framed as optimisation in structured latent spaces has recently emerged as competitive with traditional methods in terms of planning success while significantly outperforming them in terms of computational speed. However, the real-world applicability of recent work in this domain remains limited by the need to express obstacle information directly in state-space, involving simple g… ▽ More

    Submitted 6 March, 2023; originally announced March 2023.

    Comments: Project website: https://amp-ls.github.io/

    Journal ref: IEEE International Conference on Robotics and Automation (ICRA), 2023

  21. arXiv:2211.14986  [pdf

    eess.IV cs.CV

    An Unpaired Cross-modality Segmentation Framework Using Data Augmentation and Hybrid Convolutional Networks for Segmenting Vestibular Schwannoma and Cochlea

    Authors: Yuzhou Zhuang, Hong Liu, Enmin Song, Coskun Cetinkaya, Chih-Cheng Hung

    Abstract: The crossMoDA challenge aims to automatically segment the vestibular schwannoma (VS) tumor and cochlea regions of unlabeled high-resolution T2 scans by leveraging labeled contrast-enhanced T1 scans. The 2022 edition extends the segmentation task by including multi-institutional scans. In this work, we proposed an unpaired cross-modality segmentation framework using data augmentation and hybrid con… ▽ More

    Submitted 27 November, 2022; originally announced November 2022.

    Comments: Accepted by BrainLes MICCAI proceedings

  22. Reaching Through Latent Space: From Joint Statistics to Path Planning in Manipulation

    Authors: Chia-Man Hung, Shaohong Zhong, Walter Goodwin, Oiwi Parker Jones, Martin Engelcke, Ioannis Havoutis, Ingmar Posner

    Abstract: We present a novel approach to path planning for robotic manipulators, in which paths are produced via iterative optimisation in the latent space of a generative model of robot poses. Constraints are incorporated through the use of constraint satisfaction classifiers operating on the same space. Optimisation leverages gradients through our learned models that provide a simple way to combine goal r… ▽ More

    Submitted 21 October, 2022; originally announced October 2022.

    Comments: 10 pages, 6 figures, 4 tables

    ACM Class: I.2.6; I.2.9; I.2.10

    Journal ref: IEEE Robotics and Automation Letters 7.2 (2022): 5334-5341

  23. arXiv:2210.07362  [pdf, other

    cs.CL

    Can Demographic Factors Improve Text Classification? Revisiting Demographic Adaptation in the Age of Transformers

    Authors: Chia-Chien Hung, Anne Lauscher, Dirk Hovy, Simone Paolo Ponzetto, Goran Glavaš

    Abstract: Demographic factors (e.g., gender or age) shape our language. Previous work showed that incorporating demographic factors can consistently improve performance for various NLP tasks with traditional NLP models. In this work, we investigate whether these previous findings still hold with state-of-the-art pretrained Transformer-based language models (PLMs). We use three common specialization methods… ▽ More

    Submitted 9 May, 2023; v1 submitted 13 October, 2022; originally announced October 2022.

    Comments: Findings of EACL 2023. arXiv admin note: text overlap with arXiv:2208.01029

  24. arXiv:2208.01029  [pdf, other

    cs.CL

    On the Limitations of Sociodemographic Adaptation with Transformers

    Authors: Chia-Chien Hung, Anne Lauscher, Dirk Hovy, Simone Paolo Ponzetto, Goran Glavaš

    Abstract: Sociodemographic factors (e.g., gender or age) shape our language. Previous work showed that incorporating specific sociodemographic factors can consistently improve performance for various NLP tasks in traditional NLP models. We investigate whether these previous findings still hold with state-of-the-art pretrained Transformers. We use three common specialization methods proven effective for inco… ▽ More

    Submitted 1 August, 2022; originally announced August 2022.

  25. arXiv:2206.03139  [pdf, other

    cs.LG cs.AI cs.CL

    Intra-agent speech permits zero-shot task acquisition

    Authors: Chen Yan, Federico Carnevale, Petko Georgiev, Adam Santoro, Aurelia Guy, Alistair Muldal, Chia-Chun Hung, Josh Abramson, Timothy Lillicrap, Gregory Wayne

    Abstract: Human language learners are exposed to a trickle of informative, context-sensitive language, but a flood of raw sensory data. Through both social language use and internal processes of rehearsal and practice, language learners are able to build high-level, semantic representations that explain their perceptions. Here, we take inspiration from such processes of "inner speech" in humans (Vygotsky, 1… ▽ More

    Submitted 7 June, 2022; originally announced June 2022.

  26. arXiv:2205.14981  [pdf, other

    cs.CL

    ZusammenQA: Data Augmentation with Specialized Models for Cross-lingual Open-retrieval Question Answering System

    Authors: Chia-Chien Hung, Tommaso Green, Robert Litschko, Tornike Tsereteli, Sotaro Takeshita, Marco Bombieri, Goran Glavaš, Simone Paolo Ponzetto

    Abstract: This paper introduces our proposed system for the MIA Shared Task on Cross-lingual Open-retrieval Question Answering (COQA). In this challenging scenario, given an input question the system has to gather evidence documents from a multilingual pool and generate from them an answer in the language of the question. We devised several approaches combining different model variants for three main compon… ▽ More

    Submitted 30 May, 2022; originally announced May 2022.

  27. arXiv:2205.10400  [pdf, other

    cs.CL

    Multi2WOZ: A Robust Multilingual Dataset and Conversational Pretraining for Task-Oriented Dialog

    Authors: Chia-Chien Hung, Anne Lauscher, Ivan Vulić, Simone Paolo Ponzetto, Goran Glavaš

    Abstract: Research on (multi-domain) task-oriented dialog (TOD) has predominantly focused on the English language, primarily due to the shortage of robust TOD datasets in other languages, preventing the systematic investigation of cross-lingual transfer for this crucial NLP application area. In this work, we introduce Multi2WOZ, a new multilingual multi-domain TOD dataset, derived from the well-established… ▽ More

    Submitted 20 May, 2022; originally announced May 2022.

    Comments: NAACL 2022

  28. arXiv:2205.07446  [pdf, other

    cs.CL cs.AI cs.LG

    Miutsu: NTU's TaskBot for the Alexa Prize

    Authors: Yen-Ting Lin, Hui-Chi Kuo, Ze-Song Xu, Ssu Chiu, Chieh-Chi Hung, Yi-Cheng Chen, Chao-Wei Huang, Yun-Nung Chen

    Abstract: This paper introduces Miutsu, National Taiwan University's Alexa Prize TaskBot, which is designed to assist users in completing tasks requiring multiple steps and decisions in two different domains -- home improvement and cooking. We overview our system design and architectural goals, and detail the proposed core elements, including question answering, task retrieval, social chatting, and various… ▽ More

    Submitted 16 May, 2022; originally announced May 2022.

  29. arXiv:2201.00008  [pdf, other

    cs.LG cs.AI

    A Lightweight and Accurate Spatial-Temporal Transformer for Traffic Forecasting

    Authors: Guanyao Li, Shuhan Zhong, S. -H. Gary Chan, Ruiyuan Li, Chih-Chieh Hung, Wen-Chih Peng

    Abstract: We study the forecasting problem for traffic with dynamic, possibly periodical, and joint spatial-temporal dependency between regions. Given the aggregated inflow and outflow traffic of regions in a city from time slots 0 to t-1, we predict the traffic at time t at any region. Prior arts in the area often consider the spatial and temporal dependencies in a decoupled manner or are rather computatio… ▽ More

    Submitted 3 May, 2022; v1 submitted 30 December, 2021; originally announced January 2022.

  30. arXiv:2110.08395  [pdf, other

    cs.CL

    DS-TOD: Efficient Domain Specialization for Task Oriented Dialog

    Authors: Chia-Chien Hung, Anne Lauscher, Simone Paolo Ponzetto, Goran Glavaš

    Abstract: Recent work has shown that self-supervised dialog-specific pretraining on large conversational datasets yields substantial gains over traditional language modeling (LM) pretraining in downstream task-oriented dialog (TOD). These approaches, however, exploit general dialogic corpora (e.g., Reddit) and thus presumably fail to reliably embed domain-specific knowledge useful for concrete downstream TO… ▽ More

    Submitted 20 May, 2022; v1 submitted 15 October, 2021; originally announced October 2021.

    Comments: Findings of ACL 2022

  31. arXiv:2107.05455  [pdf, ps, other

    cs.DM

    A Local Diagnosis Algorithm for Hypercube-like Networks under the BGM Diagnosis Model

    Authors: Cheng-Kuan Lin, Tzu-Liang Kung, Chun-Nan Hung, Yuan-Hsiang Teng

    Abstract: System diagnosis is process of identifying faulty nodes in a system. An efficient diagnosis is crucial for a multiprocessor system. The BGM diagnosis model is a modification of the PMC diagnosis model, which is a test-based diagnosis. In this paper, we present a specific structure and propose an algorithm for diagnosing a node in a system under the BGM model. We also give a polynomial-time algorit… ▽ More

    Submitted 8 June, 2022; v1 submitted 30 June, 2021; originally announced July 2021.

    Journal ref: Fundamenta Informaticae, Volume 185, Issue 4 (July 7, 2022) fi:7674

  32. arXiv:2104.06274  [pdf, other

    cs.DC

    Optimal Data Placement for Data-Sharing Scientific Workflows in Heterogeneous Edge-Cloud Computing Environments

    Authors: Xin Du, Songtao Tang, Zhihui Lu, Keke Gai, Jie Wu, Patrick C. K. Hung

    Abstract: The heterogeneous edge-cloud computing paradigm can provide a more optimal direction to deploy scientific workflows than traditional distributed computing or cloud computing environments. Due to the different sizes of scientific datasets and some of these datasets must keep private, it is still a difficult problem to finding an data placement strategy that can minimize data transmission as well as… ▽ More

    Submitted 13 April, 2021; originally announced April 2021.

  33. arXiv:2103.11881  [pdf, other

    cs.RO cs.LG

    Introspective Visuomotor Control: Exploiting Uncertainty in Deep Visuomotor Control for Failure Recovery

    Authors: Chia-Man Hung, Li Sun, Yizhe Wu, Ioannis Havoutis, Ingmar Posner

    Abstract: End-to-end visuomotor control is emerging as a compelling solution for robot manipulation tasks. However, imitation learning-based visuomotor control approaches tend to suffer from a common limitation, lacking the ability to recover from an out-of-distribution state caused by compounding errors. In this paper, instead of using tactile feedback or explicitly detecting the failure through vision, we… ▽ More

    Submitted 22 March, 2021; originally announced March 2021.

    Comments: 7 pages, 5 figures, 1 table

    ACM Class: I.2.9; I.2.10

  34. arXiv:2102.02446  [pdf, other

    cs.LG math.GT

    The Analysis from Nonlinear Distance Metric to Kernel-based Drug Prescription Prediction System

    Authors: Der-Chen Chang, Ophir Frieder, Chi-Feng Hung, Hao-Ren Yao

    Abstract: Distance metrics and their nonlinear variant play a crucial role in machine learning based real-world problem solving. We demonstrated how Euclidean and cosine distance measures differ not only theoretically but also in real-world medical application, namely, outcome prediction of drug prescription. Euclidean distance exhibits favorable properties in the local geometry problem. To this regard, Euc… ▽ More

    Submitted 23 February, 2021; v1 submitted 4 February, 2021; originally announced February 2021.

    Comments: Accepted to Journal of Nonlinear and Variational Analysis, JNVA 2021

  35. arXiv:2011.05755  [pdf, other

    q-bio.QM cs.DC eess.IV

    Cryo-RALib -- a modular library for accelerating alignment in cryo-EM

    Authors: Szu-Chi Chung, Cheng-Yu Hung, Huei-Lun Siao, Hung-Yi Wu, Wei-Hau Chang, I-** Tu

    Abstract: Thanks to automated cryo-EM and GPU-accelerated processing, single-particle cryo-EM has become a rapid structure determination method that permits capture of dynamical structures of molecules in solution, which has been recently demonstrated by the determination of COVID-19 spike protein in March, shortly after its breakout in late January 2020. This rapidity is critical for vaccine development in… ▽ More

    Submitted 25 February, 2021; v1 submitted 11 November, 2020; originally announced November 2020.

  36. Cross-Global Attention Graph Kernel Network Prediction of Drug Prescription

    Authors: Hao-Ren Yao, Der-Chen Chang, Ophir Frieder, Wendy Huang, I-Chia Liang, Chi-Feng Hung

    Abstract: We present an end-to-end, interpretable, deep-learning architecture to learn a graph kernel that predicts the outcome of chronic disease drug prescription. This is achieved through a deep metric learning collaborative with a Support Vector Machine objective using a graphical representation of Electronic Health Records. We formulate the predictive model as a binary graph classification problem with… ▽ More

    Submitted 4 August, 2020; originally announced August 2020.

    Comments: ACM-BCB 2020 (Full paper)

    Journal ref: Proceedings of the 11th ACM International Conference on Bioinformatics, Computational Biology and Health Informatics (BCB '20), September 21-24, 2020, Virtual Event, USA

  37. arXiv:2005.02220  [pdf

    cs.CV

    Learning of Art Style Using AI and Its Evaluation Based on Psychological Experiments

    Authors: Mai Cong Hung, Ryohei Nakatsu, Naoko Tosa, Takashi Kusumi, Koji Koyamada

    Abstract: GANs (Generative adversarial networks) is a new AI technology that can perform deep learning with less training data and has the capability of achieving transformation between two image sets. Using GAN we have carried out a comparison between several art sets with different art style. We have prepared several image sets; a flower photo set (A), an art image set (B1) of Impressionism drawings, an a… ▽ More

    Submitted 4 May, 2020; originally announced May 2020.

  38. arXiv:2005.01492  [pdf, other

    physics.soc-ph cs.LG stat.ML

    TRIPDECODER: Study Travel Time Attributes and Route Preferences of Metro Systems from Smart Card Data

    Authors: Xiancai Tian, Baihua Zheng, Yazhe Wang, Hsiao-Ting Huang, Chih-Chieh Hung

    Abstract: In this paper, we target at recovering the exact routes taken by commuters inside a metro system that arenot captured by an Automated Fare Collection (AFC) system and hence remain unknown. We strategicallypropose two inference tasks to handle the recovering, one to infer the travel time of each travel link thatcontributes to the total duration of any trip inside a metro network and the other to in… ▽ More

    Submitted 1 May, 2020; originally announced May 2020.

    ACM Class: I.5.1

  39. arXiv:2003.12175  [pdf, other

    cs.LG cs.SD eess.AS

    Incremental Learning Algorithm for Sound Event Detection

    Authors: Eunjeong Koh, Fatemeh Saki, Yinyi Guo, Cheng-Yu Hung, Erik Visser

    Abstract: This paper presents a new learning strategy for the Sound Event Detection (SED) system to tackle the issues of i) knowledge migration from a pre-trained model to a new target model and ii) learning new sound events without forgetting the previously learned ones without re-training from scratch. In order to migrate the previously learned knowledge from the source model to the target one, a neural a… ▽ More

    Submitted 26 March, 2020; originally announced March 2020.

    Comments: IEEE ICME 2020 Camera Ready Version

    Journal ref: IEEE ICME 2020

  40. arXiv:2003.08854  [pdf, other

    cs.RO cs.CV cs.LG

    Goal-Conditioned End-to-End Visuomotor Control for Versatile Skill Primitives

    Authors: Oliver Groth, Chia-Man Hung, Andrea Vedaldi, Ingmar Posner

    Abstract: Visuomotor control (VMC) is an effective means of achieving basic manipulation tasks such as pushing or pick-and-place from raw images. Conditioning VMC on desired goal states is a promising way of achieving versatile skill primitives. However, common conditioning schemes either rely on task-specific fine tuning - e.g. using one-shot imitation learning (IL) - or on sampling approaches using a forw… ▽ More

    Submitted 24 September, 2021; v1 submitted 19 March, 2020; originally announced March 2020.

    Comments: revised manuscript with additional baselines and generalisation experiments; 11 pages, 8 figures, 7 tables

    ACM Class: I.2.9; I.2.10

  41. arXiv:1910.14540  [pdf, other

    cs.RO

    Team NCTU: Toward AI-Driving for Autonomous Surface Vehicles -- From Duckietown to RobotX

    Authors: Yi-Wei Huang, Tzu-Kuan Chuang, Ni-Ching Lin, Yu-Chieh Hsiao, Pin-Wei Chen, Ching-Tang Hung, Shih-Hsing Liu, Hsiao-Sheng Chen, Ya-Hsiu Hsieh, Ching-Tang Hung, Yen-Hsiang Huang, Yu-Xuan Chen, Kuan-Lin Chen, Ya-Jou Lan, Chao-Chun Hsu, Chun-Yi Lin, Jhih-Ying Li, Jui-Te Huang, Yu-Jen Menn, Sin-Kiat Lim, Kim-Boon Lua, Chia-Hung Dylan Tsai, Chi-Fang Chen, Hsueh-Cheng Wang

    Abstract: Robotic software and hardware systems of autonomous surface vehicles have been developed in transportation, military, and ocean researches for decades. Previous efforts in RobotX Challenges 2014 and 2016 facilitates the developments for important tasks such as obstacle avoidance and docking. Team NCTU is motivated by the AI Driving Olympics (AI-DO) developed by the Duckietown community, and adopts… ▽ More

    Submitted 31 October, 2019; originally announced October 2019.

  42. arXiv:1910.06562  [pdf, other

    cs.LG stat.ML

    Compacting, Picking and Growing for Unforgetting Continual Learning

    Authors: Steven C. Y. Hung, Cheng-Hao Tu, Cheng-En Wu, Chien-Hung Chen, Yi-Ming Chan, Chu-Song Chen

    Abstract: Continual lifelong learning is essential to many applications. In this paper, we propose a simple but effective approach to continual deep learning. Our approach leverages the principles of deep model compression, critical weights selection, and progressive networks expansion. By enforcing their integration in an iterative manner, we introduce an incremental learning method that is scalable to the… ▽ More

    Submitted 30 October, 2019; v1 submitted 15 October, 2019; originally announced October 2019.

    Comments: To appear in NeurIPS 2019

  43. arXiv:1905.08413  [pdf

    cs.CV eess.IV

    Dual-branch residual network for lung nodule segmentation

    Authors: Haichao Cao, Hong Liu, Enmin Song, Chih-Cheng Hung, Guangzhi Ma, Xiangyang Xu, Renchao **, Jianguo Lu

    Abstract: An accurate segmentation of lung nodules in computed tomography (CT) images is critical to lung cancer analysis and diagnosis. However, due to the variety of lung nodules and the similarity of visual characteristics between nodules and their surroundings, a robust segmentation of nodules becomes a challenging problem. In this study, we propose the Dual-branch Residual Network (DB-ResNet) which is… ▽ More

    Submitted 20 May, 2019; originally announced May 2019.

    Comments: 24 pages, 6 figures

  44. arXiv:1905.03445  [pdf

    cs.CV eess.IV

    Two-Stage Convolutional Neural Network Architecture for Lung Nodule Detection

    Authors: Haichao Cao, Hong Liu, Enmin Song, Guangzhi Ma, Xiangyang Xu, Renchao **, Tengying Liu, Chih-Cheng Hung

    Abstract: Early detection of lung cancer is an effective way to improve the survival rate of patients. It is a critical step to have accurate detection of lung nodules in computed tomography (CT) images for the diagnosis of lung cancer. However, due to the heterogeneity of the lung nodules and the complexity of the surrounding environment, robust nodule detection has been a challenging task. In this study,… ▽ More

    Submitted 9 May, 2019; originally announced May 2019.

    Comments: 29 pages, 10 figures

  45. arXiv:1903.07164  [pdf, ps, other

    eess.SP cs.IR math.OC

    Linearly Constrained Smoothing Group Sparsity Solvers in Off-grid Model

    Authors: Cheng-Yu Hung, Mostafa Kaveh

    Abstract: In compressed sensing, the sensing matrix is assumed perfectly known. However, there exists perturbation in the sensing matrix in reality due to sensor offsets or noise disturbance. Directions-of-arrival (DoA) estimation with off-grid effect satisfies this situation, and can be formulated into a (non)convex optimization problem with linear inequalities constraints, which can be solved by the inter… ▽ More

    Submitted 3 June, 2019; v1 submitted 17 March, 2019; originally announced March 2019.

  46. arXiv:1903.07158  [pdf, ps, other

    eess.SP cs.IR math.OC

    Joint Block Low Rank and Sparse Matrix Recovery in Array Self-Calibration Off-Grid DoA Estimation

    Authors: Cheng-Yu Hung, Mostafa Kaveh

    Abstract: This letter addresses the estimation of directions-of-arrival (DoA) by a sensor array using a sparse model in the presence of array calibration errors and off-grid directions. The received signal utilizes previously used models for unknown errors in calibration and structured linear representation of the off-grid effect. A convex optimization problem is formulated with an objective function to pro… ▽ More

    Submitted 3 June, 2019; v1 submitted 17 March, 2019; originally announced March 2019.

  47. arXiv:1902.04043  [pdf, other

    cs.LG cs.MA stat.ML

    The StarCraft Multi-Agent Challenge

    Authors: Mikayel Samvelyan, Tabish Rashid, Christian Schroeder de Witt, Gregory Farquhar, Nantas Nardelli, Tim G. J. Rudner, Chia-Man Hung, Philip H. S. Torr, Jakob Foerster, Shimon Whiteson

    Abstract: In the last few years, deep multi-agent reinforcement learning (RL) has become a highly active area of research. A particularly challenging class of problems in this area is partially observable, cooperative, multi-agent learning, in which teams of agents must learn to coordinate their behaviour while conditioning only on their private observations. This is an attractive research area since such p… ▽ More

    Submitted 9 December, 2019; v1 submitted 11 February, 2019; originally announced February 2019.

  48. arXiv:1810.06721  [pdf, other

    cs.AI cs.LG

    Optimizing Agent Behavior over Long Time Scales by Transporting Value

    Authors: Chia-Chun Hung, Timothy Lillicrap, Josh Abramson, Yan Wu, Mehdi Mirza, Federico Carnevale, Arun Ahuja, Greg Wayne

    Abstract: Humans spend a remarkable fraction of waking life engaged in acts of "mental time travel". We dwell on our actions in the past and experience satisfaction or regret. More than merely autobiographical storytelling, we use these event recollections to change how we will act in similar scenarios in the future. This process endows us with a computationally important ability to link actions and consequ… ▽ More

    Submitted 21 December, 2018; v1 submitted 15 October, 2018; originally announced October 2018.

  49. arXiv:1804.01128  [pdf, other

    cs.AI

    Probing Physics Knowledge Using Tools from Developmental Psychology

    Authors: Luis Piloto, Ari Weinstein, Dhruva TB, Arun Ahuja, Mehdi Mirza, Greg Wayne, David Amos, Chia-chun Hung, Matt Botvinick

    Abstract: In order to build agents with a rich understanding of their environment, one key objective is to endow them with a grasp of intuitive physics; an ability to reason about three-dimensional objects, their dynamic interactions, and responses to forces. While some work on this problem has taken the approach of building in components such as ready-made physics engines, other research aims to extract ge… ▽ More

    Submitted 3 April, 2018; originally announced April 2018.

  50. arXiv:1803.10760  [pdf, other

    cs.LG stat.ML

    Unsupervised Predictive Memory in a Goal-Directed Agent

    Authors: Greg Wayne, Chia-Chun Hung, David Amos, Mehdi Mirza, Arun Ahuja, Agnieszka Grabska-Barwinska, Jack Rae, Piotr Mirowski, Joel Z. Leibo, Adam Santoro, Mevlana Gemici, Malcolm Reynolds, Tim Harley, Josh Abramson, Shakir Mohamed, Danilo Rezende, David Saxton, Adam Cain, Chloe Hillier, David Silver, Koray Kavukcuoglu, Matt Botvinick, Demis Hassabis, Timothy Lillicrap

    Abstract: Animals execute goal-directed behaviours despite the limited range and scope of their sensors. To cope, they explore environments and store memories maintaining estimates of important information that is not presently available. Recently, progress has been made with artificial intelligence (AI) agents that learn to perform tasks from sensory input, even at a human level, by merging reinforcement l… ▽ More

    Submitted 28 March, 2018; originally announced March 2018.