Skip to main content

Showing 1–47 of 47 results for author: Hsu, T

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.12123  [pdf, other

    cs.RO cs.AI cs.LG

    ChatEMG: Synthetic Data Generation to Control a Robotic Hand Orthosis for Stroke

    Authors: **gxi Xu, Runsheng Wang, Siqi Shang, Ava Chen, Lauren Winterbottom, To-Liang Hsu, Wenxi Chen, Khondoker Ahmed, Pedro Leandro La Rotta, Xinyue Zhu, Dawn M. Nilsen, Joel Stein, Matei Ciocarlie

    Abstract: Intent inferral on a hand orthosis for stroke patients is challenging due to the difficulty of data collection from impaired subjects. Additionally, EMG signals exhibit significant variations across different conditions, sessions, and subjects, making it hard for classifiers to generalize. Traditional approaches require a large labeled dataset from the new condition, session, or subject to train i… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

    Comments: 8 pages

  2. SciCapenter: Supporting Caption Composition for Scientific Figures with Machine-Generated Captions and Ratings

    Authors: Ting-Yao Hsu, Chieh-Yang Huang, Shih-Hong Huang, Ryan Rossi, Sungchul Kim, Tong Yu, C. Lee Giles, Ting-Hao K. Huang

    Abstract: Crafting effective captions for figures is important. Readers heavily depend on these captions to grasp the figure's message. However, despite a well-developed set of AI technologies for figures and captions, these have rarely been tested for usefulness in aiding caption writing. This paper introduces SciCapenter, an interactive system that puts together cutting-edge AI technologies for scientific… ▽ More

    Submitted 26 March, 2024; originally announced March 2024.

    Comments: CHI EA '24: Extended Abstracts of the 2024 CHI Conference on Human Factors in Computing Systems

  3. arXiv:2403.15363  [pdf, other

    eess.SY cs.LG

    Cascading Blackout Severity Prediction with Statistically-Augmented Graph Neural Networks

    Authors: Joe Gorka, Tim Hsu, Wenting Li, Yury Maximov, Line Roald

    Abstract: Higher variability in grid conditions, resulting from growing renewable penetration and increased incidence of extreme weather events, has increased the difficulty of screening for scenarios that may lead to catastrophic cascading failures. Traditional power-flow-based tools for assessing cascading blackout risk are too slow to properly explore the space of possible failures and load/generation pa… ▽ More

    Submitted 22 March, 2024; originally announced March 2024.

    Comments: Accepted to Power Systems Computation Conference (PSCC) 2024

  4. arXiv:2403.03516  [pdf, other

    cs.CL cs.IR

    Unsupervised Multilingual Dense Retrieval via Generative Pseudo Labeling

    Authors: Chao-Wei Huang, Chen-An Li, Tsu-Yuan Hsu, Chen-Yu Hsu, Yun-Nung Chen

    Abstract: Dense retrieval methods have demonstrated promising performance in multilingual information retrieval, where queries and documents can be in different languages. However, dense retrievers typically require a substantial amount of paired data, which poses even greater challenges in multilingual scenarios. This paper introduces UMR, an Unsupervised Multilingual dense Retriever trained without any pa… ▽ More

    Submitted 6 March, 2024; originally announced March 2024.

    Comments: Accepted to Findings of EACL 2024

  5. arXiv:2312.05472  [pdf, other

    cond-mat.mtrl-sci cs.LG

    Spectroscopy-Guided Discovery of Three-Dimensional Structures of Disordered Materials with Diffusion Models

    Authors: Hyuna Kwon, Tim Hsu, Wenyu Sun, Wonseok Jeong, Fikret Aydin, James Chapman, Xiao Chen, Matthew R. Carbone, Deyu Lu, Fei Zhou, Tuan Anh Pham

    Abstract: The ability to rapidly develop materials with desired properties has a transformative impact on a broad range of emerging technologies. In this work, we introduce a new framework based on the diffusion model, a recent generative machine learning method to predict 3D structures of disordered materials from a target property. For demonstration, we apply the model to identify the atomic structures of… ▽ More

    Submitted 9 December, 2023; originally announced December 2023.

  6. arXiv:2310.15405  [pdf, other

    cs.CL

    GPT-4 as an Effective Zero-Shot Evaluator for Scientific Figure Captions

    Authors: Ting-Yao Hsu, Chieh-Yang Huang, Ryan Rossi, Sungchul Kim, C. Lee Giles, Ting-Hao K. Huang

    Abstract: There is growing interest in systems that generate captions for scientific figures. However, assessing these systems output poses a significant challenge. Human evaluation requires academic expertise and is costly, while automatic evaluation depends on often low-quality author-written captions. This paper investigates using large language models (LLMs) as a cost-effective, reference-free method fo… ▽ More

    Submitted 23 October, 2023; originally announced October 2023.

    Comments: To Appear in EMNLP 2023 Findings

  7. arXiv:2310.01678  [pdf, other

    physics.comp-ph cs.LG

    Score dynamics: scaling molecular dynamics with picoseconds timestep via conditional diffusion model

    Authors: Tim Hsu, Babak Sadigh, Vasily Bulatov, Fei Zhou

    Abstract: We propose score dynamics (SD), a general framework for learning accelerated evolution operators with large timesteps from molecular-dynamics simulations. SD is centered around scores, or derivatives of the transition log-probability with respect to the dynamical degrees of freedom. The latter play the same role as force fields in MD but are used in denoising diffusion probability models to genera… ▽ More

    Submitted 6 March, 2024; v1 submitted 2 October, 2023; originally announced October 2023.

  8. arXiv:2309.14324  [pdf, other

    eess.AS cs.CL cs.LG cs.SD

    Towards General-Purpose Text-Instruction-Guided Voice Conversion

    Authors: Chun-Yi Kuan, Chen An Li, Tsu-Yuan Hsu, Tse-Yang Lin, Ho-Lam Chung, Kai-Wei Chang, Shuo-yiin Chang, Hung-yi Lee

    Abstract: This paper introduces a novel voice conversion (VC) model, guided by text instructions such as "articulate slowly with a deep tone" or "speak in a cheerful boyish voice". Unlike traditional methods that rely on reference utterances to determine the attributes of the converted speech, our model adds versatility and specificity to voice conversion. The proposed VC model is a neural codec language mo… ▽ More

    Submitted 16 January, 2024; v1 submitted 25 September, 2023; originally announced September 2023.

    Comments: Accepted to ASRU 2023

  9. arXiv:2309.06748  [pdf, other

    cs.CL cs.IR

    CONVERSER: Few-Shot Conversational Dense Retrieval with Synthetic Data Generation

    Authors: Chao-Wei Huang, Chen-Yu Hsu, Tsu-Yuan Hsu, Chen-An Li, Yun-Nung Chen

    Abstract: Conversational search provides a natural interface for information retrieval (IR). Recent approaches have demonstrated promising results in applying dense retrieval to conversational IR. However, training dense retrievers requires large amounts of in-domain paired data. This hinders the development of conversational dense retrievers, as abundant in-domain conversations are expensive to collect. In… ▽ More

    Submitted 13 September, 2023; originally announced September 2023.

    Comments: Accepted to SIGDIAL 2023

  10. arXiv:2305.07455  [pdf, other

    cs.CL cs.SD eess.AS

    Improving Cascaded Unsupervised Speech Translation with Denoising Back-translation

    Authors: Yu-Kuan Fu, Liang-Hsuan Tseng, Jiatong Shi, Chen-An Li, Tsu-Yuan Hsu, Shinji Watanabe, Hung-yi Lee

    Abstract: Most of the speech translation models heavily rely on parallel data, which is hard to collect especially for low-resource languages. To tackle this issue, we propose to build a cascaded speech translation system without leveraging any kind of paired data. We use fully unpaired data to train our unsupervised systems and evaluate our results on CoVoST 2 and CVSS. The results show that our work is co… ▽ More

    Submitted 12 May, 2023; originally announced May 2023.

  11. arXiv:2302.12757  [pdf, other

    eess.AS cs.CL cs.SD

    Ensemble knowledge distillation of self-supervised speech models

    Authors: Kuan-Po Huang, Tzu-hsun Feng, Yu-Kuan Fu, Tsu-Yuan Hsu, Po-Chieh Yen, Wei-Cheng Tseng, Kai-Wei Chang, Hung-yi Lee

    Abstract: Distilled self-supervised models have shown competitive performance and efficiency in recent years. However, there is a lack of experience in jointly distilling multiple self-supervised speech models. In our work, we performed Ensemble Knowledge Distillation (EKD) on various self-supervised speech models such as HuBERT, RobustHuBERT, and WavLM. We tried two different aggregation techniques, layerw… ▽ More

    Submitted 24 February, 2023; originally announced February 2023.

    Comments: Accepted by ICASSP 2023

  12. arXiv:2302.12324  [pdf, other

    cs.CL

    Summaries as Captions: Generating Figure Captions for Scientific Documents with Automated Text Summarization

    Authors: Chieh-Yang Huang, Ting-Yao Hsu, Ryan Rossi, Ani Nenkova, Sungchul Kim, Gromit Yeuk-Yin Chan, Eunyee Koh, Clyde Lee Giles, Ting-Hao 'Kenneth' Huang

    Abstract: Good figure captions help paper readers understand complex scientific figures. Unfortunately, even published papers often have poorly written captions. Automatic caption generation could aid paper writers by providing good starting captions that can be refined for better quality. Prior work often treated figure caption generation as a vision-to-language task. In this paper, we show that it can be… ▽ More

    Submitted 11 August, 2023; v1 submitted 23 February, 2023; originally announced February 2023.

    Comments: Accepted by INLG-2023

  13. arXiv:2301.07208  [pdf, other

    cs.LO

    Bounded Model Checking for Asynchronous Hyperproperties

    Authors: Tzu-Han Hsu, Borzoo Bonakdarpour, Bernd Finkbeiner, César Sánchez

    Abstract: Many types of attacks on confidentiality stem from the nondeterministic nature of the environment that computer programs operate in (e.g., schedulers and asynchronous communication channels). In this paper, we focus on verification of confidentiality in nondeterministic environments by reasoning about asynchronous hyperproperties. First, we generalize the temporal logic A-HLTL to allow nested traj… ▽ More

    Submitted 25 January, 2023; v1 submitted 17 January, 2023; originally announced January 2023.

    Comments: 34 pages

  14. arXiv:2301.06209  [pdf, other

    cs.LO

    Efficient Loop Conditions for Bounded Model Checking Hyperproperties

    Authors: Tzu-Han Hsu, César Sánchez, Sarai Sheinvald, Borzoo Bonakdarpour

    Abstract: Bounded model checking (BMC) is an effective technique for hunting bugs by incrementally exploring the state space of a system. To reason about infinite traces through a finite structure and to ultimately obtain completeness, BMC incorporates loop conditions that revisit previously observed states. This paper focuses on develo** loop conditions for BMC of HyperLTL- a temporal logic for hyperprop… ▽ More

    Submitted 26 January, 2023; v1 submitted 15 January, 2023; originally announced January 2023.

    Comments: 20 pages

  15. arXiv:2212.02421  [pdf, other

    cond-mat.mtrl-sci cs.LG physics.atom-ph

    Score-based denoising for atomic structure identification

    Authors: Tim Hsu, Babak Sadigh, Nicolas Bertin, Cheol Woo Park, James Chapman, Vasily Bulatov, Fei Zhou

    Abstract: We propose an effective method for removing thermal vibrations that complicate the task of analyzing complex dynamics in atomistic simulation of condensed matter. Our method iteratively subtracts thermal noises or perturbations in atomic positions using a denoising score function trained on synthetically noised but otherwise perfect crystal lattices. The resulting denoised structures clearly revea… ▽ More

    Submitted 3 May, 2023; v1 submitted 5 December, 2022; originally announced December 2022.

  16. arXiv:2211.16044  [pdf, other

    cs.SD cs.CL cs.CR cs.LG eess.AS

    Model Extraction Attack against Self-supervised Speech Models

    Authors: Tsu-Yuan Hsu, Chen-An Li, Tung-Yu Wu, Hung-yi Lee

    Abstract: Self-supervised learning (SSL) speech models generate meaningful representations of given clips and achieve incredible performance across various downstream tasks. Model extraction attack (MEA) often refers to an adversary stealing the functionality of the victim model with only query access. In this work, we study the MEA problem against SSL speech model with a small number of queries. We propose… ▽ More

    Submitted 8 October, 2023; v1 submitted 29 November, 2022; originally announced November 2022.

  17. arXiv:2211.09892  [pdf, other

    cs.CL cs.AI

    Summarizing Community-based Question-Answer Pairs

    Authors: Ting-Yao Hsu, Yoshi Suhara, Xiaolan Wang

    Abstract: Community-based Question Answering (CQA), which allows users to acquire their desired information, has increasingly become an essential component of online services in various domains such as E-commerce, travel, and dining. However, an overwhelming number of CQA pairs makes it difficult for users without particular intent to find useful information spread over CQA pairs. To help users quickly dige… ▽ More

    Submitted 17 November, 2022; originally announced November 2022.

    Comments: To appear in EMNLP 2022 main conference

  18. arXiv:2210.07978  [pdf, other

    cs.SD cs.CL eess.AS

    Improving generalizability of distilled self-supervised speech processing models under distorted settings

    Authors: Kuan-Po Huang, Yu-Kuan Fu, Tsu-Yuan Hsu, Fabian Ritter Gutierrez, Fan-Lin Wang, Liang-Hsuan Tseng, Yu Zhang, Hung-yi Lee

    Abstract: Self-supervised learned (SSL) speech pre-trained models perform well across various speech processing tasks. Distilled versions of SSL models have been developed to match the needs of on-device speech applications. Though having similar performance as original SSL models, distilled counterparts suffer from performance degradation even more than their original versions in distorted environments. Th… ▽ More

    Submitted 20 October, 2022; v1 submitted 14 October, 2022; originally announced October 2022.

    Comments: Accepted by IEEE SLT2022

  19. arXiv:2209.12900  [pdf, other

    cs.SD cs.AI cs.CL eess.AS

    The Efficacy of Self-Supervised Speech Models for Audio Representations

    Authors: Tung-Yu Wu, Chen-An Li, Tzu-Han Lin, Tsu-Yuan Hsu, Hung-Yi Lee

    Abstract: Self-supervised learning (SSL) speech models, which can serve as powerful upstream models to extract meaningful speech representations, have achieved unprecedented success in speech representation learning. However, their effectiveness on non-speech datasets is relatively less explored. In this work, we propose an ensemble framework, with a combination of ensemble techniques, to fuse SSL speech mo… ▽ More

    Submitted 31 January, 2023; v1 submitted 26 September, 2022; originally announced September 2022.

    Comments: to appear in Proceedings of Machine Learning Research (PMLR): NeurIPS 2021 Competition Track

  20. arXiv:2202.13033  [pdf

    cs.SI math.PR

    A New BAT and PageRank algorithm for Propagation Probability in Social Networks

    Authors: WC Yeh, CL Huang, TY Hsu, Z Liu, SY Tan

    Abstract: Social networks have increasingly become important and popular in modern times. Moreover, the influence of social networks plays a vital role in various organizations including government organizations, academic research or corporate organizations. Therefore, how to strategize the optimal propagation strategy in social networks has also become more important. By increasing the precision of evaluat… ▽ More

    Submitted 25 February, 2022; originally announced February 2022.

  21. arXiv:2201.12666  [pdf, other

    cs.LG cs.CR cs.IR

    Challenges and approaches to privacy preserving post-click conversion prediction

    Authors: Conor O'Brien, Arvind Thiagarajan, Sourav Das, Rafael Barreto, Chetan Verma, Tim Hsu, James Neufield, Jonathan J Hunt

    Abstract: Online advertising has typically been more personalized than offline advertising, through the use of machine learning models and real-time auctions for ad targeting. One specific task, predicting the likelihood of conversion (i.e.\ the probability a user will purchase the advertised product), is crucial to the advertising ecosystem for both targeting and pricing ads. Currently, these models are of… ▽ More

    Submitted 29 January, 2022; originally announced January 2022.

  22. arXiv:2110.11624  [pdf, other

    cs.CL cs.AI cs.CV

    SciCap: Generating Captions for Scientific Figures

    Authors: Ting-Yao Hsu, C. Lee Giles, Ting-Hao 'Kenneth' Huang

    Abstract: Researchers use figures to communicate rich, complex information in scientific papers. The captions of these figures are critical to conveying effective messages. However, low-quality figure captions commonly occur in scientific articles and may decrease understanding. In this paper, we propose an end-to-end neural framework to automatically generate informative, high-quality captions for scientif… ▽ More

    Submitted 25 October, 2021; v1 submitted 22 October, 2021; originally announced October 2021.

    Comments: To Appear in EMNLP 2021 Findings. The dataset is available at: https://github.com/tingyaohsu/SciCap

  23. arXiv:2110.05664  [pdf

    eess.IV cs.AI cs.CV physics.med-ph

    Accurate and Generalizable Quantitative Scoring of Liver Steatosis from Ultrasound Images via Scalable Deep Learning

    Authors: Bowen Li, Dar-In Tai, Ke Yan, Yi-Cheng Chen, Shiu-Feng Huang, Tse-Hwa Hsu, Wan-Ting Yu, **g Xiao, Le Lu, Adam P. Harrison

    Abstract: Background & Aims: Hepatic steatosis is a major cause of chronic liver disease. 2D ultrasound is the most widely used non-invasive tool for screening and monitoring, but associated diagnoses are highly subjective. We developed a scalable deep learning (DL) algorithm for quantitative scoring of liver steatosis from 2D ultrasound images. Approach & Results: Using retrospectively collected multi-vi… ▽ More

    Submitted 11 October, 2021; originally announced October 2021.

    Comments: Journal paper submission, 45 pages (main body: 28 pages, supplementary material: 17 pages)

  24. arXiv:2109.12989  [pdf, other

    cs.LO cs.CR

    HyperQB: A QBF-Based Bounded Model Checker for Hyperproperties

    Authors: Tzu-Han Hsu, Borzoo Bonakdarpour, César Sánchez

    Abstract: We present HyperQB, a push-button QBF-based bounded model checker for hyperproperties. HyperQB takes as input a NuSMV model and a formula expressed in the temporal logic HyperLTL. Our QBF-based technique allows HyperQB to seamlessly deal with quantifier alternations. Based on the selection of either bug hunting or synthesis, the instances of counterexamples (for negated formula) or witnesses (for… ▽ More

    Submitted 24 January, 2024; v1 submitted 21 September, 2021; originally announced September 2021.

  25. arXiv:2109.11576  [pdf, other

    cs.LG physics.comp-ph

    Efficient, Interpretable Graph Neural Network Representation for Angle-dependent Properties and its Application to Optical Spectroscopy

    Authors: Tim Hsu, Tuan Anh Pham, Nathan Keilbart, Stephen Weitzner, James Chapman, Penghao Xiao, S. Roger Qiu, Xiao Chen, Brandon C. Wood

    Abstract: Graph neural networks are attractive for learning properties of atomic structures thanks to their intuitive graph encoding of atoms and bonds. However, conventional encoding does not include angular information, which is critical for describing atomic arrangements in disordered systems. In this work, we extend the recently proposed ALIGNN encoding, which incorporates bond angles, to also include d… ▽ More

    Submitted 15 February, 2022; v1 submitted 23 September, 2021; originally announced September 2021.

  26. arXiv:2103.08290  [pdf, other

    cs.CV cs.LG

    DeepOPG: Improving Orthopantomogram Finding Summarization with Weak Supervision

    Authors: Tzu-Ming Harry Hsu, Yin-Chih Chelsea Wang

    Abstract: Clinical finding summaries from an orthopantomogram, or a dental panoramic radiograph, have significant potential to improve patient communication and speed up clinical judgments. While orthopantomogram is a first-line tool for dental examinations, no existing work has explored the summarization of findings from it. A finding summary has to find teeth in the imaging study and label the teeth with… ▽ More

    Submitted 6 July, 2021; v1 submitted 15 March, 2021; originally announced March 2021.

  27. arXiv:2010.10938  [pdf, other

    cs.CL cs.LG

    What makes multilingual BERT multilingual?

    Authors: Chi-Liang Liu, Tsung-Yuan Hsu, Yung-Sung Chuang, Hung-yi Lee

    Abstract: Recently, multilingual BERT works remarkably well on cross-lingual transfer tasks, superior to static non-contextualized word embeddings. In this work, we provide an in-depth experimental study to supplement the existing literature of cross-lingual ability. We compare the cross-lingual ability of non-contextualized and contextualized representation model with the same data. We found that datasize… ▽ More

    Submitted 20 October, 2020; originally announced October 2020.

    Comments: arXiv admin note: substantial text overlap with arXiv:2004.09205

  28. arXiv:2010.10041  [pdf, other

    cs.CL cs.AI

    Looking for Clues of Language in Multilingual BERT to Improve Cross-lingual Generalization

    Authors: Chi-Liang Liu, Tsung-Yuan Hsu, Yung-Sung Chuang, Chung-Yi Li, Hung-yi Lee

    Abstract: Token embeddings in multilingual BERT (m-BERT) contain both language and semantic information. We find that the representation of a language can be obtained by simply averaging the embeddings of the tokens of the language. Given this language representation, we control the output languages of multilingual BERT by manipulating the token embeddings, thus achieving unsupervised token translation. We… ▽ More

    Submitted 1 November, 2021; v1 submitted 20 October, 2020; originally announced October 2020.

    Comments: preprint

  29. arXiv:2009.08907  [pdf, other

    cs.FL cs.CR

    Bounded Model Checking for Hyperproperties

    Authors: Tzu-Han Hsu, Cesar Sanchez, Borzoo Bonakdarpour

    Abstract: Hyperproperties are properties of systems that relate multiple computation traces, including security and concurrency properties. This paper introduces a bounded model checking (BMC) algorithm for hyperproperties expressed in HyperLTL, which - to the best of our knowledge - is the first such algorithm. Just as the classic BMC technique for LTL primarily aims at finding bugs, our approach also targ… ▽ More

    Submitted 15 October, 2020; v1 submitted 18 September, 2020; originally announced September 2020.

  30. arXiv:2007.07196  [pdf, other

    cs.CL cs.AI

    Investigation of Sentiment Controllable Chatbot

    Authors: Hung-yi Lee, Cheng-Hao Ho, Chien-Fu Lin, Chiung-Chih Chang, Chih-Wei Lee, Yau-Shian Wang, Tsung-Yuan Hsu, Kuan-Yu Chen

    Abstract: Conventional seq2seq chatbot models attempt only to find sentences with the highest probabilities conditioned on the input sequences, without considering the sentiment of the output sentences. In this paper, we investigate four models to scale or adjust the sentiment of the chatbot response: a persona-based model, reinforcement learning, a plug and play model, and CycleGAN, all based on the seq2se… ▽ More

    Submitted 11 July, 2020; originally announced July 2020.

    Comments: arXiv admin note: text overlap with arXiv:1804.02504

  31. arXiv:2006.15229  [pdf, other

    cs.LG stat.ML

    CheXpert++: Approximating the CheXpert labeler for Speed,Differentiability, and Probabilistic Output

    Authors: Matthew B. A. McDermott, Tzu Ming Harry Hsu, Wei-Hung Weng, Marzyeh Ghassemi, Peter Szolovits

    Abstract: It is often infeasible or impossible to obtain ground truth labels for medical data. To circumvent this, one may build rule-based or other expert-knowledge driven labelers to ingest data and yield silver labels absent any ground-truth training data. One popular such labeler is CheXpert, a labeler that produces diagnostic labels for chest X-ray radiology reports. CheXpert is very useful, but is rel… ▽ More

    Submitted 26 June, 2020; originally announced June 2020.

    Comments: To appear at MLHC 2020

  32. arXiv:2006.13886  [pdf, other

    eess.IV cond-mat.mtrl-sci cs.CV

    Microstructure Generation via Generative Adversarial Network for Heterogeneous, Topologically Complex 3D Materials

    Authors: Tim Hsu, William K. Epting, Hokon Kim, Harry W. Abernathy, Gregory A. Hackett, Anthony D. Rollett, Paul A. Salvador, Elizabeth A. Holm

    Abstract: Using a large-scale, experimentally captured 3D microstructure dataset, we implement the generative adversarial network (GAN) framework to learn and generate 3D microstructures of solid oxide fuel cell electrodes. The generated microstructures are visually, statistically, and topologically realistic, with distributions of microstructural parameters, including volume fraction, particle size, surfac… ▽ More

    Submitted 22 June, 2020; originally announced June 2020.

    Comments: submitted to JOM

  33. arXiv:2004.09205  [pdf, other

    cs.CL

    A Study of Cross-Lingual Ability and Language-specific Information in Multilingual BERT

    Authors: Chi-Liang Liu, Tsung-Yuan Hsu, Yung-Sung Chuang, Hung-Yi Lee

    Abstract: Recently, multilingual BERT works remarkably well on cross-lingual transfer tasks, superior to static non-contextualized word embeddings. In this work, we provide an in-depth experimental study to supplement the existing literature of cross-lingual ability. We compare the cross-lingual ability of non-contextualized and contextualized representation model with the same data. We found that datasize… ▽ More

    Submitted 20 April, 2020; originally announced April 2020.

  34. arXiv:2003.08082  [pdf, other

    cs.LG cs.CV stat.ML

    Federated Visual Classification with Real-World Data Distribution

    Authors: Tzu-Ming Harry Hsu, Hang Qi, Matthew Brown

    Abstract: Federated Learning enables visual models to be trained on-device, bringing advantages for user privacy (data need never leave the device), but challenges in terms of data diversity and quality. Whilst typical models in the datacenter are trained using data that are independent and identically distributed (IID), data at source are typically far from IID. Furthermore, differing quantities of data ar… ▽ More

    Submitted 17 July, 2020; v1 submitted 18 March, 2020; originally announced March 2020.

  35. arXiv:2001.07672  [pdf, other

    cs.DS

    Streaming Complexity of Spanning Tree Computation

    Authors: Yi-Jun Chang, Martin Farach-Colton, Tsan-Sheng Hsu, Meng-Tsung Tsai

    Abstract: The semi-streaming model is a variant of the streaming model frequently used for the computation of graph problems. It allows the edges of an $n$-node input graph to be read sequentially in $p$ passes using $\tilde{O}(n)$ space. In this model, some graph problems, such as spanning trees and $k$-connectivity, can be exactly solved in a single pass; while other graph problems, such as triangle detec… ▽ More

    Submitted 21 January, 2020; originally announced January 2020.

    Comments: This is the full version of a conference paper to appear in the Proceedings of 37th International Symposium on Theoretical Aspects of Computer Science (STACS)

    ACM Class: F.2

  36. arXiv:1909.09587  [pdf, ps, other

    cs.CL cs.LG stat.ML

    Zero-shot Reading Comprehension by Cross-lingual Transfer Learning with Multi-lingual Language Representation Model

    Authors: Tsung-yuan Hsu, Chi-liang Liu, Hung-yi Lee

    Abstract: Because it is not feasible to collect training data for every language, there is a growing interest in cross-lingual transfer learning. In this paper, we systematically explore zero-shot cross-lingual transfer learning on reading comprehension tasks with a language representation model pre-trained on multi-lingual corpus. The experimental results show that with pre-trained language representation… ▽ More

    Submitted 15 September, 2019; originally announced September 2019.

  37. arXiv:1909.06335  [pdf, other

    cs.LG cs.CV stat.ML

    Measuring the Effects of Non-Identical Data Distribution for Federated Visual Classification

    Authors: Tzu-Ming Harry Hsu, Hang Qi, Matthew Brown

    Abstract: Federated Learning enables visual models to be trained in a privacy-preserving way using real-world data from mobile devices. Given their distributed nature, the statistics of the data across these devices is likely to differ significantly. In this work, we look at the effect such non-identical data distributions has on visual classification via Federated Learning. We propose a way to synthesize d… ▽ More

    Submitted 13 September, 2019; originally announced September 2019.

  38. arXiv:1907.12888  [pdf, other

    cs.CV cs.LG cs.MM

    CoachAI: A Project for Microscopic Badminton Match Data Collection and Tactical Analysis

    Authors: Tzu-Han Hsu, Ching-Hsuan Chen, Nyan ** Ju, Tsì-Uí İk, Wen-Chih Peng, Chih-Chuan Wang, Yu-Shuen Wang, Yuan-Hsiang Lin, Yu-Chee Tseng, Jiun-Long Huang, Yu-Tai Ching

    Abstract: Computer vision based object tracking has been used to annotate and augment sports video. For sports learning and training, video replay is often used in post-match review and training review for tactical analysis and movement analysis. For automatically and systematically competition data collection and tactical analysis, a project called CoachAI has been supported by the Ministry of Science and… ▽ More

    Submitted 12 July, 2019; originally announced July 2019.

  39. arXiv:1906.01764  [pdf, other

    cs.CL cs.AI cs.HC

    Visual Story Post-Editing

    Authors: Ting-Yao Hsu, Chieh-Yang Huang, Yen-Chia Hsu, Ting-Hao 'Kenneth' Huang

    Abstract: We introduce the first dataset for human edits of machine-generated visual stories and explore how these collected edits may be used for the visual story post-editing task. The dataset, VIST-Edit, includes 14,905 human edited versions of 2,981 machine-generated visual stories. The stories were generated by two state-of-the-art visual storytelling models, each aligned to 5 human-edited versions. We… ▽ More

    Submitted 4 June, 2019; originally announced June 2019.

    Comments: Accepted by ACL 2019

  40. arXiv:1904.02633  [pdf, other

    cs.CV cs.CL

    Clinically Accurate Chest X-Ray Report Generation

    Authors: Guanxiong Liu, Tzu-Ming Harry Hsu, Matthew McDermott, Willie Boag, Wei-Hung Weng, Peter Szolovits, Marzyeh Ghassemi

    Abstract: The automatic generation of radiology reports given medical radiographs has significant potential to operationally and improve clinical patient care. A number of prior works have focused on this problem, employing advanced methods from computer vision and natural language generation to produce readable reports. However, these works often fail to account for the particular nuances of the radiology… ▽ More

    Submitted 29 July, 2019; v1 submitted 4 April, 2019; originally announced April 2019.

  41. arXiv:1902.08327  [pdf, other

    cs.HC cs.CL

    On How Users Edit Computer-Generated Visual Stories

    Authors: Ting-Yao Hsu, Yen-Chia Hsu, Ting-Hao 'Kenneth' Huang

    Abstract: A significant body of research in Artificial Intelligence (AI) has focused on generating stories automatically, either based on prior story plots or input images. However, literature has little to say about how users would receive and use these stories. Given the quality of stories generated by modern AI algorithms, users will nearly inevitably have to edit these stories before putting them to rea… ▽ More

    Submitted 8 March, 2019; v1 submitted 21 February, 2019; originally announced February 2019.

    Comments: To appear in CHI'19 Late-Breaking Work on Human Factors in Computing Systems (CHI LBW 2019), 2019

  42. arXiv:1811.08615  [pdf, other

    cs.LG cs.CL

    Unsupervised Multimodal Representation Learning across Medical Images and Reports

    Authors: Tzu-Ming Harry Hsu, Wei-Hung Weng, Willie Boag, Matthew McDermott, Peter Szolovits

    Abstract: Joint embeddings between medical imaging modalities and associated radiology reports have the potential to offer significant benefits to the clinical community, ranging from cross-domain retrieval to conditional generation of reports to the broader goals of multimodal representation learning. In this work, we establish baseline joint embedding results measured via both local and global retrieval m… ▽ More

    Submitted 21 November, 2018; originally announced November 2018.

    Comments: Machine Learning for Health (ML4H) Workshop at NeurIPS 2018 arXiv:1811.07216

    Report number: ML4H/2018/215

  43. arXiv:1808.09351  [pdf, other

    cs.CV cs.GR eess.IV

    3D-Aware Scene Manipulation via Inverse Graphics

    Authors: Shunyu Yao, Tzu Ming Harry Hsu, Jun-Yan Zhu, Jiajun Wu, Antonio Torralba, William T. Freeman, Joshua B. Tenenbaum

    Abstract: We aim to obtain an interpretable, expressive, and disentangled scene representation that contains comprehensive structural and textural information for each object. Previous scene representations learned by neural networks are often uninterpretable, limited to a single object, or lacking 3D knowledge. In this work, we propose 3D scene de-rendering networks (3D-SDN) to address the above issues by… ▽ More

    Submitted 18 December, 2018; v1 submitted 28 August, 2018; originally announced August 2018.

    Comments: NeurIPS 2018. Code: https://github.com/ysymyth/3D-SDN Website: http://3dsdn.csail.mit.edu/

  44. arXiv:1805.07027  [pdf, other

    cs.IT

    Efficient Downlink Channel Reconstruction for FDD Multi-Antenna Systems

    Authors: Yu Han, Tien-Hao Hsu, Chao-Kai Wen, Kai-Kit Wong, Shi **

    Abstract: In this paper, we propose an efficient downlink channel reconstruction scheme for a frequency-division-duplex multi-antenna system by utilizing uplink channel state information combined with limited feedback. Based on the spatial reciprocity in a wireless channel, the downlink channel is reconstructed by using frequency-independent parameters. We first estimate the gains, delays, and angles during… ▽ More

    Submitted 17 May, 2018; originally announced May 2018.

  45. arXiv:1804.02504  [pdf, other

    cs.CL

    Scalable Sentiment for Sequence-to-sequence Chatbot Response with Performance Analysis

    Authors: Chih-Wei Lee, Yau-Shian Wang, Tsung-Yuan Hsu, Kuan-Yu Chen, Hung-Yi Lee, Lin-shan Lee

    Abstract: Conventional seq2seq chatbot models only try to find the sentences with the highest probabilities conditioned on the input sequences, without considering the sentiment of the output sentences. Some research works trying to modify the sentiment of the output sequences were reported. In this paper, we propose five models to scale or adjust the sentiment of the chatbot response: persona-based model,… ▽ More

    Submitted 6 April, 2018; originally announced April 2018.

  46. Optimized Random Deployment of Energy Harvesting Sensors for Field Reconstruction in Analog and Digital Forwarding Systems

    Authors: Teng-Cheng Hsu, Y. -W. Peter Hong, Tsang-Yi Wang

    Abstract: This work examines the large-scale deployment of energy harvesting sensors for the purpose of sensing and reconstruction of a spatially correlated Gaussian random field. The sensors are powered solely by energy harvested from the environment and are deployed randomly according to a spatially nonhomogeneous Poisson point process whose density depends on the energy arrival statistics at different lo… ▽ More

    Submitted 19 May, 2015; originally announced May 2015.

  47. arXiv:cs/0102009  [pdf, ps, other

    cs.DS cs.DM

    Optimal Augmentation for Bipartite Componentwise Biconnectivity in Linear Time

    Authors: Tsan-sheng Hsu, Ming-Yang Kao

    Abstract: A graph is componentwise biconnected if every connected component either is an isolated vertex or is biconnected. We present a linear-time algorithm for the problem of adding the smallest number of edges to make a bipartite graph componentwise biconnected while preserving its bipartiteness. This algorithm has immediate applications for protecting sensitive information in statistical tables.

    Submitted 9 February, 2001; originally announced February 2001.

    Comments: A preliminary version appeared in T. Asano, Y. Igarashi, H. Nagamochi, S. Miyano, and S. Suri, editors, Lecture Notes in Computer Science 1178: Proceedings of the 7th Annual International Symposium on Algorithms and Computation, pages 213--222. Springer-Verlag, New York, NY, 1996

    ACM Class: F.2.2; G.2.2