Skip to main content

Showing 1–50 of 56 results for author: Hsieh, Y

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.00031  [pdf, other

    cs.DC cs.SE

    Supercharging Federated Learning with Flower and NVIDIA FLARE

    Authors: Holger R. Roth, Daniel J. Beutel, Yan Cheng, Javier Fernandez Marques, Heng Pan, Chester Chen, Zhihong Zhang, Yuhong Wen, Sean Yang, Isaac, Yang, Yuan-Ting Hsieh, Ziyue Xu, Daguang Xu, Nicholas D. Lane, Andrew Feng

    Abstract: Several open-source systems, such as Flower and NVIDIA FLARE, have been developed in recent years while focusing on different aspects of federated learning (FL). Flower is dedicated to implementing a cohesive approach to FL, analytics, and evaluation. Over time, Flower has cultivated extensive strategies and algorithms tailored for FL application development, fostering a vibrant FL community in re… ▽ More

    Submitted 21 May, 2024; originally announced July 2024.

  2. arXiv:2405.16557  [pdf, other

    cs.LG cs.AI

    Scalable Numerical Embeddings for Multivariate Time Series: Enhancing Healthcare Data Representation Learning

    Authors: Chun-Kai Huang, Yi-Hsien Hsieh, Ta-Jung Chien, Li-Cheng Chien, Shao-Hua Sun, Tung-Hung Su, Jia-Horng Kao, Che Lin

    Abstract: Multivariate time series (MTS) data, when sampled irregularly and asynchronously, often present extensive missing values. Conventional methodologies for MTS analysis tend to rely on temporal embeddings based on timestamps that necessitate subsequent imputations, yet these imputed values frequently deviate substantially from their actual counterparts, thereby compromising prediction accuracy. Furth… ▽ More

    Submitted 26 May, 2024; originally announced May 2024.

  3. arXiv:2403.02363  [pdf, other

    cs.LG cs.AI

    Addressing Long-Tail Noisy Label Learning Problems: a Two-Stage Solution with Label Refurbishment Considering Label Rarity

    Authors: Ying-Hsuan Wu, Jun-Wei Hsieh, Li Xin, Shin-You Teng, Yi-Kuan Hsieh, Ming-Ching Chang

    Abstract: Real-world datasets commonly exhibit noisy labels and class imbalance, such as long-tailed distributions. While previous research addresses this issue by differentiating noisy and clean samples, reliance on information from predictions based on noisy long-tailed data introduces potential errors. To overcome the limitations of prior works, we introduce an effective two-stage approach by combining s… ▽ More

    Submitted 4 March, 2024; originally announced March 2024.

  4. arXiv:2402.07792  [pdf, other

    cs.LG cs.DC

    Empowering Federated Learning for Massive Models with NVIDIA FLARE

    Authors: Holger R. Roth, Ziyue Xu, Yuan-Ting Hsieh, Adithya Renduchintala, Isaac Yang, Zhihong Zhang, Yuhong Wen, Sean Yang, Kevin Lu, Kristopher Kersten, Camir Ricketts, Daguang Xu, Chester Chen, Yan Cheng, Andrew Feng

    Abstract: In the ever-evolving landscape of artificial intelligence (AI) and large language models (LLMs), handling and leveraging data effectively has become a critical challenge. Most state-of-the-art machine learning algorithms are data-centric. However, as the lifeblood of model performance, necessary data cannot always be centralized due to various factors such as privacy, regulation, geopolitics, copy… ▽ More

    Submitted 12 February, 2024; originally announced February 2024.

  5. arXiv:2402.02998  [pdf, other

    cs.LG stat.ML

    Careful with that Scalpel: Improving Gradient Surgery with an EMA

    Authors: Yu-Guan Hsieh, James Thornton, Eugene Ndiaye, Michal Klein, Marco Cuturi, Pierre Ablin

    Abstract: Beyond minimizing a single training loss, many deep learning estimation pipelines rely on an auxiliary objective to quantify and encourage desirable properties of the model (e.g. performance on another dataset, robustness, agreement with a prior). Although the simplest approach to incorporating an auxiliary loss is to sum it with the training loss as a regularizer, recent works have shown that one… ▽ More

    Submitted 5 February, 2024; originally announced February 2024.

  6. arXiv:2312.16771  [pdf, other

    cs.CV

    Scale-Aware Crowd Count Network with Annotation Error Correction

    Authors: Yi-Kuan Hsieh, Jun-Wei Hsieh, Yu-Chee Tseng, Ming-Ching Chang, Li Xin

    Abstract: Traditional crowd counting networks suffer from information loss when feature maps are downsized through pooling layers, leading to inaccuracies in counting crowds at a distance. Existing methods often assume correct annotations during training, disregarding the impact of noisy annotations, especially in crowded scenes. Furthermore, the use of a fixed Gaussian kernel fails to account for the varyi… ▽ More

    Submitted 27 December, 2023; originally announced December 2023.

    Comments: 7 pages, 6 figues. arXiv admin note: text overlap with arXiv:2211.06835

  7. arXiv:2312.02213  [pdf, other

    cs.LG cs.AI cs.DB stat.AP

    JarviX: A LLM No code Platform for Tabular Data Analysis and Optimization

    Authors: Shang-Ching Liu, ShengKun Wang, Wenqi Lin, Chung-Wei Hsiung, Yi-Chen Hsieh, Yu-** Cheng, Sian-Hong Luo, Tsungyao Chang, Jianwei Zhang

    Abstract: In this study, we introduce JarviX, a sophisticated data analytics framework. JarviX is designed to employ Large Language Models (LLMs) to facilitate an automated guide and execute high-precision data analyzes on tabular datasets. This framework emphasizes the significance of varying column types, capitalizing on state-of-the-art LLMs to generate concise data insight summaries, propose relevant an… ▽ More

    Submitted 3 December, 2023; originally announced December 2023.

  8. arXiv:2311.16706  [pdf, ps, other

    cs.LG math.PR stat.ML

    Sinkhorn Flow: A Continuous-Time Framework for Understanding and Generalizing the Sinkhorn Algorithm

    Authors: Mohammad Reza Karimi, Ya-** Hsieh, Andreas Krause

    Abstract: Many problems in machine learning can be formulated as solving entropy-regularized optimal transport on the space of probability measures. The canonical approach involves the Sinkhorn iterates, renowned for their rich mathematical properties. Recently, the Sinkhorn algorithm has been recast within the mirror descent framework, thus benefiting from classical optimization theory insights. Here, we b… ▽ More

    Submitted 28 November, 2023; originally announced November 2023.

  9. arXiv:2311.02374  [pdf, other

    math.OC cs.LG

    Riemannian stochastic optimization methods avoid strict saddle points

    Authors: Ya-** Hsieh, Mohammad Reza Karimi, Andreas Krause, Panayotis Mertikopoulos

    Abstract: Many modern machine learning applications - from online principal component analysis to covariance matrix identification and dictionary learning - can be formulated as minimization problems on Riemannian manifolds, and are typically solved with a Riemannian stochastic gradient method (or some variant thereof). However, in many cases of interest, the resulting minimization problem is not geodesical… ▽ More

    Submitted 4 November, 2023; originally announced November 2023.

    Comments: 27 pages, 3 figures

    MSC Class: Primary 62L20; 37N40; secondary 90C15; 90C48

  10. A Versatile Data Fabric for Advanced IoT-Based Remote Health Monitoring

    Authors: Italo Buleje, Vince S. Siu, Kuan Yu Hsieh, Nigel Hinds, Bing Dang, Erhan Bilal, Thanhnha Nguyen, Ellen E. Lee, Colin A. Depp, Jeffrey L. Rogers

    Abstract: This paper presents a data-centric and security-focused data fabric designed for digital health applications. With the increasing interest in digital health research, there has been a surge in the volume of Internet of Things (IoT) data derived from smartphones, wearables, and ambient sensors. Managing this vast amount of data, encompassing diverse data types and varying time scales, is crucial. M… ▽ More

    Submitted 2 October, 2023; originally announced October 2023.

    Journal ref: 2023 IEEE International Conference on Digital Health (ICDH), Chicago, IL, USA, 2023, pp. 88-90

  11. arXiv:2309.14859  [pdf, other

    cs.CV cs.AI cs.GR cs.LG

    Navigating Text-To-Image Customization: From LyCORIS Fine-Tuning to Model Evaluation

    Authors: Shih-Ying Yeh, Yu-Guan Hsieh, Zhidong Gao, Bernard B W Yang, Giyeong Oh, Yanmin Gong

    Abstract: Text-to-image generative models have garnered immense attention for their ability to produce high-fidelity images from text prompts. Among these, Stable Diffusion distinguishes itself as a leading open-source model in this fast-growing field. However, the intricacies of fine-tuning these models pose multiple challenges from new methodology integration to systematic evaluation. Addressing these iss… ▽ More

    Submitted 11 March, 2024; v1 submitted 26 September, 2023; originally announced September 2023.

    Comments: In International Conference on Learning Representations 12 (ICLR 2024) [79 pages, 54 figures, 7 tables]

  12. arXiv:2309.09514  [pdf, other

    cs.CV

    PanoMixSwap Panorama Mixing via Structural Swap** for Indoor Scene Understanding

    Authors: Yu-Cheng Hsieh, Cheng Sun, Suraj Dengale, Min Sun

    Abstract: The volume and diversity of training data are critical for modern deep learningbased methods. Compared to the massive amount of labeled perspective images, 360 panoramic images fall short in both volume and diversity. In this paper, we propose PanoMixSwap, a novel data augmentation technique specifically designed for indoor panoramic images. PanoMixSwap explicitly mixes various background styles,… ▽ More

    Submitted 27 September, 2023; v1 submitted 18 September, 2023; originally announced September 2023.

    Comments: BMVC'23; project page:https://yuchenghsieh.github.io/PanoMixSwap

  13. arXiv:2306.09099  [pdf, other

    cs.LG

    Unbalanced Diffusion Schrödinger Bridge

    Authors: Matteo Pariset, Ya-** Hsieh, Charlotte Bunne, Andreas Krause, Valentin De Bortoli

    Abstract: Schrödinger bridges (SBs) provide an elegant framework for modeling the temporal evolution of populations in physical, chemical, or biological systems. Such natural processes are commonly subject to changes in population size over time due to the emergence of new species or birth and death events. However, existing neural parameterizations of SBs such as diffusion Schrödinger bridges (DSBs) are re… ▽ More

    Submitted 15 June, 2023; originally announced June 2023.

  14. arXiv:2305.12444  [pdf, other

    quant-ph cs.CC

    On the Impossibility of General Parallel Fast-forwarding of Hamiltonian Simulation

    Authors: Nai-Hui Chia, Kai-Min Chung, Yao-Ching Hsieh, Han-Hsuan Lin, Yao-Ting Lin, Yu-Ching Shen

    Abstract: Hamiltonian simulation is one of the most important problems in the field of quantum computing. There have been extended efforts on designing algorithms for faster simulation, and the evolution time $T$ for the simulation turns out to largely affect algorithm runtime. While there are some specific types of Hamiltonians that can be fast-forwarded, i.e., simulated within time $o(T)$, for large enoug… ▽ More

    Submitted 21 May, 2023; originally announced May 2023.

    Comments: 44 pages, 7 figures

  15. arXiv:2302.11419  [pdf, other

    cs.LG q-bio.QM

    Aligned Diffusion Schrödinger Bridges

    Authors: Vignesh Ram Somnath, Matteo Pariset, Ya-** Hsieh, Maria Rodriguez Martinez, Andreas Krause, Charlotte Bunne

    Abstract: Diffusion Schrödinger bridges (DSB) have recently emerged as a powerful framework for recovering stochastic dynamics via their marginal observations at different time points. Despite numerous successful applications, existing algorithms for solving DSBs have so far failed to utilize the structure of aligned data, which naturally arises in many biological phenomena. In this paper, we propose a nove… ▽ More

    Submitted 28 April, 2024; v1 submitted 22 February, 2023; originally announced February 2023.

  16. arXiv:2302.05831  [pdf, ps, other

    econ.TH cs.SI

    On the Difficulty of Characterizing Network Formation with Endogenous Behavior

    Authors: Benjamin Golub, Yu-Chi Hsieh, Evan Sadler

    Abstract: Bolletta (2021, Math. Soc. Sci. 114:1-10) studies a model in which a network is strategically formed and then agents play a linear best-response investment game in it. The model is motivated by an application in which people choose both their study partners and their levels of educational effort. Agents have different one-dimensional types $\unicode{x2013}$ private returns to effort. A main result… ▽ More

    Submitted 22 February, 2023; v1 submitted 11 February, 2023; originally announced February 2023.

  17. arXiv:2301.05182  [pdf, other

    cs.LG cs.AI stat.ML

    Thompson Sampling with Diffusion Generative Prior

    Authors: Yu-Guan Hsieh, Shiva Prasad Kasiviswanathan, Branislav Kveton, Patrick Blöbaum

    Abstract: In this work, we initiate the idea of using denoising diffusion models to learn priors for online decision making problems. Our special focus is on the meta-learning for bandit framework, with the goal of learning a strategy that performs well across bandit tasks of a same class. To this end, we train a diffusion model that learns the underlying task distribution and combine Thompson sampling with… ▽ More

    Submitted 30 January, 2023; v1 submitted 12 January, 2023; originally announced January 2023.

  18. arXiv:2212.01287  [pdf, other

    cs.CV cs.AI

    SARAS-Net: Scale and Relation Aware Siamese Network for Change Detection

    Authors: Chao-Peng Chen, Jun-Wei Hsieh, **-Yang Chen, Yi-Kuan Hsieh, Bor-Shiun Wang

    Abstract: Change detection (CD) aims to find the difference between two images at different times and outputs a change map to represent whether the region has changed or not. To achieve a better result in generating the change map, many State-of-The-Art (SoTA) methods design a deep learning model that has a powerful discriminative ability. However, these methods still get lower performance because they igno… ▽ More

    Submitted 2 December, 2022; originally announced December 2022.

  19. arXiv:2211.12839  [pdf

    q-fin.TR cs.CE

    Newly Developed Flexible Grid Trading Model Combined ANN and SSO algorithm

    Authors: Wei-Chang Yeh, Yu-Hsin Hsieh, Chia-Ling Huang

    Abstract: In modern society, the trading methods and strategies used in financial market have gradually changed from traditional on-site trading to electronic remote trading, and even online automatic trading performed by a pre-programmed computer programs because the continuous development of network and computer computing technology. The quantitative trading, which the main purpose is to automatically for… ▽ More

    Submitted 5 September, 2022; originally announced November 2022.

  20. arXiv:2211.06835  [pdf, other

    cs.CV cs.AI

    Scale-Aware Crowd Counting Using a Joint Likelihood Density Map and Synthetic Fusion Pyramid Network

    Authors: Yi-Kuan Hsieh, Jun-Wei Hsieh, Yu-Chee Tseng, Ming-Ching Chang, Bor-Shiun Wang

    Abstract: We develop a Synthetic Fusion Pyramid Network (SPF-Net) with a scale-aware loss function design for accurate crowd counting. Existing crowd-counting methods assume that the training annotation points were accurate and thus ignore the fact that noisy annotations can lead to large model-learning bias and counting error, especially for counting highly dense crowds that appear far away. To the best of… ▽ More

    Submitted 2 January, 2023; v1 submitted 13 November, 2022; originally announced November 2022.

    Comments: 8 pages, 8 figures, 4 tables

  21. Health Guardian Platform: A technology stack to accelerate discovery in Digital Health research

    Authors: Bo Wen, Vince S. Siu, Italo Buleje, Kuan Yu Hsieh, Takashi Itoh, Lukas Zimmerli, Nigel Hinds, Elif Eyigoz, Bing Dang, Stefan von Cavallar, Jeffrey L. Rogers

    Abstract: This paper highlights the design philosophy and architecture of the Health Guardian, a platform developed by the IBM Digital Health team to accelerate discoveries of new digital biomarkers and development of digital health technologies. The Health Guardian allows for rapid translation of artificial intelligence (AI) research into cloud-based microservices that can be tested with data from clinical… ▽ More

    Submitted 10 November, 2022; originally announced November 2022.

    Comments: 6 pages, 3 figures, https://ieeexplore.ieee.org/document/9861047

    Journal ref: IEEE International Conference on Digital Health (ICDH), 2022, pp. 40-46

  22. arXiv:2210.13867  [pdf, ps, other

    cs.LG math.PR math.ST

    A Dynamical System View of Langevin-Based Non-Convex Sampling

    Authors: Mohammad Reza Karimi, Ya-** Hsieh, Andreas Krause

    Abstract: Non-convex sampling is a key challenge in machine learning, central to non-convex optimization in deep learning as well as to approximate probabilistic inference. Despite its significance, theoretically there remain many important challenges: Existing guarantees (1) typically only hold for the averaged iterates rather than the more desirable last iterates, (2) lack convergence metrics that capture… ▽ More

    Submitted 13 March, 2023; v1 submitted 25 October, 2022; originally announced October 2022.

    Comments: typos corrected, references added

    MSC Class: 62D05

  23. arXiv:2210.13291  [pdf, other

    cs.LG cs.AI cs.CV cs.NI cs.SE

    NVIDIA FLARE: Federated Learning from Simulation to Real-World

    Authors: Holger R. Roth, Yan Cheng, Yuhong Wen, Isaac Yang, Ziyue Xu, Yuan-Ting Hsieh, Kristopher Kersten, Ahmed Harouni, Can Zhao, Kevin Lu, Zhihong Zhang, Wenqi Li, Andriy Myronenko, Dong Yang, Sean Yang, Nicola Rieke, Abood Quraini, Chester Chen, Daguang Xu, Nic Ma, Prerna Dogra, Mona Flores, Andrew Feng

    Abstract: Federated learning (FL) enables building robust and generalizable AI models by leveraging diverse datasets from multiple collaborators without centralizing the data. We created NVIDIA FLARE as an open-source software development kit (SDK) to make it easier for data scientists to use FL in their research and real-world applications. The SDK includes solutions for state-of-the-art FL algorithms and… ▽ More

    Submitted 28 April, 2023; v1 submitted 24 October, 2022; originally announced October 2022.

    Comments: Accepted at the International Workshop on Federated Learning, NeurIPS 2022, New Orleans, USA (https://federated-learning.org/fl-neurips-2022); Revised version v2: added Key Components list, system metrics for homomorphic encryption experiment; Extended v3 for journal submission

    Journal ref: IEEE Data Eng. Bull., Vol. 46, No. 1, 2023

  24. arXiv:2207.07105  [pdf, ps, other

    stat.ML cs.LG math.OC

    Continuous-time Analysis for Variational Inequalities: An Overview and Desiderata

    Authors: Tatjana Chavdarova, Ya-** Hsieh, Michael I. Jordan

    Abstract: Algorithms that solve zero-sum games, multi-objective agent objectives, or, more generally, variational inequality (VI) problems are notoriously unstable on general problems. Owing to the increasing need for solving such problems in machine learning, this instability has been highlighted in recent years as a significant research challenge. In this paper, we provide an overview of recent progress i… ▽ More

    Submitted 14 July, 2022; originally announced July 2022.

  25. arXiv:2206.06795  [pdf, other

    math.OC cs.LG math.DS

    Riemannian stochastic approximation algorithms

    Authors: Mohammad Reza Karimi, Ya-** Hsieh, Panayotis Mertikopoulos, Andreas Krause

    Abstract: We examine a wide class of stochastic approximation algorithms for solving (stochastic) nonlinear problems on Riemannian manifolds. Such algorithms arise naturally in the study of Riemannian optimization, game theory and optimal transport, but their behavior is much less understood compared to the Euclidean case because of the lack of a global linear structure on the manifold. We overcome this dif… ▽ More

    Submitted 27 December, 2022; v1 submitted 14 June, 2022; originally announced June 2022.

    Comments: 33 pages, 2 figures; a one-page abstract of this paper was presented in COLT 2022

    MSC Class: Primary 62L20; 37N40; secondary 90C15; 90C47; 90C48

  26. arXiv:2206.06015  [pdf, other

    cs.GT cs.LG

    No-Regret Learning in Games with Noisy Feedback: Faster Rates and Adaptivity via Learning Rate Separation

    Authors: Yu-Guan Hsieh, Kimon Antonakopoulos, Volkan Cevher, Panayotis Mertikopoulos

    Abstract: We examine the problem of regret minimization when the learner is involved in a continuous game with other optimizing agents: in this case, if all players follow a no-regret algorithm, it is possible to achieve significantly lower regret relative to fully adversarial environments. We study this problem in the context of variationally stable games (a class of continuous games which includes all con… ▽ More

    Submitted 17 March, 2023; v1 submitted 13 June, 2022; originally announced June 2022.

    Comments: In Advances in Neural Information Processing Systems 35 (NeurIPS 2022)

  27. arXiv:2206.04113  [pdf, other

    math.OC cs.DC cs.LG cs.MA

    Push--Pull with Device Sampling

    Authors: Yu-Guan Hsieh, Yassine Laguel, Franck Iutzeler, Jérôme Malick

    Abstract: We consider decentralized optimization problems in which a number of agents collaborate to minimize the average of their local functions by exchanging over an underlying communication graph. Specifically, we place ourselves in an asynchronous model where only a random portion of nodes perform computation at each iteration, while the information exchange can be conducted between all the nodes and i… ▽ More

    Submitted 17 March, 2023; v1 submitted 8 June, 2022; originally announced June 2022.

    Comments: In IEEE Transactions on Automatic Control

  28. arXiv:2206.04091  [pdf, other

    stat.ML cs.LG

    Uplifting Bandits

    Authors: Yu-Guan Hsieh, Shiva Prasad Kasiviswanathan, Branislav Kveton

    Abstract: We introduce a multi-armed bandit model where the reward is a sum of multiple random variables, and each action only alters the distributions of some of them. After each action, the agent observes the realizations of all the variables. This model is motivated by marketing campaigns and recommender systems, where the variables represent outcomes on individual customers, such as clicks. We propose U… ▽ More

    Submitted 8 June, 2022; originally announced June 2022.

  29. arXiv:2206.03922  [pdf, other

    cs.GT cs.LG math.OC

    A unified stochastic approximation framework for learning in games

    Authors: Panayotis Mertikopoulos, Ya-** Hsieh, Volkan Cevher

    Abstract: We develop a flexible stochastic approximation framework for analyzing the long-run behavior of learning in games (both continuous and finite). The proposed analysis template incorporates a wide array of popular learning algorithms, including gradient-based methods, the exponential/multiplicative weights algorithm for learning in finite games, optimistic and bandit variants of the above, etc. In a… ▽ More

    Submitted 3 July, 2023; v1 submitted 8 June, 2022; originally announced June 2022.

    Comments: 40 pages, 5 figures, 2 tables

    MSC Class: Primary 91A10; 91A26; secondary 68Q32; 68T02

  30. arXiv:2202.05722  [pdf, other

    cs.LG q-bio.QM

    The Schrödinger Bridge between Gaussian Measures has a Closed Form

    Authors: Charlotte Bunne, Ya-** Hsieh, Marco Cuturi, Andreas Krause

    Abstract: The static optimal transport $(\mathrm{OT})$ problem between Gaussians seeks to recover an optimal map, or more generally a coupling, to morph a Gaussian into another. It has been well studied and applied to a wide variety of tasks. Here we focus on the dynamic formulation of OT, also known as the Schrödinger bridge (SB) problem, which has recently seen a surge of interest in machine learning due… ▽ More

    Submitted 31 March, 2023; v1 submitted 11 February, 2022; originally announced February 2022.

  31. arXiv:2112.02538  [pdf, ps, other

    eess.AS cs.SD

    Toward Real-World Voice Disorder Classification

    Authors: Heng-Cheng Kuo, Yu-Peng Hsieh, Huan-Hsin Tseng, Chi-Te Wang, Shih-Hau Fang, Yu Tsao

    Abstract: Objective: Voice disorders significantly compromise individuals' ability to speak in their daily lives. Without early diagnosis and treatment, these disorders may deteriorate drastically. Thus, automatic classification systems at home are desirable for people who are inaccessible to clinical disease assessments. However, the performance of such systems may be weakened due to the constrained resour… ▽ More

    Submitted 26 April, 2023; v1 submitted 5 December, 2021; originally announced December 2021.

    Comments: Accepted by IEEE TBME (under an IEEE Open Access publishing Agreement)

  32. arXiv:2110.04795  [pdf, ps, other

    cs.CR

    Isogeny-based Group Signatures and Accountable Ring Signatures in QROM

    Authors: Kai-Min Chung, Yao-Ching Hsieh, Mi-Ying Huang, Yu-Hsuan Huang, Tanja Lange, Bo-Yin Yang

    Abstract: We provide the first isogeny-based group signature (GS) and accountable ring signature (ARS) that are provably secure in the quantum random oracle model (QROM). We do so by building an intermediate primitive called openable sigma protocol and show that every such protocol gives rise to a secure ARS and GS. Additionally, the QROM security is guaranteed if the perfect unique-response property is sat… ▽ More

    Submitted 2 November, 2022; v1 submitted 10 October, 2021; originally announced October 2021.

  33. arXiv:2109.00711  [pdf, other

    cs.LG cond-mat.dis-nn

    Heterogeneous relational message passing networks for molecular dynamics simulations

    Authors: Zun Wang, Chong Wang, Sibo Zhao, Yong Xu, Shaogang Hao, Chang Yu Hsieh, Bing-Lin Gu, Wenhui Duan

    Abstract: With many frameworks based on message passing neural networks proposed to predict molecular and bulk properties, machine learning methods have tremendously shifted the paradigms of computational sciences underpinning physics, material science, chemistry, and biology. While existing machine learning models have yielded superior performances in many occasions, most of them model and process molecula… ▽ More

    Submitted 2 September, 2021; originally announced September 2021.

  34. arXiv:2107.00127  [pdf, other

    cs.RO

    SQRP: Sensing Quality-aware Robot Programming System for Non-expert Programmers

    Authors: Yi-Hsuan Hsieh, Pei-Chi Huang, Aloysius K Mok

    Abstract: Robot programming typically makes use of a set of mechanical skills that is acquired by machine learning. Because there is in general no guarantee that machine learning produces robot programs that are free of surprising behavior, the safe execution of a robot program must utilize monitoring modules that take sensor data as inputs in real time to ensure the correctness of the skill execution. Owin… ▽ More

    Submitted 30 June, 2021; originally announced July 2021.

    Comments: 7 pages, 9 figures, 1 table; accepted for presentation in IEEE ICRA 2021(IEEE International Conference on Robotics and Automation)

  35. arXiv:2105.13348  [pdf, other

    math.OC cs.LG cs.MA

    Optimization in Open Networks via Dual Averaging

    Authors: Yu-Guan Hsieh, Franck Iutzeler, Jérôme Malick, Panayotis Mertikopoulos

    Abstract: In networks of autonomous agents (e.g., fleets of vehicles, scattered sensors), the problem of minimizing the sum of the agents' local functions has received a lot of interest. We tackle here this distributed optimization problem in the case of open networks when agents can join and leave the network at any time. Leveraging recent online optimization techniques, we propose and analyze the converge… ▽ More

    Submitted 16 October, 2021; v1 submitted 27 May, 2021; originally announced May 2021.

    Comments: In 60th IEEE Conference on Decision and Control (CDC 2021); 7 pages, 1 figure

  36. arXiv:2105.07622  [pdf, other

    cs.CL

    Ensemble-based Transfer Learning for Low-resource Machine Translation Quality Estimation

    Authors: Ting-Wei Wu, Yung-An Hsieh, Yi-Chieh Liu

    Abstract: Quality Estimation (QE) of Machine Translation (MT) is a task to estimate the quality scores for given translation outputs from an unknown MT system. However, QE scores for low-resource languages are usually intractable and hard to collect. In this paper, we focus on the Sentence-Level QE Shared Task of the Fifth Conference on Machine Translation (WMT20), but in a more challenging setting. We aim… ▽ More

    Submitted 17 May, 2021; originally announced May 2021.

  37. arXiv:2104.12761  [pdf, other

    cs.GT cs.LG math.OC

    Adaptive Learning in Continuous Games: Optimal Regret Bounds and Convergence to Nash Equilibrium

    Authors: Yu-Guan Hsieh, Kimon Antonakopoulos, Panayotis Mertikopoulos

    Abstract: In game-theoretic learning, several agents are simultaneously following their individual interests, so the environment is non-stationary from each player's perspective. In this context, the performance of a learning algorithm is often measured by its regret. However, no-regret algorithms are not created equal in terms of game-theoretic guarantees: depending on how they are tuned, some of them may… ▽ More

    Submitted 16 October, 2021; v1 submitted 26 April, 2021; originally announced April 2021.

    Comments: In the 34th Annual Conference on Learning Theory (COLT 2021); 35 pages, 2 figures

  38. arXiv:2103.13495  [pdf, other

    physics.app-ph cond-mat.mes-hall cs.LG eess.IV physics.data-an

    Machine Learning-based Automatic Graphene Detection with Color Correction for Optical Microscope Images

    Authors: Hui-Ying Siao, Siyu Qi, Zhi Ding, Chia-Yu Lin, Yu-Chiang Hsieh, Tse-Ming Chen

    Abstract: Graphene serves critical application and research purposes in various fields. However, fabricating high-quality and large quantities of graphene is time-consuming and it requires heavy human resource labor costs. In this paper, we propose a Machine Learning-based Automatic Graphene Detection Method with Color Correction (MLA-GDCC), a reliable and autonomous graphene detection from microscopic imag… ▽ More

    Submitted 24 March, 2021; originally announced March 2021.

    Comments: 14 pages, 8 figures

  39. arXiv:2012.11579  [pdf, ps, other

    cs.LG cs.MA math.OC

    Multi-Agent Online Optimization with Delays: Asynchronicity, Adaptivity, and Optimism

    Authors: Yu-Guan Hsieh, Franck Iutzeler, Jérôme Malick, Panayotis Mertikopoulos

    Abstract: In this paper, we provide a general framework for studying multi-agent online learning problems in the presence of delays and asynchronicities. Specifically, we propose and analyze a class of adaptive dual averaging schemes in which agents only need to accumulate gradient feedback received from the whole system, without requiring any between-agent coordination. In the single-agent case, the adapti… ▽ More

    Submitted 16 April, 2022; v1 submitted 21 December, 2020; originally announced December 2020.

    Comments: Accepted by Journal of Machine Learning Research (JMLR)

  40. arXiv:2007.03795  [pdf, other

    cs.LG math.OC stat.ML

    Conditional gradient methods for stochastically constrained convex minimization

    Authors: Maria-Luiza Vladarean, Ahmet Alacaoglu, Ya-** Hsieh, Volkan Cevher

    Abstract: We propose two novel conditional gradient-based methods for solving structured stochastic convex optimization problems with a large number of linear constraints. Instances of this template naturally arise from SDP-relaxations of combinatorial problems, which involve a number of constraints that is polynomial in the problem dimension. The most important feature of our framework is that only a subse… ▽ More

    Submitted 7 July, 2020; originally announced July 2020.

  41. arXiv:2006.09065  [pdf, other

    math.OC cs.LG stat.ML

    The limits of min-max optimization algorithms: convergence to spurious non-critical sets

    Authors: Ya-** Hsieh, Panayotis Mertikopoulos, Volkan Cevher

    Abstract: Compared to ordinary function minimization problems, min-max optimization algorithms encounter far greater challenges because of the existence of periodic cycles and similar phenomena. Even though some of these behaviors can be overcome in the convex-concave regime, the general case is considerably more difficult. On that account, we take an in-depth look at a comprehensive class of state-of-the a… ▽ More

    Submitted 14 February, 2021; v1 submitted 16 June, 2020; originally announced June 2020.

  42. arXiv:2003.10162  [pdf, other

    math.OC cs.GT cs.LG

    Explore Aggressively, Update Conservatively: Stochastic Extragradient Methods with Variable Stepsize Scaling

    Authors: Yu-Guan Hsieh, Franck Iutzeler, Jérôme Malick, Panayotis Mertikopoulos

    Abstract: Owing to their stability and convergence speed, extragradient methods have become a staple for solving large-scale saddle-point problems in machine learning. The basic premise of these algorithms is the use of an extrapolation step before performing an update; thanks to this exploration step, extra-gradient methods overcome many of the non-convergence issues that plague gradient descent/ascent sch… ▽ More

    Submitted 5 November, 2020; v1 submitted 23 March, 2020; originally announced March 2020.

    Comments: In Advances in Neural Information Processing Systems 33 (NeurIPS 2020); 29 pages, 5 figures

    MSC Class: 65K15; 62L20; 90C15; 90C33

  43. arXiv:2002.06063  [pdf, other

    cs.LG stat.ML

    Robust Reinforcement Learning via Adversarial training with Langevin Dynamics

    Authors: Parameswaran Kamalaruban, Yu-Ting Huang, Ya-** Hsieh, Paul Rolland, Cheng Shi, Volkan Cevher

    Abstract: We introduce a sampling perspective to tackle the challenging task of training robust Reinforcement Learning (RL) agents. Leveraging the powerful Stochastic Gradient Langevin Dynamics, we present a novel, scalable two-player RL algorithm, which is a sampling variant of the two-player policy gradient method. Our algorithm consistently outperforms existing baselines, in terms of generalization acros… ▽ More

    Submitted 5 November, 2020; v1 submitted 14 February, 2020; originally announced February 2020.

  44. arXiv:2001.01538  [pdf, other

    eess.AS cs.SD

    Speech Enhancement based on Denoising Autoencoder with Multi-branched Encoders

    Authors: Cheng Yu, Ryandhimas E. Zezario, Syu-Siang Wang, Jonathan Sherman, Yi-Yen Hsieh, Xugang Lu, Hsin-Min Wang, Yu Tsao

    Abstract: Deep learning-based models have greatly advanced the performance of speech enhancement (SE) systems. However, two problems remain unsolved, which are closely related to model generalizability to noisy conditions: (1) mismatched noisy condition during testing, i.e., the performance is generally sub-optimal when models are tested with unseen noise types that are not involved in the training data; (2… ▽ More

    Submitted 24 December, 2020; v1 submitted 6 January, 2020; originally announced January 2020.

  45. Interpretable Self-Attention Temporal Reasoning for Driving Behavior Understanding

    Authors: Yi-Chieh Liu, Yung-An Hsieh, Min-Hung Chen, Chao-Han Huck Yang, Jesper Tegner, Yi-Chang James Tsai

    Abstract: Performing driving behaviors based on causal reasoning is essential to ensure driving safety. In this work, we investigated how state-of-the-art 3D Convolutional Neural Networks (CNNs) perform on classifying driving behaviors based on causal reasoning. We proposed a perturbation-based visual explanation method to inspect the models' performance visually. By examining the video attention saliency,… ▽ More

    Submitted 5 November, 2019; originally announced November 2019.

    Comments: Submitted to IEEE ICASSP 2020; Pytorch code will be released soon

    Journal ref: 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

  46. arXiv:1910.14540  [pdf, other

    cs.RO

    Team NCTU: Toward AI-Driving for Autonomous Surface Vehicles -- From Duckietown to RobotX

    Authors: Yi-Wei Huang, Tzu-Kuan Chuang, Ni-Ching Lin, Yu-Chieh Hsiao, Pin-Wei Chen, Ching-Tang Hung, Shih-Hsing Liu, Hsiao-Sheng Chen, Ya-Hsiu Hsieh, Ching-Tang Hung, Yen-Hsiang Huang, Yu-Xuan Chen, Kuan-Lin Chen, Ya-Jou Lan, Chao-Chun Hsu, Chun-Yi Lin, Jhih-Ying Li, Jui-Te Huang, Yu-Jen Menn, Sin-Kiat Lim, Kim-Boon Lua, Chia-Hung Dylan Tsai, Chi-Fang Chen, Hsueh-Cheng Wang

    Abstract: Robotic software and hardware systems of autonomous surface vehicles have been developed in transportation, military, and ocean researches for decades. Previous efforts in RobotX Challenges 2014 and 2016 facilitates the developments for important tasks such as obstacle avoidance and docking. Team NCTU is motivated by the AI Driving Olympics (AI-DO) developed by the Duckietown community, and adopts… ▽ More

    Submitted 31 October, 2019; originally announced October 2019.

  47. arXiv:1909.04495  [pdf, other

    cs.IR cs.CL cs.CR cs.LG

    Natural Adversarial Sentence Generation with Gradient-based Perturbation

    Authors: Yu-Lun Hsieh, Minhao Cheng, Da-Cheng Juan, Wei Wei, Wen-Lian Hsu, Cho-Jui Hsieh

    Abstract: This work proposes a novel algorithm to generate natural language adversarial input for text classification models, in order to investigate the robustness of these models. It involves applying gradient-based perturbation on the sentence embeddings that are used as the features for the classifier, and learning a decoder for generation. We employ this method to a sentiment analysis model and verify… ▽ More

    Submitted 6 September, 2019; originally announced September 2019.

  48. arXiv:1908.08465  [pdf, other

    math.OC cs.GT cs.LG

    On the convergence of single-call stochastic extra-gradient methods

    Authors: Yu-Guan Hsieh, Franck Iutzeler, Jérôme Malick, Panayotis Mertikopoulos

    Abstract: Variational inequalities have recently attracted considerable interest in machine learning as a flexible paradigm for models that go beyond ordinary loss function minimization (such as generative adversarial networks and related deep learning systems). In this setting, the optimal $\mathcal{O}(1/t)$ convergence rate for solving smooth monotone variational inequalities is achieved by the Extra-Grad… ▽ More

    Submitted 11 February, 2020; v1 submitted 22 August, 2019; originally announced August 2019.

    Comments: In Advances in Neural Information Processing Systems 32 (NeurIPS 2019); 24 pages, 3 figures

    MSC Class: 65K15; 62L20; 90C15; 90C33

  49. arXiv:1811.02002  [pdf, ps, other

    cs.LG cs.GT stat.ML

    Finding Mixed Nash Equilibria of Generative Adversarial Networks

    Authors: Ya-** Hsieh, Chen Liu, Volkan Cevher

    Abstract: We reconsider the training objective of Generative Adversarial Networks (GANs) from the mixed Nash Equilibria (NE) perspective. Inspired by the classical prox methods, we develop a novel algorithmic framework for GANs via an infinite-dimensional two-player game and prove rigorous convergence rates to the mixed NE, resolving the longstanding problem that no provably convergent algorithm exists for… ▽ More

    Submitted 23 October, 2018; originally announced November 2018.

  50. arXiv:1810.00846  [pdf, other

    cs.LG stat.ML

    Classification from Positive, Unlabeled and Biased Negative Data

    Authors: Yu-Guan Hsieh, Gang Niu, Masashi Sugiyama

    Abstract: In binary classification, there are situations where negative (N) data are too diverse to be fully labeled and we often resort to positive-unlabeled (PU) learning in these scenarios. However, collecting a non-representative N set that contains only a small portion of all possible N data can often be much easier in practice. This paper studies a novel classification framework which incorporates suc… ▽ More

    Submitted 13 July, 2019; v1 submitted 1 October, 2018; originally announced October 2018.

    Comments: In Proceedings of the 36th International Conference on Machine Learning (ICML 2019)