Skip to main content

Showing 1–50 of 77 results for author: Tseng, H

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.02288  [pdf, other

    cs.CV cs.AI cs.RO

    Prospective Role of Foundation Models in Advancing Autonomous Vehicles

    Authors: Jianhua Wu, Bingzhao Gao, **cheng Gao, Jianhao Yu, Hongqing Chu, Qiankun Yu, Xun Gong, Yi Chang, H. Eric Tseng, Hong Chen, Jie Chen

    Abstract: With the development of artificial intelligence and breakthroughs in deep learning, large-scale Foundation Models (FMs), such as GPT, Sora, etc., have achieved remarkable results in many fields including natural language processing and computer vision. The application of FMs in autonomous driving holds considerable promise. For example, they can contribute to enhancing scene understanding and reas… ▽ More

    Submitted 17 May, 2024; v1 submitted 8 December, 2023; originally announced May 2024.

    Comments: 45 pages,8 figures

  2. arXiv:2404.09995  [pdf, other

    cs.CV cs.AI cs.LG

    Taming Latent Diffusion Model for Neural Radiance Field Inpainting

    Authors: Chieh Hubert Lin, Changil Kim, Jia-Bin Huang, Qinbo Li, Chih-Yao Ma, Johannes Kopf, Ming-Hsuan Yang, Hung-Yu Tseng

    Abstract: Neural Radiance Field (NeRF) is a representation for 3D reconstruction from multi-view images. Despite some recent work showing preliminary success in editing a reconstructed NeRF with diffusion prior, they remain struggling to synthesize reasonable geometry in completely uncovered regions. One major reason is the high diversity of synthetic contents from the diffusion model, which hinders the rad… ▽ More

    Submitted 15 April, 2024; originally announced April 2024.

    Comments: Project page: https://hubert0527.github.io/MALD-NeRF

  3. Real-time Neuron Segmentation for Voltage Imaging

    Authors: Yosuke Bando, Ramdas Pillai, Atsushi Kajita, Farhan Abdul Hakeem, Yves Quemener, Hua-an Tseng, Kiryl D. Piatkevich, Changyang Linghu, Xue Han, Edward S. Boyden

    Abstract: In voltage imaging, where the membrane potentials of individual neurons are recorded at from hundreds to thousand frames per second using fluorescence microscopy, data processing presents a challenge. Even a fraction of a minute of recording with a limited image size yields gigabytes of video data consisting of tens of thousands of frames, which can be time-consuming to process. Moreover, millisec… ▽ More

    Submitted 25 March, 2024; originally announced March 2024.

    Journal ref: IEEE International Conference on Bioinformatics and Biomedicine (BIBM), 813-818, 2023

  4. arXiv:2403.15577  [pdf, other

    cs.AI cs.RO eess.SY

    Autonomous Driving With Perception Uncertainties: Deep-Ensemble Based Adaptive Cruise Control

    Authors: Xiao Li, H. Eric Tseng, Anouck Girard, Ilya Kolmanovsky

    Abstract: Autonomous driving depends on perception systems to understand the environment and to inform downstream decision-making. While advanced perception systems utilizing black-box Deep Neural Networks (DNNs) demonstrate human-like comprehension, their unpredictable behavior and lack of interpretability may hinder their deployment in safety critical scenarios. In this paper, we develop an Ensemble of DN… ▽ More

    Submitted 22 March, 2024; originally announced March 2024.

  5. arXiv:2403.09216  [pdf

    cs.CY

    Unlocking the Potential of Open Government Data: Exploring the Strategic, Technical, and Application Perspectives of High-Value Datasets Opening in Taiwan

    Authors: Hsien-Lee Tseng, Anastasija Nikiforova

    Abstract: Today, data has an unprecedented value as it forms the basis for data-driven decision-making, including serving as an input for AI models, where the latter is highly dependent on the availability of the data. However, availability of data in an open data format creates a little added value, where the value of these data, i.e., their relevance to the real needs of the end user, is key. This is wher… ▽ More

    Submitted 14 March, 2024; originally announced March 2024.

    Comments: This paper has been accepted for publication in Proceedings of the 25th Annual International Conference on Digital Government Research and this is a pre-print version of the manuscript. It is posted here for your personal use. Not for redistribution

  6. arXiv:2403.01807  [pdf, other

    cs.CV

    ViewDiff: 3D-Consistent Image Generation with Text-to-Image Models

    Authors: Lukas Höllein, Aljaž Božič, Norman Müller, David Novotny, Hung-Yu Tseng, Christian Richardt, Michael Zollhöfer, Matthias Nießner

    Abstract: 3D asset generation is getting massive amounts of attention, inspired by the recent success of text-guided 2D content creation. Existing text-to-3D methods use pretrained text-to-image diffusion models in an optimization problem or fine-tune them on synthetic data, which often results in non-photorealistic 3D objects without backgrounds. In this paper, we present a method that leverages pretrained… ▽ More

    Submitted 4 March, 2024; originally announced March 2024.

    Comments: Accepted to CVPR 2024, project page: https://lukashoel.github.io/ViewDiff/, video: https://www.youtube.com/watch?v=SdjoCqHzMMk, code: https://github.com/facebookresearch/ViewDiff

  7. arXiv:2401.07464  [pdf, other

    quant-ph cs.CR cs.LG

    Quantum Privacy Aggregation of Teacher Ensembles (QPATE) for Privacy-preserving Quantum Machine Learning

    Authors: William Watkins, Heehwan Wang, Sangyoon Bae, Huan-Hsin Tseng, Jiook Cha, Samuel Yen-Chi Chen, Shinjae Yoo

    Abstract: The utility of machine learning has rapidly expanded in the last two decades and presents an ethical challenge. Papernot et. al. developed a technique, known as Private Aggregation of Teacher Ensembles (PATE) to enable federated learning in which multiple teacher models are trained on disjoint datasets. This study is the first to apply PATE to an ensemble of quantum neural networks (QNN) to pave a… ▽ More

    Submitted 14 January, 2024; originally announced January 2024.

  8. arXiv:2312.10880  [pdf, other

    cs.RO eess.SY

    Sharable Clothoid-based Continuous Motion Planning for Connected Automated Vehicles

    Authors: Sanghoon Oh, Qi Chen, H. Eric Tseng, Gaurav Pandey, Gabor Orosz

    Abstract: A continuous motion planning method for connected automated vehicles is considered for generating feasible trajectories in real-time using three consecutive clothoids. The proposed method reduces path planning to a small set of nonlinear algebraic equations such that the generated path can be efficiently checked for feasibility and collision. After path planning, velocity planning is executed whil… ▽ More

    Submitted 17 December, 2023; originally announced December 2023.

    Comments: 14 pages, 14 figures

  9. arXiv:2312.09429  [pdf

    eess.SP cs.LG

    Deep Learning-Enabled Swallowing Monitoring and Postoperative Recovery Biosensing System

    Authors: Chih-Ning Tsai, Pei-Wen Yang, Tzu-Yen Huang, Jung-Chih Chen, Hsin-Yi Tseng, Che-Wei Wu, Amrit Sarmah, Tzu-En Lin

    Abstract: This study introduces an innovative 3D printed dry electrode tailored for biosensing in postoperative recovery scenarios. Fabricated through a drop coating process, the electrode incorporates a novel 2D material.

    Submitted 24 November, 2023; originally announced December 2023.

    Comments: the abstract can't uploaded fully

    MSC Class: NA ACM Class: A.0

  10. arXiv:2311.18832  [pdf, other

    cs.CV

    Exploiting Diffusion Prior for Generalizable Dense Prediction

    Authors: Hsin-Ying Lee, Hung-Yu Tseng, Hsin-Ying Lee, Ming-Hsuan Yang

    Abstract: Contents generated by recent advanced Text-to-Image (T2I) diffusion models are sometimes too imaginative for existing off-the-shelf dense predictors to estimate due to the immitigable domain gap. We introduce DMP, a pipeline utilizing pre-trained T2I models as a prior for dense prediction tasks. To address the misalignment between deterministic prediction tasks and stochastic T2I models, we reform… ▽ More

    Submitted 2 April, 2024; v1 submitted 30 November, 2023; originally announced November 2023.

    Comments: To appear in CVPR 2024. Project page: https://shinying.github.io/dmp

  11. arXiv:2311.10041  [pdf, other

    cs.RO

    Interpretable Reinforcement Learning for Robotics and Continuous Control

    Authors: Rohan Paleja, Letian Chen, Yaru Niu, Andrew Silva, Zhaoxin Li, Songan Zhang, Chace Ritchie, Sugju Choi, Kimberlee Chestnut Chang, Hongtei Eric Tseng, Yan Wang, Subramanya Nageshrao, Matthew Gombolay

    Abstract: Interpretability in machine learning is critical for the safe deployment of learned policies across legally-regulated and safety-critical domains. While gradient-based approaches in reinforcement learning have achieved tremendous success in learning policies for continuous control problems such as robotics and autonomous driving, the lack of interpretability is a fundamental barrier to adoption. W… ▽ More

    Submitted 16 November, 2023; originally announced November 2023.

    Comments: arXiv admin note: text overlap with arXiv:2202.02352

  12. Single-Image 3D Human Digitization with Shape-Guided Diffusion

    Authors: Badour AlBahar, Shunsuke Saito, Hung-Yu Tseng, Changil Kim, Johannes Kopf, Jia-Bin Huang

    Abstract: We present an approach to generate a 360-degree view of a person with a consistent, high-resolution appearance from a single input image. NeRF and its variants typically require videos or images from different viewpoints. Most existing approaches taking monocular input either rely on ground-truth 3D scans for supervision or lack 3D consistency. While recent 3D generative models show promise of 3D… ▽ More

    Submitted 15 November, 2023; originally announced November 2023.

    Comments: SIGGRAPH Asia 2023. Project website: https://human-sgd.github.io/

  13. arXiv:2311.06673  [pdf, other

    cs.LG cs.AI cs.RO

    Dream to Adapt: Meta Reinforcement Learning by Latent Context Imagination and MDP Imagination

    Authors: Lu Wen, Songan Zhang, H. Eric Tseng, Huei Peng

    Abstract: Meta reinforcement learning (Meta RL) has been amply explored to quickly learn an unseen task by transferring previously learned knowledge from similar tasks. However, most state-of-the-art algorithms require the meta-training tasks to have a dense coverage on the task distribution and a great amount of data for each of them. In this paper, we propose MetaDreamer, a context-based Meta RL algorithm… ▽ More

    Submitted 11 November, 2023; originally announced November 2023.

  14. arXiv:2310.20561  [pdf, other

    cs.RO eess.SY math.OC

    Predictive Control for Autonomous Driving with Uncertain, Multi-modal Predictions

    Authors: Siddharth H. Nair, Hotae Lee, Eunhyek Joa, Yan Wang, H. Eric Tseng, Francesco Borrelli

    Abstract: We propose a Stochastic MPC (SMPC) formulation for path planning with autonomous vehicles in scenarios involving multiple agents with multi-modal predictions. The multi-modal predictions capture the uncertainty of urban driving in distinct modes/maneuvers (e.g., yield, keep speed) and driving trajectories (e.g., speed, turning radius), which are incorporated for multi-modal collision avoidance cha… ▽ More

    Submitted 31 October, 2023; originally announced October 2023.

    Comments: The first three authors contributed equally

  15. arXiv:2310.20148  [pdf, other

    cs.AI cs.RO eess.SY

    Decision-Making for Autonomous Vehicles with Interaction-Aware Behavioral Prediction and Social-Attention Neural Network

    Authors: Xiao Li, Kaiwen Liu, H. Eric Tseng, Anouck Girard, Ilya Kolmanovsky

    Abstract: Autonomous vehicles need to accomplish their tasks while interacting with human drivers in traffic. It is thus crucial to equip autonomous vehicles with artificial reasoning to better comprehend the intentions of the surrounding traffic, thereby facilitating the accomplishments of the tasks. In this work, we propose a behavioral model that encodes drivers' interacting intentions into latent social… ▽ More

    Submitted 31 October, 2023; v1 submitted 30 October, 2023; originally announced October 2023.

  16. arXiv:2310.20009  [pdf, other

    cs.GT

    Nash or Stackelberg? -- A comparative study for game-theoretic AV decision-making

    Authors: Brady Bateman, Ming Xin, H. Eric Tseng, Mushuang Liu

    Abstract: This paper studies game-theoretic decision-making for autonomous vehicles (AVs). A receding horizon multi-player game is formulated to model the AV decision-making problem. Two classes of games, including Nash game and Stackelber games, are developed respectively. For each of the two games, two solution settings, including pairwise games and multi-player games, are introduced, respectively, to sol… ▽ More

    Submitted 30 October, 2023; originally announced October 2023.

    Comments: 8 pages, submitted to ECC24

  17. arXiv:2310.15084  [pdf, other

    quant-ph cs.LG

    Quantum Federated Learning With Quantum Networks

    Authors: Tyler Wang, Huan-Hsin Tseng, Shinjae Yoo

    Abstract: A major concern of deep learning models is the large amount of data that is required to build and train them, much of which is reliant on sensitive and personally identifiable information that is vulnerable to access by third parties. Ideas of using the quantum internet to address this issue have been previously proposed, which would enable fast and completely secure online communications. Previou… ▽ More

    Submitted 23 October, 2023; originally announced October 2023.

  18. arXiv:2310.06973  [pdf, other

    quant-ph cs.LG

    Federated Quantum Machine Learning with Differential Privacy

    Authors: Rod Rofougaran, Shinjae Yoo, Huan-Hsin Tseng, Samuel Yen-Chi Chen

    Abstract: The preservation of privacy is a critical concern in the implementation of artificial intelligence on sensitive training data. There are several techniques to preserve data privacy but quantum computations are inherently more secure due to the no-cloning theorem, resulting in a most desirable computational platform on top of the potential quantum advantages. There have been prior works in protecti… ▽ More

    Submitted 10 October, 2023; originally announced October 2023.

    Comments: 5 pages, 7 figures

  19. arXiv:2309.14497  [pdf, other

    cs.AI cs.RO eess.SY

    Interaction-Aware Decision-Making for Autonomous Vehicles in Forced Merging Scenario Leveraging Social Psychology Factors

    Authors: Xiao Li, Kaiwen Liu, H. Eric Tseng, Anouck Girard, Ilya Kolmanovsky

    Abstract: Understanding the intention of vehicles in the surrounding traffic is crucial for an autonomous vehicle to successfully accomplish its driving tasks in complex traffic scenarios such as highway forced merging. In this paper, we consider a behavioral model that incorporates both social behaviors and personal objectives of the interacting drivers. Leveraging this model, we develop a receding-horizon… ▽ More

    Submitted 25 September, 2023; originally announced September 2023.

  20. arXiv:2309.04063  [pdf, other

    cs.CV

    INSURE: An Information Theory Inspired Disentanglement and Purification Model for Domain Generalization

    Authors: Xi Yu, Huan-Hsin Tseng, Shinjae Yoo, Haibin Ling, Yuewei Lin

    Abstract: Domain Generalization (DG) aims to learn a generalizable model on the unseen target domain by only training on the multiple observed source domains. Although a variety of DG methods have focused on extracting domain-invariant features, the domain-specific class-relevant features have attracted attention and been argued to benefit generalization to the unseen target domain. To take into account the… ▽ More

    Submitted 7 September, 2023; originally announced September 2023.

    Comments: 10 pages, 4 figures

  21. arXiv:2304.14404  [pdf, other

    cs.CV

    Motion-Conditioned Diffusion Model for Controllable Video Synthesis

    Authors: Tsai-Shien Chen, Chieh Hubert Lin, Hung-Yu Tseng, Tsung-Yi Lin, Ming-Hsuan Yang

    Abstract: Recent advancements in diffusion models have greatly improved the quality and diversity of synthesized content. To harness the expressive power of diffusion models, researchers have explored various controllable mechanisms that allow users to intuitively guide the content synthesis process. Although the latest efforts have primarily focused on video synthesis, there has been a lack of effective me… ▽ More

    Submitted 27 April, 2023; originally announced April 2023.

    Comments: Project page: https://tsaishien-chen.github.io/MCDiff/

  22. arXiv:2303.17598  [pdf, other

    cs.CV

    Consistent View Synthesis with Pose-Guided Diffusion Models

    Authors: Hung-Yu Tseng, Qinbo Li, Changil Kim, Suhib Alsisan, Jia-Bin Huang, Johannes Kopf

    Abstract: Novel view synthesis from a single image has been a cornerstone problem for many Virtual Reality applications that provide immersive experiences. However, most existing techniques can only synthesize novel views within a limited range of camera motion or fail to generate consistent and high-quality novel views under significant camera movement. In this work, we propose a pose-guided diffusion mode… ▽ More

    Submitted 30 March, 2023; originally announced March 2023.

    Comments: CVPR 2023. Project page: https://poseguided-diffusion.github.io/

  23. arXiv:2303.16280  [pdf, other

    cs.CV

    UVCGAN v2: An Improved Cycle-Consistent GAN for Unpaired Image-to-Image Translation

    Authors: Dmitrii Torbunov, Yi Huang, Huan-Hsin Tseng, Haiwang Yu, ** Huang, Shinjae Yoo, Meifeng Lin, Brett Viren, Yihui Ren

    Abstract: An unpaired image-to-image (I2I) translation technique seeks to find a map** between two domains of data in a fully unsupervised manner. While initial solutions to the I2I problem were provided by generative adversarial neural networks (GANs), diffusion models (DMs) currently hold the state-of-the-art status on the I2I translation benchmarks in terms of Frechet inception distance (FID). Yet, DMs… ▽ More

    Submitted 22 September, 2023; v1 submitted 28 March, 2023; originally announced March 2023.

  24. arXiv:2303.12861  [pdf, other

    eess.IV cs.LG eess.SP physics.bio-ph

    Parallel Diffusion Model-based Sparse-view Cone-beam Breast CT

    Authors: Wenjun Xia, Hsin Wu Tseng, Chuang Niu, Wenxiang Cong, Xiaohua Zhang, Shaohua Liu, Ruola Ning, Srinivasan Vedantham, Ge Wang

    Abstract: Breast cancer is the most prevalent cancer among women worldwide, and early detection is crucial for reducing its mortality rate and improving quality of life. Dedicated breast computed tomography (CT) scanners offer better image quality than mammography and tomosynthesis in general but at higher radiation dose. To enable breast CT for cancer screening, the challenge is to minimize the radiation d… ▽ More

    Submitted 28 January, 2024; v1 submitted 22 March, 2023; originally announced March 2023.

  25. arXiv:2303.09905  [pdf, other

    cs.CL

    More Robust Schema-Guided Dialogue State Tracking via Tree-Based Paraphrase Ranking

    Authors: A. Coca, B. H. Tseng, W. Lin, B. Byrne

    Abstract: The schema-guided paradigm overcomes scalability issues inherent in building task-oriented dialogue (TOD) agents with static ontologies. Instead of operating on dialogue context alone, agents have access to hierarchical schemas containing task-relevant natural language descriptions. Fine-tuned language models excel at schema-guided dialogue state tracking (DST) but are sensitive to the writing sty… ▽ More

    Submitted 17 March, 2023; originally announced March 2023.

    Comments: Accepted at EACL (Findings) 2023

  26. arXiv:2302.01798  [pdf, other

    cs.LG

    Interpretations of Domain Adaptations via Layer Variational Analysis

    Authors: Huan-Hsin Tseng, Hsin-Yi Lin, Kuo-Hsuan Hung, Yu Tsao

    Abstract: Transfer learning is known to perform efficiently in many applications empirically, yet limited literature reports the mechanism behind the scene. This study establishes both formal derivations and heuristic analysis to formulate the theory of transfer learning in deep learning. Our framework utilizing layer variational analysis proves that the success of transfer learning can be guaranteed with c… ▽ More

    Submitted 9 May, 2023; v1 submitted 3 February, 2023; originally announced February 2023.

    Comments: Published at ICLR 2023

  27. arXiv:2301.02239  [pdf, other

    cs.CV

    Robust Dynamic Radiance Fields

    Authors: Yu-Lun Liu, Chen Gao, Andreas Meuleman, Hung-Yu Tseng, Ayush Saraf, Changil Kim, Yung-Yu Chuang, Johannes Kopf, Jia-Bin Huang

    Abstract: Dynamic radiance field reconstruction methods aim to model the time-varying structure and appearance of a dynamic scene. Existing methods, however, assume that accurate camera poses can be reliably estimated by Structure from Motion (SfM) algorithms. These methods, thus, are unreliable as SfM algorithms often fail or produce erroneous poses on challenging videos with highly dynamic objects, poorly… ▽ More

    Submitted 21 March, 2023; v1 submitted 5 January, 2023; originally announced January 2023.

    Comments: CVPR 2023. Project page: https://robust-dynrf.github.io/

  28. arXiv:2211.12628  [pdf, other

    eess.SY cs.AI math.OC

    Safe Control and Learning Using Generalized Action Governor

    Authors: Nan Li, Yutong Li, Ilya Kolmanovsky, Anouck Girard, H. Eric Tseng, Dimitar Filev

    Abstract: This paper introduces the Generalized Action Governor, which is a supervisory scheme for augmenting a nominal closed-loop system with the capability of strictly handling constraints. After presenting its theory for general systems and introducing tailored design approaches for linear and discrete systems, we discuss its application to safe online learning, which aims to safely evolve control param… ▽ More

    Submitted 22 November, 2022; originally announced November 2022.

    Comments: 10 pages, 4 figures

  29. arXiv:2211.11997  [pdf, other

    cs.RO

    REFINE: Reachability-based Trajectory Design using Robust Feedback Linearization and Zonotopes

    Authors: **sun Liu, Yifei Shao, Lucas Lymburner, Hansen Qin, Vishrut Kaushik, Lena Trang, Ruiyang Wang, Vladimir Ivanovic, H. Eric Tseng, Ram Vasudevan

    Abstract: Performing real-time receding horizon motion planning for autonomous vehicles while providing safety guarantees remains difficult. This is because existing methods to accurately predict ego vehicle behavior under a chosen controller use online numerical integration that requires a fine time discretization and thereby adversely affects real-time performance. To address this limitation, several rece… ▽ More

    Submitted 21 November, 2022; originally announced November 2022.

  30. arXiv:2211.06508  [pdf, other

    cs.SD cs.LG eess.AS

    On the robustness of non-intrusive speech quality model by adversarial examples

    Authors: Hsin-Yi Lin, Huan-Hsin Tseng, Yu Tsao

    Abstract: It has been shown recently that deep learning based models are effective on speech quality prediction and could outperform traditional metrics in various perspectives. Although network models have potential to be a surrogate for complex human hearing perception, they may contain instabilities in predictions. This work shows that deep speech quality predictors can be vulnerable to adversarial pertu… ▽ More

    Submitted 11 November, 2022; originally announced November 2022.

  31. A Flexible-Frame-Rate Vision-Aided Inertial Object Tracking System for Mobile Devices

    Authors: Yo-Chung Lau, Kuan-Wei Tseng, I-Ju Hsieh, Hsiao-Ching Tseng, Yi-** Hung

    Abstract: Real-time object pose estimation and tracking is challenging but essential for emerging augmented reality (AR) applications. In general, state-of-the-art methods address this problem using deep neural networks which indeed yield satisfactory results. Nevertheless, the high computational cost of these methods makes them unsuitable for mobile devices where real-world applications usually take place.… ▽ More

    Submitted 22 October, 2022; originally announced October 2022.

  32. arXiv:2210.04400  [pdf

    cs.HC cs.AI

    Focus Plus: Detect Learner's Distraction by Web Camera in Distance Teaching

    Authors: Eason Chen, Yuen Hsien Tseng, Kuo-** Lo

    Abstract: Distance teaching has become popular these years because of the COVID-19 epidemic. However, both students and teachers face several challenges in distance teaching, like being easy to distract. We proposed Focus+, a system designed to detect learners' status with the latest AI technology from their web camera to solve such challenges. By doing so, teachers can know students' status, and students c… ▽ More

    Submitted 9 October, 2022; originally announced October 2022.

    Comments: 5 Pages, 4 Figures, 2021 National Chair Professorship Academic Series: Teaching and Learning in Pandemic Era

  33. arXiv:2208.12675  [pdf, other

    cs.CV

    Adaptively-Realistic Image Generation from Stroke and Sketch with Diffusion Model

    Authors: Shin-I Cheng, Yu-Jie Chen, Wei-Chen Chiu, Hung-Yu Tseng, Hsin-Ying Lee

    Abstract: Generating images from hand-drawings is a crucial and fundamental task in content creation. The translation is difficult as there exist infinite possibilities and the different users usually expect different outcomes. Therefore, we propose a unified framework supporting a three-dimensional control over the image synthesis from sketches and strokes based on diffusion models. Users can not only deci… ▽ More

    Submitted 1 September, 2022; v1 submitted 26 August, 2022; originally announced August 2022.

  34. arXiv:2208.07256  [pdf, other

    cs.RO cs.CV

    Multi-modal Transformer Path Prediction for Autonomous Vehicle

    Authors: Chia Hong Tseng, Jie Zhang, Min-Te Sun, Kazuya Sakai, Wei-Shinn Ku

    Abstract: Reasoning about vehicle path prediction is an essential and challenging problem for the safe operation of autonomous driving systems. There exist many research works for path prediction. However, most of them do not use lane information and are not based on the Transformer architecture. By utilizing different types of data collected from sensors equipped on the self-driving vehicles, we propose a… ▽ More

    Submitted 15 August, 2022; originally announced August 2022.

    Comments: 9 pages, 12 figures, and 5 tables

  35. arXiv:2208.03529  [pdf, other

    cs.RO math.OC

    Collision Avoidance for Dynamic Obstacles with Uncertain Predictions using Model Predictive Control

    Authors: Siddharth H. Nair, Eric H. Tseng, Francesco Borrelli

    Abstract: We propose a Model Predictive Control (MPC) for collision avoidance between an autonomous agent and dynamic obstacles with uncertain predictions. The collision avoidance constraints are imposed by enforcing positive distance between convex sets representing the agent and the obstacles, and tractably reformulating them using Lagrange duality. This approach allows for smooth collision avoidance cons… ▽ More

    Submitted 6 August, 2022; originally announced August 2022.

    Comments: Accepted to CDC'22

  36. arXiv:2207.13286  [pdf, other

    cs.CV

    Vector Quantized Image-to-Image Translation

    Authors: Yu-Jie Chen, Shin-I Cheng, Wei-Chen Chiu, Hung-Yu Tseng, Hsin-Ying Lee

    Abstract: Current image-to-image translation methods formulate the task with conditional generation models, leading to learning only the recolorization or regional changes as being constrained by the rich structural information provided by the conditional contexts. In this work, we propose introducing the vector quantization technique into the image-to-image translation framework. The vector quantized conte… ▽ More

    Submitted 27 July, 2022; originally announced July 2022.

  37. arXiv:2207.08240  [pdf, other

    eess.SY cs.AI cs.RO

    Robust Action Governor for Uncertain Piecewise Affine Systems with Non-convex Constraints and Safe Reinforcement Learning

    Authors: Yutong Li, Nan Li, H. Eric Tseng, Anouck Girard, Dimitar Filev, Ilya Kolmanovsky

    Abstract: The action governor is an add-on scheme to a nominal control loop that monitors and adjusts the control actions to enforce safety specifications expressed as pointwise-in-time state and control constraints. In this paper, we introduce the Robust Action Governor (RAG) for systems the dynamics of which can be represented using discrete-time Piecewise Affine (PWA) models with both parametric and addi… ▽ More

    Submitted 17 July, 2022; originally announced July 2022.

  38. arXiv:2206.01202  [pdf, other

    cs.CV cs.AI cs.LG

    Unveiling The Mask of Position-Information Pattern Through the Mist of Image Features

    Authors: Chieh Hubert Lin, Hsin-Ying Lee, Hung-Yu Tseng, Maneesh Singh, Ming-Hsuan Yang

    Abstract: Recent studies show that paddings in convolutional neural networks encode absolute position information which can negatively affect the model performance for certain tasks. However, existing metrics for quantifying the strength of positional information remain unreliable and frequently lead to erroneous results. To address this issue, we propose novel metrics for measuring (and visualizing) the en… ▽ More

    Submitted 2 June, 2022; originally announced June 2022.

  39. arXiv:2205.01252  [pdf, other

    cs.AR

    SIMD$^2$: A Generalized Matrix Instruction Set for Accelerating Tensor Computation beyond GEMM

    Authors: Yunan Zhang, Po-An Tsai, Hung-Wei Tseng

    Abstract: Matrix-multiplication units (MXUs) are now prevalent in every computing platform. The key attribute that makes MXUs so successful is the semiring structure, which allows tiling for both parallelism and data reuse. Nonetheless, matrix-multiplication is not the only algorithm with such attributes. We find that many algorithms share the same structure and differ in only the core operation; for exampl… ▽ More

    Submitted 31 August, 2022; v1 submitted 2 May, 2022; originally announced May 2022.

    Comments: To Appear in the 49th International Symposium on Computer Architecture (ISCA'22), June 18--22, 2022, New York, NY, USA

  40. arXiv:2112.07624  [pdf, other

    eess.SY cs.RO

    Interaction-Aware Trajectory Prediction and Planning for Autonomous Vehicles in Forced Merge Scenarios

    Authors: Kaiwen Liu, Nan Li, H. Eric Tseng, Ilya Kolmanovsky, Anouck Girard

    Abstract: Merging is, in general, a challenging task for both human drivers and autonomous vehicles, especially in dense traffic, because the merging vehicle typically needs to interact with other vehicles to identify or create a gap and safely merge into. In this paper, we consider the problem of autonomous vehicle control for forced merge scenarios. We propose a novel game-theoretic controller, called the… ▽ More

    Submitted 14 December, 2021; originally announced December 2021.

    Comments: 15 pages, 12 figures

  41. arXiv:2112.07552  [pdf, other

    cs.DB

    TCUDB: Accelerating Database with Tensor Processors

    Authors: Yu-Ching Hu, Yuliang Li, Hung-Wei Tseng

    Abstract: The emergence of novel hardware accelerators has powered the tremendous growth of machine learning in recent years. These accelerators deliver incomparable performance gains in processing high-volume matrix operators, particularly matrix multiplication, a core component of neural network training and inference. In this work, we explored opportunities of accelerating database systems using NVIDIA's… ▽ More

    Submitted 14 December, 2021; originally announced December 2021.

    Comments: 16 pages, 14 figures, to appear in the 2022 ACM SIGMOD/PODS International Conference on Management of Data (SIGMOD 2022)

  42. arXiv:2112.02538  [pdf, ps, other

    eess.AS cs.SD

    Toward Real-World Voice Disorder Classification

    Authors: Heng-Cheng Kuo, Yu-Peng Hsieh, Huan-Hsin Tseng, Chi-Te Wang, Shih-Hau Fang, Yu Tsao

    Abstract: Objective: Voice disorders significantly compromise individuals' ability to speak in their daily lives. Without early diagnosis and treatment, these disorders may deteriorate drastically. Thus, automatic classification systems at home are desirable for people who are inaccessible to clinical disease assessments. However, the performance of such systems may be weakened due to the constrained resour… ▽ More

    Submitted 26 April, 2023; v1 submitted 5 December, 2021; originally announced December 2021.

    Comments: Accepted by IEEE TBME (under an IEEE Open Access publishing Agreement)

  43. arXiv:2111.06316  [pdf, other

    cs.SD cs.LG eess.AS

    Unsupervised Noise Adaptive Speech Enhancement by Discriminator-Constrained Optimal Transport

    Authors: Hsin-Yi Lin, Huan-Hsin Tseng, Xugang Lu, Yu Tsao

    Abstract: This paper presents a novel discriminator-constrained optimal transport network (DOTN) that performs unsupervised domain adaptation for speech enhancement (SE), which is an essential regression task in speech processing. The DOTN aims to estimate clean references of noisy speech in a target domain, by exploiting the knowledge available from the source domain. The domain shift between training and… ▽ More

    Submitted 11 November, 2021; originally announced November 2021.

    Comments: Accepted at NeurIPS 2021

  44. arXiv:2109.09792  [pdf, other

    eess.SY cs.RO

    Stochastic MPC with Multi-modal Predictions for Traffic Intersections

    Authors: Siddharth H. Nair, Vijay Govindarajan, Theresa Lin, Chris Meissen, H. Eric Tseng, Francesco Borrelli

    Abstract: We propose a Stochastic MPC (SMPC) formulation for autonomous driving at traffic intersections which incorporates multi-modal predictions of surrounding vehicles for collision avoidance constraints. The multi-modal predictions are obtained with Gaussian Mixture Models (GMM) and constraints are formulated as chance-constraints. Our main theoretical contribution is a SMPC formulation that optimizes… ▽ More

    Submitted 25 February, 2022; v1 submitted 20 September, 2021; originally announced September 2021.

    Comments: Extended version of ITSC 2022 submission

  45. Improved Robustness and Safety for Pre-Adaptation of Meta Reinforcement Learning with Prior Regularization

    Authors: Lu Wen, Songan Zhang, H. Eric Tseng, Baljeet Singh, Dimitar Filev, Huei Peng

    Abstract: Meta Reinforcement Learning (Meta-RL) has seen substantial advancements recently. In particular, off-policy methods were developed to improve the data efficiency of Meta-RL techniques. \textit{Probabilistic embeddings for actor-critic RL} (PEARL) is a leading approach for multi-MDP adaptation problems. A major drawback of many existing Meta-RL methods, including PEARL, is that they do not explicit… ▽ More

    Submitted 9 February, 2023; v1 submitted 18 August, 2021; originally announced August 2021.

  46. arXiv:2107.05473  [pdf, other

    cs.DC cs.PF eess.SP

    GPTPU: Accelerating Applications using Edge Tensor Processing Units

    Authors: Kuan-Chieh Hsu, Hung-Wei Tseng

    Abstract: Neural network (NN) accelerators have been integrated into a wide-spectrum of computer systems to accommodate the rapidly growing demands for artificial intelligence (AI) and machine learning (ML) applications. NN accelerators share the idea of providing native hardware support for operations on multidimensional tensor data. Therefore, NN accelerators are theoretically tensor processors that can i… ▽ More

    Submitted 13 July, 2021; v1 submitted 22 June, 2021; originally announced July 2021.

    Comments: This paper is a pre-print of a paper in the 2021 SC, the International Conference for High Performance Computing, Networking, Storage and Analysis

  47. arXiv:2106.06191  [pdf, other

    cs.ET cond-mat.soft physics.app-ph q-bio.NC

    Structural evolution and on-demand growth of artificial synapses via field-directed polymerization

    Authors: Matteo Cucchi, Hans Kleemann, Hsin Tseng, Alexander Lee, Karl Leo

    Abstract: Interconnectivity, fault tolerance, and dynamic evolution of the circuitry are long sought-after objectives of bio-inspired engineering. Here, we propose dendritic transistors composed of organic semiconductors as building blocks for neuromorphic computing. These devices, owning to their voltage-triggered growth and resemblance to neural structures, respond to action potentials to achieve complex… ▽ More

    Submitted 11 June, 2021; originally announced June 2021.

  48. arXiv:2106.03719  [pdf, other

    cs.CV

    Incremental False Negative Detection for Contrastive Learning

    Authors: Tsai-Shien Chen, Wei-Chih Hung, Hung-Yu Tseng, Shao-Yi Chien, Ming-Hsuan Yang

    Abstract: Self-supervised learning has recently shown great potential in vision tasks through contrastive learning, which aims to discriminate each image, or instance, in the dataset. However, such instance-level learning ignores the semantic relationship among instances and sometimes undesirably repels the anchor from the semantically similar samples, termed as "false negatives". In this work, we show that… ▽ More

    Submitted 16 March, 2022; v1 submitted 7 June, 2021; originally announced June 2021.

    Comments: ICLR 2022

  49. arXiv:2105.13509  [pdf, other

    cs.CV

    Learning to Stylize Novel Views

    Authors: Hsin-** Huang, Hung-Yu Tseng, Saurabh Saini, Maneesh Singh, Ming-Hsuan Yang

    Abstract: We tackle a 3D scene stylization problem - generating stylized images of a scene from arbitrary novel views given a set of images of the same scene and a reference image of the desired style as inputs. Direct solution of combining novel view synthesis and stylization approaches lead to results that are blurry or not consistent across different views. We propose a point cloud-based method for consi… ▽ More

    Submitted 15 September, 2021; v1 submitted 27 May, 2021; originally announced May 2021.

    Comments: Project page: https://hhsin**.github.io/3d_scene_stylization/ Code: https://github.com/hhsin**/stylescene

  50. arXiv:2105.13016  [pdf, other

    cs.CV

    Stylizing 3D Scene via Implicit Representation and HyperNetwork

    Authors: Pei-Ze Chiang, Meng-Shiun Tsai, Hung-Yu Tseng, Wei-sheng Lai, Wei-Chen Chiu

    Abstract: In this work, we aim to address the 3D scene stylization problem - generating stylized images of the scene at arbitrary novel view angles. A straightforward solution is to combine existing novel view synthesis and image/video style transfer approaches, which often leads to blurry results or inconsistent appearance. Inspired by the high-quality results of the neural radiance fields (NeRF) method, w… ▽ More

    Submitted 16 January, 2022; v1 submitted 27 May, 2021; originally announced May 2021.

    Comments: Accepted to WACV2022; Project page: https://ztex08010518.github.io/3dstyletransfer/