Skip to main content

Showing 1–38 of 38 results for author: Liao, W

Searching in archive eess. Search in all archives.
.
  1. arXiv:2406.15751  [pdf, other

    cs.SD eess.AS

    Improving Unsupervised Clean-to-Rendered Guitar Tone Transformation Using GANs and Integrated Unaligned Clean Data

    Authors: Yu-Hua Chen, Woosung Choi, Wei-Hsiang Liao, Marco Martínez-Ramírez, Kin Wai Cheuk, Yuki Mitsufuji, Jyh-Shing Roger Jang, Yi-Hsuan Yang

    Abstract: Recent years have seen increasing interest in applying deep learning methods to the modeling of guitar amplifiers or effect pedals. Existing methods are mainly based on the supervised approach, requiring temporally-aligned data pairs of unprocessed and rendered audio. However, this approach does not scale well, due to the complicated process involved in creating the data pairs. A very recent work… ▽ More

    Submitted 22 June, 2024; originally announced June 2024.

    Comments: Accepted to DAFx 2024

  2. arXiv:2406.13674  [pdf, other

    eess.IV cs.CV

    Rethinking Abdominal Organ Segmentation (RAOS) in the clinical scenario: A robustness evaluation benchmark with challenging cases

    Authors: Xiangde Luo, Zihan Li, Shaoting Zhang, Wenjun Liao, Guotai Wang

    Abstract: Deep learning has enabled great strides in abdominal multi-organ segmentation, even surpassing junior oncologists on common cases or organs. However, robustness on corner cases and complex organs remains a challenging open problem for clinical adoption. To investigate model robustness, we collected and annotated the RAOS dataset comprising 413 CT scans ($\sim$80k 2D images, $\sim$8k 3D organ annot… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

    Comments: 10 pages, 1 figure, 6 tables, Early Accept to MICCAI 2024

  3. arXiv:2406.03822  [pdf, other

    cs.SD cs.CR eess.AS

    SilentCipher: Deep Audio Watermarking

    Authors: Mayank Kumar Singh, Naoya Takahashi, Weihsiang Liao, Yuki Mitsufuji

    Abstract: In the realm of audio watermarking, it is challenging to simultaneously encode imperceptible messages while enhancing the message capacity and robustness. Although recent advancements in deep learning-based methods bolster the message capacity and robustness over traditional methods, the encoded messages introduce audible artefacts that restricts their usage in professional settings. In this study… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

  4. arXiv:2405.18386  [pdf, other

    cs.SD cs.AI cs.LG cs.MM eess.AS

    Instruct-MusicGen: Unlocking Text-to-Music Editing for Music Language Models via Instruction Tuning

    Authors: Yixiao Zhang, Yukara Ikemiya, Woosung Choi, Naoki Murata, Marco A. Martínez-Ramírez, Liwei Lin, Gus Xia, Wei-Hsiang Liao, Yuki Mitsufuji, Simon Dixon

    Abstract: Recent advances in text-to-music editing, which employ text queries to modify music (e.g.\ by changing its style or adjusting instrumental components), present unique challenges and opportunities for AI-assisted music creation. Previous approaches in this domain have been constrained by the necessity to train specific editing models from scratch, which is both resource-intensive and inefficient; o… ▽ More

    Submitted 29 May, 2024; v1 submitted 28 May, 2024; originally announced May 2024.

    Comments: Code and demo are available at: https://github.com/ldzhangyx/instruct-musicgen

  5. arXiv:2403.10024  [pdf, other

    cs.SD cs.AI cs.LG cs.MM eess.AS

    MR-MT3: Memory Retaining Multi-Track Music Transcription to Mitigate Instrument Leakage

    Authors: Hao Hao Tan, Kin Wai Cheuk, Taemin Cho, Wei-Hsiang Liao, Yuki Mitsufuji

    Abstract: This paper presents enhancements to the MT3 model, a state-of-the-art (SOTA) token-based multi-instrument automatic music transcription (AMT) model. Despite SOTA performance, MT3 has the issue of instrument leakage, where transcriptions are fragmented across different instruments. To mitigate this, we propose MR-MT3, with enhancements including a memory retention mechanism, prior token sampling, a… ▽ More

    Submitted 15 March, 2024; originally announced March 2024.

  6. arXiv:2403.05246  [pdf, other

    eess.IV cs.CV

    LightM-UNet: Mamba Assists in Lightweight UNet for Medical Image Segmentation

    Authors: Weibin Liao, Yinghao Zhu, Xinyuan Wang, Chengwei Pan, Yasha Wang, Liantao Ma

    Abstract: UNet and its variants have been widely used in medical image segmentation. However, these models, especially those based on Transformer architectures, pose challenges due to their large number of parameters and computational loads, making them unsuitable for mobile health applications. Recently, State Space Models (SSMs), exemplified by Mamba, have emerged as competitive alternatives to CNN and Tr… ▽ More

    Submitted 11 March, 2024; v1 submitted 8 March, 2024; originally announced March 2024.

  7. arXiv:2402.06178  [pdf, other

    cs.SD cs.AI cs.MM eess.AS

    MusicMagus: Zero-Shot Text-to-Music Editing via Diffusion Models

    Authors: Yixiao Zhang, Yukara Ikemiya, Gus Xia, Naoki Murata, Marco A. Martínez-Ramírez, Wei-Hsiang Liao, Yuki Mitsufuji, Simon Dixon

    Abstract: Recent advances in text-to-music generation models have opened new avenues in musical creativity. However, music generation usually involves iterative refinements, and how to edit the generated music remains a significant challenge. This paper introduces a novel approach to the editing of music generated by such models, enabling the modification of specific attributes, such as genre, mood and inst… ▽ More

    Submitted 28 May, 2024; v1 submitted 8 February, 2024; originally announced February 2024.

    Comments: Accepted to IJCAI 2024

  8. arXiv:2312.15741  [pdf

    eess.SY cs.LG

    Improving the Accuracy and Interpretability of Neural Networks for Wind Power Forecasting

    Authors: Wenlong Liao, Fernando Porte-Agel, Jiannong Fang, Birgitte Bak-Jensen, Zhe Yang, Gonghao Zhang

    Abstract: Deep neural networks (DNNs) are receiving increasing attention in wind power forecasting due to their ability to effectively capture complex patterns in wind data. However, their forecasted errors are severely limited by the local optimal weight issue in optimization algorithms, and their forecasted behavior also lacks interpretability. To address these two challenges, this paper firstly proposes… ▽ More

    Submitted 25 December, 2023; originally announced December 2023.

    Comments: 10 pages, 10 figures

  9. arXiv:2312.09576  [pdf, other

    eess.IV cs.CV

    SegRap2023: A Benchmark of Organs-at-Risk and Gross Tumor Volume Segmentation for Radiotherapy Planning of Nasopharyngeal Carcinoma

    Authors: Xiangde Luo, Jia Fu, Yunxin Zhong, Shuolin Liu, Bing Han, Mehdi Astaraki, Simone Bendazzoli, Iuliana Toma-Dasu, Yiwen Ye, Ziyang Chen, Yong Xia, Yanzhou Su, ** Ye, Junjun He, Zhaohu Xing, Hongqiu Wang, Lei Zhu, Kaixiang Yang, Xin Fang, Zhiwei Wang, Chan Woong Lee, Sang Joon Park, Jaehee Chun, Constantin Ulrich, Klaus H. Maier-Hein , et al. (17 additional authors not shown)

    Abstract: Radiation therapy is a primary and effective NasoPharyngeal Carcinoma (NPC) treatment strategy. The precise delineation of Gross Tumor Volumes (GTVs) and Organs-At-Risk (OARs) is crucial in radiation treatment, directly impacting patient prognosis. Previously, the delineation of GTVs and OARs was performed by experienced radiation oncologists. Recently, deep learning has achieved promising results… ▽ More

    Submitted 15 December, 2023; originally announced December 2023.

    Comments: A challenge report of SegRap2023 (organized in conjunction with MICCAI2023)

  10. arXiv:2311.03863  [pdf

    eess.SY cs.LG

    An Explainable Framework for Machine learning-Based Reactive Power Optimization of Distribution Network

    Authors: Wenlong Liao, Benjamin Schäfer, Dalin Qin, Gonghao Zhang, Zhixian Wang, Zhe Yang

    Abstract: To reduce the heavy computational burden of reactive power optimization of distribution networks, machine learning models are receiving increasing attention. However, most machine learning models (e.g., neural networks) are usually considered as black boxes, making it challenging for power system operators to identify and comprehend potential biases or errors in the decision-making process of mach… ▽ More

    Submitted 7 November, 2023; originally announced November 2023.

    Comments: It was submitted to the 23rd Power Systems Computation Conference (PSCC 2024) on Sept.2023

  11. arXiv:2310.18629  [pdf

    cs.LG eess.SY

    Explainable Modeling for Wind Power Forecasting: A Glass-Box Approach with High Accuracy

    Authors: Wenlong Liao, Fernando Porte-Agel, Jiannong Fang, Birgitte Bak-Jensen, Guangchun Ruan, Zhe Yang

    Abstract: Machine learning models (e.g., neural networks) achieve high accuracy in wind power forecasting, but they are usually regarded as black boxes that lack interpretability. To address this issue, the paper proposes a glass-box approach that combines high accuracy with transparency for wind power forecasting. Specifically, the core is to sum up the feature effects by constructing shape functions, whic… ▽ More

    Submitted 26 February, 2024; v1 submitted 28 October, 2023; originally announced October 2023.

  12. arXiv:2310.13267  [pdf, other

    cs.CL cs.CV cs.LG cs.SD eess.AS

    On the Language Encoder of Contrastive Cross-modal Models

    Authors: Mengjie Zhao, Junya Ono, Zhi Zhong, Chieh-Hsin Lai, Yuhta Takida, Naoki Murata, Wei-Hsiang Liao, Takashi Shibuya, Hiromi Wakaki, Yuki Mitsufuji

    Abstract: Contrastive cross-modal models such as CLIP and CLAP aid various vision-language (VL) and audio-language (AL) tasks. However, there has been limited investigation of and improvement in their language encoder, which is the central component of encoding natural language descriptions of image/audio into vector representations. We extensively evaluate how unsupervised and supervised sentence embedding… ▽ More

    Submitted 20 October, 2023; originally announced October 2023.

  13. arXiv:2309.15717  [pdf, other

    eess.AS cs.LG cs.SD

    Timbre-Trap: A Low-Resource Framework for Instrument-Agnostic Music Transcription

    Authors: Frank Cwitkowitz, Kin Wai Cheuk, Woosung Choi, Marco A. Martínez-Ramírez, Keisuke Toyama, Wei-Hsiang Liao, Yuki Mitsufuji

    Abstract: In recent years, research on music transcription has focused mainly on architecture design and instrument-specific data acquisition. With the lack of availability of diverse datasets, progress is often limited to solo-instrument tasks such as piano transcription. Several works have explored multi-instrument transcription as a means to bolster the performance of models on low-resource tasks, but th… ▽ More

    Submitted 24 January, 2024; v1 submitted 27 September, 2023; originally announced September 2023.

    Comments: Accepted to ICASSP 2024

  14. arXiv:2309.06934  [pdf, other

    eess.AS cs.SD

    VRDMG: Vocal Restoration via Diffusion Posterior Sampling with Multiple Guidance

    Authors: Carlos Hernandez-Olivan, Koichi Saito, Naoki Murata, Chieh-Hsin Lai, Marco A. Martínez-Ramirez, Wei-Hsiang Liao, Yuki Mitsufuji

    Abstract: Restoring degraded music signals is essential to enhance audio quality for downstream music manipulation. Recent diffusion-based music restoration methods have demonstrated impressive performance, and among them, diffusion posterior sampling (DPS) stands out given its intrinsic properties, making it versatile across various restoration tasks. In this paper, we identify that there are potential iss… ▽ More

    Submitted 13 September, 2023; originally announced September 2023.

  15. arXiv:2308.06979  [pdf, other

    eess.AS cs.SD

    The Sound Demixing Challenge 2023 $\unicode{x2013}$ Music Demixing Track

    Authors: Giorgio Fabbro, Stefan Uhlich, Chieh-Hsin Lai, Woosung Choi, Marco Martínez-Ramírez, Weihsiang Liao, Igor Gadelha, Geraldo Ramos, Eddie Hsu, Hugo Rodrigues, Fabian-Robert Stöter, Alexandre Défossez, Yi Luo, Jianwei Yu, Dipam Chakraborty, Sharada Mohanty, Roman Solovyev, Alexander Stempkovskiy, Tatiana Habruseva, Nabarun Goswami, Tatsuya Harada, Minseok Kim, Jun Hyung Lee, Yuanliang Dong, Xinran Zhang , et al. (2 additional authors not shown)

    Abstract: This paper summarizes the music demixing (MDX) track of the Sound Demixing Challenge (SDX'23). We provide a summary of the challenge setup and introduce the task of robust music source separation (MSS), i.e., training MSS models in the presence of errors in the training data. We propose a formalization of the errors that can occur in the design of a training dataset for MSS systems and introduce t… ▽ More

    Submitted 19 April, 2024; v1 submitted 14 August, 2023; originally announced August 2023.

    Comments: Published in Transactions of the International Society for Music Information Retrieval (https://transactions.ismir.net/articles/10.5334/tismir.171)

    Journal ref: Transactions of the International Society for Music Information Retrieval, 7(1), pp.63-84, 2024

  16. arXiv:2308.05305  [pdf, other

    eess.IV cs.CV cs.LG

    From CNN to Transformer: A Review of Medical Image Segmentation Models

    Authors: Wenjian Yao, Jiajun Bai, Wei Liao, Yuheng Chen, Mengjuan Liu, Yao Xie

    Abstract: Medical image segmentation is an important step in medical image analysis, especially as a crucial prerequisite for efficient disease diagnosis and treatment. The use of deep learning for image segmentation has become a prevalent trend. The widely adopted approach currently is U-Net and its variants. Additionally, with the remarkable success of pre-trained models in natural language processing tas… ▽ More

    Submitted 9 August, 2023; originally announced August 2023.

    Comments: 18 pages, 8 figures

  17. arXiv:2307.04305  [pdf, other

    cs.SD cs.LG eess.AS

    Automatic Piano Transcription with Hierarchical Frequency-Time Transformer

    Authors: Keisuke Toyama, Taketo Akama, Yukara Ikemiya, Yuhta Takida, Wei-Hsiang Liao, Yuki Mitsufuji

    Abstract: Taking long-term spectral and temporal dependencies into account is essential for automatic piano transcription. This is especially helpful when determining the precise onset and offset for each note in the polyphonic piano content. In this case, we may rely on the capability of self-attention mechanism in Transformers to capture these long-term dependencies in the frequency and time axes. In this… ▽ More

    Submitted 9 July, 2023; originally announced July 2023.

    Comments: 8 pages, 6 figures, to be published in ISMIR2023

  18. arXiv:2307.02514  [pdf, other

    eess.AS cs.AI cs.SD

    Exploring Multimodal Approaches for Alzheimer's Disease Detection Using Patient Speech Transcript and Audio Data

    Authors: Hongmin Cai, Xiaoke Huang, Zhengliang Liu, Wenxiong Liao, Haixing Dai, Zihao Wu, Dajiang Zhu, Hui Ren, Quanzheng Li, Tianming Liu, Xiang Li

    Abstract: Alzheimer's disease (AD) is a common form of dementia that severely impacts patient health. As AD impairs the patient's language understanding and expression ability, the speech of AD patients can serve as an indicator of this disease. This study investigates various methods for detecting AD using patients' speech and transcripts data from the DementiaBank Pitt database. The proposed approach invo… ▽ More

    Submitted 5 July, 2023; originally announced July 2023.

  19. arXiv:2304.09100  [pdf, other

    cs.LG eess.SY

    Real Time Bearing Fault Diagnosis Based on Convolutional Neural Network and STM32 Microcontroller

    Authors: Wenhao Liao

    Abstract: With the rapid development of big data and edge computing, many researchers focus on improving the accuracy of bearing fault classification using deep learning models, and implementing the deep learning classification model on limited resource platforms such as STM32. To this end, this paper realizes the identification of bearing fault vibration signal based on convolutional neural network, the fa… ▽ More

    Submitted 14 April, 2023; originally announced April 2023.

    Comments: 6 pages, 9 figures

  20. arXiv:2211.02247  [pdf, other

    eess.AS cs.LG cs.SD

    Music Mixing Style Transfer: A Contrastive Learning Approach to Disentangle Audio Effects

    Authors: Junghyun Koo, Marco A. Martínez-Ramírez, Wei-Hsiang Liao, Stefan Uhlich, Kyogu Lee, Yuki Mitsufuji

    Abstract: We propose an end-to-end music mixing style transfer system that converts the mixing style of an input multitrack to that of a reference song. This is achieved with an encoder pre-trained with a contrastive objective to extract only audio effects related information from a reference music recording. All our models are trained in a self-supervised manner from an already-processed wet multitrack dat… ▽ More

    Submitted 11 April, 2023; v1 submitted 3 November, 2022; originally announced November 2022.

  21. arXiv:2208.11428  [pdf, other

    eess.AS cs.LG cs.SD eess.SP

    Automatic music mixing with deep learning and out-of-domain data

    Authors: Marco A. Martínez-Ramírez, Wei-Hsiang Liao, Giorgio Fabbro, Stefan Uhlich, Chihiro Nagashima, Yuki Mitsufuji

    Abstract: Music mixing traditionally involves recording instruments in the form of clean, individual tracks and blending them into a final mixture using audio effects and expert knowledge (e.g., a mixing engineer). The automation of music production tasks has become an emerging field in recent years, where rule-based methods and machine learning approaches have been explored. Nevertheless, the lack of dry o… ▽ More

    Submitted 29 August, 2022; v1 submitted 24 August, 2022; originally announced August 2022.

    Comments: 23rd International Society for Music Information Retrieval Conference (ISMIR), December, 2022. Source code, demo and audio examples: https://marco-martinez-sony.github.io/FxNorm-automix/ - added acknowledgements

  22. arXiv:2203.02106  [pdf, other

    eess.IV cs.CV

    Scribble-Supervised Medical Image Segmentation via Dual-Branch Network and Dynamically Mixed Pseudo Labels Supervision

    Authors: Xiangde Luo, Minhao Hu, Wenjun Liao, Shuwei Zhai, Tao Song, Guotai Wang, Shaoting Zhang

    Abstract: Medical image segmentation plays an irreplaceable role in computer-assisted diagnosis, treatment planning, and following-up. Collecting and annotating a large-scale dataset is crucial to training a powerful segmentation model, but producing high-quality segmentation masks is an expensive and time-consuming procedure. Recently, weakly-supervised learning that uses sparse annotations (points, scribb… ▽ More

    Submitted 3 March, 2022; originally announced March 2022.

    Comments: 11 pages, 4 figures,code is available: https://github.com/HiLab-git/WSL4MIS.This is a comprehensive study about scribble-supervised medical image segmentation based on the ACDC dataset

  23. arXiv:2111.07958  [pdf

    eess.SY cs.LG

    Short-Term Power Prediction for Renewable Energy Using Hybrid Graph Convolutional Network and Long Short-Term Memory Approach

    Authors: Wenlong Liao, Birgitte Bak-Jensen, Jayakrishnan Radhakrishna Pillai, Zhe Yang, Kuangpu Liu

    Abstract: Accurate short-term solar and wind power predictions play an important role in the planning and operation of power systems. However, the short-term power prediction of renewable energy has always been considered a complex regression problem, owing to the fluctuation and intermittence of output powers and the law of dynamic change with time due to local weather conditions, i.e. spatio-temporal corr… ▽ More

    Submitted 7 February, 2022; v1 submitted 15 November, 2021; originally announced November 2021.

    Comments: This paper was accepted the 22nd Power Systems Computation Conference (PSCC 2022)

  24. WORD: A large scale dataset, benchmark and clinical applicable study for abdominal organ segmentation from CT image

    Authors: Xiangde Luo, Wenjun Liao, Jianghong Xiao, Jieneng Chen, Tao Song, Xiaofan Zhang, Kang Li, Dimitris N. Metaxas, Guotai Wang, Shaoting Zhang

    Abstract: Whole abdominal organ segmentation is important in diagnosing abdomen lesions, radiotherapy, and follow-up. However, oncologists' delineating all abdominal organs from 3D volumes is time-consuming and very expensive. Deep learning-based medical image segmentation has shown the potential to reduce manual delineation efforts, but it still requires a large-scale fine annotated dataset for training, a… ▽ More

    Submitted 12 February, 2023; v1 submitted 2 November, 2021; originally announced November 2021.

    Comments: Accepted to Medical Image Analysis, dataset at: https://github.com/HiLab-git/WORD (we corrected the results or description in this version.)

  25. arXiv:2110.06525  [pdf, other

    cs.SD cs.LG eess.AS

    Automatic DJ Transitions with Differentiable Audio Effects and Generative Adversarial Networks

    Authors: Bo-Yu Chen, Wei-Han Hsu, Wei-Hsiang Liao, Marco A. Martínez Ramírez, Yuki Mitsufuji, Yi-Hsuan Yang

    Abstract: A central task of a Disc Jockey (DJ) is to create a mixset of mu-sic with seamless transitions between adjacent tracks. In this paper, we explore a data-driven approach that uses a generative adversarial network to create the song transition by learning from real-world DJ mixes. In particular, the generator of the model uses two differentiable digital signal processing components, an equalizer (EQ… ▽ More

    Submitted 17 February, 2022; v1 submitted 13 October, 2021; originally announced October 2021.

    Comments: To be published at ICASSP 2022

  26. arXiv:2108.03386  [pdf, other

    eess.SY

    Probabilistic Reach-Avoid Reachability in Nondeterministic Systems with Time-VaryingTargets and Obstacles

    Authors: Wei Liao, Taotao Liang, Xiaohui Wei, Qiaozhi Yin

    Abstract: The probabilistic reachability problems of nondeterministic systems are studied. Based on the existing studies, the definition of probabilistic reachable sets is generalized by taking into account time-varying target set and obstacle. A numerical method is proposed to compute probabilistic reachable sets. First, a scalar function in the state space is constructed by backward recursion and grid int… ▽ More

    Submitted 7 August, 2021; originally announced August 2021.

    Comments: 12 pages, 5 figures

  27. arXiv:2107.11941  [pdf, other

    eess.SY math.OC

    Computation of Reachable Sets Based on Hamilton-Jacobi-Bellman Equation with Running Cost Function

    Authors: Weiwei Liao, Tao Liang

    Abstract: A novel method for computing reachable sets is proposed in this paper. In the proposed method, a Hamilton-Jacobi-Bellman equation with running cost functionis numerically solved and the reachable sets of different time horizons are characterized by a family of non-zero level sets of the solution of the Hamilton-Jacobi-Bellman equation. In addition to the classical reachable set, by setting differe… ▽ More

    Submitted 16 May, 2022; v1 submitted 25 July, 2021; originally announced July 2021.

  28. Stability and Super-resolution of MUSIC and ESPRIT for Multi-snapshot Spectral Estimation

    Authors: Weilin Li, Zengying Zhu, Weiguo Gao, Wen**g Liao

    Abstract: This paper studies the spectral estimation problem of estimating the locations of a fixed number of point sources given multiple snapshots of Fourier measurements collected by a uniform array of sensors. We prove novel stability bounds for MUSIC and ESPRIT as a function of the noise standard deviation, number of snapshots, source amplitudes, and support. Our most general result is a perturbation b… ▽ More

    Submitted 22 September, 2022; v1 submitted 29 May, 2021; originally announced May 2021.

    Comments: 16 pages

  29. arXiv:2104.07200  [pdf, other

    eess.SY

    A Novel Unified Framework for Solving Reachability, Viability and Invariance Problems

    Authors: Wei Liao, Taotao Liang, Xiaohui Wei, Jizhou Lai

    Abstract: The level set method is a widely used tool for solving reachability and invariance problems. However, some shortcomings, such as the difficulties of handling dissipation function and constructing terminal conditions for solving the Hamilton-Jacobi partial differential equation, limit the application of the level set method in some problems with non-affine nonlinear systems and irregular target set… ▽ More

    Submitted 29 November, 2021; v1 submitted 14 April, 2021; originally announced April 2021.

    Comments: arXiv admin note: text overlap with arXiv:2101.09646

  30. arXiv:2102.03360  [pdf

    eess.SY

    Scenario Generation for Cooling, Heating, and Power Loads Using Generative Moment Matching Networks

    Authors: Wenlong Liao, Yusen Wang, Yuelong Wang, Kody Powell, Qi Liu, Zhe Yang

    Abstract: Scenario generations of cooling, heating, and power loads are of great significance for the economic operation and stability analysis of integrated energy systems. In this paper, a novel deep generative network is proposed to model cooling, heating, and power load curves based on a generative moment matching networks (GMMN) where an auto-encoder transforms high-dimensional load curves into low-dim… ▽ More

    Submitted 21 April, 2022; v1 submitted 5 February, 2021; originally announced February 2021.

    Comments: This paper was accepted by CSEE Journal of Power and Energy Systems in June 2021

  31. arXiv:2101.10025  [pdf

    cs.LG cs.AI eess.SY

    A Review of Graph Neural Networks and Their Applications in Power Systems

    Authors: Wenlong Liao, Birgitte Bak-Jensen, Jayakrishnan Radhakrishna Pillai, Yuelong Wang, Yusen Wang

    Abstract: Deep neural networks have revolutionized many machine learning tasks in power systems, ranging from pattern recognition to signal processing. The data in these tasks is typically represented in Euclidean domains. Nevertheless, there is an increasing number of applications in power systems, where data are collected from non-Euclidean domains and represented as graph-structured data with high dimens… ▽ More

    Submitted 12 June, 2021; v1 submitted 25 January, 2021; originally announced January 2021.

  32. arXiv:2101.09646  [pdf, other

    eess.SY

    An Improved Level Set Method for Reachability Problems in Differential Games

    Authors: Wei Liao, Taotao Liang, Pengwen Xiong, Chen Wang, Aiguo Song, Peter X. Liu

    Abstract: This study focuses on reachability problems in differential games. An improved level set method for computing reachable tubes is proposed in this paper. The reachable tube is described as a sublevel set of a value function, which is the viscosity solution of a Hamilton-Jacobi equation with running cost. We generalize the concept of reachable tubes and propose a new class of reachable tubes, which… ▽ More

    Submitted 16 May, 2022; v1 submitted 23 January, 2021; originally announced January 2021.

    Comments: 9 pages, 13 figures

  33. arXiv:2011.06209  [pdf, other

    eess.SY

    Recursive Regret Matching: A General Method for Solving Time-invariant Nonlinear Zero-sum Differential Games

    Authors: Wei Liao, Xiaohui Wei, Jizhou Lai

    Abstract: In this paper, a new method is proposed to compute the rolling Nash equilibrium of the time-invariant nonlinear two-person zero-sum differential games. The idea is to discretize the time to transform a differential game into a sequential game with several steps, and by introducing state-value function, transform the sequential game into a recursion consisting of several normal-form games, finally,… ▽ More

    Submitted 12 November, 2020; originally announced November 2020.

    Comments: 18 pages, 9 figures

    MSC Class: 91-08; 93-08

  34. arXiv:2009.13148  [pdf, other

    eess.IV cs.CV

    Automated Pancreas Segmentation Using Multi-institutional Collaborative Deep Learning

    Authors: Pochuan Wang, Chen Shen, Holger R. Roth, Dong Yang, Daguang Xu, Masahiro Oda, Kazunari Misawa, Po-Ting Chen, Kao-Lang Liu, Wei-Chih Liao, Weichung Wang, Kensaku Mori

    Abstract: The performance of deep learning-based methods strongly relies on the number of datasets used for training. Many efforts have been made to increase the data in the medical image analysis field. However, unlike photography images, it is hard to generate centralized databases to collect medical images because of numerous technical, legal, and privacy issues. In this work, we study the use of federat… ▽ More

    Submitted 28 September, 2020; originally announced September 2020.

    Comments: Accepted by MICCAI DCL Workshop 2020

  35. Coupled Convolutional Neural Network with Adaptive Response Function Learning for Unsupervised Hyperspectral Super-Resolution

    Authors: Ke Zheng, Lianru Gao, Wenzhi Liao, Danfeng Hong, Bing Zhang, Ximin Cui, Jocelyn Chanussot

    Abstract: Due to the limitations of hyperspectral imaging systems, hyperspectral imagery (HSI) often suffers from poor spatial resolution, thus hampering many applications of the imagery. Hyperspectral super-resolution refers to fusing HSI and MSI to generate an image with both high spatial and high spectral resolutions. Recently, several new methods have been proposed to solve this fusion problem, and most… ▽ More

    Submitted 28 July, 2020; originally announced July 2020.

    Journal ref: IEEE Transactions on Geoscience and Remote Sensing,2020

  36. arXiv:2004.13316  [pdf, other

    cs.CV cs.LG eess.IV

    SCRDet++: Detecting Small, Cluttered and Rotated Objects via Instance-Level Feature Denoising and Rotation Loss Smoothing

    Authors: Xue Yang, Junchi Yan, Wenlong Liao, Xiaokang Yang, ** Tang, Tao He

    Abstract: Small and cluttered objects are common in real-world which are challenging for detection. The difficulty is further pronounced when the objects are rotated, as traditional detectors often routinely locate the objects in horizontal bounding box such that the region of interest is contaminated with background or nearby interleaved objects. In this paper, we first innovatively introduce the idea of d… ▽ More

    Submitted 28 April, 2022; v1 submitted 28 April, 2020; originally announced April 2020.

    Comments: 15 pages, 12 figures, 11 tables, accepted by TPAMI

  37. arXiv:1509.02604  [pdf, ps, other

    cs.DC cs.LG eess.SY

    Asynchronous Distributed ADMM for Large-Scale Optimization- Part II: Linear Convergence Analysis and Numerical Performance

    Authors: Tsung-Hui Chang, Wei-Cheng Liao, Mingyi Hong, Xiangfeng Wang

    Abstract: The alternating direction method of multipliers (ADMM) has been recognized as a versatile approach for solving modern large-scale machine learning and signal processing problems efficiently. When the data size and/or the problem dimension is large, a distributed version of ADMM can be used, which is capable of distributing the computation load and the data set to a network of computing nodes. Unfo… ▽ More

    Submitted 8 September, 2015; originally announced September 2015.

    Comments: submitted for publication, 28 pages

  38. arXiv:1509.02597  [pdf, ps, other

    cs.DC cs.LG eess.SY

    Asynchronous Distributed ADMM for Large-Scale Optimization- Part I: Algorithm and Convergence Analysis

    Authors: Tsung-Hui Chang, Mingyi Hong, Wei-Cheng Liao, Xiangfeng Wang

    Abstract: Aiming at solving large-scale learning problems, this paper studies distributed optimization methods based on the alternating direction method of multipliers (ADMM). By formulating the learning problem as a consensus problem, the ADMM can be used to solve the consensus problem in a fully parallel fashion over a computer network with a star topology. However, traditional synchronized computation do… ▽ More

    Submitted 19 February, 2016; v1 submitted 8 September, 2015; originally announced September 2015.

    Comments: 37 pages