Skip to main content

Showing 1–50 of 57 results for author: Qi, J

Searching in archive eess. Search in all archives.
.
  1. arXiv:2311.03557  [pdf, other

    cs.LG cs.CV eess.IV

    Spatio-Temporal Similarity Measure based Multi-Task Learning for Predicting Alzheimer's Disease Progression using MRI Data

    Authors: Xulong Wang, Yu Zhang, Menghui Zhou, Tong Liu, Jun Qi, Po Yang

    Abstract: Identifying and utilising various biomarkers for tracking Alzheimer's disease (AD) progression have received many recent attentions and enable hel** clinicians make the prompt decisions. Traditional progression models focus on extracting morphological biomarkers in regions of interest (ROIs) from MRI/PET images, such as regional average cortical thickness and regional volume. They are effective… ▽ More

    Submitted 6 November, 2023; originally announced November 2023.

  2. arXiv:2307.11436  [pdf, other

    math.OC cs.LG eess.SY math.AP

    Neural Operators for PDE Backstep** Control of First-Order Hyperbolic PIDE with Recycle and Delay

    Authors: Jie Qi, **g Zhang, Miroslav Krstic

    Abstract: The recently introduced DeepONet operator-learning framework for PDE control is extended from the results for basic hyperbolic and parabolic PDEs to an advanced hyperbolic class that involves delays on both the state and the system output or input. The PDE backstep** design produces gain functions that are outputs of a nonlinear operator, map** functions on a spatial domain into functions on a… ▽ More

    Submitted 14 June, 2024; v1 submitted 21 July, 2023; originally announced July 2023.

    Comments: 20 pages

    Journal ref: Systems & Control Letters, 2024

  3. arXiv:2307.11424  [pdf, ps, other

    math.OC eess.SY math.AP physics.class-ph physics.flu-dyn

    Robust stabilization of $2 \times 2$ first-order hyperbolic PDEs with uncertain input delay

    Authors: **g Zhang, Jie Qi

    Abstract: A backstep**-based compensator design is developed for a system of $2\times2$ first-order linear hyperbolic partial differential equations (PDE) in the presence of an uncertain long input delay at boundary. We introduce a transport PDE to represent the delayed input, which leads to three coupled first-order hyperbolic PDEs. A novel backstep** transformation, composed of two Volterra transforma… ▽ More

    Submitted 21 July, 2023; originally announced July 2023.

  4. arXiv:2307.04212  [pdf, other

    math.AP eess.SY math.OC physics.class-ph physics.flu-dyn

    Delay-Adaptive Control of First-order Hyperbolic PIDEs

    Authors: Shanshan Wang, Jie Qi, Miroslav Krstic

    Abstract: We develop a delay-adaptive controller for a class of first-order hyperbolic partial integro-differential equations (PIDEs) with an unknown input delay. By employing a transport PDE to represent delayed actuator states, the system is transformed into a transport partial differential equation (PDE) with unknown propagation speed cascaded with a PIDE. A parameter update law is designed using a Lyapu… ▽ More

    Submitted 9 July, 2023; originally announced July 2023.

  5. arXiv:2307.03727  [pdf, ps, other

    math.OC eess.SY math.AP physics.class-ph physics.flu-dyn

    Bilateral boundary control of an input delayed 2-D reaction-diffusion equation

    Authors: Dandan Guan, Yanmei Chen, Jie Qi, Linglong Du

    Abstract: In this paper, a delay compensation design method based on PDE backstep** is developed for a two-dimensional reaction-diffusion partial differential equation (PDE) with bilateral input delays. The PDE is defined in a rectangular domain, and the bilateral control is imposed on a pair of opposite sides of the rectangle. To represent the delayed bilateral inputs, we introduce two 2-D transport PDEs… ▽ More

    Submitted 7 July, 2023; originally announced July 2023.

    Comments: 11 pages, 3 figures(including 8 sub-figures)

  6. arXiv:2306.07090  [pdf, other

    eess.AS cs.SD q-bio.QM

    Parameter-efficient Dysarthric Speech Recognition Using Adapter Fusion and Householder Transformation

    Authors: **zi Qi, Hugo Van hamme

    Abstract: In dysarthric speech recognition, data scarcity and the vast diversity between dysarthric speakers pose significant challenges. While finetuning has been a popular solution, it can lead to overfitting and low parameter efficiency. Adapter modules offer a better solution, with their small size and easy applicability. Additionally, Adapter Fusion can facilitate knowledge transfer from multiple learn… ▽ More

    Submitted 12 June, 2023; originally announced June 2023.

    Comments: Accepted by Interspeech 2023

  7. arXiv:2305.12838  [pdf, other

    eess.AS cs.SD

    An Enhanced Res2Net with Local and Global Feature Fusion for Speaker Verification

    Authors: Yafeng Chen, Siqi Zheng, Hui Wang, Luyao Cheng, Qian Chen, Jiajun Qi

    Abstract: Effective fusion of multi-scale features is crucial for improving speaker verification performance. While most existing methods aggregate multi-scale features in a layer-wise manner via simple operations, such as summation or concatenation. This paper proposes a novel architecture called Enhanced Res2Net (ERes2Net), which incorporates both local and global feature fusion techniques to improve the… ▽ More

    Submitted 3 August, 2023; v1 submitted 22 May, 2023; originally announced May 2023.

  8. arXiv:2305.00127  [pdf, other

    cs.LG cs.AI eess.SY

    Optimal Scheduling in IoT-Driven Smart Isolated Microgrids Based on Deep Reinforcement Learning

    Authors: Jiaju Qi, Lei Lei, Kan Zheng, Simon X. Yang, Xuemin, Shen

    Abstract: In this paper, we investigate the scheduling issue of diesel generators (DGs) in an Internet of Things (IoT)-Driven isolated microgrid (MG) by deep reinforcement learning (DRL). The renewable energy is fully exploited under the uncertainty of renewable generation and load demand. The DRL agent learns an optimal policy from history renewable and load data of previous days, where the policy can gene… ▽ More

    Submitted 28 April, 2023; originally announced May 2023.

  9. arXiv:2302.00676  [pdf

    physics.optics eess.SY physics.app-ph

    Enhancing Light Extraction of Organic Light Emitting Diodes by Deep-Groove High-index Dielectric Nanomesh Using Large-area Nanoimprint

    Authors: Ji Qi, Wei Ding, Qi Zhang, Yuxuan Wang, Hao Chen, Stephen Y. Chou

    Abstract: To solve the conventional conflict between maintaining good charge transport property and achieving high light extraction efficiency when using micro/nanostructure patterned substrates to extract light from organic light emitting diodes (OLEDs), we developed a novel OLED structure, termed High-index Deep-Groove Dielectric Nanomesh OLED (HDNM-OLED), fabricated by large-area nanoimprint lithography… ▽ More

    Submitted 31 January, 2023; originally announced February 2023.

    Comments: arXiv admin note: text overlap with arXiv:2302.00044

  10. arXiv:2211.13939  [pdf, other

    cs.SD cs.LG eess.AS

    Efficient Incremental Text-to-Speech on GPUs

    Authors: Muyang Du, Chuan Liu, Jiaxing Qi, Junjie Lai

    Abstract: Incremental text-to-speech, also known as streaming TTS, has been increasingly applied to online speech applications that require ultra-low response latency to provide an optimal user experience. However, most of the existing speech synthesis pipelines deployed on GPU are still non-incremental, which uncovers limitations in high-concurrency scenarios, especially when the pipeline is built with end… ▽ More

    Submitted 5 December, 2022; v1 submitted 25 November, 2022; originally announced November 2022.

    Comments: 5 pages, 4 figures

  11. arXiv:2210.13144  [pdf, other

    eess.AS cs.SD

    Weak-Supervised Dysarthria-invariant Features for Spoken Language Understanding using an FHVAE and Adversarial Training

    Authors: **zi Qi, Hugo Van hamme

    Abstract: The scarcity of training data and the large speaker variation in dysarthric speech lead to poor accuracy and poor speaker generalization of spoken language understanding systems for dysarthric speech. Through work on the speech features, we focus on improving the model generalization ability with limited dysarthric data. Factorized Hierarchical Variational Auto-Encoders (FHVAE) trained unsupervise… ▽ More

    Submitted 24 October, 2022; originally announced October 2022.

  12. arXiv:2210.06382  [pdf, other

    eess.AS cs.AI cs.LG cs.SD eess.SP

    An Ensemble Teacher-Student Learning Approach with Poisson Sub-sampling to Differential Privacy Preserving Speech Recognition

    Authors: Chao-Han Huck Yang, Jun Qi, Sabato Marco Siniscalchi, Chin-Hui Lee

    Abstract: We propose an ensemble learning framework with Poisson sub-sampling to effectively train a collection of teacher models to issue some differential privacy (DP) guarantee for training data. Through boosting under DP, a student model derived from the training data suffers little model degradation from the models trained with no privacy protection. Our proposed solution leverages upon two mechanisms,… ▽ More

    Submitted 12 October, 2022; originally announced October 2022.

    Comments: Accepted to ISCA, ISCSLP 2022, Singapore. 5 Pages

  13. arXiv:2205.12459  [pdf, other

    cs.CV eess.IV

    A CNN with Noise Inclined Module and Denoise Framework for Hyperspectral Image Classification

    Authors: Zhiqiang Gong, ** Zhong, Jiahao Qi, Panhe Hu

    Abstract: Deep Neural Networks have been successfully applied in hyperspectral image classification. However, most of prior works adopt general deep architectures while ignore the intrinsic structure of the hyperspectral image, such as the physical noise generation. This would make these deep models unable to generate discriminative features and provide impressive classification performance. To leverage suc… ▽ More

    Submitted 24 May, 2022; originally announced May 2022.

    Journal ref: IET Image Processing, 2022

  14. arXiv:2205.09987  [pdf, other

    cs.RO eess.SY

    Model Predictive Manipulation of Compliant Objects with Multi-Objective Optimizer and Adversarial Network for Occlusion Compensation

    Authors: Jiaming Qi, Dongyu Li, Yufeng Gao, Peng Zhou, David Navarro-Alarcon

    Abstract: The robotic manipulation of compliant objects is currently one of the most active problems in robotics due to its potential to automate many important applications. Despite the progress achieved by the robotics community in recent years, the 3D sha** of these types of materials remains an open research problem. In this paper, we propose a new vision-based controller to automatically regulate the… ▽ More

    Submitted 20 May, 2022; originally announced May 2022.

  15. arXiv:2203.07659  [pdf

    eess.IV cs.CV

    Breast Cancer Molecular Subtypes Prediction on Pathological Images with Discriminative Patch Selecting and Multi-Instance Learning

    Authors: Hong Liu, Wen-Dong Xu, Zi-Hao Shang, Xiang-Dong Wang, Hai-Yan Zhou, Ke-Wen Ma, Huan Zhou, Jia-Lin Qi, Jia-Rui Jiang, Li-Lan Tan, Hui-Min Zeng, Hui-Juan Cai, Kuan-Song Wang, Yue-Liang Qian

    Abstract: Molecular subtypes of breast cancer are important references to personalized clinical treatment. For cost and labor savings, only one of the patient's paraffin blocks is usually selected for subsequent immunohistochemistry (IHC) to obtain molecular subtypes. Inevitable sampling error is risky due to tumor heterogeneity and could result in a delay in treatment. Molecular subtype prediction from con… ▽ More

    Submitted 15 March, 2022; originally announced March 2022.

  16. arXiv:2203.06031  [pdf, other

    cs.LG cs.AI cs.SD eess.AS

    Exploiting Low-Rank Tensor-Train Deep Neural Networks Based on Riemannian Gradient Descent With Illustrations of Speech Processing

    Authors: Jun Qi, Chao-Han Huck Yang, Pin-Yu Chen, Javier Tejedor

    Abstract: This work focuses on designing low complexity hybrid tensor networks by considering trade-offs between the model complexity and practical performance. Firstly, we exploit a low-rank tensor-train deep neural network (TT-DNN) to build an end-to-end deep learning pipeline, namely LR-TT-DNN. Secondly, a hybrid model combining LR-TT-DNN with a convolutional neural network (CNN), which is denoted as CNN… ▽ More

    Submitted 11 March, 2022; originally announced March 2022.

    Comments: 10 pages, 10 Figures

  17. arXiv:2203.03550  [pdf, other

    cs.CL cs.AI cs.DC cs.NE eess.AS

    When BERT Meets Quantum Temporal Convolution Learning for Text Classification in Heterogeneous Computing

    Authors: Chao-Han Huck Yang, Jun Qi, Samuel Yen-Chi Chen, Yu Tsao, Pin-Yu Chen

    Abstract: The rapid development of quantum computing has demonstrated many unique characteristics of quantum advantages, such as richer feature representation and more secured protection on model parameters. This work proposes a vertical federated learning architecture based on variational quantum circuits to demonstrate the competitive performance of a quantum-enhanced pre-trained BERT model for text class… ▽ More

    Submitted 17 February, 2022; originally announced March 2022.

    Comments: Accepted to ICASSP 2022

  18. arXiv:2202.06727   

    cs.LG eess.SY

    STG-GAN: A spatiotemporal graph generative adversarial networks for short-term passenger flow prediction in urban rail transit systems

    Authors: **lei Zhang, Hua Li, Lixing Yang, Guangyin **, Jianguo Qi, Ziyou Gao

    Abstract: Short-term passenger flow prediction is an important but challenging task for better managing urban rail transit (URT) systems. Some emerging deep learning models provide good insights to improve short-term prediction accuracy. However, there exist many complex spatiotemporal dependencies in URT systems. Most previous methods only consider the absolute error between ground truth and predictions as… ▽ More

    Submitted 16 August, 2023; v1 submitted 10 February, 2022; originally announced February 2022.

    Comments: There are some errors that might mislead readers for this version. There is no new version right now

    ACM Class: E.0

  19. arXiv:2201.10609  [pdf, other

    cs.SD cs.LG eess.AS

    Exploiting Hybrid Models of Tensor-Train Networks for Spoken Command Recognition

    Authors: Jun Qi, Javier Tejedor

    Abstract: This work aims to design a low complexity spoken command recognition (SCR) system by considering different trade-offs between the number of model parameters and classification accuracy. More specifically, we exploit a deep hybrid architecture of a tensor-train (TT) network to build an end-to-end SRC pipeline. Our command recognition system, namely CNN+(TT-DNN), is composed of convolutional layers… ▽ More

    Submitted 11 January, 2022; originally announced January 2022.

    Comments: Accepted in Proc. ICASSP 2022

  20. arXiv:2201.01443  [pdf, other

    eess.IV cs.CV physics.med-ph

    Neural KEM: A Kernel Method with Deep Coefficient Prior for PET Image Reconstruction

    Authors: Siqi Li, Kuang Gong, Ramsey D. Badawi, Edward J. Kim, **yi Qi, Guobao Wang

    Abstract: Image reconstruction of low-count positron emission tomography (PET) data is challenging. Kernel methods address the challenge by incorporating image prior information in the forward model of iterative PET image reconstruction. The kernelized expectation-maximization (KEM) algorithm has been developed and demonstrated to be effective and easy to implement. A common approach for a further improveme… ▽ More

    Submitted 24 October, 2022; v1 submitted 4 January, 2022; originally announced January 2022.

    Comments: arXiv admin note: text overlap with arXiv:2110.01174

  21. arXiv:2112.09216  [pdf, other

    eess.IV cs.CV

    A Deep-Learning Framework for Improving COVID-19 CT Image Quality and Diagnostic Accuracy

    Authors: Garvit Goel, **gyuan Qi, Wu-chun Feng, Guohua Cao

    Abstract: We present a deep-learning based computing framework for fast-and-accurate CT (DL-FACT) testing of COVID-19. Our CT-based DL framework was developed to improve the testing speed and accuracy of COVID-19 (plus its variants) via a DL-based approach for CT image enhancement and classification. The image enhancement network is adapted from DDnet, short for DenseNet and Deconvolution based network. To… ▽ More

    Submitted 16 December, 2021; originally announced December 2021.

    Comments: 10 pages

  22. arXiv:2112.01697  [pdf, other

    cs.CV cs.CL cs.LG cs.SD eess.AS

    LMR-CBT: Learning Modality-fused Representations with CB-Transformer for Multimodal Emotion Recognition from Unaligned Multimodal Sequences

    Authors: Ziwang Fu, Feng Liu, Hanyang Wang, Siyuan Shen, Jiahao Zhang, Jiayin Qi, Xiangling Fu, Aimin Zhou

    Abstract: Learning modality-fused representations and processing unaligned multimodal sequences are meaningful and challenging in multimodal emotion recognition. Existing approaches use directional pairwise attention or a message hub to fuse language, visual, and audio modalities. However, those approaches introduce information redundancy when fusing features and are inefficient without considering the comp… ▽ More

    Submitted 2 December, 2021; originally announced December 2021.

    Comments: 9 pages ,Figure 2, Table 5

  23. Delay-Compensated Distributed PDE Control of Traffic with Connected/Automated Vehicles

    Authors: Jie Qi, Shurong Mo, Miroslav Krstic

    Abstract: We develop an input delay-compensating design for stabilization of an Aw-Rascle-Zhang (ARZ) traffic model in congested regime which is governed by a $2\times 2$ first-order hyperbolic nonlinear PDE. The traffic flow consists of both adaptive cruise control-equipped (ACC-equipped) and manually-driven vehicles. The control input is the time gap of ACC-equipped and connected vehicles, which is subjec… ▽ More

    Submitted 2 September, 2022; v1 submitted 19 July, 2021; originally announced July 2021.

  24. arXiv:2106.10359  [pdf, other

    eess.IV cs.CV physics.med-ph

    Direct Reconstruction of Linear Parametric Images from Dynamic PET Using Nonlocal Deep Image Prior

    Authors: Kuang Gong, Ciprian Catana, **yi Qi, Quanzheng Li

    Abstract: Direct reconstruction methods have been developed to estimate parametric images directly from the measured PET sinograms by combining the PET imaging model and tracer kinetics in an integrated framework. Due to limited counts received, signal-to-noise-ratio (SNR) and resolution of parametric images produced by direct reconstruction frameworks are still limited. Recently supervised deep learning me… ▽ More

    Submitted 18 June, 2021; originally announced June 2021.

    Comments: 10 pages, 10 figures

  25. arXiv:2106.07337  [pdf, other

    eess.AS

    Speech Disorder Classification Using Extended Factorized Hierarchical Variational Auto-encoders

    Authors: **zi Qi, Hugo Van hamme

    Abstract: Objective speech disorder classification for speakers with communication difficulty is desirable for diagnosis and administering therapy. With the current state of speech technology, it is evident to propose neural networks for this application. But neural network model training is hampered by a lack of labeled disordered speech data. In this research, we apply an extended version of Factorized Hi… ▽ More

    Submitted 14 June, 2021; originally announced June 2021.

    Comments: 5 pages, 2 figures, submitted to INTERSPEECH2021

  26. arXiv:2106.02424  [pdf, other

    cs.RO eess.SY

    Contour Moments Based Manipulation of Composite Rigid-Deformable Objects with Finite Time Model Estimation and Shape/Position Control

    Authors: Jiaming Qi, Guangfu Ma, Jihong Zhu, Peng Zhou, Yueyong Lyu, Haibo Zhang, David Navarro-Alarcon

    Abstract: The robotic manipulation of composite rigid-deformable objects (i.e. those with mixed non-homogeneous stiffness properties) is a challenging problem with clear practical applications that, despite the recent progress in the field, it has not been sufficiently studied in the literature. To deal with this issue, in this paper we propose a new visual servoing method that has the capability to manipul… ▽ More

    Submitted 4 June, 2021; originally announced June 2021.

  27. arXiv:2104.00230  [pdf, other

    eess.AS

    Bidirectional Multiscale Feature Aggregation for Speaker Verification

    Authors: Jiajun Qi, Wu Guo, Bin Gu

    Abstract: In this paper, we propose a novel bidirectional multiscale feature aggregation (BMFA) network with attentional fusion modules for text-independent speaker verification. The feature maps from different stages of the backbone network are iteratively combined and refined in both a bottom-up and top-down manner. Furthermore, instead of simple concatenation or element-wise addition of feature maps from… ▽ More

    Submitted 31 March, 2021; originally announced April 2021.

  28. arXiv:2012.00803  [pdf, other

    eess.SY

    Generator Parameter Estimation by Q-Learning Based on PMU Measurements

    Authors: Seyyed Rashid Khazeiynasab, Junjian Qi, Issa Batarseh

    Abstract: In this paper, a novel Q-learning based approach is proposed for estimating the parameters of synchronous generators using PMU measurements. Event playback is used to generate model outputs under different parameters for training the agent in Q-learning. We assume that the exact values of some parameters in the model are not known by the agent in Q-learning. Then, an optimal history-dependent poli… ▽ More

    Submitted 1 December, 2020; originally announced December 2020.

  29. arXiv:2010.13309  [pdf, other

    cs.SD cs.LG cs.NE eess.AS quant-ph

    Decentralizing Feature Extraction with Quantum Convolutional Neural Network for Automatic Speech Recognition

    Authors: Chao-Han Huck Yang, Jun Qi, Samuel Yen-Chi Chen, Pin-Yu Chen, Sabato Marco Siniscalchi, Xiaoli Ma, Chin-Hui Lee

    Abstract: We propose a novel decentralized feature extraction approach in federated learning to address privacy-preservation issues for speech recognition. It is built upon a quantum convolutional neural network (QCNN) composed of a quantum circuit encoder for feature extraction, and a recurrent neural network (RNN) based end-to-end acoustic model (AM). To enhance model parameter protection in a decentraliz… ▽ More

    Submitted 12 February, 2021; v1 submitted 25 October, 2020; originally announced October 2020.

    Comments: Accepted to IEEE ICASSP 2021. Code is available: https://github.com/huckiyang/QuantumSpeech-QCNN

    Journal ref: 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

  30. arXiv:2010.10919   

    eess.AS cs.SD

    Multi-task Metric Learning for Text-independent Speaker Verification

    Authors: Yafeng Chen, Wu Guo, **g**g Shi, Jiajun Qi, Tan Liu

    Abstract: In this work, we introduce metric learning (ML) to enhance the deep embedding learning for text-independent speaker verification (SV). Specifically, the deep speaker embedding network is trained with conventional cross entropy loss and auxiliary pair-based ML loss function. For the auxiliary ML task, training samples of a mini-batch are first arranged into pairs, then positive and negative pairs a… ▽ More

    Submitted 22 March, 2023; v1 submitted 21 October, 2020; originally announced October 2020.

    Comments: Not a particularly high-quality work, so we request withdrawal

  31. arXiv:2010.07540  [pdf, other

    eess.SY

    Multi-Objective PMU Allocation for Resilient Power System Monitoring

    Authors: Hamed Haggi, Wei Sun, Junjian Qi

    Abstract: Phasor measurement units (PMUs) enable better system monitoring and security enhancement in smart grids. In order to enhance power system resilience against outages and blackouts caused by extreme weather events or man-made attacks, it remains a major challenge to determine the optimal number and location of PMUs. In this paper, a multi-objective resilient PMU placement (MORPP) problem is formulat… ▽ More

    Submitted 15 October, 2020; originally announced October 2020.

    Comments: IEEE PES General Meeting 2020

  32. arXiv:2010.06248   

    eess.AS

    Exploring Universal Speech Attributes for Speaker Verification with an Improved Cross-stitch Network

    Authors: Jiajun Qi, Wu Guo, **g**g Shi, Yafeng Chen, Tan Liu

    Abstract: The universal speech attributes for x-vector based speaker verification (SV) are addressed in this paper. The manner and place of articulation form the fundamental speech attribute unit (SAU), and then new speech attribute (NSA) units for acoustic modeling are generated by tied tri-SAU states. An improved cross-stitch network is adopted as a multitask learning (MTL) framework for integrating these… ▽ More

    Submitted 31 May, 2023; v1 submitted 13 October, 2020; originally announced October 2020.

    Comments: Not a particularly high-quality work, so we request withdrawal

  33. arXiv:2009.14155  [pdf, other

    eess.SY

    Resilience Analysis and Cascading FailureModeling of Power Systems under Extreme Temperatures

    Authors: Seyyed Rashid Khazeiynasab, Junjian Qi

    Abstract: In this paper, we propose an AC power flow based cascading failure model that explicitly considers external weather conditions, extreme temperatures in particular, and evaluates the impact of extreme temperature on the initiation and propagation of cascading blackouts. Specifically, load and dynamic line rating changes are modeled due to temperature disturbance, the probabilities for transmission… ▽ More

    Submitted 29 September, 2020; originally announced September 2020.

  34. arXiv:2009.01003  [pdf, other

    cs.CL cs.SD eess.AS

    Variational Inference-Based Dropout in Recurrent Neural Networks for Slot Filling in Spoken Language Understanding

    Authors: Jun Qi, Xu Liu, Javier Tejedor

    Abstract: This paper proposes to generalize the variational recurrent neural network (RNN) with variational inference (VI)-based dropout regularization employed for the long short-term memory (LSTM) cells to more advanced RNN architectures like gated recurrent unit (GRU) and bi-directional LSTM/GRU. The new variational RNNs are employed for slot filling, which is an intriguing but challenging task in spoken… ▽ More

    Submitted 23 August, 2020; originally announced September 2020.

    Comments: conference paper, 5 pages

  35. arXiv:2008.07281  [pdf, ps, other

    eess.AS cs.LG cs.SD eess.SP stat.ML

    On Mean Absolute Error for Deep Neural Network Based Vector-to-Vector Regression

    Authors: Jun Qi, Jun Du, Sabato Marco Siniscalchi, Xiaoli Ma, Chin-Hui Lee

    Abstract: In this paper, we exploit the properties of mean absolute error (MAE) as a loss function for the deep neural network (DNN) based vector-to-vector regression. The goal of this work is two-fold: (i) presenting performance bounds of MAE, and (ii) demonstrating new properties of MAE that make it more appropriate than mean squared error (MSE) as a loss function for DNN based vector-to-vector regression… ▽ More

    Submitted 12 August, 2020; originally announced August 2020.

    Journal ref: IEEE Signal Processing Letters, 2020

  36. arXiv:2008.06896  [pdf, other

    cs.RO eess.SY

    Adaptive Shape Servoing of Elastic Rods using Parameterized Regression Features and Auto-Tuning Motion Controls

    Authors: Jiaming Qi, Guangtao Ran, Bohui Wang, Jian Liu, Wanyu Ma, Peng Zhou, David Navarro-Alarcon

    Abstract: The robotic manipulation of deformable linear objects has shown great potential in a wide range of real-world applications. However, it presents many challenges due to the objects' complex nonlinearity and high-dimensional configuration. In this paper, we propose a new shape servoing framework to automatically manipulate elastic rods through visual feedback. Our new method uses parameterized regre… ▽ More

    Submitted 9 September, 2023; v1 submitted 16 August, 2020; originally announced August 2020.

    Comments: 8 pages, 12 figures

  37. arXiv:2008.05459  [pdf, other

    cs.LG eess.SP stat.ML

    Analyzing Upper Bounds on Mean Absolute Errors for Deep Neural Network Based Vector-to-Vector Regression

    Authors: Jun Qi, Jun Du, Sabato Marco Siniscalchi, Xiaoli Ma, Chin-Hui Lee

    Abstract: In this paper, we show that, in vector-to-vector regression utilizing deep neural networks (DNNs), a generalized loss of mean absolute error (MAE) between the predicted and expected feature vectors is upper bounded by the sum of an approximation error, an estimation error, and an optimization error. Leveraging upon error decomposition techniques in statistical learning theory and non-convex optimi… ▽ More

    Submitted 4 August, 2020; originally announced August 2020.

    Journal ref: IEEE Transactions on Signal Processing, Vol 68, pp. 3411-3422, 2020

  38. arXiv:2007.13024  [pdf, other

    eess.AS cs.CL cs.LG cs.NE cs.SD

    Exploring Deep Hybrid Tensor-to-Vector Network Architectures for Regression Based Speech Enhancement

    Authors: Jun Qi, Hu Hu, Yannan Wang, Chao-Han Huck Yang, Sabato Marco Siniscalchi, Chin-Hui Lee

    Abstract: This paper investigates different trade-offs between the number of model parameters and enhanced speech qualities by employing several deep tensor-to-vector regression models for speech enhancement. We find that a hybrid architecture, namely CNN-TT, is capable of maintaining a good quality performance with a reduced model parameter size. CNN-TT is composed of several convolutional layers at the bo… ▽ More

    Submitted 2 August, 2020; v1 submitted 25 July, 2020; originally announced July 2020.

    Comments: Accepted to InterSpeech 2020

  39. arXiv:2004.12097  [pdf, other

    cs.RO eess.SY

    A Lyapunov-Stable Adaptive Method to Approximate Sensorimotor Models for Sensor-Based Control

    Authors: David Navarro-Alarcon, Jiaming Qi, Jihong Zhu, Andrea Cherubini

    Abstract: In this article, we present a new scheme that approximates unknown sensorimotor models of robots by using feedback signals only. The formulation of the uncalibrated sensor-based regulation problem is first formulated, then, we develop a computational method that distributes the model estimation problem amongst multiple adaptive units that specialise in a local sensorimotor map. Different from trad… ▽ More

    Submitted 4 July, 2020; v1 submitted 25 April, 2020; originally announced April 2020.

    Comments: 19 pages, 15 figures

  40. arXiv:2003.13917  [pdf, other

    eess.AS cs.CL cs.CR cs.LG cs.SD

    Characterizing Speech Adversarial Examples Using Self-Attention U-Net Enhancement

    Authors: Chao-Han Huck Yang, Jun Qi, Pin-Yu Chen, Xiaoli Ma, Chin-Hui Lee

    Abstract: Recent studies have highlighted adversarial examples as ubiquitous threats to the deep neural network (DNN) based speech recognition systems. In this work, we present a U-Net based attention model, U-Net$_{At}$, to enhance adversarial speech signals. Specifically, we evaluate the model performance by interpretable speech recognition metrics and discuss the model performance by the augmented advers… ▽ More

    Submitted 31 December, 2021; v1 submitted 30 March, 2020; originally announced March 2020.

    Comments: The authors have revised some annotations in Table 4 to improve the clarity. The authors thank reading feedbacks from Jonathan Le Roux. The first draft was finished in August 2019. Accepted to IEEE ICASSP 2020

    Journal ref: 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

  41. Enhanced Adversarial Strategically-Timed Attacks against Deep Reinforcement Learning

    Authors: Chao-Han Huck Yang, Jun Qi, Pin-Yu Chen, Yi Ouyang, I-Te Danny Hung, Chin-Hui Lee, Xiaoli Ma

    Abstract: Recent deep neural networks based techniques, especially those equipped with the ability of self-adaptation in the system level such as deep reinforcement learning (DRL), are shown to possess many advantages of optimizing robot learning systems (e.g., autonomous navigation and continuous robot arm control.) However, the learning-based systems and the associated models may be threatened by the risk… ▽ More

    Submitted 20 February, 2020; originally announced February 2020.

    Comments: Accepted to IEEE ICASSP 2020

    Journal ref: 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

  42. arXiv:2002.00544  [pdf, other

    eess.AS cs.CL cs.LG cs.NE cs.SD

    Tensor-to-Vector Regression for Multi-channel Speech Enhancement based on Tensor-Train Network

    Authors: Jun Qi, Hu Hu, Yannan Wang, Chao-Han Huck Yang, Sabato Marco Siniscalchi, Chin-Hui Lee

    Abstract: We propose a tensor-to-vector regression approach to multi-channel speech enhancement in order to address the issue of input size explosion and hidden-layer size expansion. The key idea is to cast the conventional deep neural network (DNN) based vector-to-vector regression formulation under a tensor-train network (TTN) framework. TTN is a recently emerged solution for compact representation of dee… ▽ More

    Submitted 2 February, 2020; originally announced February 2020.

    Comments: Accepted to ICASSP 2020. Update reproducible code

    Journal ref: IEEE ICASSP 2020

  43. arXiv:2001.10529  [pdf

    eess.AS cs.LG cs.NE cs.SD

    Submodular Rank Aggregation on Score-based Permutations for Distributed Automatic Speech Recognition

    Authors: Jun Qi, Chao-Han Huck Yang, Javier Tejedor

    Abstract: Distributed automatic speech recognition (ASR) requires to aggregate outputs of distributed deep neural network (DNN)-based models. This work studies the use of submodular functions to design a rank aggregation on score-based permutations, which can be used for distributed ASR systems in both supervised and unsupervised modes. Specifically, we compose an aggregation rank function based on the Lova… ▽ More

    Submitted 27 January, 2020; originally announced January 2020.

    Comments: Accepted to ICASSP 2020. Please download the pdf to view Figure 1. arXiv admin note: substantial text overlap with arXiv:1707.01166

    Journal ref: 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

  44. arXiv:1911.08415  [pdf, other

    eess.SP cs.LG

    GMAN: A Graph Multi-Attention Network for Traffic Prediction

    Authors: Chuanpan Zheng, Xiaoliang Fan, Cheng Wang, Jianzhong Qi

    Abstract: Long-term traffic prediction is highly challenging due to the complexity of traffic systems and the constantly changing nature of many impacting factors. In this paper, we focus on the spatio-temporal factors, and propose a graph multi-attention network (GMAN) to predict traffic conditions for time steps ahead at different locations on a road network graph. GMAN adapts an encoder-decoder architect… ▽ More

    Submitted 25 November, 2019; v1 submitted 11 November, 2019; originally announced November 2019.

    Comments: AAAI 2020 paper

  45. arXiv:1910.09487  [pdf, other

    eess.SY math.OC

    Robust Dynamic State Estimation of Synchronous Machines with Asymptotic State Estimation Error Performance Guarantees

    Authors: Sebastian Nugroho, Ahmad F. Taha, Junjian Qi

    Abstract: A robust observer for performing power system dynamic state estimation (DSE) of a synchronous generator is proposed. The observer is developed using the concept of $\mathcal{L}_{\infty}$ stability for uncertain, nonlinear dynamic generator models. We use this concept to (i) design a simple, scalable, and robust dynamic state estimator and (ii) obtain a performance guarantee on the state estimation… ▽ More

    Submitted 17 February, 2020; v1 submitted 21 October, 2019; originally announced October 2019.

    Comments: IEEE Transactions on Power Systems, In Press. V2: Fixed some typos in the appendix

  46. Dynamic State Estimation of Synchronous Machines Using Robust Cubature Kalman Filter Against Complex Measurement Noise Statistics

    Authors: Yang Li, **g Li, Liang Chen, Junjian Qi, Guoqing Li

    Abstract: Cubature Kalman Filter (CKF) has good performance when handling nonlinear dynamic state estimations. However, it cannot work well in non-Gaussian noise and bad data environment due to the lack of auto-adaptive ability to measure noise statistics on line. In order to address the problem of behavioral decline and divergence when measure noise statistics deviate prior noise statistics, a new robust C… ▽ More

    Submitted 21 July, 2019; originally announced July 2019.

    Comments: Accepted by Transactions of China Electrotechnical Society, in Chinese

    Journal ref: Transactions of China Electrotechnical Society 34 (2019) 3651-3660

  47. arXiv:1907.01831  [pdf, other

    cs.DB eess.SP

    GeoPrune: Efficiently Finding Shareable Vehicles Based on Geometric Properties

    Authors: Yixin Xu, Jianzhong Qi, Renata Borovica-Gajic, Lars Kulik

    Abstract: On-demand ride-sharing is rapidly growing.Matching trip requests to vehicles efficiently is critical for the service quality of ride-sharing. To match trip requests with vehicles, a prune-and-select scheme is commonly used. The pruning stage identifies feasible vehicles that can satisfy the trip constraints (e.g., trip time). The selection stage selects the optimal one(s) from the feasible vehicle… ▽ More

    Submitted 19 October, 2019; v1 submitted 3 July, 2019; originally announced July 2019.

  48. Robust Cubature Kalman Filter for Dynamic State Estimation of Synchronous Machines under Unknown Measurement Noise Statistics

    Authors: Yang Li, **g Li, Junjian Qi, Liang Chen

    Abstract: Kalman-type filtering techniques including cubature Kalman filter (CKF) does not work well in non-Gaussian environments, especially in the presence of outliers. To solve this problem, Huber's M-estimation based robust CKF (RCKF) is proposed for synchronous machines by combining the Huber's M-estimation theory with the classical CKF, which is capable of co** with the deterioration in performance… ▽ More

    Submitted 19 February, 2019; originally announced February 2019.

    Comments: Accepted by IEEE Access

    Journal ref: IEEE Access 7 (2019) 29139-29148

  49. arXiv:1902.06025  [pdf, other

    eess.SY math.OC

    Characterizing the Nonlinearity of Power System Generator Models

    Authors: Sebastian A. Nugroho, Ahmad F. Taha, Junjian Qi

    Abstract: Power system dynamics are naturally nonlinear. The nonlinearity stems from power flows, generator dynamics, and electromagnetic transients. Characterizing the nonlinearity of the dynamical power system model is useful for designing superior estimation and control methods, providing better situational awareness and system stability. In this paper, we consider the synchronous generator model with a… ▽ More

    Submitted 18 June, 2019; v1 submitted 15 February, 2019; originally announced February 2019.

    Comments: To Appear in 2019 American Control Conference, July 10--12, Philadelphia, PA V2 includes a correction for a citation

  50. arXiv:1802.09071  [pdf, other

    eess.SY

    Robust Control for Renewable-Integrated Power Networks Considering Input Bound Constraints and Worst-Case Uncertainty Measure

    Authors: Ahmad F. Taha, Mohammadhafez Bazrafshan, Sebastian Nugroho, Nikolaos Gatsis, Junjian Qi

    Abstract: Uncertainty from renewable energy and loads is one of the major challenges for stable grid operation. Various approaches have been explored to remedy these uncertainties. In this paper, we design centralized or decentralized state-feedback controllers for generators while considering worst-case uncertainty. Specifically, this paper introduces the notion of $\mathcal{L}_{\infty}$ robust control and… ▽ More

    Submitted 15 July, 2019; v1 submitted 25 February, 2018; originally announced February 2018.

    Comments: IEEE Transactions on Control of Network Systems, Special Issue on Analysis, Control and Optimization of Energy System Networks