Skip to main content

Showing 1–20 of 20 results for author: Ni, J

Searching in archive eess. Search in all archives.
.
  1. arXiv:2406.08380  [pdf, other

    cs.CL cs.SD eess.AS

    Towards Unsupervised Speech Recognition Without Pronunciation Models

    Authors: Junrui Ni, Liming Wang, Yang Zhang, Kaizhi Qian, Heting Gao, Mark Hasegawa-Johnson, Chang D. Yoo

    Abstract: Recent advancements in supervised automatic speech recognition (ASR) have achieved remarkable performance, largely due to the growing availability of large transcribed speech corpora. However, most languages lack sufficient paired speech and text data to effectively train these systems. In this article, we tackle the challenge of develo** ASR systems without paired speech and text corpora by pro… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

    Comments: This work has been submitted to the IEEE for possible publication

  2. arXiv:2404.15349  [pdf, other

    eess.SP cs.LG cs.MM

    A Survey on Multimodal Wearable Sensor-based Human Action Recognition

    Authors: Jianyuan Ni, Hao Tang, Syed Tousiful Haque, Yan Yan, Anne H. H. Ngu

    Abstract: The combination of increased life expectancy and falling birth rates is resulting in an aging population. Wearable Sensor-based Human Activity Recognition (WSHAR) emerges as a promising assistive technology to support the daily lives of older individuals, unlocking vast potential for human-centric applications. However, recent surveys in WSHAR have been limited, focusing either solely on deep lear… ▽ More

    Submitted 14 April, 2024; originally announced April 2024.

    Comments: Multimodal Survey for Wearable Sensor-based Human Action Recognition

  3. arXiv:2403.16476  [pdf

    eess.IV

    A Method for Target Detection Based on Mmw Radar and Vision Fusion

    Authors: Ming Zong, Jiaying Wu, Zhanyu Zhu, **gen Ni

    Abstract: An efficient and accurate traffic monitoring system often takes advantages of multi-sensor detection to ensure the safety of urban traffic, promoting the accuracy and robustness of target detection and tracking. A method for target detection using Radar-Vision Fusion Path Aggregation Fully Convolutional One-Stage Network (RV-PAFCOS) is proposed in this paper, which is extended from Fully Convoluti… ▽ More

    Submitted 25 March, 2024; originally announced March 2024.

  4. arXiv:2312.06904  [pdf, other

    eess.SY

    Enhanced Q-Learning Approach to Finite-Time Reachability with Maximum Probability for Probabilistic Boolean Control Networks

    Authors: Hongyue Fan, **gjie Ni, Fangfei Li

    Abstract: In this paper, we investigate the problem of controlling probabilistic Boolean control networks (PBCNs) to achieve reachability with maximum probability in the finite time horizon. We address three questions: 1) finding control policies that achieve reachability with maximum probability under fixed, and particularly, varied finite time horizon, 2) leveraging prior knowledge to solve question 1) wi… ▽ More

    Submitted 11 December, 2023; originally announced December 2023.

  5. arXiv:2311.07912  [pdf, other

    cs.CV eess.SP

    Detection of Small Targets in Sea Clutter Based on RepVGG and Continuous Wavelet Transform

    Authors: **gchen Ni, Haoru Li, Lilin Xu, **g Liang

    Abstract: Constructing a high-performance target detector under the background of sea clutter is always necessary and important. In this work, we propose a RepVGGA0-CWT detector, where RepVGG is a residual network that gains a high detection accuracy. Different from traditional residual networks, RepVGG keeps an acceptable calculation speed. Giving consideration to both accuracy and speed, the RepVGGA0 is s… ▽ More

    Submitted 14 November, 2023; originally announced November 2023.

  6. Stochastic Deep Koopman Model for Quality Propagation Analysis in Multistage Manufacturing Systems

    Authors: Zhiyi Chen, Harshal Maske, Huanyi Shui, Devesh Upadhyay, Michael Hopka, Joseph Cohen, Xingjian Lai, Xun Huan, Jun Ni

    Abstract: The modeling of multistage manufacturing systems (MMSs) has attracted increased attention from both academia and industry. Recent advancements in deep learning methods provide an opportunity to accomplish this task with reduced cost and expertise. This study introduces a stochastic deep Koopman (SDK) framework to model the complex behavior of MMSs. Specifically, we present a novel application of K… ▽ More

    Submitted 18 September, 2023; originally announced September 2023.

    Journal ref: Journal of Manufacturing Systems 71 (2023) 609-619

  7. arXiv:2304.04950  [pdf, other

    eess.SY

    Reinforcement Learning Based Minimum State-flipped Control for the Reachability of Boolean Control Networks

    Authors: **gjie Ni, Fangfei Li

    Abstract: To realize reachability as well as reduce control costs of Boolean Control Networks (BCNs) with state-flipped control, a reinforcement learning based method is proposed to obtain flip kernels and the optimal policy with minimal flip** actions to realize reachability. The method proposed is model-free and of low computational complexity. In particular, Q-learning (QL), fast QL, and small memory Q… ▽ More

    Submitted 10 April, 2023; originally announced April 2023.

  8. arXiv:2304.03489  [pdf, other

    eess.SY

    Deep Reinforcement Learning Based Optimal Infinite-Horizon Control of Probabilistic Boolean Control Networks

    Authors: **gjie Ni, Fangfei Li, Zheng-Guang Wu

    Abstract: In this paper, a deep reinforcement learning based method is proposed to obtain optimal policies for optimal infinite-horizon control of probabilistic Boolean control networks (PBCNs). Compared with the existing literatures, the proposed method is model-free, namely, the system model and the initial states needn't to be known. Meanwhile, it is suitable for large-scale PBCNs. First, we establish th… ▽ More

    Submitted 7 April, 2023; originally announced April 2023.

  9. arXiv:2303.14581  [pdf

    cs.LG eess.SP eess.SY

    Shapley-based Explainable AI for Clustering Applications in Fault Diagnosis and Prognosis

    Authors: Joseph Cohen, Xun Huan, Jun Ni

    Abstract: Data-driven artificial intelligence models require explainability in intelligent manufacturing to streamline adoption and trust in modern industry. However, recently developed explainable artificial intelligence (XAI) techniques that estimate feature contributions on a model-agnostic level such as SHapley Additive exPlanations (SHAP) have not yet been evaluated for semi-supervised fault diagnosis… ▽ More

    Submitted 25 March, 2023; originally announced March 2023.

    Comments: 23 pages with 8 figures

  10. arXiv:2303.12982  [pdf

    cs.LG eess.SP eess.SY

    Fault Prognosis of Turbofan Engines: Eventual Failure Prediction and Remaining Useful Life Estimation

    Authors: Joseph Cohen, Xun Huan, Jun Ni

    Abstract: In the era of industrial big data, prognostics and health management is essential to improve the prediction of future failures to minimize inventory, maintenance, and human costs. Used for the 2021 PHM Data Challenge, the new Commercial Modular Aero-Propulsion System Simulation dataset from NASA is an open-source benchmark containing simulated turbofan engine units flown under realistic flight con… ▽ More

    Submitted 22 March, 2023; originally announced March 2023.

    Comments: Preprint with 10 pages, 5 figures. Submitted to International Journal of Prognostics and Health Management (IJPHM)

    Journal ref: International Journal of Prognostics and Health Management 14 (2023) 3486

  11. arXiv:2303.03600  [pdf, other

    cs.CL cs.LG cs.SD eess.AS

    Adaptive Knowledge Distillation between Text and Speech Pre-trained Models

    Authors: **jie Ni, Yukun Ma, Wen Wang, Qian Chen, Dianwen Ng, Han Lei, Trung Hieu Nguyen, Chong Zhang, Bin Ma, Erik Cambria

    Abstract: Learning on a massive amount of speech corpus leads to the recent success of many self-supervised speech models. With knowledge distillation, these models may also benefit from the knowledge encoded by language models that are pre-trained on rich sources of texts. The distillation process, however, is challenging due to the modal disparity between textual and speech embedding spaces. This paper st… ▽ More

    Submitted 6 March, 2023; originally announced March 2023.

  12. arXiv:2302.14597  [pdf, other

    cs.SD eess.AS

    deHuBERT: Disentangling Noise in a Self-supervised Model for Robust Speech Recognition

    Authors: Dianwen Ng, Ruixi Zhang, Jia Qi Yip, Zhao Yang, **jie Ni, Chong Zhang, Yukun Ma, Chongjia Ni, Eng Siong Chng, Bin Ma

    Abstract: Existing self-supervised pre-trained speech models have offered an effective way to leverage massive unannotated corpora to build good automatic speech recognition (ASR). However, many current models are trained on a clean corpus from a single source, which tends to do poorly when noise is present during testing. Nonetheless, it is crucial to overcome the adverse influence of noise for real-world… ▽ More

    Submitted 28 February, 2023; originally announced February 2023.

    Comments: Accepted by ICASSP 2023

  13. arXiv:2204.09224  [pdf, other

    cs.SD cs.AI eess.AS

    ContentVec: An Improved Self-Supervised Speech Representation by Disentangling Speakers

    Authors: Kaizhi Qian, Yang Zhang, Heting Gao, Junrui Ni, Cheng-I Lai, David Cox, Mark Hasegawa-Johnson, Shiyu Chang

    Abstract: Self-supervised learning in speech involves training a speech representation network on a large-scale unannotated speech corpus, and then applying the learned representations to downstream tasks. Since the majority of the downstream tasks of SSL learning in speech largely focus on the content information in speech, the most desirable speech representations should be able to disentangle unwanted va… ▽ More

    Submitted 23 June, 2022; v1 submitted 20 April, 2022; originally announced April 2022.

  14. arXiv:2203.15863  [pdf, other

    eess.AS cs.AI cs.CL

    WAVPROMPT: Towards Few-Shot Spoken Language Understanding with Frozen Language Models

    Authors: Heting Gao, Junrui Ni, Kaizhi Qian, Yang Zhang, Shiyu Chang, Mark Hasegawa-Johnson

    Abstract: Large-scale auto-regressive language models pretrained on massive text have demonstrated their impressive ability to perform new natural language tasks with only a few text examples, without the need for fine-tuning. Recent studies further show that such a few-shot learning ability can be extended to the text-image setting by training an encoder to encode the images into embeddings functioning lik… ▽ More

    Submitted 13 April, 2022; v1 submitted 29 March, 2022; originally announced March 2022.

    Comments: submitted to INTERSPEECH 2022

  15. arXiv:2203.15796  [pdf, other

    eess.AS cs.AI

    Unsupervised Text-to-Speech Synthesis by Unsupervised Automatic Speech Recognition

    Authors: Junrui Ni, Liming Wang, Heting Gao, Kaizhi Qian, Yang Zhang, Shiyu Chang, Mark Hasegawa-Johnson

    Abstract: An unsupervised text-to-speech synthesis (TTS) system learns to generate speech waveforms corresponding to any written sentence in a language by observing: 1) a collection of untranscribed speech waveforms in that language; 2) a collection of texts written in that language without access to any transcribed speech. Develo** such a system can significantly improve the availability of speech techno… ▽ More

    Submitted 15 August, 2022; v1 submitted 29 March, 2022; originally announced March 2022.

    Comments: INTERSPEECH 2022

  16. arXiv:2111.12483  [pdf, other

    eess.IV cs.CV

    LDP-Net: An Unsupervised Pansharpening Network Based on Learnable Degradation Processes

    Authors: Jiahui Ni, Zhimin Shao, Zhongzhou Zhang, Mingzheng Hou, Jiliu Zhou, Leyuan Fang, Yi Zhang

    Abstract: Pansharpening in remote sensing image aims at acquiring a high-resolution multispectral (HRMS) image directly by fusing a low-resolution multispectral (LRMS) image with a panchromatic (PAN) image. The main concern is how to effectively combine the rich spectral information of LRMS image with the abundant spatial information of PAN image. Recently, many methods based on deep learning have been prop… ▽ More

    Submitted 24 November, 2021; originally announced November 2021.

  17. arXiv:2009.00385  [pdf

    cs.RO eess.SY

    Autonomous Formula Racecar: Overall System Design and Experimental Validation

    Authors: Hanqing Tian, Jun Ni, Zirui Li, Jibin Hu

    Abstract: This paper develops and summarizes the work of building the autonomous integrated system including perception system and vehicle dynamic controller for a formula student autonomous racecar. We propose a system framework combining X-by-wired modification, perception & motion planning and vehicle dynamic control as a template of FSAC racecar which can be easily replicated. A LIDAR-vision cooperating… ▽ More

    Submitted 1 September, 2020; originally announced September 2020.

    Comments: 10 pages

  18. arXiv:2007.09372  [pdf

    cs.RO eess.SY

    Learning based Predictive Error Estimation and Compensator Design for Autonomous Vehicle Path Tracking

    Authors: Chaoyang Jiang, Hanqing Tian, Jibin Hu, Jiankun Zhai, Chao Wei, Jun Ni

    Abstract: Model predictive control (MPC) is widely used for path tracking of autonomous vehicles due to its ability to handle various types of constraints. However, a considerable predictive error exists because of the error of mathematics model or the model linearization. In this paper, we propose a framework combining the MPC with a learning-based error estimator and a feedforward compensator to improve t… ▽ More

    Submitted 18 July, 2020; originally announced July 2020.

    Comments: 5 pages, 8 figures,ICIEA 2020 paper

  19. arXiv:2006.00234  [pdf, other

    cs.LG cs.CV eess.IV stat.ML

    Integrating global spatial features in CNN based Hyperspectral/SAR imagery classification

    Authors: Fan Zhang, MinChao Yan, Chen Hu, Jun Ni, Fei Ma

    Abstract: The land cover classification has played an important role in remote sensing because it can intelligently identify things in one huge remote sensing image to reduce the work of humans. However, a lot of classification methods are designed based on the pixel feature or limited spatial feature of the remote sensing image, which limits the classification accuracy and universality of their methods. Th… ▽ More

    Submitted 15 June, 2020; v1 submitted 30 May, 2020; originally announced June 2020.

  20. arXiv:2003.12040  [pdf, other

    cs.CV cs.LG eess.IV

    Pseudo-Labeling for Small Lesion Detection on Diabetic Retinopathy Images

    Authors: Qilei Chen, ** Liu, **g Ni, Yu Cao, Benyuan Liu, Honggang Zhang

    Abstract: Diabetic retinopathy (DR) is a primary cause of blindness in working-age people worldwide. About 3 to 4 million people with diabetes become blind because of DR every year. Diagnosis of DR through color fundus images is a common approach to mitigate such problem. However, DR diagnosis is a difficult and time consuming task, which requires experienced clinicians to identify the presence and signific… ▽ More

    Submitted 26 March, 2020; originally announced March 2020.