Skip to main content

Showing 1–16 of 16 results for author: Ju, C

Searching in archive eess. Search in all archives.
.
  1. arXiv:2403.11074  [pdf, other

    cs.CV cs.AI cs.MM cs.SD eess.AS

    Audio-Visual Segmentation via Unlabeled Frame Exploitation

    Authors: **xiang Liu, Yikun Liu, Fei Zhang, Chen Ju, Ya Zhang, Yanfeng Wang

    Abstract: Audio-visual segmentation (AVS) aims to segment the sounding objects in video frames. Although great progress has been witnessed, we experimentally reveal that current methods reach marginal performance gain within the use of the unlabeled frames, leading to the underutilization issue. To fully explore the potential of the unlabeled frames for AVS, we explicitly divide them into two categories bas… ▽ More

    Submitted 16 March, 2024; originally announced March 2024.

    Comments: Accepted by CVPR 2024

  2. Post-Training Embedding Alignment for Decoupling Enrollment and Runtime Speaker Recognition Models

    Authors: Chenyang Gao, Brecht Desplanques, Chelsea J. -T. Ju, Aman Chadha, Andreas Stolcke

    Abstract: Automated speaker identification (SID) is a crucial step for the personalization of a wide range of speech-enabled services. Typical SID systems use a symmetric enrollment-verification framework with a single model to derive embeddings both offline for voice profiles extracted from enrollment utterances, and online from runtime utterances. Due to the distinct circumstances of enrollment and runtim… ▽ More

    Submitted 22 January, 2024; originally announced January 2024.

    Comments: Accepted to ICASSP 2024

  3. arXiv:2307.13236  [pdf, other

    cs.SD cs.CV cs.LG cs.MM eess.AS

    Audio-aware Query-enhanced Transformer for Audio-Visual Segmentation

    Authors: **xiang Liu, Chen Ju, Chaofan Ma, Yanfeng Wang, Yu Wang, Ya Zhang

    Abstract: The goal of the audio-visual segmentation (AVS) task is to segment the sounding objects in the video frames using audio cues. However, current fusion-based methods have the performance limitations due to the small receptive field of convolution and inadequate fusion of audio-visual features. To overcome these issues, we propose a novel \textbf{Au}dio-aware query-enhanced \textbf{TR}ansformer (AuTR… ▽ More

    Submitted 24 July, 2023; originally announced July 2023.

    Comments: arXiv admin note: text overlap with arXiv:2305.11019

  4. Score-Based Data Generation for EEG Spatial Covariance Matrices: Towards Boosting BCI Performance

    Authors: Ce Ju, Reinmar Josef Kobler, Cuntai Guan

    Abstract: The efficacy of Electroencephalogram (EEG) classifiers can be augmented by increasing the quantity of available data. In the case of geometric deep learning classifiers, the input consists of spatial covariance matrices derived from EEGs. In order to synthesize these spatial covariance matrices and facilitate future improvements of geometric deep learning classifiers, we propose a generative model… ▽ More

    Submitted 15 December, 2023; v1 submitted 22 February, 2023; originally announced February 2023.

    Comments: 7 pages, 4 figures; This work has been accepted by the 2023 45th Annual International Conference of the IEEE Engineering in Medicine & Biology Conference (IEEE EMBC 2023'). Copyright will be transferred without notice, after which this version may no longer be accessible

    ACM Class: I.2.0

  5. arXiv:2211.02641  [pdf, ps, other

    eess.SP cs.AI cs.LG

    Graph Neural Networks on SPD Manifolds for Motor Imagery Classification: A Perspective from the Time-Frequency Analysis

    Authors: Ce Ju, Cuntai Guan

    Abstract: The motor imagery (MI) classification has been a prominent research topic in brain-computer interfaces based on electroencephalography (EEG). Over the past few decades, the performance of MI-EEG classifiers has seen gradual enhancement. In this study, we amplify the geometric deep learning-based MI-EEG classifiers from the perspective of time-frequency analysis, introducing a new architecture call… ▽ More

    Submitted 20 August, 2023; v1 submitted 25 October, 2022; originally announced November 2022.

    Comments: 15 pages, 5 figures, 6 Tables; This work has been accepted by the IEEE Transactions on Neural Networks and Learning Systems, 2023. Copyright will be transferred without notice, after which this version may no longer be accessible

    ACM Class: I.2.0

  6. Adversarial Reweighting for Speaker Verification Fairness

    Authors: Minho **, Chelsea J. -T. Ju, Zeya Chen, Yi-Chieh Liu, Jasha Droppo, Andreas Stolcke

    Abstract: We address performance fairness for speaker verification using the adversarial reweighting (ARW) method. ARW is reformulated for speaker verification with metric learning, and shown to improve results across different subgroups of gender and nationality, without requiring annotation of subgroups in the training data. An adversarial network learns a weight for each training sample in the batch so t… ▽ More

    Submitted 15 July, 2022; originally announced July 2022.

    Journal ref: Proc. Interspeech, Sept. 2022, pp. 4800-4804

  7. arXiv:2206.12772  [pdf, other

    cs.CV cs.MM cs.SD eess.AS

    Exploiting Transformation Invariance and Equivariance for Self-supervised Sound Localisation

    Authors: **xiang Liu, Chen Ju, Weidi Xie, Ya Zhang

    Abstract: We present a simple yet effective self-supervised framework for audio-visual representation learning, to localize the sound source in videos. To understand what enables to learn useful representations, we systematically investigate the effects of data augmentations, and reveal that (1) composition of data augmentations plays a critical role, i.e. explicitly encouraging the audio-visual representat… ▽ More

    Submitted 15 August, 2022; v1 submitted 25 June, 2022; originally announced June 2022.

    Comments: Camera-ready Version for ACMMM 2022, Project page is https://**xiang-liu.github.io/SSL-TIE/

  8. arXiv:2202.02472  [pdf, ps, other

    eess.SP cs.CV cs.LG eess.IV

    Tensor-CSPNet: A Novel Geometric Deep Learning Framework for Motor Imagery Classification

    Authors: Ce Ju, Cuntai Guan

    Abstract: Deep learning (DL) has been widely investigated in a vast majority of applications in electroencephalography (EEG)-based brain-computer interfaces (BCIs), especially for motor imagery (MI) classification in the past five years. The mainstream DL methodology for the MI-EEG classification exploits the temporospatial patterns of EEG signals using convolutional neural networks (CNNs), which have remar… ▽ More

    Submitted 23 September, 2022; v1 submitted 4 February, 2022; originally announced February 2022.

    Comments: 15 pages, 10 figures, 12 tables; This work has been accepted by the IEEE Transactions on Neural Networks and Learning Systems. Copyright will be transferred without notice, after which this version may no longer be accessible

    ACM Class: I.2.0

  9. arXiv:2201.05745  [pdf, other

    cs.LG cs.AI eess.SP

    Deep Optimal Transport for Domain Adaptation on SPD Manifolds

    Authors: Ce Ju, Cuntai Guan

    Abstract: The machine learning community has shown increasing interest in addressing the domain adaptation problem on symmetric positive definite (SPD) manifolds. This interest is primarily driven by the complexities of neuroimaging data generated from brain signals, which often exhibit shifts in data distribution across recording sessions. These neuroimaging data, represented by signal covariance matrices,… ▽ More

    Submitted 3 June, 2024; v1 submitted 14 January, 2022; originally announced January 2022.

    Comments: This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible

    ACM Class: I.2.0

  10. arXiv:2106.10169  [pdf, other

    cs.LG cs.CL cs.SD eess.AS

    Fusion of Embeddings Networks for Robust Combination of Text Dependent and Independent Speaker Recognition

    Authors: Ruirui Li, Chelsea J. -T. Ju, Zeya Chen, Hongda Mao, Oguz Elibol, Andreas Stolcke

    Abstract: By implicitly recognizing a user based on his/her speech input, speaker identification enables many downstream applications, such as personalized system behavior and expedited shop** checkouts. Based on whether the speech content is constrained or not, both text-dependent (TD) and text-independent (TI) speaker recognition models may be used. We wish to combine the advantages of both types of mod… ▽ More

    Submitted 18 June, 2021; originally announced June 2021.

  11. arXiv:2102.03745  [pdf, other

    eess.SY math.OC

    Hierarchically Coordinated Energy Management for A Regional Multi-microgrid Community

    Authors: Chengquan Ju

    Abstract: This paper proposes a novel hierarchically coordinated energy management system (EMS) for a regional community (e.g., residential area, campus, industrial park, etc.) comprising multiple small-scale microgrids (MGs) (e.g., houses, buildings, etc.). It aims to minimize the total operational cost of the MG community and maximize the individual benefit of each MG simultaneously. At the local level in… ▽ More

    Submitted 7 February, 2021; originally announced February 2021.

    Comments: 11 pages

  12. arXiv:2011.03682  [pdf, other

    cs.SD eess.AS

    Non-local convolutional neural networks (nlcnn) for speaker recognition

    Authors: Haici Yang, Hongda Mao, Ruirui Li, Chelsea J. T. Ju, Oguz Elibol

    Abstract: Speaker recognition is the process of identifying a speaker based on the voice. The technology has attracted more attention with the recent increase in popularity of smart voice assistants, such as Amazon Alexa. In the past few years, various convolutional neural network (CNN) based speaker recognition algorithms have been proposed and achieved satisfactory performance. However, convolutional oper… ▽ More

    Submitted 19 May, 2021; v1 submitted 6 November, 2020; originally announced November 2020.

  13. Federated Transfer Learning for EEG Signal Classification

    Authors: Ce Ju, Dashan Gao, Ravikiran Mane, Ben Tan, Yang Liu, Cuntai Guan

    Abstract: The success of deep learning (DL) methods in the Brain-Computer Interfaces (BCI) field for classification of electroencephalographic (EEG) recordings has been restricted by the lack of large datasets. Privacy concerns associated with EEG signals limit the possibility of constructing a large EEG-BCI dataset by the conglomeration of multiple small ones for jointly training machine learning models. H… ▽ More

    Submitted 25 January, 2021; v1 submitted 26 April, 2020; originally announced April 2020.

    Comments: 6 pages, 2 figures, Accepted for IEEE Engineering in Medicine and Biology Society (EMBC) 2020 GitHub: https://github.com/DashanGao/Federated-Transfer-Leraning-for-EEG

    ACM Class: I.5.4

    Journal ref: 2020 42nd Annual International Conference of the IEEE Engineering in Medicine & Biology Society (EMBC), Montreal, QC, Canada, 2020, pp. 3040-3045

  14. arXiv:2002.08602  [pdf, other

    cs.RO eess.SY

    A Hybrid Systems-based Hierarchical Control Architecture for Heterogeneous Field Robot Teams

    Authors: Chanyoung Ju, Hyoung Il Son

    Abstract: Field robot systems have recently been applied to a wide range of research fields. Making such systems more automated, advanced, and activated requires cooperation among heterogeneous robots. Classic control theory is inefficient in managing large-scale complex dynamic systems. Therefore, the supervisory control theory based on discrete event system needs to be introduced to overcome this limitati… ▽ More

    Submitted 20 February, 2020; originally announced February 2020.

    Comments: 23pages, 19 figures, submitted for publication

  15. arXiv:2002.07630  [pdf

    math.OC cs.LG eess.SY

    Extending iLQR method with control delay

    Authors: Cheng Ju, Yan Qin, Chunjiang Fu

    Abstract: Iterative linear quadradic regulator(iLQR) has become a benchmark method to deal with nonlinear stochastic optimal control problem. However, it does not apply to delay system. In this paper, we extend the iLQR theory and prove new theorem in case of input signal with fixed delay. Which could be beneficial for machine learning or optimal control application to real time robot or human assistive dev… ▽ More

    Submitted 15 February, 2020; originally announced February 2020.

  16. arXiv:1909.05784  [pdf, other

    eess.SP cs.AI

    HHHFL: Hierarchical Heterogeneous Horizontal Federated Learning for Electroencephalography

    Authors: Dashan Gao, Ce Ju, Xiguang Wei, Yang Liu, Tianjian Chen, Qiang Yang

    Abstract: Electroencephalography (EEG) classification techniques have been widely studied for human behavior and emotion recognition tasks. But it is still a challenging issue since the data may vary from subject to subject, may change over time for the same subject, and maybe heterogeneous. Recent years, increasing privacy-preserving demands poses new challenges to this task. The data heterogeneity, as wel… ▽ More

    Submitted 10 September, 2020; v1 submitted 11 September, 2019; originally announced September 2019.

    Comments: 5 pages, 6 figures, Accepted for International Workshop on Federated Machine Learning for User Privacy and Data Confidentiality in Conjunction with IJCAI 2019 (FL-IJCAI'2019)

    ACM Class: I.2.6; I.2.11