Skip to main content

Showing 1–50 of 114 results for author: Vu, T

Searching in archive eess. Search in all archives.
.
  1. Joint Optimization of Switching Point and Power Control in Dynamic TDD Cell-Free Massive MIMO

    Authors: Martin Andersson, Tung T. Vu, Pål Frenger, Erik G. Larsson

    Abstract: We consider a cell-free massive multiple-input multiple-output (CFmMIMO) network operating in dynamic time division duplex (DTDD). The switching point between the uplink (UL) and downlink (DL) data transmission phases can be adapted dynamically to the instantaneous quality-of-service (QoS) requirements in order to improve energy efficiency (EE). To this end, we formulate a problem of optimizing th… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

    Comments: Presented at the Asilomar Conference on Signals, Systems, and Computers 2023

  2. arXiv:2406.06406  [pdf, other

    cs.CL cs.SD eess.AS

    Controlling Emotion in Text-to-Speech with Natural Language Prompts

    Authors: Thomas Bott, Florian Lux, Ngoc Thang Vu

    Abstract: In recent years, prompting has quickly become one of the standard ways of steering the outputs of generative machine learning models, due to its intuitive use of natural language. In this work, we propose a system conditioned on embeddings derived from an emotionally rich text that serves as prompt. Thereby, a joint representation of speaker and prompt embeddings is integrated at several points wi… ▽ More

    Submitted 11 June, 2024; v1 submitted 10 June, 2024; originally announced June 2024.

    Comments: accepted at Interspeech 2024

  3. arXiv:2406.06403  [pdf, other

    cs.CL cs.LG cs.SD eess.AS

    Meta Learning Text-to-Speech Synthesis in over 7000 Languages

    Authors: Florian Lux, Sarina Meyer, Lyonel Behringer, Frank Zalkow, Phat Do, Matt Coler, Emanuël A. P. Habets, Ngoc Thang Vu

    Abstract: In this work, we take on the challenging task of building a single text-to-speech synthesis system that is capable of generating speech in over 7000 languages, many of which lack sufficient data for traditional TTS development. By leveraging a novel integration of massively multilingual pretraining and meta learning to approximate language representations, our approach enables zero-shot speech syn… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

    Comments: accepted at Interspeech 2024

  4. arXiv:2404.10922  [pdf, other

    cs.CL cs.SD eess.AS

    Teaching a Multilingual Large Language Model to Understand Multilingual Speech via Multi-Instructional Training

    Authors: Pavel Denisov, Ngoc Thang Vu

    Abstract: Recent advancements in language modeling have led to the emergence of Large Language Models (LLMs) capable of various natural language processing tasks. Despite their success in text-based tasks, applying LLMs to the speech domain remains limited and challenging. This paper presents BLOOMZMMS, a novel model that integrates a multilingual LLM with a multilingual speech encoder, aiming to harness th… ▽ More

    Submitted 16 April, 2024; originally announced April 2024.

    Comments: NAACL Findings 2024

  5. arXiv:2403.17913  [pdf, ps, other

    eess.SP

    Enhancing Indoor and Outdoor THz Communications with Beyond Diagonal-IRS: Optimization and Performance Analysis

    Authors: Asad Mahmood, Thang X. Vu, Symeon Chatzinotas, Björn Ottersten

    Abstract: This work investigates the application of Beyond Diagonal Intelligent Reflective Surface (BD-IRS) to enhance THz downlink communication systems, operating in a hybrid: reflective and transmissive mode, to simultaneously provide services to indoor and outdoor users. We propose an optimization framework that jointly optimizes the beamforming vectors and phase shifts in the hybrid reflective/transmis… ▽ More

    Submitted 9 May, 2024; v1 submitted 26 March, 2024; originally announced March 2024.

  6. Real-time hybrid controls of energy storage and load shedding for integrated power and energy systems of ships

    Authors: Linh Vu, Thai-Thanh Nguyen, Bang Le-Huy Nguyen, Md Isfakul Anam, Tuyen Vu

    Abstract: This paper presents an original energy management methodology to enhance the resilience of ship power systems. The integration of various energy storage systems (ESS), including battery energy storage systems (BESS) and super-capacitor energy storage systems (SCESS), in modern ship power systems poses challenges in designing an efficient energy management system (EMS). The EMS proposed in this pap… ▽ More

    Submitted 2 March, 2024; originally announced March 2024.

    Comments: 15 pages, 17 figures

    Journal ref: Electric Power Systems Research, volume 229, pages 110191, year 2024

  7. Cell-Free Massive MIMO with Multi-Antenna Users and Phase Misalignments: A Novel Partially Coherent Transmission Framework

    Authors: Unnikrishnan Kunnath Ganesan, Tung Thanh Vu, Erik G. Larsson

    Abstract: Cell-free massive multiple-input multiple-output (MIMO) is a promising technology for next-generation communication systems. This work proposes a novel partially coherent (PC) transmission framework to cope with the challenge of phase misalignment among the access points (APs), which is important for unlocking the full potential of cell-free massive MIMO technology. With the PC operation, the APs… ▽ More

    Submitted 3 April, 2024; v1 submitted 1 March, 2024; originally announced March 2024.

    Comments: 17 pages, 10 figures. Published in IEEE Open Journal of the Communications Society

  8. arXiv:2401.05425  [pdf, other

    eess.SP cs.LG

    An Unobtrusive and Lightweight Ear-worn System for Continuous Epileptic Seizure Detection

    Authors: Abdul Aziz, Nhat Pham, Neel Vora, Cody Reynolds, Jaime Lehnen, Pooja Venkatesh, Zhuoran Yao, Jay Harvey, Tam Vu, Kan Ding, Phuc Nguyen

    Abstract: Epilepsy is one of the most common neurological diseases globally, affecting around 50 million people worldwide. Fortunately, up to 70 percent of people with epilepsy could live seizure-free if properly diagnosed and treated, and a reliable technique to monitor the onset of seizures could improve the quality of life of patients who are constantly facing the fear of random seizure attacks. The scal… ▽ More

    Submitted 1 January, 2024; originally announced January 2024.

  9. arXiv:2401.02701  [pdf, ps, other

    cs.IT eess.SP

    Joint User Association and Power Control for Cell-Free Massive MIMO

    Authors: Chongzheng Hao, Tung Thanh Vu, Hien Quoc Ngo, Minh N. Dao, Xiaoyu Dang, Chenghua Wang, Michail Matthaiou

    Abstract: This work proposes novel approaches that jointly design user equipment (UE) association and power control (PC) in a downlink user-centric cell-free massive multiple-input multiple-output (CFmMIMO) network, where each UE is only served by a set of access points (APs) for reducing the fronthaul signalling and computational complexity. In order to maximize the sum spectral efficiency (SE) of the UEs,… ▽ More

    Submitted 20 May, 2024; v1 submitted 5 January, 2024; originally announced January 2024.

    Comments: minor revision of the previous version

  10. arXiv:2312.17738  [pdf, other

    eess.SY

    Physics-informed Graphical Neural Network for Power System State Estimation

    Authors: Quang-Ha Ngo, Bang L. H. Nguyen, Tuyen V. Vu, Jianhua Zhang, Tuan Ngo

    Abstract: State estimation is highly critical for accurately observing the dynamic behavior of the power grids and minimizing risks from cyber threats. However, existing state estimation methods encounter challenges in accurately capturing power system dynamics, primarily because of limitations in encoding the grid topology and sparse measurements. This paper proposes a physics-informed graphical learning s… ▽ More

    Submitted 29 December, 2023; originally announced December 2023.

    Comments: 11 pages, 17 figures, journal accepted

  11. arXiv:2312.11127  [pdf, other

    cs.IT eess.SP

    User-centric Flexible Resource Management Framework for LEO Satellites with Fully Regenerative Payload

    Authors: Sovit Bhandari, Thang X. Vu, Symeon Chatzinotas

    Abstract: The regenerative capabilities of next-generation satellite systems offer a novel approach to design low earth orbit (LEO) satellite communication systems, enabling full flexibility in bandwidth and spot beam management, power control, and onboard data processing. These advancements allow the implementation of intelligent spatial multiplexing techniques, addressing the ever-increasing demand for fu… ▽ More

    Submitted 18 December, 2023; originally announced December 2023.

    Comments: To appear in IEEE JSAC

  12. arXiv:2311.07199  [pdf, ps, other

    eess.SP

    Joint Computation and Communication Resource Optimization for Beyond Diagonal UAV-IRS Empowered MEC Networks

    Authors: Asad Mahmood, Thang X. Vu, Wali Ullah Khan, Symeon Chatzinotas, Björn Ottersten

    Abstract: Recent advancements in 6G systems signal a leap towards universal connectivity and ultra-reliable, low-latency communications for real-time data devices. Yet, these advancements encounter obstacles such as limited device battery life and computational power, along with urban signal blockages. To counter these, Intelligent Reconfigurable Surfaces (IRS) within Mobile Edge Cloud (MEC) infrastructures… ▽ More

    Submitted 15 March, 2024; v1 submitted 13 November, 2023; originally announced November 2023.

  13. arXiv:2311.05049  [pdf, ps, other

    eess.SP

    Constrained Independent Vector Analysis with Reference for Multi-Subject fMRI Analysis

    Authors: Trung Vu, Francisco Laport, Hanlu Yang, Vince D. Calhoun, Tulay Adali

    Abstract: Independent component analysis (ICA) is now a widely used solution for the analysis of multi-subject functional magnetic resonance imaging (fMRI) data. Independent vector analysis (IVA) generalizes ICA to multiple datasets, i.e., to multi-subject data, and in addition to higher-order statistical information in ICA, it leverages the statistical dependence across the datasets as an additional type o… ▽ More

    Submitted 8 November, 2023; originally announced November 2023.

    Comments: 11 pages

  14. Controllable Generation of Artificial Speaker Embeddings through Discovery of Principal Directions

    Authors: Florian Lux, Pascal Tilli, Sarina Meyer, Ngoc Thang Vu

    Abstract: Customizing voice and speaking style in a speech synthesis system with intuitive and fine-grained controls is challenging, given that little data with appropriate labels is available. Furthermore, editing an existing human's voice also comes with ethical concerns. In this paper, we propose a method to generate artificial speaker embeddings that cannot be linked to a real human while offering intui… ▽ More

    Submitted 26 October, 2023; originally announced October 2023.

    Comments: Published at ISCA Interspeech 2023 https://www.isca-speech.org/archive/interspeech_2023/lux23_interspeech.html

  15. arXiv:2310.17499  [pdf, other

    cs.CL cs.LG eess.AS

    The IMS Toucan System for the Blizzard Challenge 2023

    Authors: Florian Lux, Julia Koch, Sarina Meyer, Thomas Bott, Nadja Schauffler, Pavel Denisov, Antje Schweitzer, Ngoc Thang Vu

    Abstract: For our contribution to the Blizzard Challenge 2023, we improved on the system we submitted to the Blizzard Challenge 2021. Our approach entails a rule-based text-to-phoneme processing system that includes rule-based disambiguation of homographs in the French language. It then transforms the phonemes to spectrograms as intermediate representations using a fast and efficient non-autoregressive synt… ▽ More

    Submitted 26 October, 2023; originally announced October 2023.

    Comments: Published at the Blizzard Challenge Workshop 2023, colocated with the Speech Synthesis Workshop 2023, a sattelite event of the Interspeech 2023

  16. arXiv:2310.12574  [pdf

    eess.IV cs.CV

    A reproducible 3D convolutional neural network with dual attention module (3D-DAM) for Alzheimer's disease classification

    Authors: Thanh Phuong Vu, Tien Nhat Nguyen, N. Minh Nhat Hoang, Gia Minh Hoang

    Abstract: Alzheimer's disease is one of the most common types of neurodegenerative disease, characterized by the accumulation of amyloid-beta plaque and tau tangles. Recently, deep learning approaches have shown promise in Alzheimer's disease diagnosis. In this study, we propose a reproducible model that utilizes a 3D convolutional neural network with a dual attention module for Alzheimer's disease classifi… ▽ More

    Submitted 4 March, 2024; v1 submitted 19 October, 2023; originally announced October 2023.

  17. arXiv:2310.10549  [pdf, other

    cs.NI eess.SP

    Applications of Distributed Machine Learning for the Internet-of-Things: A Comprehensive Survey

    Authors: Mai Le, Thien Huynh-The, Tan Do-Duy, Thai-Hoc Vu, Won-Joo Hwang, Quoc-Viet Pham

    Abstract: The emergence of new services and applications in emerging wireless networks (e.g., beyond 5G and 6G) has shown a growing demand for the usage of artificial intelligence (AI) in the Internet of Things (IoT). However, the proliferation of massive IoT connections and the availability of computing resources distributed across future IoT systems have strongly demanded the development of distributed AI… ▽ More

    Submitted 16 October, 2023; originally announced October 2023.

  18. arXiv:2310.06103  [pdf, other

    cs.CL cs.SD eess.AS

    Leveraging Multilingual Self-Supervised Pretrained Models for Sequence-to-Sequence End-to-End Spoken Language Understanding

    Authors: Pavel Denisov, Ngoc Thang Vu

    Abstract: A number of methods have been proposed for End-to-End Spoken Language Understanding (E2E-SLU) using pretrained models, however their evaluation often lacks multilingual setup and tasks that require prediction of lexical fillers, such as slot filling. In this work, we propose a unified method that integrates multilingual pretrained speech and text models and performs E2E-SLU on six datasets in four… ▽ More

    Submitted 9 October, 2023; originally announced October 2023.

    Comments: IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU) 2023

  19. arXiv:2310.01024  [pdf, other

    cs.IT cs.NI eess.SP

    Joint Source-Channel Coding System for 6G Communication: Design, Prototype and Future Directions

    Authors: Xinchao Zhong, Sean Longyu Ma, Hong-fu Chou, Arsham Mostaani, Thang X. Vu, Symeon Chatzinotas

    Abstract: The goal of semantic communication is to surpass optimal Shannon's criterion regarding a notable problem for future communication which lies in the integration of collaborative efforts between the intelligence of the transmission source and the joint design of source coding and channel coding. The convergence of scholarly investigation and applicable products in the field of semantic communication… ▽ More

    Submitted 2 October, 2023; originally announced October 2023.

    Comments: 14 pages, 9 figures, Journal

  20. VoicePAT: An Efficient Open-source Evaluation Toolkit for Voice Privacy Research

    Authors: Sarina Meyer, Xiaoxiao Miao, Ngoc Thang Vu

    Abstract: Speaker anonymization is the task of modifying a speech recording such that the original speaker cannot be identified anymore. Since the first Voice Privacy Challenge in 2020, along with the release of a framework, the popularity of this research topic is continually increasing. However, the comparison and combination of different anonymization approaches remains challenging due to the complexity… ▽ More

    Submitted 21 December, 2023; v1 submitted 14 September, 2023; originally announced September 2023.

    Comments: Accepted by OJSP-ICASSP 2024 https://ieeexplore.ieee.org/document/10365329

  21. arXiv:2306.14380  [pdf, ps, other

    eess.SP

    A New Optimal Subpattern Assignment (OSPA) Metric for Multi-target Filtering

    Authors: Tuyet Vu

    Abstract: This paper proposes and evaluates a new metric. This metric will overcome a limitation of the Optimal Subpattern Assignment (OSPA) metric mentioned by Schuhmacher et al.: the OSPA distance between two sets of points is insensitive to the the case where one is empty. This proposed metric called Complete OSPA (COSPA), retains all the advantages of the OSPA metric for evaluating the performance of mu… ▽ More

    Submitted 25 June, 2023; originally announced June 2023.

  22. Multi-agent Deep Reinforcement Learning for Distributed Load Restoration

    Authors: Linh Vu, Tuyen Vu, Thanh-Long Vu, Anurag Srivastava

    Abstract: This paper addresses the load restoration problem after power outage events. Our primary proposed methodology is using multi-agent deep reinforcement learning to optimize the load restoration process in distribution systems, modeled as networked microgrids, via determining the optimal operational sequence of circuit breakers (switches). An innovative invalid action masking technique is incorporate… ▽ More

    Submitted 24 June, 2023; originally announced June 2023.

    Comments: 12 pages, 19 figures, journal under review

  23. A Cyber-HIL for Investigating Control Systems in Ship Cyber Physical Systems under Communication Issues and Cyber Attacks

    Authors: Linh Vu, Lam Nguyen, Mahmoud Abdelaal, Tuyen Vu, Osama Mohammed

    Abstract: This paper presents a novel Cyber-Hardware-in-the-Loop (Cyber-HIL) platform for assessing control operation in ship cyber-physical systems. The proposed platform employs cutting-edge technologies, including Docker containers, real-time simulator $OPAL-RT$, and network emulator $ns3$, to create a secure and controlled testing and deployment environment for investigating the potential impact of cybe… ▽ More

    Submitted 25 August, 2023; v1 submitted 24 June, 2023; originally announced June 2023.

    Comments: 10 pages, 16 figures, journal under review

    Journal ref: IEEE Transactions on Industry Applications, vol. 60, no. 2, pp. 2142-2152, March-April 2024

  24. arXiv:2305.04519  [pdf, other

    eess.SP

    Joint Optimization of 3D Placement and Radio Resource Allocation for per-UAV Sum Rate Maximization

    Authors: Asad Mahmood, Thang X. Vu, Symeon Chatzinotas, Björn Ottersten

    Abstract: Unmanned aerial vehicles (UAV) have emerged as a practical solution that provides on-demand services to users in areas where the terrestrial network is non-existent or temporarily unavailable, e.g., due to natural disasters or network congestion. In general, UAVs' user-serving capacity is typically constrained by their limited battery life and the finite communication resources that highly impact… ▽ More

    Submitted 8 May, 2023; originally announced May 2023.

  25. arXiv:2305.00769  [pdf, other

    eess.SP cs.LG

    Multi-scale Transformer-based Network for Emotion Recognition from Multi Physiological Signals

    Authors: Tu Vu, Van Thong Huynh, Soo-Hyung Kim

    Abstract: This paper presents an efficient Multi-scale Transformer-based approach for the task of Emotion recognition from Physiological data, which has gained widespread attention in the research community due to the vast amount of information that can be extracted from these signals using modern sensors and machine learning techniques. Our approach involves applying a Multi-modal technique combined with s… ▽ More

    Submitted 7 May, 2023; v1 submitted 1 May, 2023; originally announced May 2023.

  26. arXiv:2304.13008  [pdf, other

    eess.SP

    Artificial Intelligence for Satellite Communication and Non-Terrestrial Networks: A Survey

    Authors: G. Fontanesi, F. Ortíz, E. Lagunas, V. Monzon Baeza, M. Á. Vázquez, J. A. Vásquez-Peralvo, M. Minardi, H. N. Vu, P. J. Honnaiah, C. Lacoste, Y. Drif, T. S. Abdu, G. Eappen, J. Rehman, L. M. Garcés-Socorrás, W. A. Martins, P. Henarejos, H. Al-Hraishawi, J. C. Merlano Duncan, T. X. Vu, S. Chatzinotas

    Abstract: This paper surveys the application and development of Artificial Intelligence (AI) in Satellite Communication (SatCom) and Non-Terrestrial Networks (NTN). We first present a comprehensive list of use cases, the relative challenges and the main AI tools capable of addressing those challenges. For each use case, we present the main motivation, a system description, the available non-AI solutions and… ▽ More

    Submitted 25 April, 2023; originally announced April 2023.

  27. arXiv:2304.12798  [pdf, other

    eess.SP

    Multi-Objective Optimization for 3D Placement and Resource Allocation in OFDMA-based Multi-UAV Networks

    Authors: Asad Mahmood, Thang X. Vu, Shree Krishna Sharma, Symeon Chatzinotas, Björn Ottersten

    Abstract: This work considers the orthogonal frequency division multiple access (OFDMA) technology that enables multiple unmanned aerial vehicles (multi-UAV) communication systems to provide on-demand services. The main aim of this work is to derive the optimal allocation of radio resources, 3D placement of UAVs, and user association matrices. To achieve the desired objectives, we decoupled the original joi… ▽ More

    Submitted 25 April, 2023; originally announced April 2023.

  28. arXiv:2304.04478  [pdf, other

    cs.CL cs.SD eess.AS

    Oh, Jeez! or Uh-huh? A Listener-aware Backchannel Predictor on ASR Transcriptions

    Authors: Daniel Ortega, Chia-Yu Li, Ngoc Thang Vu

    Abstract: This paper presents our latest investigation on modeling backchannel in conversations. Motivated by a proactive backchanneling theory, we aim at develo** a system which acts as a proactive listener by inserting backchannels, such as continuers and assessment, to influence speakers. Our model takes into account not only lexical and acoustic cues, but also introduces the simple and novel idea of u… ▽ More

    Submitted 10 April, 2023; originally announced April 2023.

    Comments: Published in ICASSP 2020

  29. Solving Differential-Algebraic Equations in Power System Dynamic Analysis with Quantum Computing

    Authors: Huynh Trung Thanh Tran, Hieu T. Nguyen, Long T. Vu, Samuel T. Ojetola

    Abstract: Power system dynamics are generally modeled by high dimensional nonlinear differential-algebraic equations (DAEs) given a large number of components forming the network. These DAEs' complexity can grow exponentially due to the increasing penetration of distributed energy resources, whereas their computation time becomes sensitive due to the increasing interconnection of the power grid with other e… ▽ More

    Submitted 1 March, 2024; v1 submitted 19 February, 2023; originally announced February 2023.

    Comments: This version was uploaded as an incorrect replacement, and was intended as a replacement of arXiv:2306.01961. I need to withdraw this paper to upload it as a replacement of the correct paper

    Journal ref: Energy Conversion and Economics, Volume 5, Issue 1, Feb 2024, pages 40-53

  30. arXiv:2302.02711  [pdf, other

    cs.LG eess.SY

    Network-Aided Intelligent Traffic Steering in 6G O-RAN: A Multi-Layer Optimization Framework

    Authors: Van-Dinh Nguyen, Thang X. Vu, Nhan Thanh Nguyen, Dinh C. Nguyen, Markku Juntti, Nguyen Cong Luong, Dinh Thai Hoang, Diep N. Nguyen, Symeon Chatzinotas

    Abstract: To enable an intelligent, programmable and multi-vendor radio access network (RAN) for 6G networks, considerable efforts have been made in standardization and development of open RAN (O-RAN). So far, however, the applicability of O-RAN in controlling and optimizing RAN functions has not been widely investigated. In this paper, we jointly optimize the flow-split distribution, congestion control and… ▽ More

    Submitted 29 May, 2023; v1 submitted 6 February, 2023; originally announced February 2023.

    Comments: 15 pages, 10 figures. A short version will be submitted to IEEE GLOBECOM 2023

  31. arXiv:2301.01628  [pdf, other

    cs.IT cs.MA eess.SY

    Task-Effective Compression of Observations for the Centralized Control of a Multi-agent System Over Bit-Budgeted Channels

    Authors: Arsham Mostaani, Thang X. Vu, Symeon Chatzinotas, Bjorn Ottersten

    Abstract: We consider a task-effective quantization problem that arises when multiple agents are controlled via a centralized controller (CC). While agents have to communicate their observations to the CC for decision-making, the bit-budgeted communications of agent-CC links may limit the task-effectiveness of the system which is measured by the system's average sum of stage costs/rewards. As a result, each… ▽ More

    Submitted 4 January, 2023; originally announced January 2023.

  32. arXiv:2211.02930  [pdf

    eess.SY cs.LG

    1-D Convolutional Graph Convolutional Networks for Fault Detection in Distributed Energy Systems

    Authors: Bang L. H. Nguyen, Tuyen Vu, Thai-Thanh Nguyen, Mayank Panwar, Rob Hovsapian

    Abstract: This paper presents a 1-D convolutional graph neural network for fault detection in microgrids. The combination of 1-D convolutional neural networks (1D-CNN) and graph convolutional networks (GCN) helps extract both spatial-temporal correlations from the voltage measurements in microgrids. The fault detection scheme includes fault event detection, fault type and phase classification, and fault loc… ▽ More

    Submitted 5 November, 2022; originally announced November 2022.

    Comments: arXiv admin note: text overlap with arXiv:2210.15177

  33. arXiv:2211.02928  [pdf

    eess.SY

    Hierarchical Control of Grid-Connected Hydrogen Electrolyzer Providing Grid Services

    Authors: Bang L. H. Nguyen, Mayank Panwar, Rob Hovsapian, Yashodhan Agalgaokar, Tuyen Vu

    Abstract: This paper presents the operation modes and control architecture of the grid-connected hydrogen electrolyzer systems for the provision of frequency and voltage supports. The analysis is focused on the primary and secondary loops in the hierarchical control scheme. At the power converter inner control loop, the voltage- and current-control modes are analyzed. At the primary level, the droop and opp… ▽ More

    Submitted 5 November, 2022; originally announced November 2022.

  34. arXiv:2211.02592  [pdf

    eess.SY

    A Large-Scale Study of a Sleep Tracking and Improving Device with Closed-loop and Personalized Real-time Acoustic Stimulation

    Authors: Anh Nguyen, Galen Pogoncheff, Ban Xuan Dong, Nam Bui, Hoang Truong, Nhat Pham, Linh Nguyen, Hoang Huu Nguyen, Sy Duong-Quy, Sangtae Ha, Tam Vu

    Abstract: Various intervention therapies ranging from pharmaceutical to hi-tech tailored solutions have been available to treat difficulty in falling asleep commonly caused by insomnia in modern life. However, current techniques largely remain ill-suited, ineffective, and unreliable due to their lack of precise real-time sleep tracking, in-time feedback on the therapies, an ability to keep people asleep dur… ▽ More

    Submitted 4 November, 2022; originally announced November 2022.

    Comments: 33 pages, 8 figures

  35. arXiv:2210.15177  [pdf

    eess.SP

    Spatial-Temporal Recurrent Graph Neural Networks for Fault Diagnostics in Power Distribution Systems

    Authors: Bang Nguyen, Tuyen Vu, Thai-Thanh Nguyen, Mayank Panwar, Rob Hovsapian

    Abstract: Fault diagnostics are extremely important to decide proper actions toward fault isolation and system restoration. The growing integration of inverter-based distributed energy resources imposes strong influences on fault detection using traditional overcurrent relays. This paper utilizes emerging graph learning techniques to build a new temporal recurrent graph neural network models for fault diagn… ▽ More

    Submitted 27 October, 2022; originally announced October 2022.

  36. arXiv:2210.12223  [pdf, other

    cs.CL cs.SD eess.AS

    Low-Resource Multilingual and Zero-Shot Multispeaker TTS

    Authors: Florian Lux, Julia Koch, Ngoc Thang Vu

    Abstract: While neural methods for text-to-speech (TTS) have shown great advances in modeling multiple speakers, even in zero-shot settings, the amount of data needed for those approaches is generally not feasible for the vast majority of the world's over 6,000 spoken languages. In this work, we bring together the tasks of zero-shot voice cloning and multilingual low-resource TTS. Using the language agnosti… ▽ More

    Submitted 21 October, 2022; originally announced October 2022.

    Comments: Accepted to AACL 2022

  37. arXiv:2210.11642  [pdf, other

    cs.CL cs.SD eess.AS

    Improving Semi-supervised End-to-end Automatic Speech Recognition using CycleGAN and Inter-domain Losses

    Authors: Chia-Yu Li, Ngoc Thang Vu

    Abstract: We propose a novel method that combines CycleGAN and inter-domain losses for semi-supervised end-to-end automatic speech recognition. Inter-domain loss targets the extraction of an intermediate shared representation of speech and text inputs using a shared network. CycleGAN uses cycle-consistent loss and the identity map** loss to preserve relevant characteristics of the input feature after conv… ▽ More

    Submitted 20 October, 2022; originally announced October 2022.

    Comments: 6 pages + 2 references, 6 figures, accepted by SLT2022

  38. arXiv:2210.08829  [pdf, other

    eess.SY

    Intelligent Traffic Steering in Beyond 5G Open RAN based on LSTM Traffic Prediction

    Authors: Fatemeh Kavehmadavani, Van-Dinh Nguyen, Thang X. Vu, Symeon Chatzinotas

    Abstract: Open radio access network (ORAN) Alliance offers a disaggregated RAN functionality built using open interface specifications between blocks. To efficiently support various competing services, \textit{namely} enhanced mobile broadband (eMBB) and ultra-reliable and low-latency (uRLLC), the ORAN Alliance has introduced a standard approach toward more virtualized, open and intelligent networks. To rea… ▽ More

    Submitted 17 October, 2022; originally announced October 2022.

  39. arXiv:2210.07002  [pdf, other

    cs.SD cs.CL eess.AS

    Anonymizing Speech with Generative Adversarial Networks to Preserve Speaker Privacy

    Authors: Sarina Meyer, Pascal Tilli, Pavel Denisov, Florian Lux, Julia Koch, Ngoc Thang Vu

    Abstract: In order to protect the privacy of speech data, speaker anonymization aims for hiding the identity of a speaker by changing the voice in speech recordings. This typically comes with a privacy-utility trade-off between protection of individuals and usability of the data for downstream applications. One of the challenges in this context is to create non-existent voices that sound as natural as possi… ▽ More

    Submitted 20 October, 2022; v1 submitted 13 October, 2022; originally announced October 2022.

    Comments: IEEE Spoken Language Technology Workshop 2022

  40. arXiv:2210.04041  [pdf, ps, other

    cs.IT cs.LG eess.SP

    Almost-lossless compression of a low-rank random tensor

    Authors: Minh Thanh Vu

    Abstract: In this work, we establish an asymptotic limit of almost-lossless compression of a random, finite alphabet tensor which admits a low-rank canonical polyadic decomposition.

    Submitted 23 October, 2022; v1 submitted 8 October, 2022; originally announced October 2022.

    Comments: This version fixes typos and adds some remarks

    MSC Class: 68P30; 15A69

  41. arXiv:2209.07385  [pdf, other

    eess.SY

    Resilient Communication Scheme for Distributed Decision of InterconnectingNetworks of Microgrids

    Authors: Thanh Long Vu, Sayak Mukherjee, Veronica Adetola

    Abstract: Networking of microgrids can provide the operational flexibility needed for the increasing number of DERs deployed at the distribution level and supporting end-use demand when there is loss of the bulk power system. But, networked microgrids are vulnerable to cyber-physical attacks and faults due to the complex interconnections. As such, it is necessary to design resilient control systems to suppo… ▽ More

    Submitted 15 September, 2022; originally announced September 2022.

  42. arXiv:2209.05969  [pdf

    eess.SY

    Integrated Multiport Bidirectional DC-DC Converter for HEV/FCV Applications

    Authors: Bang Le-Huy Nguyen, Honnyong Cha, Tuyen Vu, Thai-Thanh Nguyen

    Abstract: This paper proposes a novel integrated multiport bidirectional dc-dc converter to interface the battery, the ultra-capacitor, the fuel cell, or other energy sources with the dc-link capacitor of the hybrid energy systems such as the hybrid electric vehicle (HEV) and fuel cell vehicle (FCV) applications. The proposed converter can be applied to the distributed generation systems which include local… ▽ More

    Submitted 13 September, 2022; originally announced September 2022.

  43. arXiv:2209.05967  [pdf

    eess.SY

    Power Converter Topologies for Electrolyzer Applications to Enable Electric Grid Services

    Authors: Bang L. H. Nguyen, Mayank Panwar, Rob Hovsapian, Kazunori Nagasawa, Tuyen V. Vu

    Abstract: Hydrogen electrolyzers, with their operational flexibility, can be configured as smart dynamic loads which can provide grid services and facilitate the integration of more renewable energy sources into the electrical grid. However, to enable this ability, the electrolyzer system should be able to control both active and reactive power in coordination with the low-level controller of the electrolyz… ▽ More

    Submitted 13 September, 2022; originally announced September 2022.

  44. arXiv:2209.05962  [pdf

    eess.SY

    Integrated Multiport Back-to-Back Power Converter for Type-4 Wind Turbine Generator with Hybrid Energy Storage System

    Authors: Bang Le-Huy Nguyen, Thai-Thanh Nguyen, Van-Long Pham, Tuyen Vu, Mayank Panwar, Rob Hovsapian

    Abstract: This paper proposes a novel integrated multiport bidirectional back-to-back power converter for a type-4 wind turbine that accommodates a battery and supercapacitor for energy storage. The circuit topology reduces 4 switches compared to the traditional configuration. Moreover, owing to the dual-buck structure embedded in the phase leg, the circuitry has no short-circuit path, therefore it withstan… ▽ More

    Submitted 13 September, 2022; originally announced September 2022.

  45. arXiv:2208.00097  [pdf, other

    stat.AP cs.LG eess.IV physics.data-an stat.ME

    Robust Rayleigh Regression Method for SAR Image Processing in Presence of Outliers

    Authors: B. G. Palm, F. M. Bayer, R. Machado, M. I. Pettersson, V. T. Vu, R. J. Cintra

    Abstract: The presence of outliers (anomalous values) in synthetic aperture radar (SAR) data and the misspecification in statistical image models may result in inaccurate inferences. To avoid such issues, the Rayleigh regression model based on a robust estimation process is proposed as a more realistic approach to model this type of data. This paper aims at obtaining Rayleigh regression model parameter esti… ▽ More

    Submitted 29 July, 2022; originally announced August 2022.

    Comments: 17 pages, 5 figures, 4 tables

    Journal ref: IEEE Transactions on Geoscience and Remote Sensing, v. 60, 2021

  46. arXiv:2207.11400  [pdf, other

    eess.IV eess.SP physics.data-an stat.AP stat.ME

    Wavelength-Resolution SAR Ground Scene Prediction Based on Image Stack

    Authors: B. G. Palm, D. I. Alves, M. I. Pettersson, V. T. Vu, R. Machado, R. J. Cintra, F. M. Bayer, P. Dammert, H. Hellsten

    Abstract: This paper presents five different statistical methods for ground scene prediction (GSP) in wavelength-resolution synthetic aperture radar (SAR) images. The GSP image can be used as a reference image in a change detection algorithm yielding a high probability of detection and low false alarm rate. The predictions are based on image stacks, which are composed of images from the same scene acquired… ▽ More

    Submitted 22 July, 2022; originally announced July 2022.

    Comments: 15 pages, 8 figures, 3 tables

    Journal ref: Sensors 2020, 20(7)

  47. arXiv:2207.05549  [pdf, other

    eess.AS cs.CL cs.LG cs.SD

    PoeticTTS -- Controllable Poetry Reading for Literary Studies

    Authors: Julia Koch, Florian Lux, Nadja Schauffler, Toni Bernhart, Felix Dieterle, Jonas Kuhn, Sandra Richter, Gabriel Viehhauser, Ngoc Thang Vu

    Abstract: Speech synthesis for poetry is challenging due to specific intonation patterns inherent to poetic speech. In this work, we propose an approach to synthesise poems with almost human like naturalness in order to enable literary scholars to systematically examine hypotheses on the interplay between text, spoken realisation, and the listener's perception of poems. To meet these special requirements fo… ▽ More

    Submitted 18 October, 2022; v1 submitted 11 July, 2022; originally announced July 2022.

    Comments: Presented at Interspeech 2022

  48. arXiv:2207.04834  [pdf, other

    cs.SD cs.CR cs.LG eess.AS

    Speaker Anonymization with Phonetic Intermediate Representations

    Authors: Sarina Meyer, Florian Lux, Pavel Denisov, Julia Koch, Pascal Tilli, Ngoc Thang Vu

    Abstract: In this work, we propose a speaker anonymization pipeline that leverages high quality automatic speech recognition and synthesis systems to generate speech conditioned on phonetic transcriptions and anonymized speaker embeddings. Using phones as the intermediate representation ensures near complete elimination of speaker identity information from the input while preserving the original phonetic co… ▽ More

    Submitted 11 July, 2022; originally announced July 2022.

    Comments: Accepted at Interspeech 2022

  49. arXiv:2206.12229  [pdf, other

    cs.SD cs.CL eess.AS

    Exact Prosody Cloning in Zero-Shot Multispeaker Text-to-Speech

    Authors: Florian Lux, Julia Koch, Ngoc Thang Vu

    Abstract: The cloning of a speaker's voice using an untranscribed reference sample is one of the great advances of modern neural text-to-speech (TTS) methods. Approaches for mimicking the prosody of a transcribed reference audio have also been proposed recently. In this work, we bring these two tasks together for the first time through utterance level normalization in conjunction with an utterance level spe… ▽ More

    Submitted 21 October, 2022; v1 submitted 24 June, 2022; originally announced June 2022.

    Comments: Accepted to IEEE SLT 2022

  50. arXiv:2206.10832  [pdf, ps, other

    math.OC eess.SP

    On Local Linear Convergence of Projected Gradient Descent for Unit-Modulus Least Squares

    Authors: Trung Vu, Raviv Raich, Xiao Fu

    Abstract: The unit-modulus least squares (UMLS) problem has a wide spectrum of applications in signal processing, e.g., phase-only beamforming, phase retrieval, radar code design, and sensor network localization. Scalable first-order methods such as projected gradient descent (PGD) have recently been studied as a simple yet efficient approach to solving the UMLS problem. Existing results on the convergence… ▽ More

    Submitted 1 July, 2022; v1 submitted 22 June, 2022; originally announced June 2022.

    Comments: 16 pages