Skip to main content

Showing 1–50 of 186 results for author: Kim, D

Searching in archive eess. Search in all archives.
.
  1. arXiv:2406.13248  [pdf, other

    cs.IT eess.SP

    Overlay Space-Air-Ground Integrated Networks with SWIPT-Empowered Aerial Communications

    Authors: Anuradha Verma, Pankaj Kumar Sharma, Pawan Kumar, Dong In Kim

    Abstract: In this article, we consider overlay space-air-ground integrated networks (OSAGINs) where a low earth orbit (LEO) satellite communicates with ground users (GUs) with the assistance of an energy-constrained coexisting air-to-air (A2A) network. Particularly, a non-linear energy harvester with a hybrid SWIPT utilizing both power-splitting and time-switching energy harvesting (EH) techniques is employ… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

    Comments: 36 pages, 14 figures, This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible

  2. arXiv:2406.11427  [pdf, other

    eess.AS cs.AI cs.CL cs.LG cs.SD

    DiTTo-TTS: Efficient and Scalable Zero-Shot Text-to-Speech with Diffusion Transformer

    Authors: Keon Lee, Dong Won Kim, Jaehyeon Kim, Jaewoong Cho

    Abstract: Large-scale diffusion models have shown outstanding generative abilities across multiple modalities including images, videos, and audio. However, text-to-speech (TTS) systems typically involve domain-specific modeling factors (e.g., phonemes and phoneme-level durations) to ensure precise temporal alignments between text and speech, which hinders the efficiency and scalability of diffusion models f… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

  3. arXiv:2406.08714  [pdf, other

    eess.SP

    Real-time Digital RF Emulation -- II: A Near Memory Custom Accelerator

    Authors: Mandovi Mukherjee, Xiangyu Mao, Nael Rahman, Coleman DeLude, Joe Driscoll, Sudarshan Sharma, Payman Behnam, Uday Kamal, Jongseok Woo, Daehyun Kim, Sharjeel Khan, Jianming Tong, Jamin Seo, Prachi Sinha, Madhavan Swaminathan, Tushar Krishna, Santosh Pande, Justin Romberg, Saibal Mukhopadhyay

    Abstract: A near memory hardware accelerator, based on a novel direct path computational model, for real-time emulation of radio frequency systems is demonstrated. Our evaluation of hardware performance uses both application-specific integrated circuits (ASIC) and field programmable gate arrays (FPGA) methodologies: 1). The ASIC testchip implementation, using TSMC 28nm CMOS, leverages distributed autonomous… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

  4. arXiv:2406.02562  [pdf, other

    eess.AS cs.AI cs.CL

    Gated Low-rank Adaptation for personalized Code-Switching Automatic Speech Recognition on the low-spec devices

    Authors: Gwantae Kim, Bokyeung Lee, Donghyeon Kim, Hanseok Ko

    Abstract: In recent times, there has been a growing interest in utilizing personalized large models on low-spec devices, such as mobile and CPU-only devices. However, utilizing a personalized large model in the on-device is inefficient, and sometimes limited due to computational cost. To tackle the problem, this paper presents the weights separation method to minimize on-device model weights using parameter… ▽ More

    Submitted 23 April, 2024; originally announced June 2024.

    Comments: Table 2 is revised

    Journal ref: ICASSP 2024 Workshop(HSCMA 2024) paper

  5. arXiv:2405.18503  [pdf, other

    cs.SD cs.LG eess.AS

    SoundCTM: Uniting Score-based and Consistency Models for Text-to-Sound Generation

    Authors: Koichi Saito, Dongjun Kim, Takashi Shibuya, Chieh-Hsin Lai, Zhi Zhong, Yuhta Takida, Yuki Mitsufuji

    Abstract: Sound content is an indispensable element for multimedia works such as video games, music, and films. Recent high-quality diffusion-based sound generation models can serve as valuable tools for the creators. However, despite producing high-quality sounds, these models often suffer from slow inference speeds. This drawback burdens creators, who typically refine their sounds through trial and error… ▽ More

    Submitted 10 June, 2024; v1 submitted 28 May, 2024; originally announced May 2024.

    Comments: Audio samples: https://koichi-saito-sony.github.io/soundctm/. Codes: https://github.com/sony/soundctm. Checkpoints: https://huggingface.co/Sony/soundctm

  6. arXiv:2405.18012  [pdf, other

    cs.CV eess.IV

    Flow-Assisted Motion Learning Network for Weakly-Supervised Group Activity Recognition

    Authors: Muhammad Adi Nugroho, Sangmin Woo, Sumin Lee, **young Park, Yooseung Wang, Donguk Kim, Changick Kim

    Abstract: Weakly-Supervised Group Activity Recognition (WSGAR) aims to understand the activity performed together by a group of individuals with the video-level label and without actor-level labels. We propose Flow-Assisted Motion Learning Network (Flaming-Net) for WSGAR, which consists of the motion-aware actor encoder to extract actor features and the two-pathways relation module to infer the interaction… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

  7. arXiv:2405.11133  [pdf

    eess.IV cs.CV

    XCAT-3.0: A Comprehensive Library of Personalized Digital Twins Derived from CT Scans

    Authors: Lavsen Dahal, Mobina Ghojoghnejad, Dhrubajyoti Ghosh, Yubraj Bhandari, David Kim, Fong Chi Ho, Fakrul Islam Tushar, Sheng Luoa, Kyle J. Lafata, Ehsan Abadi, Ehsan Samei, Joseph Y. Lo, W. Paul Segars

    Abstract: Virtual Imaging Trials (VIT) offer a cost-effective and scalable approach for evaluating medical imaging technologies. Computational phantoms, which mimic real patient anatomy and physiology, play a central role in VIT. However, the current libraries of computational phantoms face limitations, particularly in terms of sample size and diversity. Insufficient representation of the population hampers… ▽ More

    Submitted 1 June, 2024; v1 submitted 17 May, 2024; originally announced May 2024.

  8. arXiv:2405.05107  [pdf, other

    cs.ET cs.AR eess.SY

    Leveraging AES Padding: dBs for Nothing and FEC for Free in IoT Systems

    Authors: Jongchan Woo, Vipindev Adat Vasudevan, Benjamin D. Kim, Rafael G. L. D'Oliveira, Alejandro Cohen, Thomas Stahlbuhk, Ken R. Duffy, Muriel Médard

    Abstract: The Internet of Things (IoT) represents a significant advancement in digital technology, with its rapidly growing network of interconnected devices. This expansion, however, brings forth critical challenges in data security and reliability, especially under the threat of increasing cyber vulnerabilities. Addressing the security concerns, the Advanced Encryption Standard (AES) is commonly employed… ▽ More

    Submitted 8 May, 2024; originally announced May 2024.

  9. arXiv:2404.18705  [pdf, other

    cs.IT eess.SP

    Wireless Information and Energy Transfer in the Era of 6G Communications

    Authors: Constantinos Psomas, Konstantinos Ntougias, Nikita Shanin, Dongfang Xu, Kenneth MacSporran Mayer, Nguyen Minh Tran, Laura Cottatellucci, Kae Won Choi, Dong In Kim, Robert Schober, Ioannis Krikidis

    Abstract: Wireless information and energy transfer (WIET) represents an emerging paradigm which employs controllable transmission of radio-frequency signals for the dual purpose of data communication and wireless charging. As such, WIET is widely regarded as an enabler of envisioned 6G use cases that rely on energy-sustainable Internet-of-Things (IoT) networks, such as smart cities and smart grids. Meeting… ▽ More

    Submitted 16 May, 2024; v1 submitted 29 April, 2024; originally announced April 2024.

    Comments: Proceedings of the IEEE, 36 pages, 33 figures

  10. arXiv:2404.17585  [pdf, other

    cs.HC cs.AI cs.LG eess.SP

    NeuroNet: A Novel Hybrid Self-Supervised Learning Framework for Sleep Stage Classification Using Single-Channel EEG

    Authors: Cheol-Hui Lee, Hakseung Kim, Hyun-jee Han, Min-Kyung Jung, Byung C. Yoon, Dong-Joo Kim

    Abstract: The classification of sleep stages is a pivotal aspect of diagnosing sleep disorders and evaluating sleep quality. However, the conventional manual scoring process, conducted by clinicians, is time-consuming and prone to human bias. Recent advancements in deep learning have substantially propelled the automation of sleep stage classification. Nevertheless, challenges persist, including the need fo… ▽ More

    Submitted 13 May, 2024; v1 submitted 10 April, 2024; originally announced April 2024.

    Comments: 14 pages, 4 figures

  11. arXiv:2404.15333  [pdf, other

    eess.SP cs.LG

    EB-GAME: A Game-Changer in ECG Heartbeat Anomaly Detection

    Authors: JuneYoung Park, Da Young Kim, Yunsoo Kim, Jisu Yoo, Tae Joon Kim

    Abstract: Cardiologists use electrocardiograms (ECG) for the detection of arrhythmias. However, continuous monitoring of ECG signals to detect cardiac abnormal-ities requires significant time and human resources. As a result, several deep learning studies have been conducted in advance for the automatic detection of arrhythmia. These models show relatively high performance in supervised learning, but are no… ▽ More

    Submitted 8 April, 2024; originally announced April 2024.

  12. arXiv:2404.14140  [pdf, other

    eess.SP

    Generative Artificial Intelligence Assisted Wireless Sensing: Human Flow Detection in Practical Communication Environments

    Authors: Jiacheng Wang, Hongyang Du, Dusit Niyato, Zehui Xiong, Jiawen Kang, Bo Ai, Zhu Han, Dong In Kim

    Abstract: Groundbreaking applications such as ChatGPT have heightened research interest in generative artificial intelligence (GAI). Essentially, GAI excels not only in content generation but also in signal processing, offering support for wireless sensing. Hence, we introduce a novel GAI-assisted human flow detection system (G-HFD). Rigorously, G-HFD first uses channel state information (CSI) to estimate t… ▽ More

    Submitted 22 April, 2024; originally announced April 2024.

  13. arXiv:2404.01661  [pdf, other

    cs.RO eess.SY

    Interaction-Aware Vehicle Motion Planning with Collision Avoidance Constraints in Highway Traffic

    Authors: Dongryul Kim, Hyeonjeong Kim, Kyoungseok Han

    Abstract: This paper proposes collision-free optimal trajectory planning for autonomous vehicles in highway traffic, where vehicles need to deal with the interaction among each other. To address this issue, a novel optimal control framework is suggested, which couples the trajectory of surrounding vehicles with collision avoidance constraints. Additionally, we describe a trajectory optimization technique un… ▽ More

    Submitted 2 April, 2024; originally announced April 2024.

  14. arXiv:2403.17420  [pdf, other

    cs.CV cs.MM cs.SD eess.AS

    Learning to Visually Localize Sound Sources from Mixtures without Prior Source Knowledge

    Authors: Dong** Kim, Sung ** Um, Sangmin Lee, Jung Uk Kim

    Abstract: The goal of the multi-sound source localization task is to localize sound sources from the mixture individually. While recent multi-sound source localization methods have shown improved performance, they face challenges due to their reliance on prior information about the number of objects to be separated. In this paper, to overcome this limitation, we present a novel multi-sound source localizati… ▽ More

    Submitted 26 March, 2024; originally announced March 2024.

    Comments: Accepted at CVPR 2024

  15. arXiv:2403.16477  [pdf, other

    cs.IT eess.SP

    Safeguarding Next Generation Multiple Access Using Physical Layer Security Techniques: A Tutorial

    Authors: Lu Lv, Dongyang Xu, Rose Qingyang Hu, Yinghui Ye, Long Yang, Xianfu Lei, Xianbin Wang, Dong In Kim, Arumugam Nallanathan

    Abstract: Driven by the ever-increasing requirements of ultra-high spectral efficiency, ultra-low latency, and massive connectivity, the forefront of wireless research calls for the design of advanced next generation multiple access schemes to facilitate provisioning of these stringent demands. This inspires the embrace of non-orthogonal multiple access (NOMA) in future wireless communication networks. Neve… ▽ More

    Submitted 21 May, 2024; v1 submitted 25 March, 2024; originally announced March 2024.

    Comments: Invited paper by Proceedings of the IEEE

  16. arXiv:2403.11104  [pdf, other

    eess.SY

    Deep Neural Network NMPC for Computationally Tractable Optimal Power Management of Hybrid Electric Vehicle

    Authors: Suyong Park, Duc Giap Nguyen, **rak Park, Dohee Kim, Jeong Soo Eo, Kyoungseok Han

    Abstract: This study presents a method for deep neural network nonlinear model predictive control (DNN-MPC) to reduce computational complexity, and we show its practical utility through its application in optimizing the energy management of hybrid electric vehicles (HEVs). For optimal power management of HEVs, we first design the online NMPC to collect the data set, and the deep neural network is trained to… ▽ More

    Submitted 17 March, 2024; originally announced March 2024.

    Comments: 6 pages, 10 figures, 3 tables, 2024 ACC conference (accepted)

  17. arXiv:2403.08187  [pdf, other

    cs.CL cs.SD eess.AS

    Automatic Speech Recognition (ASR) for the Diagnosis of pronunciation of Speech Sound Disorders in Korean children

    Authors: Taekyung Ahn, Yeonjung Hong, Younggon Im, Do Hyung Kim, Dayoung Kang, Joo Won Jeong, Jae Won Kim, Min Jung Kim, Ah-ra Cho, Dae-Hyun Jang, Hosung Nam

    Abstract: This study presents a model of automatic speech recognition (ASR) designed to diagnose pronunciation issues in children with speech sound disorders (SSDs) to replace manual transcriptions in clinical procedures. Since ASR models trained for general purposes primarily predict input speech into real words, employing a well-known high-performance ASR model for evaluating pronunciation in children wit… ▽ More

    Submitted 12 March, 2024; originally announced March 2024.

    Comments: 12 pages, 2 figures

    ACM Class: I.2.7

  18. arXiv:2403.04925  [pdf, ps, other

    cs.IT eess.SP

    Near Field Communications for DMA-NOMA Networks

    Authors: Zheng Zhang, Yuanwei Liu, Zhaolin Wang, Jian Chen, Dong In Kim

    Abstract: A novel near-field transmission framework is proposed for dynamic metasurface antenna (DMA)-enabled non-orthogonal multiple access (NOMA) networks. The base station (BS) exploits the hybrid beamforming to communicate with multiple near users (NUs) and far users (FUs) using the NOMA principle. Based on this framework, two novel beamforming schemes are proposed. 1) For the case of the grouped users… ▽ More

    Submitted 7 March, 2024; originally announced March 2024.

    Comments: 13 pages

  19. arXiv:2402.09756  [pdf, other

    cs.NI eess.SP

    Mixture of Experts for Network Optimization: A Large Language Model-enabled Approach

    Authors: Hongyang Du, Guangyuan Liu, Yi**g Lin, Dusit Niyato, Jiawen Kang, Zehui Xiong, Dong In Kim

    Abstract: Optimizing various wireless user tasks poses a significant challenge for networking systems because of the expanding range of user requirements. Despite advancements in Deep Reinforcement Learning (DRL), the need for customized optimization tasks for individual users complicates develo** and applying numerous DRL models, leading to substantial computation resource and energy consumption and can… ▽ More

    Submitted 15 February, 2024; originally announced February 2024.

  20. arXiv:2401.13936  [pdf, ps, other

    eess.SY

    Learning-based sensing and computing decision for data freshness in edge computing-enabled networks

    Authors: Sinwoong Yun, Dongsun Kim, Chanwon Park, Jemin Lee

    Abstract: As the demand on artificial intelligence (AI)-based applications increases, the freshness of sensed data becomes crucial in the wireless sensor networks. Since those applications require a large amount of computation for processing the sensed data, it is essential to offload the computation load to the edge computing (EC) server. In this paper, we propose the sensing and computing decision (SCD) a… ▽ More

    Submitted 24 January, 2024; originally announced January 2024.

    Comments: 15 pages

  21. arXiv:2401.11429  [pdf, ps, other

    cs.IT eess.SP

    Joint Downlink and Uplink Optimization for RIS-Aided FDD MIMO Communication Systems

    Authors: Gyoseung Lee, Hyeongtaek Lee, Donghwan Kim, Jaehoon Chung, A. Lee. Swindlehurst, Junil Choi

    Abstract: This paper investigates reconfigurable intelligent surface (RIS)-aided frequency division duplexing (FDD) communication systems. Since the downlink and uplink signals are simultaneously transmitted in FDD, the phase shifts at the RIS should be designed to support both transmissions. Considering a single-user multiple-input multiple-output system, we formulate a weighted sum-rate maximization probl… ▽ More

    Submitted 21 January, 2024; originally announced January 2024.

    Comments: Accepted to IEEE Transactions on Wireless Communications

  22. arXiv:2401.06422  [pdf, other

    eess.SP eess.SY

    Joint Mechanical and Electrical Adjustment of IRS-aided LEO Satellite MIMO Communications

    Authors: Doyoung Kim, Seongah Jeong

    Abstract: In this letter, we propose a joint mechanical and electrical adjustment of intelligent reflecting surface (IRS) for the performance improvements of low-earth orbit (LEO) satellite multiple-input multiple-output (MIMO) communications. In particular, we construct a three-dimensional (3D) MIMO channel model for the mechanically-tilted IRS in general deployment, and consider two types of scenarios wit… ▽ More

    Submitted 20 June, 2024; v1 submitted 12 January, 2024; originally announced January 2024.

    Comments: 5 pages, 6 figures

  23. arXiv:2401.02216  [pdf, ps, other

    eess.SY math.OC

    Harnessing Membership Function Dynamics for Stability Analysis of T-S Fuzzy Systems

    Authors: Donghwan Lee, Do-Wan Kim

    Abstract: The main goal of this paper is to develop a new linear matrix inequality (LMI) condition for the asymptotic stability of continuous-time Takagi-Sugeno (T-S) fuzzy systems. A key advantage of this new condition is its independence from the bounds on the time-derivatives of the membership functions, a requirement present in the existing approaches. This is achieved by introducing a novel fuzzy Lyapu… ▽ More

    Submitted 13 June, 2024; v1 submitted 4 January, 2024; originally announced January 2024.

    Comments: arXiv admin note: substantial text overlap with arXiv:2309.06841

  24. arXiv:2312.09736  [pdf, other

    cs.CL cs.SD eess.AS

    HEAR: Hearing Enhanced Audio Response for Video-grounded Dialogue

    Authors: Sunjae Yoon, Dahyun Kim, Eunseop Yoon, Hee Suk Yoon, Junyeong Kim, Chnag D. Yoo

    Abstract: Video-grounded Dialogue (VGD) aims to answer questions regarding a given multi-modal input comprising video, audio, and dialogue history. Although there have been numerous efforts in develo** VGD systems to improve the quality of their responses, existing systems are competent only to incorporate the information in the video and text and tend to struggle in extracting the necessary information f… ▽ More

    Submitted 15 December, 2023; originally announced December 2023.

    Comments: EMNLP 2023, 14 pages, 13 figures

  25. arXiv:2312.09461  [pdf, other

    eess.SP cs.HC cs.LG

    Improving Generalization of Drowsiness State Classification by Domain-Specific Normalization

    Authors: Dong-Young Kim, Dong-Kyun Han, Seo-Hyeon Park, Geun-Deok Jang, Seong-Whan Lee

    Abstract: Abnormal driver states, particularly have been major concerns for road safety, emphasizing the importance of accurate drowsiness detection to prevent accidents. Electroencephalogram (EEG) signals are recognized for their effectiveness in monitoring a driver's mental state by monitoring brain activities. However, the challenge lies in the requirement for prior calibration due to the variation of EE… ▽ More

    Submitted 14 November, 2023; originally announced December 2023.

    Comments: Submitted to 2024 12th IEEE International Winter Conference on Brain-Computer Interface

  26. arXiv:2312.06985  [pdf, ps, other

    eess.SP

    Ergodic Secrecy Rate Analysis for LEO Satellite Downlink Networks

    Authors: Daeun Kim, Namyoon Lee

    Abstract: Satellite networks are recognized as an effective solution to ensure seamless connectivity worldwide, catering to a diverse range of applications. However, the broad coverage and broadcasting nature of satellite networks also expose them to security challenges. Despite these challenges, there is a lack of analytical understanding addressing the secrecy performance of these networks. This paper pre… ▽ More

    Submitted 12 December, 2023; originally announced December 2023.

  27. arXiv:2311.17923  [pdf, other

    eess.AS cs.HC

    Enhanced Generative Adversarial Networks for Unseen Word Generation from EEG Signals

    Authors: Young-Eun Lee, Seo-Hyun Lee, Soowon Kim, Jung-Sun Lee, Deok-Seon Kim, Seong-Whan Lee

    Abstract: Recent advances in brain-computer interface (BCI) technology, particularly based on generative adversarial networks (GAN), have shown great promise for improving decoding performance for BCI. Within the realm of Brain-Computer Interfaces (BCI), GANs find application in addressing many areas. They serve as a valuable tool for data augmentation, which can solve the challenge of limited data availabi… ▽ More

    Submitted 13 November, 2023; originally announced November 2023.

    Comments: 5 pages, 2 figures

  28. arXiv:2311.06523  [pdf, other

    cs.NI eess.SP

    Generative AI for Space-Air-Ground Integrated Networks (SAGIN)

    Authors: Ruichen Zhang, Hongyang Du, Dusit Niyato, Jiawen Kang, Zehui Xiong, Abbas Jamalipour, ** Zhang, Dong In Kim

    Abstract: Recently, generative AI technologies have emerged as a significant advancement in artificial intelligence field, renowned for their language and image generation capabilities. Meantime, space-air-ground integrated network (SAGIN) is an integral part of future B5G/6G for achieving ubiquitous connectivity. Inspired by this, this article explores an integration of generative AI in SAGIN, focusing on… ▽ More

    Submitted 11 November, 2023; originally announced November 2023.

    Comments: 9page, 5 figures

  29. arXiv:2309.17008  [pdf, other

    eess.SP eess.SY

    Energy-Efficient Secure Offloading System Designed via UAV-Mounted Intelligent Reflecting Surface for Resilience Enhancement

    Authors: Doyoung Kim, Seongah Jeong, **kyu Kang

    Abstract: With increasing interest in mmWave and THz communication systems, an unmanned aerial vehicle (UAV)-mounted intelligent reflecting surface (IRS) has been suggested as a key enabling technology to establish robust line-of-sight (LoS) connections with ground nodes owing to their free mobility and high altitude, especially for emergency and disaster response. This paper investigates a secure offloadin… ▽ More

    Submitted 29 September, 2023; originally announced September 2023.

    Comments: 11 pages, 5 figures

  30. arXiv:2309.14668  [pdf

    physics.optics cs.GR eess.IV physics.app-ph physics.comp-ph

    Depolarized Holography with Polarization-multiplexing Metasurface

    Authors: Seung-Woo Nam, Young** Kim, Dongyeon Kim, Yoonchan Jeong

    Abstract: The evolution of computer-generated holography (CGH) algorithms has prompted significant improvements in the performances of holographic displays. Nonetheless, they start to encounter a limited degree of freedom in CGH optimization and physical constraints stemming from the coherent nature of holograms. To surpass the physical limitations, we consider polarization as a new degree of freedom by uti… ▽ More

    Submitted 26 September, 2023; originally announced September 2023.

    Comments: 15 pages, 13 figures, to be published in SIGGRAPH Asia 2023

  31. arXiv:2309.11988  [pdf, ps, other

    math.OC eess.SY

    Relaxed Conditions for Parameterized Linear Matrix Inequality in the Form of Nested Fuzzy Summations

    Authors: Do Wan Kim, Donghwan Lee

    Abstract: The aim of this study is to investigate less conservative conditions for parameterized linear matrix inequalities (PLMIs) that are formulated as nested fuzzy summations. Such PLMIs are commonly encountered in stability analysis and control design problems for Takagi-Sugeno (T-S) fuzzy systems. Utilizing the weighted inequality of arithmetic and geometric means (AM-GM inequality), we develop new, l… ▽ More

    Submitted 18 December, 2023; v1 submitted 21 September, 2023; originally announced September 2023.

    Comments: This work has been submitted to IEEE Transactions on Systems, Man and Cybernetics: Systems for possible publications

  32. arXiv:2309.10460  [pdf, ps, other

    eess.SP

    Coverage Analysis of Dynamic Coordinated Beamforming for LEO Satellite Downlink Networks

    Authors: Daeun Kim, Jeonghun Park, Namyoon Lee

    Abstract: In this paper, we investigate the coverage performance of downlink satellite networks employing dynamic coordinated beamforming. Our approach involves modeling the spatial arrangement of satellites and users using Poisson point processes situated on concentric spheres. We derive analytical expressions for the coverage probability, which take into account the in-cluster geometry of the coordinated… ▽ More

    Submitted 19 September, 2023; originally announced September 2023.

  33. arXiv:2309.06841  [pdf, ps, other

    eess.SY cs.AI

    On the Local Quadratic Stability of T-S Fuzzy Systems in the Vicinity of the Origin

    Authors: Donghwan Lee, Do Wan Kim

    Abstract: The main goal of this paper is to introduce new local stability conditions for continuous-time Takagi-Sugeno (T-S) fuzzy systems. These stability conditions are based on linear matrix inequalities (LMIs) in combination with quadratic Lyapunov functions. Moreover, they integrate information on the membership functions at the origin and effectively leverage the linear structure of the underlying non… ▽ More

    Submitted 13 September, 2023; v1 submitted 13 September, 2023; originally announced September 2023.

  34. arXiv:2309.02616  [pdf, other

    eess.IV cs.LG cs.NI

    Generative AI-aided Joint Training-free Secure Semantic Communications via Multi-modal Prompts

    Authors: Hongyang Du, Guangyuan Liu, Dusit Niyato, Jiayi Zhang, Jiawen Kang, Zehui Xiong, Bo Ai, Dong In Kim

    Abstract: Semantic communication (SemCom) holds promise for reducing network resource consumption while achieving the communications goal. However, the computational overheads in jointly training semantic encoders and decoders-and the subsequent deployment in network devices-are overlooked. Recent advances in Generative artificial intelligence (GAI) offer a potential solution. The robust learning abilities… ▽ More

    Submitted 5 September, 2023; originally announced September 2023.

  35. arXiv:2308.13202  [pdf, other

    eess.SP

    Joint Band Assignment and Beam Management using Hierarchical Reinforcement Learning for Multi-Band Communication

    Authors: Dohyun Kim, Miguel R. Castellanos, Robert W. Heath Jr

    Abstract: Multi-band operation in wireless networks can improve data rates by leveraging the benefits of propagation in different frequency ranges. Distinctive beam management procedures in different bands complicate band assignment because they require considering not only the channel quality but also the associated beam management overhead. Reinforcement learning (RL) is a promising approach for multi-ban… ▽ More

    Submitted 25 August, 2023; originally announced August 2023.

  36. arXiv:2308.08442  [pdf, other

    cs.CL cs.SD eess.AS

    Mitigating the Exposure Bias in Sentence-Level Grapheme-to-Phoneme (G2P) Transduction

    Authors: Eunseop Yoon, Hee Suk Yoon, Dhananjaya Gowda, SooHwan Eom, Daehyeok Kim, John Harvill, Heting Gao, Mark Hasegawa-Johnson, Chanwoo Kim, Chang D. Yoo

    Abstract: Text-to-Text Transfer Transformer (T5) has recently been considered for the Grapheme-to-Phoneme (G2P) transduction. As a follow-up, a tokenizer-free byte-level model based on T5 referred to as ByT5, recently gave promising results on word-level G2P conversion by representing each input character with its corresponding UTF-8 encoding. Although it is generally understood that sentence-level or parag… ▽ More

    Submitted 16 August, 2023; originally announced August 2023.

    Comments: INTERSPEECH 2023

  37. arXiv:2308.07593  [pdf, other

    cs.CV cs.MM eess.AS eess.IV

    AKVSR: Audio Knowledge Empowered Visual Speech Recognition by Compressing Audio Knowledge of a Pretrained Model

    Authors: Jeong Hun Yeo, Minsu Kim, Jeongsoo Choi, Dae Hoe Kim, Yong Man Ro

    Abstract: Visual Speech Recognition (VSR) is the task of predicting spoken words from silent lip movements. VSR is regarded as a challenging task because of the insufficient information on lip movements. In this paper, we propose an Audio Knowledge empowered Visual Speech Recognition framework (AKVSR) to complement the insufficient speech information of visual modality by using audio modality. Different fro… ▽ More

    Submitted 11 January, 2024; v1 submitted 15 August, 2023; originally announced August 2023.

    Comments: Accepted by IEEE Transactions on Multimedia

  38. arXiv:2308.05384  [pdf, other

    cs.NI eess.SP

    Enhancing Deep Reinforcement Learning: A Tutorial on Generative Diffusion Models in Network Optimization

    Authors: Hongyang Du, Ruichen Zhang, Yinqiu Liu, Jiacheng Wang, Yi**g Lin, Zonghang Li, Dusit Niyato, Jiawen Kang, Zehui Xiong, Shuguang Cui, Bo Ai, Haibo Zhou, Dong In Kim

    Abstract: Generative Diffusion Models (GDMs) have emerged as a transformative force in the realm of Generative Artificial Intelligence (GenAI), demonstrating their versatility and efficacy across various applications. The ability to model complex data distributions and generate high-quality samples has made GDMs particularly effective in tasks such as image generation and reinforcement learning. Furthermore… ▽ More

    Submitted 8 May, 2024; v1 submitted 10 August, 2023; originally announced August 2023.

    Comments: This paper has been accepted by IEEE Communications Surveys & Tutorials (COMST)

  39. arXiv:2308.05063  [pdf, other

    cs.CR cs.AR cs.IT eess.SY

    CERMET: Coding for Energy Reduction with Multiple Encryption Techniques -- $It's\ easy\ being\ green$

    Authors: Jongchan Woo, Vipindev Adat Vasudevan, Benjamin Kim, Alejandro Cohen, Rafael G. L. D'Oliveira, Thomas Stahlbuhk, Muriel Médard

    Abstract: This paper presents CERMET, an energy-efficient hardware architecture designed for hardware-constrained cryptosystems. CERMET employs a base cryptosystem in conjunction with network coding to provide both information-theoretic and computational security while reducing energy consumption per bit. This paper introduces the hardware architecture for the system and explores various optimizations to en… ▽ More

    Submitted 9 August, 2023; originally announced August 2023.

  40. arXiv:2308.04717  [pdf, other

    eess.SY

    Fully Decentralized Peer-to-Peer Community Grid with Dynamic and Congestion Pricing

    Authors: Hien Thanh Doan, Truong Hoang Bao Huy, Daehee Kim, Hongseok Kim

    Abstract: Peer-to-peer (P2P) electricity markets enable prosumers to minimize their costs, which has been extensively studied in recent research. However, there are several challenges with P2P trading when physical network constraints are also included. Moreover, most studies use fixed prices for grid power prices without considering dynamic grid pricing, and equity for all participants. This policy may neg… ▽ More

    Submitted 9 August, 2023; originally announced August 2023.

  41. arXiv:2308.01831  [pdf, other

    cs.CL eess.AS eess.SP

    Many-to-Many Spoken Language Translation via Unified Speech and Text Representation Learning with Unit-to-Unit Translation

    Authors: Minsu Kim, Jeongsoo Choi, Dahun Kim, Yong Man Ro

    Abstract: In this paper, we propose a method to learn unified representations of multilingual speech and text with a single model, especially focusing on the purpose of speech synthesis. We represent multilingual speech audio with speech units, the quantized representations of speech features encoded from a self-supervised speech model. Therefore, we can focus on their linguistic content by treating the aud… ▽ More

    Submitted 3 August, 2023; originally announced August 2023.

  42. arXiv:2307.16706  [pdf, ps, other

    eess.SY cs.AI

    Continuous-Time Distributed Dynamic Programming for Networked Multi-Agent Markov Decision Processes

    Authors: Donghwan Lee, Han-Dong Lim, Do Wan Kim

    Abstract: The main goal of this paper is to investigate continuous-time distributed dynamic programming (DP) algorithms for networked multi-agent Markov decision problems (MAMDPs). In our study, we adopt a distributed multi-agent framework where individual agents have access only to their own rewards, lacking insights into the rewards of other agents. Moreover, each agent has the ability to share its parame… ▽ More

    Submitted 13 June, 2024; v1 submitted 31 July, 2023; originally announced July 2023.

  43. arXiv:2307.12644  [pdf, other

    eess.IV cs.AI cs.CV cs.LG eess.SP

    Remote Bio-Sensing: Open Source Benchmark Framework for Fair Evaluation of rPPG

    Authors: Dae-Yeol Kim, Eunsu Goh, KwangKee Lee, JongEui Chae, JongHyeon Mun, Junyeong Na, Chae-bong Sohn, Do-Yup Kim

    Abstract: rPPG (Remote photoplethysmography) is a technology that measures and analyzes BVP (Blood Volume Pulse) by using the light absorption characteristics of hemoglobin captured through a camera. Analyzing the measured BVP can derive various physiological signals such as heart rate, stress level, and blood pressure, which can be applied to various applications such as telemedicine, remote patient monito… ▽ More

    Submitted 18 August, 2023; v1 submitted 24 July, 2023; originally announced July 2023.

    Comments: 20 pages, 10 figures

    MSC Class: 68T45; 68T07 ACM Class: I.4.9; I.5.4; I.2

  44. arXiv:2307.12254  [pdf, other

    cs.NI eess.SP

    Semantic Communication-Empowered Vehicle Count Prediction for Traffic Management

    Authors: Sachin Kadam, Dong In Kim

    Abstract: Vehicle count prediction is an important aspect of smart city traffic management. Most major roads are monitored by cameras with computing and transmitting capabilities. These cameras provide data to the central traffic controller (CTC), which is in charge of traffic control management. In this paper, we propose a joint CNN-LSTM-based semantic communication (SemCom) model in which the semantic enc… ▽ More

    Submitted 2 January, 2024; v1 submitted 23 July, 2023; originally announced July 2023.

    Comments: Accepted for publication in WCNC 2024 - IEEE Wireless Communications and Networking Conference, Dubai, United Arab Emirates (UAE), April 2024

  45. arXiv:2307.10550  [pdf

    cs.SD cs.LG eess.AS

    SC VALL-E: Style-Controllable Zero-Shot Text to Speech Synthesizer

    Authors: Daegyeom Kim, Seongho Hong, Yong-Hoon Choi

    Abstract: Expressive speech synthesis models are trained by adding corpora with diverse speakers, various emotions, and different speaking styles to the dataset, in order to control various characteristics of speech and generate the desired voice. In this paper, we propose a style control (SC) VALL-E model based on the neural codec language model (called VALL-E), which follows the structure of the generativ… ▽ More

    Submitted 19 July, 2023; originally announced July 2023.

  46. arXiv:2307.10538  [pdf, other

    eess.SP

    Power Allocation for Device-to-Device Interference Channel Using Truncated Graph Transformers

    Authors: Dohoon Kim, Shenghui Song

    Abstract: Power control for the device-to-device interference channel with single-antenna transceivers has been widely analyzed with both model-based methods and learning-based approaches. Although the learning-based approaches, i.e., datadriven and model-driven, offer performance improvement, the widely adopted graph neural network suffers from learning the heterophilous power distribution of the interfere… ▽ More

    Submitted 23 July, 2023; v1 submitted 19 July, 2023; originally announced July 2023.

    Comments: 6 pages, 5 figures. Accepted in IEEE International Mediterranean Conference on Communications and Networking

  47. arXiv:2307.10263  [pdf, ps, other

    cs.IT cs.NI eess.SP eess.SY

    Dynamic Joint Scheduling of Anycast Transmission and Modulation in Hybrid Unicast-Multicast SWIPT-Based IoT Sensor Networks

    Authors: Do-Yup Kim, Chae-Bong Sohn, Hyun-Suk Lee

    Abstract: The separate receiver architecture with a time- or power-splitting mode, widely used for simultaneous wireless information and power transfer (SWIPT), has a major drawback: Energy-intensive local oscillators and mixers need to be installed in the information decoding (ID) component to downconvert radio frequency (RF) signals to baseband signals, resulting in high energy consumption. As a solution… ▽ More

    Submitted 17 July, 2023; originally announced July 2023.

    Comments: 29 pages, 13 figures (eps)

    MSC Class: 90B36; 68M20; 90B15 ACM Class: C.2.3; I.2.8

  48. arXiv:2307.07123  [pdf, other

    cs.CV eess.IV

    Improved Flood Insights: Diffusion-Based SAR to EO Image Translation

    Authors: Minseok Seo, Youngtack Oh, Doyi Kim, Dongmin Kang, Yeji Choi

    Abstract: Driven by rapid climate change, the frequency and intensity of flood events are increasing. Electro-Optical (EO) satellite imagery is commonly utilized for rapid response. However, its utilities in flood situations are hampered by issues such as cloud cover and limitations during nighttime, making accurate assessment of damage challenging. Several alternative flood detection techniques utilizing S… ▽ More

    Submitted 13 July, 2023; originally announced July 2023.

    Comments: 10 pages, 6 figures

    Report number: 10

  49. arXiv:2307.00541  [pdf, ps, other

    cs.LG cs.AI cs.DC eess.SP

    Collaborative Policy Learning for Dynamic Scheduling Tasks in Cloud-Edge-Terminal IoT Networks Using Federated Reinforcement Learning

    Authors: Do-Yup Kim, Da-Eun Lee, Ji-Wan Kim, Hyun-Suk Lee

    Abstract: In this paper, we examine cloud-edge-terminal IoT networks, where edges undertake a range of typical dynamic scheduling tasks. In these IoT networks, a central policy for each task can be constructed at a cloud server. The central policy can be then used by the edges conducting the task, thereby mitigating the need for them to learn their own policy from scratch. Furthermore, this central policy c… ▽ More

    Submitted 2 July, 2023; originally announced July 2023.

    Comments: 14 pages, 16 figures, IEEEtran.cls

    MSC Class: 68M20; 68T05; 68T07 ACM Class: C.2.1; C.2.4; I.2.8; I.2.11

  50. arXiv:2306.16739  [pdf, other

    eess.SP

    Sparse RF Lens Antenna Array Design for AoA Estimation in Wideband Systems: Placement Optimization and Performance Analysis

    Authors: Joo-Hyun Jo, Jae-Nam Shim, Chan-Byoung Chae, Dong Ku Kim, Robert W. Heath Jr

    Abstract: In this paper, we propose a novel architecture for a lens antenna array (LAA) designed to work with a small number of antennas and enable angle-of-arrival (AoA) estimation for advanced 5G vehicle-to-everything (V2X) use cases that demand wider bandwidths and higher data rates. We derive a received signal in terms of optical analysis to consider the variability of the focal region for different car… ▽ More

    Submitted 29 June, 2023; originally announced June 2023.

    Comments: 15 pages, 10 figures