Skip to main content

Showing 1–50 of 293 results for author: Kim, I

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.16695  [pdf, other

    cs.CV

    Geometry-Aware Score Distillation via 3D Consistent Noising and Gradient Consistency Modeling

    Authors: Min-Seop Kwak, Donghoon Ahn, Ines Hyeonsu Kim, **-Hwa Kim, Seungryong Kim

    Abstract: Score distillation sampling (SDS), the methodology in which the score from pretrained 2D diffusion models is distilled into 3D representation, has recently brought significant advancements in text-to-3D generation task. However, this approach is still confronted with critical geometric inconsistency problems such as the Janus problem. Starting from a hypothesis that such inconsistency problems may… ▽ More

    Submitted 30 June, 2024; v1 submitted 24 June, 2024; originally announced June 2024.

  2. arXiv:2406.16042  [pdf, other

    cs.CV

    Pose-Diversified Augmentation with Diffusion Model for Person Re-Identification

    Authors: Inès Hyeonsu Kim, JoungBin Lee, Soowon Son, Woojeong **, Kyusun Cho, Junyoung Seo, Min-Seop Kwak, Seokju Cho, JeongYeol Baek, Byeongwon Lee, Seungryong Kim

    Abstract: Person re-identification (Re-ID) often faces challenges due to variations in human poses and camera viewpoints, which significantly affect the appearance of individuals across images. Existing datasets frequently lack diversity and scalability in these aspects, hindering the generalization of Re-ID models to new camera systems. Previous methods have attempted to address these issues through data a… ▽ More

    Submitted 23 June, 2024; originally announced June 2024.

    Comments: The project page is available at https://ku-cvlab.github.io/Diff-ID/

  3. arXiv:2406.13964  [pdf, other

    cs.NI

    Hierarchical Micro-Segmentations for Zero-Trust Services via Large Language Model (LLM)-enhanced Graph Diffusion

    Authors: Yinqiu Liu, Guangyuan Liu, Hongyang Du, Dusit Niyato, Jiawen Kang, Zehui Xiong, Dong In Kim, Xuemin Shen

    Abstract: In the rapidly evolving Next-Generation Networking (NGN) era, the adoption of zero-trust architectures has become increasingly crucial to protect security. However, provisioning zero-trust services in NGNs poses significant challenges, primarily due to the environmental complexity and dynamics. Motivated by these challenges, this paper explores efficient zero-trust service provisioning using hiera… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

    Comments: 13 pages

  4. arXiv:2406.13248  [pdf, other

    cs.IT eess.SP

    Overlay Space-Air-Ground Integrated Networks with SWIPT-Empowered Aerial Communications

    Authors: Anuradha Verma, Pankaj Kumar Sharma, Pawan Kumar, Dong In Kim

    Abstract: In this article, we consider overlay space-air-ground integrated networks (OSAGINs) where a low earth orbit (LEO) satellite communicates with ground users (GUs) with the assistance of an energy-constrained coexisting air-to-air (A2A) network. Particularly, a non-linear energy harvester with a hybrid SWIPT utilizing both power-splitting and time-switching energy harvesting (EH) techniques is employ… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

    Comments: 36 pages, 14 figures, This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible

  5. arXiv:2406.04772  [pdf, other

    cs.LG cs.AI cs.CV

    REP: Resource-Efficient Prompting for On-device Continual Learning

    Authors: Sungho Jeon, Xinyue Ma, Kwang In Kim, Myeongjae Jeon

    Abstract: On-device continual learning (CL) requires the co-optimization of model accuracy and resource efficiency to be practical. This is extremely challenging because it must preserve accuracy while learning new tasks with continuously drifting data and maintain both high energy and memory efficiency to be deployable on real-world devices. Typically, a CL method leverages one of two types of backbone net… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

    Comments: 19 pages, 10 figures

  6. arXiv:2405.19912  [pdf, other

    stat.ML cs.LG

    Robust Kernel Hypothesis Testing under Data Corruption

    Authors: Antonin Schrab, Ilmun Kim

    Abstract: We propose two general methods for constructing robust permutation tests under data corruption. The proposed tests effectively control the non-asymptotic type I error under data corruption, and we prove their consistency in power under minimal conditions. This contributes to the practical deployment of hypothesis tests for real-world applications with potential adversarial attacks. One of our meth… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

    Comments: 26 pages, 2 figures, 2 algorithms

  7. arXiv:2405.19704  [pdf, other

    stat.ML cs.LG stat.ME

    Enhancing Sufficient Dimension Reduction via Hellinger Correlation

    Authors: Seungbeom Hong, Ilmun Kim, Jun Song

    Abstract: In this work, we develop a new theory and method for sufficient dimension reduction (SDR) in single-index models, where SDR is a sub-field of supervised dimension reduction based on conditional independence. Our work is primarily motivated by the recent introduction of the Hellinger correlation as a dependency measure. Utilizing this measure, we develop a method capable of effectively detecting th… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

  8. arXiv:2405.12472  [pdf, ps, other

    cs.NI

    Optimizing Generative AI Networking: A Dual Perspective with Multi-Agent Systems and Mixture of Experts

    Authors: Ruichen Zhang, Hongyang Du, Dusit Niyato, Jiawen Kang, Zehui Xiong, ** Zhang, Dong In Kim

    Abstract: In the continued development of next-generation networking and artificial intelligence content generation (AIGC) services, the integration of multi-agent systems (MAS) and the mixture of experts (MoE) frameworks is becoming increasingly important. Motivated by this, this article studies the contrasting and converging of MAS and MoE in AIGC-enabled networking. First, we discuss the architectural de… ▽ More

    Submitted 20 May, 2024; originally announced May 2024.

    Comments: 9 pages, 4 figures

  9. arXiv:2405.10272  [pdf, other

    cs.CV cs.AI cs.SD eess.AS eess.IV

    Faces that Speak: Jointly Synthesising Talking Face and Speech from Text

    Authors: Youngjoon Jang, Ji-Hoon Kim, Junseok Ahn, Doyeop Kwak, Hong-Sun Yang, Yoon-Cheol Ju, Il-Hwan Kim, Byeong-Yeol Kim, Joon Son Chung

    Abstract: The goal of this work is to simultaneously generate natural talking faces and speech outputs from text. We achieve this by integrating Talking Face Generation (TFG) and Text-to-Speech (TTS) systems into a unified framework. We address the main challenges of each task: (1) generating a range of head poses representative of real-world scenarios, and (2) ensuring voice consistency despite variations… ▽ More

    Submitted 16 May, 2024; originally announced May 2024.

    Comments: CVPR 2024

  10. arXiv:2405.04907  [pdf, other

    cs.NI

    Empowering Wireless Networks with Artificial Intelligence Generated Graph

    Authors: Jiacheng Wang, Yinqiu Liu, Hongyang Du, Dusit Niyato, Jiawen Kang, Haibo Zhou, Dong In Kim

    Abstract: In wireless communications, transforming network into graphs and processing them using deep learning models, such as Graph Neural Networks (GNNs), is one of the mainstream network optimization approaches. While effective, the generative AI (GAI) shows stronger capabilities in graph analysis, processing, and generation, than conventional methods such as GNN, offering a broader exploration space for… ▽ More

    Submitted 8 May, 2024; originally announced May 2024.

  11. arXiv:2405.04198  [pdf, other

    cs.CR

    Enhancing Physical Layer Communication Security through Generative AI with Mixture of Experts

    Authors: Changyuan Zhao, Hongyang Du, Dusit Niyato, Jiawen Kang, Zehui Xiong, Dong In Kim, Xuemin, Shen, Khaled B. Letaief

    Abstract: AI technologies have become more widely adopted in wireless communications. As an emerging type of AI technologies, the generative artificial intelligence (GAI) gains lots of attention in communication security. Due to its powerful learning ability, GAI models have demonstrated superiority over conventional AI methods. However, GAI still has several limitations, including high computational comple… ▽ More

    Submitted 7 May, 2024; originally announced May 2024.

    Comments: 9 pages, 4 figures

  12. arXiv:2404.18705  [pdf, other

    cs.IT eess.SP

    Wireless Information and Energy Transfer in the Era of 6G Communications

    Authors: Constantinos Psomas, Konstantinos Ntougias, Nikita Shanin, Dongfang Xu, Kenneth MacSporran Mayer, Nguyen Minh Tran, Laura Cottatellucci, Kae Won Choi, Dong In Kim, Robert Schober, Ioannis Krikidis

    Abstract: Wireless information and energy transfer (WIET) represents an emerging paradigm which employs controllable transmission of radio-frequency signals for the dual purpose of data communication and wireless charging. As such, WIET is widely regarded as an enabler of envisioned 6G use cases that rely on energy-sustainable Internet-of-Things (IoT) networks, such as smart cities and smart grids. Meeting… ▽ More

    Submitted 16 May, 2024; v1 submitted 29 April, 2024; originally announced April 2024.

    Comments: Proceedings of the IEEE, 36 pages, 33 figures

  13. arXiv:2404.16356  [pdf, other

    cs.NI cs.AI cs.LG

    Integration of Mixture of Experts and Multimodal Generative AI in Internet of Vehicles: A Survey

    Authors: Minrui Xu, Dusit Niyato, Jiawen Kang, Zehui Xiong, Abbas Jamalipour, Yuguang Fang, Dong In Kim, Xuemin, Shen

    Abstract: Generative AI (GAI) can enhance the cognitive, reasoning, and planning capabilities of intelligent modules in the Internet of Vehicles (IoV) by synthesizing augmented datasets, completing sensor data, and making sequential decisions. In addition, the mixture of experts (MoE) can enable the distributed and collaborative execution of AI models without performance degradation between connected vehicl… ▽ More

    Submitted 25 April, 2024; originally announced April 2024.

  14. arXiv:2404.12168  [pdf, other

    cs.CV cs.AI

    Real-World Efficient Blind Motion Deblurring via Blur Pixel Discretization

    Authors: Insoo Kim, Jae Seok Choi, Geonseok Seo, Kinam Kwon, **woo Shin, Hyong-Euk Lee

    Abstract: As recent advances in mobile camera technology have enabled the capability to capture high-resolution images, such as 4K images, the demand for an efficient deblurring model handling large motion has increased. In this paper, we discover that the image residual errors, i.e., blur-sharp pixel differences, can be grouped into some categories according to their motion blur type and how complex their… ▽ More

    Submitted 18 April, 2024; originally announced April 2024.

    Comments: CVPR2024 Camera-Ready

  15. arXiv:2404.09134  [pdf, ps, other

    cs.NI cs.LG

    Generative AI Agents with Large Language Model for Satellite Networks via a Mixture of Experts Transmission

    Authors: Ruichen Zhang, Hongyang Du, Yinqiu Liu, Dusit Niyato, Jiawen Kang, Zehui Xiong, Abbas Jamalipour, Dong In Kim

    Abstract: In response to the needs of 6G global communications, satellite communication networks have emerged as a key solution. However, the large-scale development of satellite communication networks is constrained by the complex system models, whose modeling is challenging for massive users. Moreover, transmission interference between satellites and users seriously affects communication performance. To s… ▽ More

    Submitted 29 June, 2024; v1 submitted 13 April, 2024; originally announced April 2024.

    Comments: 15 pages, 10 figures

  16. arXiv:2404.08396  [pdf, other

    cs.IT

    Joint Computation Offloading and Target Tracking in Integrated Sensing and Communication Enabled UAV Networks

    Authors: Trinh Van Chien, Mai Dinh Cong, Nguyen Cong Luong, Tri Nhu Do, Dong In Kim, Symeon Chatzinotas

    Abstract: In this paper, we investigate a joint computation offloading and target tracking in Integrated Sensing and Communication (ISAC)-enabled unmanned aerial vehicle (UAV) network. Therein, the UAV has a computing task that is partially offloaded to the ground UE for execution. Meanwhile, the UAV uses the offloading bit sequence to estimate the velocity of a ground target based on an autocorrelation fun… ▽ More

    Submitted 12 April, 2024; originally announced April 2024.

    Comments: 5 pages, 3 figures, 1 table. Accepted by IEEE Communications Letters

  17. arXiv:2404.03321  [pdf, other

    cs.NI

    Fusion of Mixture of Experts and Generative Artificial Intelligence in Mobile Edge Metaverse

    Authors: Guangyuan Liu, Hongyang Du, Dusit Niyato, Jiawen Kang, Zehui Xiong, Abbas Jamalipour, Shiwen Mao, Dong In Kim

    Abstract: In the digital transformation era, Metaverse offers a fusion of virtual reality (VR), augmented reality (AR), and web technologies to create immersive digital experiences. However, the evolution of the Metaverse is slowed down by the challenges of content creation, scalability, and dynamic user interaction. Our study investigates an integration of Mixture of Experts (MoE) models with Generative Ar… ▽ More

    Submitted 4 April, 2024; originally announced April 2024.

  18. arXiv:2404.01954  [pdf, other

    cs.CL cs.AI

    HyperCLOVA X Technical Report

    Authors: Kang Min Yoo, Jaegeun Han, Sookyo In, Heewon Jeon, Jisu Jeong, Jaewook Kang, Hyunwook Kim, Kyung-Min Kim, Munhyong Kim, Sungju Kim, Donghyun Kwak, Hanock Kwak, Se Jung Kwon, Bado Lee, Dongsoo Lee, Gichang Lee, Jooho Lee, Baeseong Park, Seong** Shin, Joonsang Yu, Seolki Baek, Sumin Byeon, Eungsup Cho, Dooseok Choe, Jeesung Han , et al. (371 additional authors not shown)

    Abstract: We introduce HyperCLOVA X, a family of large language models (LLMs) tailored to the Korean language and culture, along with competitive capabilities in English, math, and coding. HyperCLOVA X was trained on a balanced mix of Korean, English, and code data, followed by instruction-tuning with high-quality human-annotated datasets while abiding by strict safety guidelines reflecting our commitment t… ▽ More

    Submitted 13 April, 2024; v1 submitted 2 April, 2024; originally announced April 2024.

    Comments: 44 pages; updated authors list and fixed author names

  19. arXiv:2404.01583  [pdf, other

    cs.NI

    Defining Problem from Solutions: Inverse Reinforcement Learning (IRL) and Its Applications for Next-Generation Networking

    Authors: Yinqiu Liu, Ruichen Zhang, Hongyang Du, Dusit Niyato, Jiawen Kang, Zehui Xiong, Dong In Kim

    Abstract: Performance optimization is a critical concern in networking, on which Deep Reinforcement Learning (DRL) has achieved great success. Nonetheless, DRL training relies on precisely defined reward functions, which formulate the optimization objective and indicate the positive/negative progress towards the optimal. With the ever-increasing environmental complexity and human participation in Next-Gener… ▽ More

    Submitted 1 April, 2024; originally announced April 2024.

    Comments: 9 pages

  20. arXiv:2403.16477  [pdf, other

    cs.IT eess.SP

    Safeguarding Next Generation Multiple Access Using Physical Layer Security Techniques: A Tutorial

    Authors: Lu Lv, Dongyang Xu, Rose Qingyang Hu, Yinghui Ye, Long Yang, Xianfu Lei, Xianbin Wang, Dong In Kim, Arumugam Nallanathan

    Abstract: Driven by the ever-increasing requirements of ultra-high spectral efficiency, ultra-low latency, and massive connectivity, the forefront of wireless research calls for the design of advanced next generation multiple access schemes to facilitate provisioning of these stringent demands. This inspires the embrace of non-orthogonal multiple access (NOMA) in future wireless communication networks. Neve… ▽ More

    Submitted 21 May, 2024; v1 submitted 25 March, 2024; originally announced March 2024.

    Comments: Invited paper by Proceedings of the IEEE

  21. PECI-Net: Bolus segmentation from video fluoroscopic swallowing study images using preprocessing ensemble and cascaded inference

    Authors: Dougho Park, Younghun Kim, Harim Kang, Junmyeoung Lee, **young Choi, Taeyeon Kim, Sangeok Lee, Seokil Son, Minsol Kim, Injung Kim

    Abstract: Bolus segmentation is crucial for the automated detection of swallowing disorders in videofluoroscopic swallowing studies (VFSS). However, it is difficult for the model to accurately segment a bolus region in a VFSS image because VFSS images are translucent, have low contrast and unclear region boundaries, and lack color information. To overcome these challenges, we propose PECI-Net, a network arc… ▽ More

    Submitted 21 March, 2024; originally announced March 2024.

    Comments: 20 pages, 8 figures,

    Journal ref: Computers in Biology and Medicine (2024)

  22. arXiv:2403.13237  [pdf, ps, other

    cs.CR math.OC

    Graph Attention Network-based Block Propagation with Optimal AoI and Reputation in Web 3.0

    Authors: Jiana Liao, **bo Wen, Jiawen Kang, Changyan Yi, Yang Zhang, Yutao Jiao, Dusit Niyato, Dong In Kim, Shengli Xie

    Abstract: Web 3.0 is recognized as a pioneering paradigm that empowers users to securely oversee data without reliance on a centralized authority. Blockchains, as a core technology to realize Web 3.0, can facilitate decentralized and transparent data management. Nevertheless, the evolution of blockchain-enabled Web 3.0 is still in its nascent phase, grappling with challenges such as ensuring efficiency and… ▽ More

    Submitted 8 May, 2024; v1 submitted 19 March, 2024; originally announced March 2024.

  23. arXiv:2403.08277  [pdf, other

    cs.CV

    VIGFace: Virtual Identity Generation Model for Face Image Synthesis

    Authors: Minsoo Kim, Min-Cheol Sagong, Gi Pyo Nam, Junghyun Cho, Ig-Jae Kim

    Abstract: Deep learning-based face recognition continues to face challenges due to its reliance on huge datasets obtained from web crawling, which can be costly to gather and raise significant real-world privacy concerns. To address this issue, we propose VIGFace, a novel framework capable of generating synthetic facial images. Initially, we train the face recognition model using a real face dataset and cre… ▽ More

    Submitted 13 March, 2024; originally announced March 2024.

  24. arXiv:2403.08256  [pdf, other

    cs.CV

    IG-FIQA: Improving Face Image Quality Assessment through Intra-class Variance Guidance robust to Inaccurate Pseudo-Labels

    Authors: Minsoo Kim, Gi Pyo Nam, Haksub Kim, Haesol Park, Ig-Jae Kim

    Abstract: In the realm of face image quality assesment (FIQA), method based on sample relative classification have shown impressive performance. However, the quality scores used as pseudo-labels assigned from images of classes with low intra-class variance could be unrelated to the actual quality in this method. To address this issue, we present IG-FIQA, a novel approach to guide FIQA training, introducing… ▽ More

    Submitted 13 March, 2024; originally announced March 2024.

  25. arXiv:2403.04925  [pdf, ps, other

    cs.IT eess.SP

    Near Field Communications for DMA-NOMA Networks

    Authors: Zheng Zhang, Yuanwei Liu, Zhaolin Wang, Jian Chen, Dong In Kim

    Abstract: A novel near-field transmission framework is proposed for dynamic metasurface antenna (DMA)-enabled non-orthogonal multiple access (NOMA) networks. The base station (BS) exploits the hybrid beamforming to communicate with multiple near users (NUs) and far users (FUs) using the NOMA principle. Based on this framework, two novel beamforming schemes are proposed. 1) For the case of the grouped users… ▽ More

    Submitted 7 March, 2024; originally announced March 2024.

    Comments: 13 pages

  26. arXiv:2402.18062  [pdf, other

    cs.RO cs.AI

    Generative AI for Unmanned Vehicle Swarms: Challenges, Applications and Opportunities

    Authors: Guangyuan Liu, Nguyen Van Huynh, Hongyang Du, Dinh Thai Hoang, Dusit Niyato, Kun Zhu, Jiawen Kang, Zehui Xiong, Abbas Jamalipour, Dong In Kim

    Abstract: With recent advances in artificial intelligence (AI) and robotics, unmanned vehicle swarms have received great attention from both academia and industry due to their potential to provide services that are difficult and dangerous to perform by humans. However, learning and coordinating movements and actions for a large number of unmanned vehicles in complex and dynamic environments introduce signif… ▽ More

    Submitted 28 February, 2024; originally announced February 2024.

    Comments: 23 pages

  27. arXiv:2402.13553  [pdf, other

    cs.CR

    Generative AI for Secure Physical Layer Communications: A Survey

    Authors: Changyuan Zhao, Hongyang Du, Dusit Niyato, Jiawen Kang, Zehui Xiong, Dong In Kim, Xuemin, Shen, Khaled B. Letaief

    Abstract: Generative Artificial Intelligence (GAI) stands at the forefront of AI innovation, demonstrating rapid advancement and unparalleled proficiency in generating diverse content. Beyond content creation, GAI has significant analytical abilities to learn complex data distribution, offering numerous opportunities to resolve security issues. In the realm of security from physical layer perspectives, trad… ▽ More

    Submitted 21 February, 2024; originally announced February 2024.

    Comments: 22pages, 8figs

  28. arXiv:2402.09756  [pdf, other

    cs.NI eess.SP

    Mixture of Experts for Network Optimization: A Large Language Model-enabled Approach

    Authors: Hongyang Du, Guangyuan Liu, Yi**g Lin, Dusit Niyato, Jiawen Kang, Zehui Xiong, Dong In Kim

    Abstract: Optimizing various wireless user tasks poses a significant challenge for networking systems because of the expanding range of user requirements. Despite advancements in Deep Reinforcement Learning (DRL), the need for customized optimization tasks for individual users complicates develo** and applying numerous DRL models, leading to substantial computation resource and energy consumption and can… ▽ More

    Submitted 15 February, 2024; originally announced February 2024.

  29. arXiv:2402.06942  [pdf, other

    cs.NI

    Toward Scalable Generative AI via Mixture of Experts in Mobile Edge Networks

    Authors: Jiacheng Wang, Hongyang Du, Dusit Niyato, Jiawen Kang, Zehui Xiong, Dong In Kim, Khaled B. Letaief

    Abstract: The advancement of generative artificial intelligence (GAI) has driven revolutionary applications like ChatGPT. The widespread of these applications relies on the mixture of experts (MoE), which contains multiple experts and selectively engages them for each task to lower operation costs while maintaining performance. Despite MoE, GAI faces challenges in resource consumption when deployed on user… ▽ More

    Submitted 10 February, 2024; originally announced February 2024.

  30. arXiv:2402.02972  [pdf, other

    cs.CV cs.LG

    Retrieval-Augmented Score Distillation for Text-to-3D Generation

    Authors: Junyoung Seo, Susung Hong, Wooseok Jang, Inès Hyeonsu Kim, Minseop Kwak, Doyup Lee, Seungryong Kim

    Abstract: Text-to-3D generation has achieved significant success by incorporating powerful 2D diffusion models, but insufficient 3D prior knowledge also leads to the inconsistency of 3D geometry. Recently, since large-scale multi-view datasets have been released, fine-tuning the diffusion model on the multi-view datasets becomes a mainstream to solve the 3D inconsistency problem. However, it has confronted… ▽ More

    Submitted 2 May, 2024; v1 submitted 5 February, 2024; originally announced February 2024.

    Comments: Accepted to ICML 2024 / Project Page: https://ku-cvlab.github.io/ReDream/

  31. arXiv:2401.10095  [pdf, other

    quant-ph cs.IT cs.LG

    Learning shallow quantum circuits

    Authors: Hsin-Yuan Huang, Yunchao Liu, Michael Broughton, Isaac Kim, Anurag Anshu, Zeph Landau, Jarrod R. McClean

    Abstract: Despite fundamental interests in learning quantum circuits, the existence of a computationally efficient algorithm for learning shallow quantum circuits remains an open question. Because shallow quantum circuits can generate distributions that are classically hard to sample from, existing learning algorithms do not apply. In this work, we present a polynomial-time classical algorithm for learning… ▽ More

    Submitted 18 January, 2024; originally announced January 2024.

    Comments: 10 pages, 14 figures (7 inline; 7 floating) + 76-page appendix

    Journal ref: In Proceedings of the 56th Annual ACM Symposium on Theory of Computing (STOC 2024)

  32. arXiv:2401.07764  [pdf, other

    cs.AI cs.NI

    When Large Language Model Agents Meet 6G Networks: Perception, Grounding, and Alignment

    Authors: Minrui Xu, Dusit Niyato, Jiawen Kang, Zehui Xiong, Shiwen Mao, Zhu Han, Dong In Kim, Khaled B. Letaief

    Abstract: AI agents based on multimodal large language models (LLMs) are expected to revolutionize human-computer interaction and offer more personalized assistant services across various domains like healthcare, education, manufacturing, and entertainment. Deploying LLM agents in 6G networks enables users to access previously expensive AI assistant services via mobile devices democratically, thereby reduci… ▽ More

    Submitted 16 February, 2024; v1 submitted 15 January, 2024; originally announced January 2024.

  33. arXiv:2401.06386  [pdf, other

    cs.MM

    Generative AI-enabled Mobile Tactical Multimedia Networks: Distribution, Generation, and Perception

    Authors: Minrui Xu, Dusit Niyato, Jiawen Kang, Zehui Xiong, Song Guo, Yuguang Fang, Dong In Kim

    Abstract: Mobile multimedia networks (MMNs) demonstrate great potential in delivering low-latency and high-quality entertainment and tactical applications, such as short-video sharing, online conferencing, and battlefield surveillance. For instance, in tactical surveillance of battlefields, scalability and sustainability are indispensable for maintaining large-scale military multimedia applications in MMNs.… ▽ More

    Submitted 12 January, 2024; originally announced January 2024.

  34. arXiv:2312.12467  [pdf, other

    cs.LG cs.AI cs.CE

    Learning Flexible Body Collision Dynamics with Hierarchical Contact Mesh Transformer

    Authors: Youn-Yeol Yu, Jeongwhan Choi, Woo** Cho, Kook** Lee, Nayong Kim, Kiseok Chang, Chang-Seung Woo, Ilho Kim, Seok-Woo Lee, Joon-Young Yang, Sooyoung Yoon, Noseong Park

    Abstract: Recently, many mesh-based graph neural network (GNN) models have been proposed for modeling complex high-dimensional physical systems. Remarkable achievements have been made in significantly reducing the solving time compared to traditional numerical solvers. These methods are typically designed to i) reduce the computational cost in solving physical dynamics and/or ii) propose techniques to enhan… ▽ More

    Submitted 25 March, 2024; v1 submitted 19 December, 2023; originally announced December 2023.

    Comments: Accepted at ICLR 2024

  35. arXiv:2312.12063  [pdf, other

    cs.NI cs.AI cs.GT

    Resource-efficient Generative Mobile Edge Networks in 6G Era: Fundamentals, Framework and Case Study

    Authors: Bingkun Lai, **bo Wen, Jiawen Kang, Hongyang Du, Jiangtian Nie, Changyan Yi, Dong In Kim, Shengli Xie

    Abstract: As the next-generation wireless communication system, Sixth-Generation (6G) technologies are emerging, enabling various mobile edge networks that can revolutionize wireless communication and connectivity. By integrating Generative Artificial Intelligence (GAI) with mobile edge networks, generative mobile edge networks possess immense potential to enhance the intelligence and efficiency of wireless… ▽ More

    Submitted 19 December, 2023; originally announced December 2023.

  36. arXiv:2312.05594  [pdf, other

    cs.NI cs.AI

    Generative AI for Physical Layer Communications: A Survey

    Authors: Nguyen Van Huynh, Jiacheng Wang, Hongyang Du, Dinh Thai Hoang, Dusit Niyato, Diep N. Nguyen, Dong In Kim, Khaled B. Letaief

    Abstract: The recent evolution of generative artificial intelligence (GAI) leads to the emergence of groundbreaking applications such as ChatGPT, which not only enhances the efficiency of digital content production, such as text, audio, video, or even network traffic data, but also enriches its diversity. Beyond digital content creation, GAI's capability in analyzing complex data distributions offers great… ▽ More

    Submitted 9 December, 2023; originally announced December 2023.

  37. arXiv:2311.17952  [pdf, other

    cs.CV

    Synchronizing Vision and Language: Bidirectional Token-Masking AutoEncoder for Referring Image Segmentation

    Authors: Minhyeok Lee, Dogyoon Lee, Jungho Lee, Suhwan Cho, Heeseung Choi, Ig-Jae Kim, Sangyoun Lee

    Abstract: Referring Image Segmentation (RIS) aims to segment target objects expressed in natural language within a scene at the pixel level. Various recent RIS models have achieved state-of-the-art performance by generating contextual tokens to model multimodal features from pretrained encoders and effectively fusing them using transformer-based cross-modal attention. While these methods match language feat… ▽ More

    Submitted 29 November, 2023; originally announced November 2023.

  38. arXiv:2311.17094  [pdf, other

    cs.LG cs.CV

    In Search of a Data Transformation That Accelerates Neural Field Training

    Authors: Junwon Seo, Sangyoon Lee, Kwang In Kim, Jaeho Lee

    Abstract: Neural field is an emerging paradigm in data representation that trains a neural network to approximate the given signal. A key obstacle that prevents its widespread adoption is the encoding speed-generating neural fields requires an overfitting of a neural network, which can take a significant number of SGD steps to reach the desired fidelity level. In this paper, we delve into the impacts of dat… ▽ More

    Submitted 26 March, 2024; v1 submitted 28 November, 2023; originally announced November 2023.

    Comments: CVPR 2024

  39. arXiv:2311.11094  [pdf, other

    cs.NI

    User-Centric Interactive AI for Distributed Diffusion Model-based AI-Generated Content

    Authors: Hongyang Du, Ruichen Zhang, Dusit Niyato, Jiawen Kang, Zehui Xiong, Shuguang Cui, Xuemin Shen, Dong In Kim

    Abstract: Distributed Artificial Intelligence-Generated Content (AIGC) has attracted increasing attention. However, it faces two significant challenges: how to maximize the subjective Quality of Experience (QoE) and how to enhance the energy efficiency, which are particularly pronounced in widely adopted Generative Diffusion Model (GDM)-based AIGC services for image generation. In this paper, we propose a n… ▽ More

    Submitted 18 November, 2023; originally announced November 2023.

  40. arXiv:2311.06523  [pdf, other

    cs.NI eess.SP

    Generative AI for Space-Air-Ground Integrated Networks (SAGIN)

    Authors: Ruichen Zhang, Hongyang Du, Dusit Niyato, Jiawen Kang, Zehui Xiong, Abbas Jamalipour, ** Zhang, Dong In Kim

    Abstract: Recently, generative AI technologies have emerged as a significant advancement in artificial intelligence field, renowned for their language and image generation capabilities. Meantime, space-air-ground integrated network (SAGIN) is an integral part of future B5G/6G for achieving ubiquitous connectivity. Inspired by this, this article explores an integration of generative AI in SAGIN, focusing on… ▽ More

    Submitted 11 November, 2023; originally announced November 2023.

    Comments: 9page, 5 figures

  41. arXiv:2311.03784  [pdf, other

    cs.CV cs.LG

    UP-NeRF: Unconstrained Pose-Prior-Free Neural Radiance Fields

    Authors: Injae Kim, Minhyuk Choi, Hyunwoo J. Kim

    Abstract: Neural Radiance Field (NeRF) has enabled novel view synthesis with high fidelity given images and camera poses. Subsequent works even succeeded in eliminating the necessity of pose priors by jointly optimizing NeRF and camera pose. However, these works are limited to relatively simple settings such as photometrically consistent and occluder-free image collections or a sequence of images from a vid… ▽ More

    Submitted 7 November, 2023; v1 submitted 7 November, 2023; originally announced November 2023.

    Comments: Neural Information Processing Systems (NeurIPS), 2023. The code is available at https://github.com/mlvlab/UP-NeRF

  42. arXiv:2311.03001  [pdf, other

    cs.LG stat.ML

    Variational Weighting for Kernel Density Ratios

    Authors: Sangwoong Yoon, Frank C. Park, Gunsu S Yun, Iljung Kim, Yung-Kyun Noh

    Abstract: Kernel density estimation (KDE) is integral to a range of generative and discriminative tasks in machine learning. Drawing upon tools from the multidimensional calculus of variations, we derive an optimal weight function that reduces bias in standard kernel density estimates for density ratios, leading to improved estimates of prediction posteriors and information-theoretic measures. In the proces… ▽ More

    Submitted 6 November, 2023; originally announced November 2023.

    Comments: NeurIPS 2023

  43. arXiv:2311.00947  [pdf, other

    cs.NI

    The Age of Generative AI and AI-Generated Everything

    Authors: Hongyang Du, Dusit Niyato, Jiawen Kang, Zehui Xiong, ** Zhang, Shuguang Cui, Xuemin Shen, Shiwen Mao, Zhu Han, Abbas Jamalipour, H. Vincent Poor, Dong In Kim

    Abstract: Generative AI (GAI) has emerged as a significant advancement in artificial intelligence, renowned for its language and image generation capabilities. This paper presents ``AI-Generated Everything'' (AIGX), a concept that extends GAI beyond mere content creation to real-time adaptation and control across diverse technological domains. In networking, AIGX collaborates closely with physical, data lin… ▽ More

    Submitted 1 November, 2023; originally announced November 2023.

  44. arXiv:2310.19043  [pdf, other

    math.ST cs.CR cs.LG stat.ME stat.ML

    Differentially Private Permutation Tests: Applications to Kernel Methods

    Authors: Ilmun Kim, Antonin Schrab

    Abstract: Recent years have witnessed growing concerns about the privacy of sensitive data. In response to these concerns, differential privacy has emerged as a rigorous framework for privacy protection, gaining widespread recognition in both academic and industrial circles. While substantial progress has been made in private data analysis, existing methods often suffer from impracticality or a significant… ▽ More

    Submitted 7 January, 2024; v1 submitted 29 October, 2023; originally announced October 2023.

  45. arXiv:2310.00488  [pdf, other

    cs.LG cs.AI

    On Memorization and Privacy Risks of Sharpness Aware Minimization

    Authors: Young In Kim, Pratiksha Agrawal, Johannes O. Royset, Rajiv Khanna

    Abstract: In many recent works, there is an increased focus on designing algorithms that seek flatter optima for neural network loss optimization as there is empirical evidence that it leads to better generalization performance in many datasets. In this work, we dissect these performance gains through the lens of data memorization in overparameterized models. We define a new metric that helps us identify wh… ▽ More

    Submitted 3 January, 2024; v1 submitted 30 September, 2023; originally announced October 2023.

  46. arXiv:2309.14072  [pdf, other

    cs.CV

    BoIR: Box-Supervised Instance Representation for Multi-Person Pose Estimation

    Authors: Uyoung Jeong, Seungryul Baek, Hyung ** Chang, Kwang In Kim

    Abstract: Single-stage multi-person human pose estimation (MPPE) methods have shown great performance improvements, but existing methods fail to disentangle features by individual instances under crowded scenes. In this paper, we propose a bounding box-level instance representation learning called BoIR, which simultaneously solves instance detection, instance disentanglement, and instance-keypoint associati… ▽ More

    Submitted 2 November, 2023; v1 submitted 25 September, 2023; originally announced September 2023.

    Comments: Accepted to BMVC 2023, 19 pages including the appendix, 6 figures, 7 tables

  47. arXiv:2309.12047  [pdf, other

    cs.CV cs.GR eess.IV

    Self-Calibrating, Fully Differentiable NLOS Inverse Rendering

    Authors: Kiseok Choi, Inchul Kim, Dongyoung Choi, Julio Marco, Diego Gutierrez, Min H. Kim

    Abstract: Existing time-resolved non-line-of-sight (NLOS) imaging methods reconstruct hidden scenes by inverting the optical paths of indirect illumination measured at visible relay surfaces. These methods are prone to reconstruction artifacts due to inversion ambiguities and capture noise, which are typically mitigated through the manual selection of filtering functions and parameters. We introduce a fully… ▽ More

    Submitted 25 September, 2023; v1 submitted 21 September, 2023; originally announced September 2023.

    Journal ref: Proceedings of ACM SIGGRAPH Asia 2023 (December 2023)

  48. arXiv:2309.02616  [pdf, other

    eess.IV cs.LG cs.NI

    Generative AI-aided Joint Training-free Secure Semantic Communications via Multi-modal Prompts

    Authors: Hongyang Du, Guangyuan Liu, Dusit Niyato, Jiayi Zhang, Jiawen Kang, Zehui Xiong, Bo Ai, Dong In Kim

    Abstract: Semantic communication (SemCom) holds promise for reducing network resource consumption while achieving the communications goal. However, the computational overheads in jointly training semantic encoders and decoders-and the subsequent deployment in network devices-are overlooked. Recent advances in Generative artificial intelligence (GAI) offer a potential solution. The robust learning abilities… ▽ More

    Submitted 5 September, 2023; originally announced September 2023.

  49. arXiv:2308.16099  [pdf, ps, other

    cs.IT

    Rate-Splitting for CF Massive MIMO Systems With Channel Aging

    Authors: Jiakang Zheng, Jiayi Zhang, Hongyang Du, Dusit Niyato, Dong In Kim, Bo Ai

    Abstract: The cell-free (CF) massive multiple-input multiple-output (MIMO) system is considered a cutting-edge technology in next-generation mobile communication due to its ability to provide high-performance coverage seamlessly and uniformly. This paper aims to mitigate the negative impact of channel aging due to the movement of user equipment in CF massive MIMO systems by utilizing rate-splitting (RS) tec… ▽ More

    Submitted 30 August, 2023; originally announced August 2023.

    Comments: 6 pages, 5 figures, accepted by IEEE Transactions on Vehicular Technology

  50. arXiv:2308.11920  [pdf, other

    cs.CV cs.AI

    Concept Bottleneck with Visual Concept Filtering for Explainable Medical Image Classification

    Authors: Injae Kim, Jongha Kim, Joonmyung Choi, Hyunwoo J. Kim

    Abstract: Interpretability is a crucial factor in building reliable models for various medical applications. Concept Bottleneck Models (CBMs) enable interpretable image classification by utilizing human-understandable concepts as intermediate targets. Unlike conventional methods that require extensive human labor to construct the concept set, recent works leveraging Large Language Models (LLMs) for generati… ▽ More

    Submitted 23 August, 2023; originally announced August 2023.

    Comments: Accepted to MedAGI Workshop at MICCAI 2023 (Oral Presentation)