Skip to main content

Showing 1–13 of 13 results for author: Hwang, R

Searching in archive cs. Search in all archives.
.
  1. arXiv:2404.08847  [pdf, other

    cs.IR cs.CR cs.LG

    LazyDP: Co-Designing Algorithm-Software for Scalable Training of Differentially Private Recommendation Models

    Authors: Juntaek Lim, Youngeun Kwon, Ranggi Hwang, Kiwan Maeng, G. Edward Suh, Minsoo Rhu

    Abstract: Differential privacy (DP) is widely being employed in the industry as a practical standard for privacy protection. While private training of computer vision or natural language processing applications has been studied extensively, the computational challenges of training of recommender systems (RecSys) with DP have not been explored. In this work, we first present our detailed characterization of… ▽ More

    Submitted 12 April, 2024; originally announced April 2024.

    Journal ref: Published at 29th ACM International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS-29), 2024

  2. arXiv:2308.12066  [pdf, other

    cs.LG cs.AI cs.AR

    Pre-gated MoE: An Algorithm-System Co-Design for Fast and Scalable Mixture-of-Expert Inference

    Authors: Ranggi Hwang, Jianyu Wei, Shijie Cao, Changho Hwang, Xiaohu Tang, Ting Cao, Mao Yang

    Abstract: Large language models (LLMs) based on transformers have made significant strides in recent years, the success of which is driven by scaling up their model size. Despite their high algorithmic performance, the computational and memory requirements of LLMs present unprecedented challenges. To tackle the high compute requirements of LLMs, the Mixture-of-Experts (MoE) architecture was introduced which… ▽ More

    Submitted 27 April, 2024; v1 submitted 23 August, 2023; originally announced August 2023.

  3. arXiv:2212.03280  [pdf, other

    cs.NI

    Optimizing Resource Allocation with High-Reliability Constraint for Multicasting Automotive Messages in 5G NR C-V2X Networks

    Authors: Kuan-Lin Chen, Wei-Yu Chen, Ren-Hung Hwang

    Abstract: Cellular vehicle-to-everything (C-V2X) has been continuously evolving since Release 14 of the 3rd Generation Partnership Project (3GPP) for future autonomous vehicles. Apart from automotive safety, 5G NR further bring new capabilities to C-V2X for autonomous driving, such as real-time local update, and coordinated driving. These capabilities rely on the provision of low latency and high reliabilit… ▽ More

    Submitted 29 September, 2022; originally announced December 2022.

    Comments: 13 pages, submitted to IEEE Transactions on Vehicular Technology

    MSC Class: C.2

  4. arXiv:2209.01349  [pdf, other

    cs.NI cs.CR

    Towards the Age of Intelligent Vehicular Networks for Connected and Autonomous Vehicles in 6G

    Authors: Van-Linh Nguyen, Ren-Hung Hwang, Po-Ching Lin, Abhishek Vyas, Van-Tao Nguyen

    Abstract: Twenty-two years after the advent of the first-generation vehicular network, i.e., dedicated short-range communications (DSRC) standard/IEEE 802.11p, the vehicular technology market has become very competitive with a new player, Cellular Vehicle-to-Everything (C-V2X). Currently, C-V2X technology likely dominates the race because of the big advantages of comprehensive coverage and high throughput/r… ▽ More

    Submitted 3 September, 2022; originally announced September 2022.

  5. arXiv:2208.12392  [pdf, other

    cs.AR cs.AI cs.CR cs.LG

    DiVa: An Accelerator for Differentially Private Machine Learning

    Authors: Beomsik Park, Ranggi Hwang, Dongho Yoon, Yoonhyuk Choi, Minsoo Rhu

    Abstract: The widespread deployment of machine learning (ML) is raising serious concerns on protecting the privacy of users who contributed to the collection of training data. Differential privacy (DP) is rapidly gaining momentum in the industry as a practical standard for privacy protection. Despite DP's importance, however, little has been explored within the computer systems community regarding the impli… ▽ More

    Submitted 25 August, 2022; originally announced August 2022.

    Comments: Accepted for publication at the 55th IEEE/ACM International Symposium on Microarchitecture (MICRO-55), 2022

  6. arXiv:2203.00158  [pdf, other

    cs.AR cs.AI cs.LG

    GROW: A Row-Stationary Sparse-Dense GEMM Accelerator for Memory-Efficient Graph Convolutional Neural Networks

    Authors: Ranggi Hwang, Minhoo Kang, Jiwon Lee, Dongyun Kam, Youngjoo Lee, Minsoo Rhu

    Abstract: Graph convolutional neural networks (GCNs) have emerged as a key technology in various application domains where the input data is relational. A unique property of GCNs is that its two primary execution stages, aggregation and combination, exhibit drastically different dataflows. Consequently, prior GCN accelerators tackle this research space by casting the aggregation and combination stages as a… ▽ More

    Submitted 30 November, 2022; v1 submitted 28 February, 2022; originally announced March 2022.

    Comments: Accepted for publication at the 29th IEEE International Symposium on High-Performance Computer Architecture (HPCA), 2023

  7. arXiv:2111.04941  [pdf, other

    math.OC cs.AI cs.LG math.NA physics.comp-ph

    Solving PDE-constrained Control Problems Using Operator Learning

    Authors: Rakhoon Hwang, Jae Yong Lee, ** Young Shin, Hyung Ju Hwang

    Abstract: The modeling and control of complex physical systems are essential in real-world problems. We propose a novel framework that is generally applicable to solving PDE-constrained optimal control problems by introducing surrogate models for PDE solution operators with special regularizers. The procedure of the proposed framework is divided into two phases: solution operator learning for PDE constraint… ▽ More

    Submitted 26 December, 2023; v1 submitted 8 November, 2021; originally announced November 2021.

    Comments: 15 pages, 12 figures. Published as a conference paper at Thirty-Sixth AAAI Conference on Artificial Intelligence (AAAI 2022)

    MSC Class: 68U07

  8. Security and privacy for 6G: A survey on prospective technologies and challenges

    Authors: Van-Linh Nguyen, Po-Ching Lin, Bo-Chao Cheng, Ren-Hung Hwang, Ying-Dar Lin

    Abstract: Sixth-generation (6G) mobile networks will have to cope with diverse threats on a space-air-ground integrated network environment, novel technologies, and an accessible user information explosion. However, for now, security and privacy issues for 6G remain largely in concept. This survey provides a systematic overview of security and privacy issues based on prospective technologies for 6G in the p… ▽ More

    Submitted 31 August, 2021; v1 submitted 26 August, 2021; originally announced August 2021.

    Comments: 45 pages, 28 figures, accepted at IEEE Communications Surveys and Tutorials, 2021

  9. arXiv:2107.05015  [pdf

    cs.NI

    Offloading Optimization with Delay Distribution in the 3-tier Federated Cloud, Edge, and Fog Systems

    Authors: Ren-Hung Hwang, Yuan-Cheng Lai, Ying-Dar Lin

    Abstract: Mobile edge computing and fog computing are promising techniques providing computation service closer to users to achieve lower latency. In this work, we study the optimal offloading strategy in the three-tier federated computation offloading system. We first present queueing models and closed-form solutions for computing the service delay distribution and the probability of the delay of a task ex… ▽ More

    Submitted 11 July, 2021; originally announced July 2021.

    Comments: submitted to IEEE Globecom 2021

  10. arXiv:2005.05968  [pdf, other

    cs.DC cs.IR cs.LG

    Centaur: A Chiplet-based, Hybrid Sparse-Dense Accelerator for Personalized Recommendations

    Authors: Ranggi Hwang, Taehun Kim, Youngeun Kwon, Minsoo Rhu

    Abstract: Personalized recommendations are the backbone machine learning (ML) algorithm that powers several important application domains (e.g., ads, e-commerce, etc) serviced from cloud datacenters. Sparse embedding layers are a crucial building block in designing recommendations yet little attention has been paid in properly accelerating this important ML algorithm. This paper first provides a detailed wo… ▽ More

    Submitted 12 May, 2020; originally announced May 2020.

    Comments: Accepted for publication at the 47th IEEE/ACM International Symposium on Computer Architecture (ISCA-47), 2020

  11. arXiv:1911.02723  [pdf, ps, other

    cs.LG stat.ML

    Option Compatible Reward Inverse Reinforcement Learning

    Authors: Rakhoon Hwang, Han** Lee, Hyung Ju Hwang

    Abstract: Reinforcement learning in complex environments is a challenging problem. In particular, the success of reinforcement learning algorithms depends on a well-designed reward function. Inverse reinforcement learning (IRL) solves the problem of recovering reward functions from expert demonstrations. In this paper, we solve a hierarchical inverse reinforcement learning problem within the options framewo… ▽ More

    Submitted 18 January, 2021; v1 submitted 6 November, 2019; originally announced November 2019.

    Comments: This paper is under consideration at Pattern Recognition Letters

  12. arXiv:1903.05470  [pdf, other

    cs.CR

    Preventing the attempts of abusing cheap-hosting Web-servers for monetization attacks

    Authors: Van-Linh Nguyen, Po-Ching Lin, Ren-Hung Hwang

    Abstract: Over the past decades, the web is always one of the most popular targets of hackers. Today, along with the popular usage of open sources such as Wordpress and Joomla, the explosion of the vulnerabilities in such frameworks causes the websites using them to face numerous security threats. Unfortunately, many clients and small companies may not be aware of these serious security threats and call a r… ▽ More

    Submitted 13 March, 2019; v1 submitted 13 March, 2019; originally announced March 2019.

  13. arXiv:1902.02905  [pdf, other

    physics.med-ph cs.AI cs.CV cs.LG q-bio.NC

    Mobile Artificial Intelligence Technology for Detecting Macula Edema and Subretinal Fluid on OCT Scans: Initial Results from the DATUM alpha Study

    Authors: Stephen G. Odaibo, Mikelson MomPremier, Richard Y. Hwang, Salman J. Yousuf, Steven L. Williams, Joshua Grant

    Abstract: Artificial Intelligence (AI) is necessary to address the large and growing deficit in retina and healthcare access globally. And mobile AI diagnostic platforms running in the Cloud may effectively and efficiently distribute such AI capability. Here we sought to evaluate the feasibility of Cloud-based mobile artificial intelligence for detection of retinal disease. And to evaluate the accuracy of a… ▽ More

    Submitted 12 February, 2019; v1 submitted 7 February, 2019; originally announced February 2019.

    Comments: Initial results of the DATUM alpha Study were initially presented on August 13th 2018 in the Keynote Address at the 116th National Medical Association Annual Meeting & Scientific Assembly's New Innovations in Ophthalmology Session. The results were also presented on September 21st 2018 in a Podium Lecture during Alumni Day at the University of Michigan--Ann Arbor Kellogg Eye Center