Skip to main content

Showing 1–50 of 50 results for author: Oh, S

Searching in archive eess. Search in all archives.
.
  1. arXiv:2405.08530  [pdf, other

    eess.IV

    Parameter-Efficient Instance-Adaptive Neural Video Compression

    Authors: Hyunmo Yang, Seungjun Oh, Eunbyung Park

    Abstract: Learning-based Neural Video Codecs (NVCs) have emerged as a compelling alternative to the standard video codecs, demonstrating promising performance, and simple and easily maintainable pipelines. However, NVCs often fall short of compression performance and occasionally exhibit poor generalization capability due to inference-only compression scheme and their dependence on training data. The instan… ▽ More

    Submitted 11 June, 2024; v1 submitted 14 May, 2024; originally announced May 2024.

    Comments: 23 pages, 13 figures

  2. arXiv:2404.15374  [pdf, other

    eess.SP cs.LG

    Minimum Description Feature Selection for Complexity Reduction in Machine Learning-based Wireless Positioning

    Authors: Myeung Suk Oh, Anindya Bijoy Das, Taejoon Kim, David J. Love, Christopher G. Brinton

    Abstract: Recently, deep learning approaches have provided solutions to difficult problems in wireless positioning (WP). Although these WP algorithms have attained excellent and consistent performance against complex channel environments, the computational complexity coming from processing high-dimensional features can be prohibitive for mobile applications. In this work, we design a novel positioning neura… ▽ More

    Submitted 21 April, 2024; originally announced April 2024.

    Comments: This paper has been accepted for the publication in IEEE Journal on Selected Areas in Communications. arXiv admin note: text overlap with arXiv:2402.09580

  3. arXiv:2404.04096  [pdf, other

    cs.IT eess.SP

    Machine Learning-Aided Cooperative Localization under Dense Urban Environment

    Authors: Hoon Lee, Hong Ki Kim, Seung Hyun Oh, Sang Hyun Lee

    Abstract: Future wireless network technology provides automobiles with the connectivity feature to consolidate the concept of vehicular networks that collaborate on conducting cooperative driving tasks. The full potential of connected vehicles, which promises road safety and quality driving experience, can be leveraged if machine learning models guarantee the robustness in performing core functions includin… ▽ More

    Submitted 5 April, 2024; originally announced April 2024.

  4. arXiv:2402.17127  [pdf, other

    cs.SD eess.AS

    Experimental Study: Enhancing Voice Spoofing Detection Models with wav2vec 2.0

    Authors: Taein Kang, Soyul Han, Sunmook Choi, Jae** Seo, Sanghyeok Chung, Seungeun Lee, Seungsang Oh, Il-Youp Kwak

    Abstract: Conventional spoofing detection systems have heavily relied on the use of handcrafted features derived from speech data. However, a notable shift has recently emerged towards the direct utilization of raw speech waveforms, as demonstrated by methods like SincNet filters. This shift underscores the demand for more sophisticated audio sample features. Moreover, the success of deep learning models, p… ▽ More

    Submitted 26 February, 2024; originally announced February 2024.

    Comments: 5 pages

    MSC Class: 00A71 ACM Class: I.2.6

  5. arXiv:2402.09580  [pdf, other

    cs.LG eess.SP

    Complexity Reduction in Machine Learning-Based Wireless Positioning: Minimum Description Features

    Authors: Myeung Suk Oh, Anindya Bijoy Das, Taejoon Kim, David J. Love, Christopher G. Brinton

    Abstract: A recent line of research has been investigating deep learning approaches to wireless positioning (WP). Although these WP algorithms have demonstrated high accuracy and robust performance against diverse channel conditions, they also have a major drawback: they require processing high-dimensional features, which can be prohibitive for mobile applications. In this work, we design a positioning neur… ▽ More

    Submitted 14 February, 2024; originally announced February 2024.

    Comments: This paper has been accepted in IEEE International Conference on Communications (ICC) 2024

  6. arXiv:2401.03690  [pdf

    physics.med-ph eess.IV q-bio.QM

    So You Want to Image Myelin Using MRI: Magnetic Susceptibility Source Separation for Myelin Imaging

    Authors: Jongho Lee, Sooyeon Ji, Se-Hong Oh

    Abstract: In MRI, researchers have long endeavored to effectively visualize myelin distribution in the brain, a pursuit with significant implications for both scientific research and clinical applications. Over time, various methods such as myelin water imaging, magnetization transfer imaging, and relaxometric imaging have been developed, each carrying distinct advantages and limitations. Recently, an innov… ▽ More

    Submitted 28 March, 2024; v1 submitted 8 January, 2024; originally announced January 2024.

    Comments: Accepted to Magnetic Resonance in Medical Sciences

  7. arXiv:2312.10880  [pdf, other

    cs.RO eess.SY

    Sharable Clothoid-based Continuous Motion Planning for Connected Automated Vehicles

    Authors: Sanghoon Oh, Qi Chen, H. Eric Tseng, Gaurav Pandey, Gabor Orosz

    Abstract: A continuous motion planning method for connected automated vehicles is considered for generating feasible trajectories in real-time using three consecutive clothoids. The proposed method reduces path planning to a small set of nonlinear algebraic equations such that the generated path can be efficiently checked for feasibility and collision. After path planning, velocity planning is executed whil… ▽ More

    Submitted 17 December, 2023; originally announced December 2023.

    Comments: 14 pages, 14 figures

  8. arXiv:2312.01638  [pdf, other

    eess.IV cs.CV

    J-Net: Improved U-Net for Terahertz Image Super-Resolution

    Authors: Woon-Ha Yeo, Seung-Hwan Jung, Seung Jae Oh, Inhee Maeng, Eui Su Lee, Han-Cheol Ryu

    Abstract: Terahertz (THz) waves are electromagnetic waves in the 0.1 to 10 THz frequency range, and THz imaging is utilized in a range of applications, including security inspections, biomedical fields, and the non-destructive examination of materials. However, THz images have low resolution due to the long wavelength of THz waves. Therefore, improving the resolution of THz images is one of the current hot… ▽ More

    Submitted 4 December, 2023; originally announced December 2023.

  9. arXiv:2307.04292  [pdf, other

    eess.AS cs.AI

    A Demand-Driven Perspective on Generative Audio AI

    Authors: Sangshin Oh, Minsung Kang, Hyeongi Moon, Keunwoo Choi, Ben Sangbae Chon

    Abstract: To achieve successful deployment of AI research, it is crucial to understand the demands of the industry. In this paper, we present the results of a survey conducted with professional audio engineers, in order to determine research priorities and define various research tasks. We also summarize the current challenges in audio quality and controllability based on the survey. Our analysis emphasizes… ▽ More

    Submitted 9 July, 2023; originally announced July 2023.

    Comments: 10 pages, 7 figures

  10. arXiv:2306.12789  [pdf

    cs.SD eess.AS

    Russian assimilatory palatalization is incomplete neutralization

    Authors: Se** Oh, Jason A. Shaw, Karthik Durvasula, Alexei Kochotov

    Abstract: Incomplete neutralization refers to phonetic traces of underlying contrasts in phonologically neutralizing contexts. The present study examines one such context: Russian assimilatory palatalization in C+j sequences. Russian contrasts plain and palatalized consonants, with the plain consonants having a secondary articulation involving retraction of the tongue dorsum (velarization/uvularization). Ho… ▽ More

    Submitted 22 June, 2023; originally announced June 2023.

  11. arXiv:2306.10841  [pdf, other

    cs.LG cs.AI cs.CR eess.SY

    Blockchain-Enabled Federated Learning: A Reference Architecture Design, Implementation, and Verification

    Authors: Eunsu Goh, Dae-Yeol Kim, Kwangkee Lee, Suyeong Oh, Jong-Eui Chae, Do-Yup Kim

    Abstract: This paper presents a novel reference architecture for blockchain-enabled federated learning (BCFL), a state-of-the-art approach that amalgamates the strengths of federated learning and blockchain technology.We define smart contract functions, stakeholders and their roles, and the use of interplanetary file system (IPFS) as key components of BCFL and conduct a comprehensive analysis. In traditiona… ▽ More

    Submitted 22 November, 2023; v1 submitted 19 June, 2023; originally announced June 2023.

    Comments: 14 pages, 15 figures, 3 tables

    MSC Class: 68T01 (Primary) 68M14; 94A60 (Secondary) ACM Class: I.2.6; I.2.11

  12. arXiv:2306.09807  [pdf, other

    eess.AS cs.LG cs.SD

    FALL-E: A Foley Sound Synthesis Model and Strategies

    Authors: Minsung Kang, Sangshin Oh, Hyeongi Moon, Kyungyun Lee, Ben Sangbae Chon

    Abstract: This paper introduces FALL-E, a foley synthesis system and its training/inference strategies. The FALL-E model employs a cascaded approach comprising low-resolution spectrogram generation, spectrogram super-resolution, and a vocoder. We trained every sound-related model from scratch using our extensive datasets, and utilized a pre-trained language model. We conditioned the model with dataset-speci… ▽ More

    Submitted 10 August, 2023; v1 submitted 16 June, 2023; originally announced June 2023.

    Comments: 5 pages, 3 figures

  13. Dynamic and Robust Sensor Selection Strategies for Wireless Positioning with TOA/RSS Measurement

    Authors: Myeung Suk Oh, Seyyedali Hosseinalipour, Taejoon Kim, David J. Love, James V. Krogmeier, Christopher G. Brinton

    Abstract: Emerging wireless applications are requiring ever more accurate location-positioning from sensor measurements. In this paper, we develop sensor selection strategies for 3D wireless positioning based on time of arrival (TOA) and received signal strength (RSS) measurements to handle two distinct scenarios: (i) known approximated target location, for which we conduct dynamic sensor selection to minim… ▽ More

    Submitted 30 April, 2023; originally announced May 2023.

    Comments: This paper has been accepted to IEEE Transactions on Vehicular Technology for future publication

  14. arXiv:2304.12200  [pdf, other

    eess.SP cs.CR cs.DC cs.IT cs.LG math.NA

    SplitAMC: Split Learning for Robust Automatic Modulation Classification

    Authors: Jihoon Park, Seungeun Oh, Seong-Lyun Kim

    Abstract: Automatic modulation classification (AMC) is a technology that identifies a modulation scheme without prior signal information and plays a vital role in various applications, including cognitive radio and link adaptation. With the development of deep learning (DL), DL-based AMC methods have emerged, while most of them focus on reducing computational complexity in a centralized structure. This cent… ▽ More

    Submitted 17 April, 2023; originally announced April 2023.

    Comments: to be presented at IEEE VTC2023-Spring

  15. arXiv:2304.06237  [pdf, other

    cs.LG eess.SP

    Deep learning based ECG segmentation for delineation of diverse arrhythmias

    Authors: Chankyu Joung, Mi** Kim, Tae** Paik, Seong-Ho Kong, Seung-Young Oh, Won Kyeong Jeon, Jae-hu Jeon, Joong-Sik Hong, Wan-Joong Kim, Woong Kook, Myung-** Cha, Otto van Koert

    Abstract: Accurate delineation of key waveforms in an ECG is a critical initial step in extracting relevant features to support the diagnosis and treatment of heart conditions. Although deep learning based methods using a segmentation model to locate the P, QRS, and T waves have shown promising results, their ability to handle signals exhibiting arrhythmia remains unclear. This study builds on existing rese… ▽ More

    Submitted 6 September, 2023; v1 submitted 12 April, 2023; originally announced April 2023.

  16. A Decentralized Pilot Assignment Algorithm for Scalable O-RAN Cell-Free Massive MIMO

    Authors: Myeung Suk Oh, Anindya Bijoy Das, Seyyedali Hosseinalipour, Taejoon Kim, David J. Love, Christopher G. Brinton

    Abstract: Radio access networks (RANs) in monolithic architectures have limited adaptability to supporting different network scenarios. Recently, open-RAN (O-RAN) techniques have begun adding enormous flexibility to RAN implementations. O-RAN is a natural architectural fit for cell-free massive multiple-input multiple-output (CFmMIMO) systems, where many geographically-distributed access points (APs) are em… ▽ More

    Submitted 1 April, 2024; v1 submitted 11 January, 2023; originally announced January 2023.

    Comments: The journal version of this paper is published in IEEE Journal on Selected Areas in Communications

  17. arXiv:2212.07939  [pdf, other

    cs.CL cs.LG cs.SD eess.AS

    RWEN-TTS: Relation-aware Word Encoding Network for Natural Text-to-Speech Synthesis

    Authors: Shinhyeok Oh, HyeongRae Noh, Yoonseok Hong, Insoo Oh

    Abstract: With the advent of deep learning, a huge number of text-to-speech (TTS) models which produce human-like speech have emerged. Recently, by introducing syntactic and semantic information w.r.t the input text, various approaches have been proposed to enrich the naturalness and expressiveness of TTS models. Although these strategies showed impressive results, they still have some limitations in utiliz… ▽ More

    Submitted 15 December, 2022; originally announced December 2022.

    Comments: Accepted to AAAI 2023

  18. arXiv:2209.06305  [pdf

    physics.optics eess.IV

    Ptychographic lens-less polarization microscopy

    Authors: Jeongsoo Kim, Seungri Song, Bora Kim, Mirae Park, Seung Jae Oh, Daesuk Kim, Barry Cense, Yong-Min Huh, Joo Yong Lee, Chulmin Joo

    Abstract: Birefringence, an inherent characteristic of optically anisotropic materials, is widely utilized in various imaging applications ranging from material characterizations to clinical diagnosis. Polarized light microscopy enables high-resolution, high-contrast imaging of optically anisotropic specimens, but it is associated with mechanical rotations of polarizer/analyzer and relatively complex optica… ▽ More

    Submitted 13 September, 2022; originally announced September 2022.

    Comments: 18 pages, 10 figures, author names corrected

  19. arXiv:2207.10760  [pdf, ps, other

    cs.SD cs.AI cs.MM eess.AS

    A Proposal for Foley Sound Synthesis Challenge

    Authors: Keunwoo Choi, Sangshin Oh, Minsung Kang, Brian McFee

    Abstract: "Foley" refers to sound effects that are added to multimedia during post-production to enhance its perceived acoustic properties, e.g., by simulating the sounds of footsteps, ambient environmental sounds, or visible objects on the screen. While foley is traditionally produced by foley artists, there is increasing interest in automatic or machine-assisted techniques building upon recent advances in… ▽ More

    Submitted 21 July, 2022; originally announced July 2022.

  20. arXiv:2207.10324  [pdf, other

    eess.IV cs.CV cs.LG

    Enhancing Generative Networks for Chest Anomaly Localization through Automatic Registration-Based Unpaired-to-Pseudo-Paired Training Data Translation

    Authors: Kyungsu Kim, Seong Je Oh, Chae Yeon Lim, Ju Hwan Lee, Tae Uk Kim, Myung ** Chung

    Abstract: Image translation based on a generative adversarial network (GAN-IT) is a promising method for the precise localization of abnormal regions in chest X-ray images (AL-CXR) even without the pixel-level annotation. However, heterogeneous unpaired datasets undermine existing methods to extract key features and distinguish normal from abnormal cases, resulting in inaccurate and unstable AL-CXR. To addr… ▽ More

    Submitted 15 June, 2024; v1 submitted 21 July, 2022; originally announced July 2022.

  21. arXiv:2206.14984  [pdf, other

    eess.AS cs.SD

    TTS-by-TTS 2: Data-selective augmentation for neural speech synthesis using ranking support vector machine with variational autoencoder

    Authors: Eunwoo Song, Ryuichi Yamamoto, Ohsung Kwon, Chan-Ho Song, Min-Jae Hwang, Suhyeon Oh, Hyun-Wook Yoon, **-Seob Kim, Jae-Min Kim

    Abstract: Recent advances in synthetic speech quality have enabled us to train text-to-speech (TTS) systems by using synthetic corpora. However, merely increasing the amount of synthetic data is not always advantageous for improving training efficiency. Our aim in this study is to selectively choose synthetic data that are beneficial to the training process. In the proposed method, we first adopt a variatio… ▽ More

    Submitted 29 June, 2022; originally announced June 2022.

    Comments: Accepted to the conference of INTERSPEECH 2022

  22. arXiv:2206.13504  [pdf, other

    eess.IV cs.CV cs.LG

    AI-based computer-aided diagnostic system of chest digital tomography synthesis: Demonstrating comparative advantage with X-ray-based AI systems

    Authors: Kyung-Su Kim, Ju Hwan Lee, Seong Je Oh, Myung ** Chung

    Abstract: Compared with chest X-ray (CXR) imaging, which is a single image projected from the front of the patient, chest digital tomosynthesis (CDTS) imaging can be more advantageous for lung lesion detection because it acquires multiple images projected from multiple angles of the patient. Various clinical comparative analysis and verification studies have been reported to demonstrate this, but there were… ▽ More

    Submitted 18 June, 2022; originally announced June 2022.

    Comments: Kyung-Su Kim, Ju Hwan Lee, and Seong Je Oh have contributed equally to this work as the co-first author. Kyung-Su Kim ([email protected]) and Myung ** Chung ([email protected]) have contributed equally to this work as the co-corresponding author

  23. arXiv:2206.13385  [pdf, other

    eess.IV cs.CV cs.LG

    3D unsupervised anomaly detection and localization through virtual multi-view projection and reconstruction: Clinical validation on low-dose chest computed tomography

    Authors: Kyung-Su Kim, Seong Je Oh, Ju Hwan Lee, Myung ** Chung

    Abstract: Computer-aided diagnosis for low-dose computed tomography (CT) based on deep learning has recently attracted attention as a first-line automatic testing tool because of its high accuracy and low radiation exposure. However, existing methods rely on supervised learning, imposing an additional burden to doctors for collecting disease data or annotating spatial labels for network training, consequent… ▽ More

    Submitted 18 June, 2022; originally announced June 2022.

    Comments: Kyung-Su Kim and Seong Je Oh have contributed equally to this work as the co-first author. Kyung-Su Kim ([email protected]) and Myung ** Chung ([email protected]) have contributed equally to this work as the co-corresponding author

  24. arXiv:2205.04104  [pdf, other

    eess.AS cs.AI

    ReCAB-VAE: Gumbel-Softmax Variational Inference Based on Analytic Divergence

    Authors: Sangshin Oh, Seyun Um, Hong-Goo Kang

    Abstract: The Gumbel-softmax distribution, or Concrete distribution, is often used to relax the discrete characteristics of a categorical distribution and enable back-propagation through differentiable reparameterization. Although it reliably yields low variance gradients, it still relies on a stochastic sampling process for optimization. In this work, we present a relaxed categorical analytic bound (ReCAB)… ▽ More

    Submitted 9 May, 2022; originally announced May 2022.

  25. arXiv:2202.04328  [pdf, other

    cs.SD eess.AS

    CAU_KU team's submission to ADD 2022 Challenge task 1: Low-quality fake audio detection through frequency feature masking

    Authors: Il-Youp Kwak, Sunmook Choi, Jonghoon Yang, Yerin Lee, Seungsang Oh

    Abstract: This technical report describes Chung-Ang University and Korea University (CAU_KU) team's model participating in the Audio Deep Synthesis Detection (ADD) 2022 Challenge, track 1: Low-quality fake audio detection. For track 1, we propose a frequency feature masking (FFM) augmentation technique to deal with a low-quality audio environment. %detection that spectrogram-based models can be applied. We… ▽ More

    Submitted 9 February, 2022; originally announced February 2022.

  26. arXiv:2112.00931  [pdf, other

    cs.IT eess.SP

    Antenna Selection in Polarization Reconfigurable MIMO (PR-MIMO) Communication Systems

    Authors: Paul S. Oh, Sean S. Kwon, Andreas F. Molisch

    Abstract: Adaptation of a wireless system to the polarization state of the propagation channel can improve reliability and throughput. This paper in particular considers polarization reconfigurable multiple input multiple output (PR-MIMO) systems, where both transmitter and receiver can change the (linear) polarization orientation at each element of their antenna arrays. We first introduce joint polarizatio… ▽ More

    Submitted 2 April, 2024; v1 submitted 1 December, 2021; originally announced December 2021.

  27. arXiv:2106.04165  [pdf, other

    cs.LG cs.NE eess.SY math.DS

    Neural Hybrid Automata: Learning Dynamics with Multiple Modes and Stochastic Transitions

    Authors: Michael Poli, Stefano Massaroli, Luca Scimeca, Seong Joon Oh, Sanghyuk Chun, Atsushi Yamashita, Hajime Asama, **kyoo Park, Animesh Garg

    Abstract: Effective control and prediction of dynamical systems often require appropriate handling of continuous-time and discrete, event-triggered processes. Stochastic hybrid systems (SHSs), common across engineering domains, provide a formalism for dynamical systems subject to discrete, possibly stochastic, state jumps and multi-modal continuous-time flows. Despite the versatility and importance of SHSs… ▽ More

    Submitted 8 June, 2021; originally announced June 2021.

  28. arXiv:2104.10781  [pdf, other

    eess.IV cs.CV

    NTIRE 2021 Challenge on Quality Enhancement of Compressed Video: Methods and Results

    Authors: Ren Yang, Radu Timofte, **g Liu, Yi Xu, Xinjian Zhang, Minyi Zhao, Shuigeng Zhou, Kelvin C. K. Chan, Shangchen Zhou, Xiangyu Xu, Chen Change Loy, Xin Li, Fanglong Liu, He Zheng, Lielin Jiang, Qi Zhang, Dongliang He, Fu Li, Qingqing Dang, Yibin Huang, Matteo Maggioni, Zhongqian Fu, Shuai Xiao, Cheng li, Thomas Tanay , et al. (47 additional authors not shown)

    Abstract: This paper reviews the first NTIRE challenge on quality enhancement of compressed video, with a focus on the proposed methods and results. In this challenge, the new Large-scale Diverse Video (LDV) dataset is employed. The challenge has three tracks. Tracks 1 and 2 aim at enhancing the videos compressed by HEVC at a fixed QP, while Track 3 is designed for enhancing the videos compressed by x265 at… ▽ More

    Submitted 31 August, 2022; v1 submitted 21 April, 2021; originally announced April 2021.

    Comments: Corrected the MOS values in Table 2, and corrected some minor typos

  29. arXiv:2102.02463  [pdf

    eess.IV cs.AI physics.med-ph

    DIFFnet: Diffusion parameter map** network generalized for input diffusion gradient schemes and bvalues

    Authors: Juhung Park, Woo** Jung, Eun-Jung Choi, Se-Hong Oh, Dongmyung Shin, Hongjun An, Jongho Lee

    Abstract: In MRI, deep neural networks have been proposed to reconstruct diffusion model parameters. However, the inputs of the networks were designed for a specific diffusion gradient scheme (i.e., diffusion gradient directions and numbers) and a specific b-value that are the same as the training data. In this study, a new deep neural network, referred to as DIFFnet, is developed to function as a generaliz… ▽ More

    Submitted 4 February, 2021; originally announced February 2021.

  30. Channel Estimation via Successive Denoising in MIMO OFDM Systems: A Reinforcement Learning Approach

    Authors: Myeung Suk Oh, Seyyedali Hosseinalipour, Taejoon Kim, Christopher G. Brinton, David J. Love

    Abstract: In general, reliable communication via multiple-input multiple-output (MIMO) orthogonal frequency division multiplexing (OFDM) requires accurate channel estimation at the receiver. The existing literature largely focuses on denoising methods for channel estimation that depend on either (i) channel analysis in the time-domain with prior channel knowledge or (ii) supervised learning techniques which… ▽ More

    Submitted 27 March, 2024; v1 submitted 25 January, 2021; originally announced January 2021.

    Comments: This paper has been published in the proceedings of 2021 IEEE International Conference on Communications (ICC)

  31. arXiv:2012.02859  [pdf, other

    eess.SY

    Idle speed control with low-complexity offset-free explicit model predictive control in presence of system delay

    Authors: Sang Hwan Son, Se-Kyu Oh, Byung Jun Park, Min Jun Song, Jong Min Lee

    Abstract: The requirement for continual improvement of idle speed control (ISC) performance is increasing due to the stringent regulation on emission and fuel economy these days. In this regard, a low-complexity offset-free explicit model predictive control (EMPC) with constraint horizon is designed to regulate the idle speed under unmeasured disturbance in presence of system delay with rigorous formulation… ▽ More

    Submitted 13 December, 2020; v1 submitted 4 December, 2020; originally announced December 2020.

  32. arXiv:2011.08061  [pdf, other

    cs.CV eess.IV

    FRDet: Balanced and Lightweight Object Detector based on Fire-Residual Modules for Embedded Processor of Autonomous Driving

    Authors: Seontaek Oh, Ji-Hwan You, Young-Keun Kim

    Abstract: For deployment on an embedded processor for autonomous driving, the object detection network should satisfy all of the accuracy, real-time inference, and light model size requirements. Conventional deep CNN-based detectors aim for high accuracy, making their model size heavy for an embedded system with limited memory space. In contrast, lightweight object detectors are greatly compressed but at a… ▽ More

    Submitted 16 November, 2020; originally announced November 2020.

  33. arXiv:2011.07673  [pdf, other

    eess.SY cs.MA

    Spatiotemporal Characteristics of Ride-sourcing Operation in Urban Area

    Authors: Simon Oh, Daniel Kondor, Ravi Seshadri, Meng Zhou, Diem-Trinh Le, Moshe Ben-Akiva

    Abstract: The emergence of ride-sourcing platforms has brought an innovative alternative in transportation, radically changed travel behaviors, and suggested new directions for transportation planners and operators. This paper provides an exploratory analysis on the operations of a ride-sourcing service using large-scale data on service performance. Observations over multiple days in Singapore suggest repro… ▽ More

    Submitted 15 November, 2020; originally announced November 2020.

    Comments: 18 pages, 11 figures, 5 tables

  34. A simulation-based evaluation of a Cargo-Hitching service for E-commerce using mobility-on-demand vehicles

    Authors: Andre Alho, Takanori Sakai, Simon Oh, Cheng Cheng, Ravi Seshadri, Wen Han Chong, Yusuke Hara, Julia Caravias, Lynette Cheah, Moshe Ben-Akiva

    Abstract: Time-sensitive parcel deliveries, shipments requested for delivery in a day or less, are an increasingly important research subject. It is challenging to deal with these deliveries from a carrier perspective since it entails additional planning constraints, preventing an efficient consolidation of deliveries which is possible when demand is well known in advance. Furthermore, such time-sensitive d… ▽ More

    Submitted 22 October, 2020; originally announced October 2020.

    Comments: 19 pages, 4 tables, 7 figures. Submitted to Transportation (Springer)

    Journal ref: Future Transp. 2021, 1, 639-656

  35. arXiv:2010.10282  [pdf, other

    cs.IT eess.SY

    User-Number Threshold-based Base Station On/Off Control for Maximizing Coverage Probability

    Authors: Jung-Hoon Noh, Seong-Jun Oh

    Abstract: In this study, we investigate the operation of user-number threshold-based base station (BS) on/off control, in which the BS turns off when the number of active users is less than a specific threshold value. This paper presents a space-based analysis of the BS on/off control system to which a stochastic geometric approach is applied. In particular, we derive the approximated closed-form expression… ▽ More

    Submitted 20 October, 2020; originally announced October 2020.

  36. arXiv:2008.10584  [pdf, other

    eess.IV cs.CV

    Accurate Alignment Inspection System for Low-resolution Automotive and Mobility LiDAR

    Authors: Seontake Oh, Ji-Hwan You, Azim Eskandarian, Young-Keun Kim

    Abstract: A misalignment of LiDAR as low as a few degrees could cause a significant error in obstacle detection and map** that could cause safety and quality issues. In this paper, an accurate inspection system is proposed for estimating a LiDAR alignment error after sensor attachment on a mobility system such as a vehicle or robot. The proposed method uses only a single target board at the fixed position… ▽ More

    Submitted 24 August, 2020; originally announced August 2020.

  37. arXiv:2008.10542  [pdf, other

    eess.IV cs.CV

    Automatic LiDAR Extrinsic Calibration System using Photodetector and Planar Board for Large-scale Applications

    Authors: Ji-Hwan You, Seon Taek Oh, Jae-Eun Park, Azim Eskandarian, Young-Keun Kim

    Abstract: This paper presents a novel automatic calibration system to estimate the extrinsic parameters of LiDAR mounted on a mobile platform for sensor misalignment inspection in the large-scale production of highly automated vehicles. To obtain subdegree and subcentimeter accuracy levels of extrinsic calibration, this study proposed a new concept of a target board with embedded photodetector arrays, named… ▽ More

    Submitted 24 August, 2020; originally announced August 2020.

    Comments: prepost for IEEE journal

  38. DeepResp: Deep learning solution for respiration-induced B0 fluctuation artifacts in multi-slice GRE

    Authors: Hongjun An, Hyeong-Geol Shin, Sooyoen Ji, Woo** Jung, Sehong Oh, Dongmyung Shin, Juhyung Park, Jongho Lee

    Abstract: Respiration-induced B$_0$ fluctuation corrupts MRI images by inducing phase errors in k-space. A few approaches such as navigator have been proposed to correct for the artifacts at the expense of sequence modification. In this study, a new deep learning method, which is referred to as DeepResp, is proposed for reducing the respiration-artifacts in multi-slice gradient echo (GRE) images. DeepResp i… ▽ More

    Submitted 19 July, 2020; originally announced July 2020.

    Comments: 19 pages

  39. arXiv:2003.09171  [pdf, other

    cs.CV cs.LG eess.IV

    DMV: Visual Object Tracking via Part-level Dense Memory and Voting-based Retrieval

    Authors: Gunhee Nam, Seoung Wug Oh, Joon-Young Lee, Seon Joo Kim

    Abstract: We propose a novel memory-based tracker via part-level dense memory and voting-based retrieval, called DMV. Since deep learning techniques have been introduced to the tracking field, Siamese trackers have attracted many researchers due to the balance between speed and accuracy. However, most of them are based on a single template matching, which limits the performance as it restricts the accessibl… ▽ More

    Submitted 20 March, 2020; originally announced March 2020.

    Comments: 19 pages, 9 figures

  40. arXiv:2003.09124  [pdf, other

    eess.IV cs.CV

    Learning the Loss Functions in a Discriminative Space for Video Restoration

    Authors: Younghyun Jo, Jaeyeon Kang, Seoung Wug Oh, Seonghyeon Nam, Peter Vajda, Seon Joo Kim

    Abstract: With more advanced deep network architectures and learning schemes such as GANs, the performance of video restoration algorithms has greatly improved recently. Meanwhile, the loss functions for optimizing deep neural networks remain relatively unchanged. To this end, we propose a new framework for building effective loss functions by learning a discriminative space specific to a video restoration… ▽ More

    Submitted 20 March, 2020; originally announced March 2020.

    Comments: 24 pages

  41. arXiv:1912.09015  [pdf

    cs.LG cs.AI eess.IV eess.SP stat.ML

    Deep Reinforcement Learning Designed Shinnar-Le Roux RF Pulse using Root-Flip**: DeepRF_SLR

    Authors: Dongmyung Shin, Sooyeon Ji, Doohee Lee, Jieun Lee, Se-Hong Oh, Jongho Lee

    Abstract: A novel approach of applying deep reinforcement learning to an RF pulse design is introduced. This method, which is referred to as DeepRF_SLR, is designed to minimize the peak amplitude or, equivalently, minimize the pulse duration of a multiband refocusing pulse generated by the Shinar Le-Roux (SLR) algorithm. In the method, the root pattern of SLR polynomial, which determines the RF pulse shape,… ▽ More

    Submitted 1 September, 2020; v1 submitted 18 December, 2019; originally announced December 2019.

    Comments: Accepted at IEEE transactions on Medical Imaging (https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=9174664)

  42. arXiv:1911.04069  [pdf, other

    cs.LG cs.RO eess.AS stat.ML

    Generative Autoregressive Networks for 3D Dancing Move Synthesis from Music

    Authors: Hyemin Ahn, Jaehun Kim, Kihyun Kim, Songhwai Oh

    Abstract: This paper proposes a framework which is able to generate a sequence of three-dimensional human dance poses for a given music. The proposed framework consists of three components: a music feature encoder, a pose generator, and a music genre classifier. We focus on integrating these components for generating a realistic 3D human dancing move from music, which can be applied to artificial agents and… ▽ More

    Submitted 10 November, 2019; originally announced November 2019.

    Comments: 8 pages, 10 figures

  43. arXiv:1911.03038  [pdf, other

    cs.IT cs.LG eess.SP

    Turbo Autoencoder: Deep learning based channel codes for point-to-point communication channels

    Authors: Yihan Jiang, Hyeji Kim, Himanshu Asnani, Sreeram Kannan, Sewoong Oh, Pramod Viswanath

    Abstract: Designing codes that combat the noise in a communication medium has remained a significant area of research in information theory as well as wireless communications. Asymptotically optimal channel codes have been developed by mathematicians for communicating under canonical models after over 60 years of research. On the other hand, in many non-canonical channel settings, optimal codes do not exist… ▽ More

    Submitted 7 November, 2019; originally announced November 2019.

  44. arXiv:1911.01635  [pdf, other

    eess.AS cs.SD

    Emotional speech synthesis with rich and granularized control

    Authors: Se-Yun Um, Sangshin Oh, Kyungguen Byun, Inseon Jang, Chunghyun Ahn, Hong-Goo Kang

    Abstract: This paper proposes an effective emotion control method for an end-to-end text-to-speech (TTS) system. To flexibly control the distinct characteristic of a target emotion category, it is essential to determine embedding vectors representing the TTS input. We introduce an inter-to-intra emotional distance ratio algorithm to the embedding vectors that can minimize the distance to the target emotion… ▽ More

    Submitted 5 November, 2019; v1 submitted 5 November, 2019; originally announced November 2019.

    Comments: Submitted to ICASSP 2020

  45. arXiv:1908.05895  [pdf, other

    cs.IT cs.LG cs.NI eess.SP

    Distilling On-Device Intelligence at the Network Edge

    Authors: Jihong Park, Shiqiang Wang, Anis Elgabli, Seungeun Oh, Eunjeong Jeong, Han Cha, Hyesung Kim, Seong-Lyun Kim, Mehdi Bennis

    Abstract: Devices at the edge of wireless networks are the last mile data sources for machine learning (ML). As opposed to traditional ready-made public datasets, these user-generated private datasets reflect the freshest local environments in real time. They are thus indispensable for enabling mission-critical intelligent systems, ranging from fog radio access networks (RANs) to driverless cars and e-Healt… ▽ More

    Submitted 16 August, 2019; originally announced August 2019.

    Comments: 7 pages, 6 figures; This work has been submitted to the IEEE for possible publication

  46. arXiv:1907.09707  [pdf, other

    cs.CV eess.IV

    RRNet: Repetition-Reduction Network for Energy Efficient Decoder of Depth Estimation

    Authors: Sangyun Oh, Hye-** S. Kim, Jongeun Lee, Junmo Kim

    Abstract: We introduce Repetition-Reduction network (RRNet) for resource-constrained depth estimation, offering significantly improved efficiency in terms of computation, memory and energy consumption. The proposed method is based on repetition-reduction (RR) blocks. The RR blocks consist of the set of repeated convolutions and the residual connection layer that take place of the pointwise reduction layer w… ▽ More

    Submitted 31 July, 2019; v1 submitted 23 July, 2019; originally announced July 2019.

    Comments: 9 pages, 5 figures

  47. arXiv:1907.06426  [pdf, other

    cs.LG cs.NI eess.SP stat.ML

    Multi-hop Federated Private Data Augmentation with Sample Compression

    Authors: Eunjeong Jeong, Seungeun Oh, Jihong Park, Hyesung Kim, Mehdi Bennis, Seong-Lyun Kim

    Abstract: On-device machine learning (ML) has brought about the accessibility to a tremendous amount of data from the users while kee** their local data private instead of storing it in a central entity. However, for privacy guarantee, it is inevitable at each device to compensate for the quality of data or learning performance, especially when it has a non-IID training dataset. In this paper, we propose… ▽ More

    Submitted 15 July, 2019; originally announced July 2019.

    Comments: to be presented at the 28th International Joint Conference on Artificial Intelligence (IJCAI-19), 1st International Workshop on Federated Machine Learning for User Privacy and Data Confidentiality (FML'19), Macao, China

  48. arXiv:1903.02295  [pdf, other

    eess.SP cs.IT

    DeepTurbo: Deep Turbo Decoder

    Authors: Yihan Jiang, Hyeji Kim, Himanshu Asnani, Sreeram Kannan, Sewoong Oh, Pramod Viswanath

    Abstract: Present-day communication systems routinely use codes that approach the channel capacity when coupled with a computationally efficient decoder. However, the decoder is typically designed for the Gaussian noise channel and is known to be sub-optimal for non-Gaussian noise distribution. Deep learning methods offer a new approach for designing decoders that can be trained and tailored for arbitrary c… ▽ More

    Submitted 24 April, 2019; v1 submitted 6 March, 2019; originally announced March 2019.

  49. arXiv:1811.12707  [pdf, other

    eess.SP cs.AI

    LEARN Codes: Inventing Low-latency Codes via Recurrent Neural Networks

    Authors: Yihan Jiang, Hyeji Kim, Himanshu Asnani, Sreeram Kannan, Sewoong Oh, Pramod Viswanath

    Abstract: Designing channel codes under low-latency constraints is one of the most demanding requirements in 5G standards. However, a sharp characterization of the performance of traditional codes is available only in the large block-length limit. Guided by such asymptotic analysis, code designs require large block lengths as well as latency to achieve the desired error rate. Tail-biting convolutional codes… ▽ More

    Submitted 24 July, 2020; v1 submitted 30 November, 2018; originally announced November 2018.

  50. arXiv:1811.02182  [pdf, ps, other

    cs.CL cs.LG cs.SD eess.AS

    Unpaired Speech Enhancement by Acoustic and Adversarial Supervision for Speech Recognition

    Authors: Geonmin Kim, Hwaran Lee, Bo-Kyeong Kim, Sang-Hoon Oh, Soo-Young Lee

    Abstract: Many speech enhancement methods try to learn the relationship between noisy and clean speech, obtained using an acoustic room simulator. We point out several limitations of enhancement methods relying on clean speech targets; the goal of this work is proposing an alternative learning algorithm, called acoustic and adversarial supervision (AAS). AAS makes the enhanced output both maximizing the lik… ▽ More

    Submitted 6 November, 2018; originally announced November 2018.

    Comments: will be published in IEEE Signal Processing Letter