Skip to main content

Showing 1–50 of 75 results for author: Lee, W

Searching in archive eess. Search in all archives.
.
  1. arXiv:2406.15723  [pdf, other

    cs.CL cs.AI cs.SD eess.AS

    Acoustic Feature Mixup for Balanced Multi-aspect Pronunciation Assessment

    Authors: Hee** Do, Wonjun Lee, Gary Geunbae Lee

    Abstract: In automated pronunciation assessment, recent emphasis progressively lies on evaluating multiple aspects to provide enriched feedback. However, acquiring multi-aspect-score labeled data for non-native language learners' speech poses challenges; moreover, it often leads to score-imbalanced distributions. In this paper, we propose two Acoustic Feature Mixup strategies, linearly and non-linearly inte… ▽ More

    Submitted 21 June, 2024; originally announced June 2024.

    Comments: Interspeech 2024

  2. arXiv:2404.15533  [pdf, other

    eess.SY

    Designing, simulating, and performing the 100-AV field test for the CIRCLES consortium: Methodology and Implementation of the Largest mobile traffic control experiment to date

    Authors: Mostafa Ameli, Sean Mcquade, Jonathan W. Lee, Matthew Bunting, Matthew Nice, Han Wang, William Barbour, Ryan Weightman, Chris Denaro, Ryan Delorenzo, Sharon Hornstein, Jon F. Davis, Dan Timsit, Riley Wagner, Rita Xu, Malaika Mahmood, Mikail Mahmood, Maria Laura Delle Monache, Benjamin Seibold, Daniel B. Work, Jonathan Sprinkle, Benedetto Piccoli, Alexandre M. Bayen

    Abstract: Previous controlled experiments on single-lane ring roads have shown that a single partially autonomous vehicle (AV) can effectively mitigate traffic waves. This naturally prompts the question of how these findings can be generalized to field operational, high-density traffic conditions. To address this question, the Congestion Impacts Reduction via CAV-in-the-loop Lagrangian Energy Smoothing (CIR… ▽ More

    Submitted 23 April, 2024; originally announced April 2024.

  3. arXiv:2404.02135  [pdf

    cs.CV eess.IV

    Enhancing Ship Classification in Optical Satellite Imagery: Integrating Convolutional Block Attention Module with ResNet for Improved Performance

    Authors: Ryan Donghan Kwon, Gangjoo Robin Nam, Jisoo Tak, Junseob Shin, Hyerin Cha, Yeom Hyeok, Seung Won Lee

    Abstract: This study presents an advanced Convolutional Neural Network (CNN) architecture for ship classification from optical satellite imagery, significantly enhancing performance through the integration of the Convolutional Block Attention Module (CBAM) and additional architectural innovations. Building upon the foundational ResNet50 model, we first incorporated a standard CBAM to direct the model's focu… ▽ More

    Submitted 8 April, 2024; v1 submitted 2 April, 2024; originally announced April 2024.

  4. arXiv:2402.17050  [pdf, other

    eess.SY cs.RO

    Reinforcement Learning Based Oscillation Dampening: Scaling up Single-Agent RL algorithms to a 100 AV highway field operational test

    Authors: Kathy Jang, Nathan Lichtlé, Eugene Vinitsky, Adit Shah, Matthew Bunting, Matthew Nice, Benedetto Piccoli, Benjamin Seibold, Daniel B. Work, Maria Laura Delle Monache, Jonathan Sprinkle, Jonathan W. Lee, Alexandre M. Bayen

    Abstract: In this article, we explore the technical details of the reinforcement learning (RL) algorithms that were deployed in the largest field test of automated vehicles designed to smooth traffic flow in history as of 2023, uncovering the challenges and breakthroughs that come with develo** RL controllers for automated vehicles. We delve into the fundamental concepts behind RL algorithms and their app… ▽ More

    Submitted 14 May, 2024; v1 submitted 26 February, 2024; originally announced February 2024.

  5. arXiv:2402.17043  [pdf, other

    eess.SY

    Traffic Control via Connected and Automated Vehicles: An Open-Road Field Experiment with 100 CAVs

    Authors: Jonathan W. Lee, Han Wang, Kathy Jang, Amaury Hayat, Matthew Bunting, Arwa Alanqary, William Barbour, Zhe Fu, Xiaoqian Gong, George Gunter, Sharon Hornstein, Abdul Rahman Kreidieh, Nathan Lichtlé, Matthew W. Nice, William A. Richardson, Adit Shah, Eugene Vinitsky, Fangyu Wu, Shengquan Xiang, Sulaiman Almatrudi, Fahd Althukair, Rahul Bhadani, Joy Carpio, Raphael Chekroun, Eric Cheng , et al. (39 additional authors not shown)

    Abstract: The CIRCLES project aims to reduce instabilities in traffic flow, which are naturally occurring phenomena due to human driving behavior. These "phantom jams" or "stop-and-go waves,"are a significant source of wasted energy. Toward this goal, the CIRCLES project designed a control system referred to as the MegaController by the CIRCLES team, that could be deployed in real traffic. Our field experim… ▽ More

    Submitted 26 February, 2024; originally announced February 2024.

  6. arXiv:2401.11567  [pdf, other

    math.OC eess.SY

    Deterministic Multi-stage Constellation Reconfiguration Using Integer Linear Programming and Sequential Decision-Making Methods

    Authors: Hang Woon Lee, David O. Williams Rogers, Brycen D. Pearl, Hao Chen, Koki Ho

    Abstract: In this paper, we address the problem of reconfiguring Earth observation satellite constellation systems through multiple stages. The Multi-stage Constellation Reconfiguration Problem (MCRP) aims to maximize the total observation rewards obtained by covering a set of targets of interest through the active manipulation of the orbits and relative phasing of constituent satellites. In this paper, we… ▽ More

    Submitted 30 April, 2024; v1 submitted 21 January, 2024; originally announced January 2024.

    Comments: 39 pages, 13 figures, submitted to the Journal of Spacecraft and Rockets

  7. arXiv:2401.09666  [pdf, other

    eess.SY cs.AI cs.MA

    Traffic Smoothing Controllers for Autonomous Vehicles Using Deep Reinforcement Learning and Real-World Trajectory Data

    Authors: Nathan Lichtlé, Kathy Jang, Adit Shah, Eugene Vinitsky, Jonathan W. Lee, Alexandre M. Bayen

    Abstract: Designing traffic-smoothing cruise controllers that can be deployed onto autonomous vehicles is a key step towards improving traffic flow, reducing congestion, and enhancing fuel efficiency in mixed autonomy traffic. We bypass the common issue of having to carefully fine-tune a large traffic microsimulator by leveraging real-world trajectory data from the I-24 highway in Tennessee, replayed in a o… ▽ More

    Submitted 17 January, 2024; originally announced January 2024.

    Comments: Accepted to be published as part of the 26th IEEE International Conference on Intelligent Transportation Systems (ITSC) 2023, Bilbao, Spain, September 24-28, 2023

  8. arXiv:2401.03850  [pdf, other

    eess.AS cs.SD

    Inverse Nonlinearity Compensation of Hyperelastic Deformation in Dielectric Elastomer for Acoustic Actuation

    Authors: ** Woo Lee, Gwang Seok An, Jeong-Yun Sun, Kyogu Lee

    Abstract: This paper delves into the analysis of nonlinear deformation induced by dielectric actuation in pre-stressed ideal dielectric elastomers. It formulates a nonlinear ordinary differential equation governing this deformation based on the hyperelastic model under dielectric stress. Through numerical integration and neural network approximations, the relationship between voltage and stretch is establis… ▽ More

    Submitted 8 January, 2024; originally announced January 2024.

  9. arXiv:2312.17344  [pdf, other

    math.DS eess.SY q-bio.MN

    Recursive Self-Composite Approach Towards Structural Understanding of Boolean Network

    Authors: Jongrae Kim, Woojeong Lee, Kwang-Hyun Cho

    Abstract: Boolean networks have been widely used in many areas of science and engineering to represent various dynamical behaviour. In systems biology, they became useful tools to study the dynamical characteristics of large-scale biomolecular networks and there have been a number of studies to develop efficient ways of finding steady states or cycles of Boolean network models. On the other hand, there has… ▽ More

    Submitted 28 December, 2023; originally announced December 2023.

    Comments: 9 pages, 3 figures

  10. arXiv:2312.09576  [pdf, other

    eess.IV cs.CV

    SegRap2023: A Benchmark of Organs-at-Risk and Gross Tumor Volume Segmentation for Radiotherapy Planning of Nasopharyngeal Carcinoma

    Authors: Xiangde Luo, Jia Fu, Yunxin Zhong, Shuolin Liu, Bing Han, Mehdi Astaraki, Simone Bendazzoli, Iuliana Toma-Dasu, Yiwen Ye, Ziyang Chen, Yong Xia, Yanzhou Su, ** Ye, Junjun He, Zhaohu Xing, Hongqiu Wang, Lei Zhu, Kaixiang Yang, Xin Fang, Zhiwei Wang, Chan Woong Lee, Sang Joon Park, Jaehee Chun, Constantin Ulrich, Klaus H. Maier-Hein , et al. (17 additional authors not shown)

    Abstract: Radiation therapy is a primary and effective NasoPharyngeal Carcinoma (NPC) treatment strategy. The precise delineation of Gross Tumor Volumes (GTVs) and Organs-At-Risk (OARs) is crucial in radiation treatment, directly impacting patient prognosis. Previously, the delineation of GTVs and OARs was performed by experienced radiation oncologists. Recently, deep learning has achieved promising results… ▽ More

    Submitted 15 December, 2023; originally announced December 2023.

    Comments: A challenge report of SegRap2023 (organized in conjunction with MICCAI2023)

  11. arXiv:2312.03312  [pdf, other

    cs.CL cs.SD eess.AS

    Optimizing Two-Pass Cross-Lingual Transfer Learning: Phoneme Recognition and Phoneme to Grapheme Translation

    Authors: Wonjun Lee, Gary Geunbae Lee, Yunsu Kim

    Abstract: This research optimizes two-pass cross-lingual transfer learning in low-resource languages by enhancing phoneme recognition and phoneme-to-grapheme translation models. Our approach optimizes these two stages to improve speech recognition across languages. We optimize phoneme vocabulary coverage by merging phonemes based on shared articulatory characteristics, thus improving recognition accuracy. A… ▽ More

    Submitted 6 December, 2023; originally announced December 2023.

    Comments: 8 pages, ASRU 2023 Accepted

  12. arXiv:2312.01842  [pdf, other

    cs.SD cs.AI eess.AS

    Exploring the Viability of Synthetic Audio Data for Audio-Based Dialogue State Tracking

    Authors: Jihyun Lee, Ye** Jeon, Wonjun Lee, Yunsu Kim, Gary Geunbae Lee

    Abstract: Dialogue state tracking plays a crucial role in extracting information in task-oriented dialogue systems. However, preceding research are limited to textual modalities, primarily due to the shortage of authentic human audio datasets. We address this by investigating synthetic audio data for audio-based DST. To this end, we develop cascading and end-to-end models, train them with our synthetic audi… ▽ More

    Submitted 4 December, 2023; originally announced December 2023.

    Comments: Accepted in ASRU 2023

  13. arXiv:2311.18539  [pdf, other

    cs.CR eess.SY

    Bridging Both Worlds in Semantics and Time: Domain Knowledge Based Analysis and Correlation of Industrial Process Attacks

    Authors: Moses Ike, Kandy Phan, Anwesh Badapanda, Matthew Landen, Keaton Sadoski, Wanda Guo, Asfahan Shah, Saman Zonouz, Wenke Lee

    Abstract: Modern industrial control systems (ICS) attacks infect supervisory control and data acquisition (SCADA) hosts to stealthily alter industrial processes, causing damage. To detect attacks with low false alarms, recent work detects attacks in both SCADA and process data. Unfortunately, this led to the same problem - disjointed (false) alerts, due to the semantic and time gap in SCADA and process beha… ▽ More

    Submitted 3 December, 2023; v1 submitted 30 November, 2023; originally announced November 2023.

  14. arXiv:2311.18505  [pdf, other

    cs.SD eess.AS eess.SP

    String Sound Synthesizer on GPU-accelerated Finite Difference Scheme

    Authors: ** Woo Lee, Min Jun Choi, Kyogu Lee

    Abstract: This paper introduces a nonlinear string sound synthesizer, based on a finite difference simulation of the dynamic behavior of strings under various excitations. The presented synthesizer features a versatile string simulation engine capable of stochastic parameterization, encompassing fundamental frequency modulation, stiffness, tension, frequency-dependent loss, and excitation control. This open… ▽ More

    Submitted 8 January, 2024; v1 submitted 30 November, 2023; originally announced November 2023.

    Comments: To be appeared in ICASSP 2024

  15. arXiv:2311.10430  [pdf, other

    eess.IV cs.CV cs.LG

    Deep Residual CNN for Multi-Class Chest Infection Diagnosis

    Authors: Ryan Donghan Kwon, Dohyun Lim, Yoonha Lee, Seung Won Lee

    Abstract: The advent of deep learning has significantly propelled the capabilities of automated medical image diagnosis, providing valuable tools and resources in the realm of healthcare and medical diagnostics. This research delves into the development and evaluation of a Deep Residual Convolutional Neural Network (CNN) for the multi-class diagnosis of chest infections, utilizing chest X-ray images. The im… ▽ More

    Submitted 17 November, 2023; originally announced November 2023.

  16. arXiv:2310.18151  [pdf, other

    eess.SY math.OC

    Traffic smoothing using explicit local controllers

    Authors: Amaury Hayat, Arwa Alanqary, Rahul Bhadani, Christopher Denaro, Ryan J. Weightman, Shengquan Xiang, Jonathan W. Lee, Matthew Bunting, Anish Gollakota, Matthew W. Nice, Derek Gloudemans, Gergely Zachar, Jon F. Davis, Maria Laura Delle Monache, Benjamin Seibold, Alexandre M. Bayen, Jonathan Sprinkle, Daniel B. Work, Benedetto Piccoli

    Abstract: The dissipation of stop-and-go waves attracted recent attention as a traffic management problem, which can be efficiently addressed by automated driving. As part of the 100 automated vehicles experiment named MegaVanderTest, feedback controls were used to induce strong dissipation via velocity smoothing. More precisely, a single vehicle driving differently in one of the four lanes of I-24 in the N… ▽ More

    Submitted 27 October, 2023; originally announced October 2023.

    Comments: 21 pages, 1 Table , 9 figures

    MSC Class: 93D15; 93D21; 93-05; 34H05; ACM Class: H.2.2

  17. arXiv:2310.06297  [pdf, other

    eess.SY

    Reducing Detailed Vehicle Energy Dynamics to Physics-Like Models

    Authors: Nour Khoudari, Sulaiman Almatrudi, Rabie Ramadan, Joy Carpio, Mengsha Yao, Kenneth Butts, Alexandre M. Bayen, Jonathan W. Lee, Benjamin Seibold

    Abstract: The energy demand of vehicles, particularly in unsteady drive cycles, is affected by complex dynamics internal to the engine and other powertrain components. Yet, in many applications, particularly macroscopic traffic flow modeling and optimization, structurally simple approximations to the complex vehicle dynamics are needed that nevertheless reproduce the correct effective energy behavior. This… ▽ More

    Submitted 10 October, 2023; originally announced October 2023.

    Comments: 40 pages, 9 figures

  18. arXiv:2309.04755  [pdf, other

    cs.CE cs.AI eess.SP physics.flu-dyn

    Towards Real-time Training of Physics-informed Neural Networks: Applications in Ultrafast Ultrasound Blood Flow Imaging

    Authors: Haotian Guan, **** Dong, Wei-Ning Lee

    Abstract: Physics-informed Neural Network (PINN) is one of the most preeminent solvers of Navier-Stokes equations, which are widely used as the governing equation of blood flow. However, current approaches, relying on full Navier-Stokes equations, are impractical for ultrafast Doppler ultrasound, the state-of-the-art technique for depiction of complex blood flow dynamics \emph{in vivo} through acquired thou… ▽ More

    Submitted 9 September, 2023; originally announced September 2023.

  19. Constrained CycleGAN for Effective Generation of Ultrasound Sector Images of Improved Spatial Resolution

    Authors: Xiaofei Sun, He Li, Wei-Ning Lee

    Abstract: Objective. A phased or a curvilinear array produces ultrasound (US) images with a sector field of view (FOV), which inherently exhibits spatially-varying image resolution with inferior quality in the far zone and towards the two sides azimuthally. Sector US images with improved spatial resolutions are favorable for accurate quantitative analysis of large and dynamic organs, such as the heart. Ther… ▽ More

    Submitted 2 September, 2023; originally announced September 2023.

    Journal ref: Physics in Medicine & Biology 2023

  20. arXiv:2308.15791  [pdf, other

    cs.CV eess.IV

    Neural Video Compression with Temporal Layer-Adaptive Hierarchical B-frame Coding

    Authors: Yeongwoong Kim, Suyong Bahk, Seungeon Kim, Won Hee Lee, Dokwan Oh, Hui Yong Kim

    Abstract: Neural video compression (NVC) is a rapidly evolving video coding research area, with some models achieving superior coding efficiency compared to the latest video coding standard Versatile Video Coding (VVC). In conventional video coding standards, the hierarchical B-frame coding, which utilizes a bidirectional prediction structure for higher compression, had been well-studied and exploited. In N… ▽ More

    Submitted 5 September, 2023; v1 submitted 30 August, 2023; originally announced August 2023.

  21. arXiv:2305.00676  [pdf, other

    cs.RO cs.AI eess.SY

    Learning Terrain-Aware Kinodynamic Model for Autonomous Off-Road Rally Driving With Model Predictive Path Integral Control

    Authors: Ho** Lee, Taekyung Kim, Jungwi Mun, Wonsuk Lee

    Abstract: High-speed autonomous driving in off-road environments has immense potential for various applications, but it also presents challenges due to the complexity of vehicle-terrain interactions. In such environments, it is crucial for the vehicle to predict its motion and adjust its controls proactively in response to environmental changes, such as variations in terrain elevation. To this end, we propo… ▽ More

    Submitted 22 September, 2023; v1 submitted 1 May, 2023; originally announced May 2023.

    Comments: Accepted to IEEE Robotics and Automation Letters (and ICRA 2024). Our video can be found at https://youtu.be/VXf_prNQnJo Project page : https://sites.google.com/view/terrainawarekinodyn

    Journal ref: IEEE Robotics and Automation Letters, 2023

  22. arXiv:2303.03093  [pdf, other

    eess.SP

    A Miniaturised Camera-based Multi-Modal Tactile Sensor

    Authors: Kaspar Althoefer, Yonggen Ling, Wanlin Li, Xinyuan Qian, Wang Wei Lee, Peng Qi

    Abstract: In conjunction with huge recent progress in camera and computer vision technology, camera-based sensors have increasingly shown considerable promise in relation to tactile sensing. In comparison to competing technologies (be they resistive, capacitive or magnetic based), they offer super-high-resolution, while suffering from fewer wiring problems. The human tactile system is composed of various ty… ▽ More

    Submitted 6 March, 2023; originally announced March 2023.

  23. arXiv:2303.00882  [pdf, other

    eess.IV cs.CV cs.LG q-bio.QM

    X-Ray2EM: Uncertainty-Aware Cross-Modality Image Reconstruction from X-Ray to Electron Microscopy in Connectomics

    Authors: Yicong Li, Yaron Meirovitch, Aaron T. Kuan, Jasper S. Phelps, Alexandra Pacureanu, Wei-Chung Allen Lee, Nir Shavit, Lu Mi

    Abstract: Comprehensive, synapse-resolution imaging of the brain will be crucial for understanding neuronal computations and function. In connectomics, this has been the sole purview of volume electron microscopy (EM), which entails an excruciatingly difficult process because it requires cutting tissue into many thin, fragile slices that then need to be imaged, aligned, and reconstructed. Unlike EM, hard X-… ▽ More

    Submitted 1 March, 2023; originally announced March 2023.

    Comments: Accepted by ISBI 2023 conference. Supplementary material is available in this arXiv version

  24. arXiv:2302.07453  [pdf, other

    eess.SY

    Cooperative Driving for Speed Harmonization in Mixed-Traffic Environments

    Authors: Zhe Fu, Abdul Rahman Kreidieh, Han Wang, Jonathan W. Lee, Maria Laura Delle Monache, Alexandre M. Bayen

    Abstract: Autonomous driving systems present promising methods for congestion mitigation in mixed autonomy traffic control settings. In particular, when coupled with even modest traffic state estimates, such systems can plan and coordinate the behaviors of automated vehicles (AVs) in response to observed downstream events, thereby inhibiting the continued propagation of congestion. In this paper, we present… ▽ More

    Submitted 3 June, 2023; v1 submitted 14 February, 2023; originally announced February 2023.

    Comments: Accepted by IEEE IV 2023

  25. arXiv:2212.05662  [pdf, other

    cs.LG eess.SY

    Optimal Planning of Hybrid Energy Storage Systems using Curtailed Renewable Energy through Deep Reinforcement Learning

    Authors: Dongju Kang, Doeun Kang, Sumin Hwangbo, Haider Niaz, Won Bo Lee, J. Jay Liu, Jonggeol Na

    Abstract: Energy management systems (EMS) are becoming increasingly important in order to utilize the continuously growing curtailed renewable energy. Promising energy storage systems (ESS), such as batteries and green hydrogen should be employed to maximize the efficiency of energy stakeholders. However, optimal decision-making, i.e., planning the leveraging between different strategies, is confronted with… ▽ More

    Submitted 11 December, 2022; originally announced December 2022.

    Comments: 30 pages, 8 figures

  26. arXiv:2211.00878  [pdf, other

    eess.AS cs.AI cs.MM cs.SD eess.SP

    Neural Fourier Shift for Binaural Speech Rendering

    Authors: ** Woo Lee, Kyogu Lee

    Abstract: We present a neural network for rendering binaural speech from given monaural audio, position, and orientation of the source. Most of the previous works have focused on synthesizing binaural speeches by conditioning the positions and orientations in the feature space of convolutional neural networks. These synthesis approaches are powerful in estimating the target binaural speeches even for in-the… ▽ More

    Submitted 1 May, 2023; v1 submitted 2 November, 2022; originally announced November 2022.

    Comments: Accepted by ICASSP 2023

  27. arXiv:2206.06978  [pdf, ps, other

    cs.IT eess.SY

    Low-Latency MAC Design for Pairwise Random Networks

    Authors: Irshad A. Meer, Woong-Hee Lee, Mustafa Ozger, Cicek Cavdar, Ki Won Sung

    Abstract: Feasibility of using unlicensed spectrum for ultra reliable low latency communications (URLLC) is still a question for beyond 5G wireless networks. Low latency access to the channel and efficiently sharing spectrum among the multiple users are the main requirements for exploiting unlicensed spectrum for URLLC. Listen before talk and back-off procedures implemented to avoid the collisions in channe… ▽ More

    Submitted 22 May, 2022; originally announced June 2022.

    Comments: Accepted in IEEE VTC Spring 2022

  28. arXiv:2206.02910  [pdf, other

    math.OC eess.SY

    Regional Constellation Reconfiguration Problem: Integer Linear Programming Formulation and Lagrangian Heuristic Method

    Authors: Hang Woon Lee, Koki Ho

    Abstract: A group of satellites, with either homogeneous or heterogeneous orbital characteristics and/or hardware specifications, can undertake a reconfiguration process due to variations in operations pertaining to Earth observation missions. This paper investigates the problem of optimizing a satellite constellation reconfiguration process against two competing mission objectives: (i) the maximization of… ▽ More

    Submitted 2 July, 2023; v1 submitted 6 June, 2022; originally announced June 2022.

    Comments: 41 pages, 10 figures, accepted for publication at Journal of Spacecraft and Rockets; pre-print

  29. Representation Selective Self-distillation and wav2vec 2.0 Feature Exploration for Spoof-aware Speaker Verification

    Authors: ** Woo Lee, Eungbeom Kim, Junghyun Koo, Kyogu Lee

    Abstract: Text-to-speech and voice conversion studies are constantly improving to the extent where they can produce synthetic speech almost indistinguishable from bona fide human speech. In this regard, the importance of countermeasures (CM) against synthetic voice attacks of the automatic speaker verification (ASV) systems emerges. Nonetheless, most end-to-end spoofing detection networks are black-box syst… ▽ More

    Submitted 2 July, 2022; v1 submitted 6 April, 2022; originally announced April 2022.

    Comments: Accepted to be published in the Proceedings of Interspeech 2022

  30. arXiv:2204.02637  [pdf, other

    eess.AS cs.SD

    Global HRTF Interpolation via Learned Affine Transformation of Hyper-conditioned Features

    Authors: ** Woo Lee, Sungho Lee, Kyogu Lee

    Abstract: Estimating Head-Related Transfer Functions (HRTFs) of arbitrary source points is essential in immersive binaural audio rendering. Computing each individual's HRTFs is challenging, as traditional approaches require expensive time and computational resources, while modern data-driven approaches are data-hungry. Especially for the data-driven approaches, existing HRTF datasets differ in spatial sampl… ▽ More

    Submitted 3 November, 2022; v1 submitted 6 April, 2022; originally announced April 2022.

    Comments: Submitted to ICASSP 2023

  31. arXiv:2203.11799  [pdf, other

    cs.CV eess.IV

    AP-BSN: Self-Supervised Denoising for Real-World Images via Asymmetric PD and Blind-Spot Network

    Authors: Wooseok Lee, Sanghyun Son, Kyoung Mu Lee

    Abstract: Blind-spot network (BSN) and its variants have made significant advances in self-supervised denoising. Nevertheless, they are still bound to synthetic noisy inputs due to less practical assumptions like pixel-wise independent noise. Hence, it is challenging to deal with spatially correlated real-world noise using self-supervised BSN. Recently, pixel-shuffle downsampling (PD) has been proposed to r… ▽ More

    Submitted 24 March, 2022; v1 submitted 22 March, 2022; originally announced March 2022.

    Comments: Accepted to CVPR2022

  32. arXiv:2203.04294  [pdf, other

    eess.IV cs.AI cs.CV

    NaviAirway: a Bronchiole-sensitive Deep Learning-based Airway Segmentation Pipeline

    Authors: Andong Wang, Terence Chi Chun Tam, Ho Ming Poon, Kun-Chang Yu, Wei-Ning Lee

    Abstract: Airway segmentation is essential for chest CT image analysis. Different from natural image segmentation, which pursues high pixel-wise accuracy, airway segmentation focuses on topology. The task is challenging not only because of its complex tree-like structure but also the severe pixel imbalance among airway branches of different generations. To tackle the problems, we present a NaviAirway method… ▽ More

    Submitted 16 June, 2023; v1 submitted 7 March, 2022; originally announced March 2022.

  33. arXiv:2202.09533  [pdf, other

    cs.CV eess.IV

    C2N: Practical Generative Noise Modeling for Real-World Denoising

    Authors: Geonwoon Jang, Wooseok Lee, Sanghyun Son, Kyoung Mu Lee

    Abstract: Learning-based image denoising methods have been bounded to situations where well-aligned noisy and clean images are given, or samples are synthesized from predetermined noise models, e.g., Gaussian. While recent generative noise modeling methods aim to simulate the unknown distribution of real-world noise, several limitations still exist. In a practical scenario, a noise generator should learn to… ▽ More

    Submitted 19 February, 2022; originally announced February 2022.

    Journal ref: Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), 2021, pp. 2350-2359

  34. arXiv:2202.01784  [pdf, other

    cs.SD cs.LG eess.AS

    Robust Audio Anomaly Detection

    Authors: Wo Jae Lee, Karim Helwani, Arvindh Krishnaswamy, Srikanth Tenneti

    Abstract: We propose an outlier robust multivariate time series model which can be used for detecting previously unseen anomalous sounds based on noisy training data. The presented approach doesn't assume the presence of labeled anomalies in the training dataset and uses a novel deep neural network architecture to learn the temporal dynamics of the multivariate time series at multiple resolutions while bein… ▽ More

    Submitted 3 February, 2022; originally announced February 2022.

    Comments: Accepted paper at RobustML Workshop@ICLR 2021

    Journal ref: RobustML Workshop - ICLR 2021

  35. TOAST: Trajectory Optimization and Simultaneous Tracking using Shared Neural Network Dynamics

    Authors: Taekyung Kim, Ho** Lee, Seongil Hong, Wonsuk Lee

    Abstract: Neural networks have been increasingly employed in Model Predictive Controller (MPC) to control nonlinear dynamic systems. However, MPC still poses a problem that an achievable update rate is insufficient to cope with model uncertainty and external disturbances. In this paper, we present a novel control scheme that can design an optimal tracking controller using the neural network dynamics of the… ▽ More

    Submitted 14 July, 2022; v1 submitted 20 January, 2022; originally announced January 2022.

    Comments: Accepted to IEEE Robotics and Automation Letters (and IROS 2022). Our video can be found at https://youtu.be/YQG0yHE5jWw

    Journal ref: IEEE Robotics and Automation Letters, 2022

  36. arXiv:2201.00229  [pdf, other

    eess.SP

    Understanding Energy Efficiency and Interference Tolerance in Millimeter Wave Receivers

    Authors: Panagiotis Skrimponis, Seongjoon Kang, Abbas Khalili, Wonho Lee, Navid Hosseinzadeh, Marco Mezzavilla, Elza Erkip, Mark J. W. Rodwell, James F. Buckwalter, Sundeep Rangan

    Abstract: Power consumption is a key challenge in millimeter wave (mmWave) receiver front-ends, due to the need to support high dimensional antenna arrays at wide bandwidths. Recently, there has been considerable work in develo** low-power front-ends, often based on low-resolution ADCs and low-power mixers. A critical but less studied consequence of such designs is the relatively low-dynamic range which i… ▽ More

    Submitted 1 January, 2022; originally announced January 2022.

    Comments: Appeared at the Asilomar Conference on Signals, Systems, and Computers 2021

  37. arXiv:2111.00187  [pdf

    eess.SP

    Echopype: A Python library for interoperable and scalable processing of water column sonar data for biological information

    Authors: Wu-Jung Lee, Emilio Mayorga, Landung Setiawan, Valentina Staneva

    Abstract: High-frequency sonar systems deployed on a wide array of ocean observing platforms are creating a deluge of water column sonar data at an unprecedented speed from all corners of the ocean. Efficient and integrative analysis of these data, either across different sonar instruments or with other oceanographic datasets, holds the key to understanding the response of marine ecosystems to the rapidly c… ▽ More

    Submitted 12 March, 2024; v1 submitted 30 October, 2021; originally announced November 2021.

    Comments: Fix erroneous annotations in use case example flowchart

  38. arXiv:2105.06887  [pdf

    eess.IV cs.CV cs.LG

    A Frequency Domain Constraint for Synthetic and Real X-ray Image Super Resolution

    Authors: Qing Ma, Jae Chul Koh, WonSook Lee

    Abstract: Synthetic X-ray images are simulated X-ray images projected from CT data. High-quality synthetic X-ray images can facilitate various applications such as surgical image guidance systems and VR training simulations. However, it is difficult to produce high-quality arbitrary view synthetic X-ray images in real-time due to different CT slice thickness, high computational cost, and the complexity of a… ▽ More

    Submitted 10 August, 2021; v1 submitted 14 May, 2021; originally announced May 2021.

  39. arXiv:2104.11421  [pdf, other

    cs.LG eess.IV eess.SP

    A Framework for Recognizing and Estimating Human Concentration Levels

    Authors: Woodo Lee, Jakyung Koo, Nokyung Park, Pilgu Kang, Jeakwon Shim

    Abstract: One of the major tasks in online education is to estimate the concentration levels of each student. Previous studies have a limitation of classifying the levels using discrete states only. The purpose of this paper is to estimate the subtle levels as specified states by using the minimum amount of body movement data. This is done by a framework composed of a Deep Neural Network and Kalman Filter.… ▽ More

    Submitted 23 April, 2021; originally announced April 2021.

  40. arXiv:2104.11267  [pdf, other

    eess.SY

    Integrated Framework of Vehicle Dynamics, Instabilities, Energy Models, and Sparse Flow Smoothing Controllers

    Authors: Jonathan W. Lee, George Gunter, Rabie Ramadan, Sulaiman Almatrudi, Paige Arnold, John Aquino, William Barbour, Rahul Bhadani, Joy Carpio, Fang-Chieh Chou, Marsalis Gibson, Xiaoqian Gong, Amaury Hayat, Nour Khoudari, Abdul Rahman Kreidieh, Maya Kumar, Nathan Lichtlé, Sean McQuade, Brian Nguyen, Megan Ross, Sydney Truong, Eugene Vinitsky, Yibo Zhao, Jonathan Sprinkle, Benedetto Piccoli , et al. (3 additional authors not shown)

    Abstract: This work presents an integrated framework of: vehicle dynamics models, with a particular attention to instabilities and traffic waves; vehicle energy models, with particular attention to accurate energy values for strongly unsteady driving profiles; and sparse Lagrangian controls via automated vehicles, with a focus on controls that can be executed via existing technology such as adaptive cruise… ▽ More

    Submitted 22 April, 2021; originally announced April 2021.

  41. Sparse Channel Estimation in Wideband Systems with Geometric Sequence Decomposition

    Authors: Woong-Hee Lee, Ki Won Sung

    Abstract: The sparsity of multipaths in the wideband channel has motivated the use of compressed sensing for channel estimation. In this letter, we propose a different approach to sparse channel estimation. We exploit the fact that $L$ taps of channel impulse response in time domain constitute a non-orthogonal superposition of $L$ geometric sequences in frequency domain. This converts the channel estimation… ▽ More

    Submitted 24 October, 2021; v1 submitted 9 April, 2021; originally announced April 2021.

  42. arXiv:2101.07937  [pdf, other

    cs.LG cs.AI eess.SP

    Noise Learning Based Denoising Autoencoder

    Authors: Woong-Hee Lee, Mustafa Ozger, Ursula Challita, Ki Won Sung

    Abstract: This letter introduces a new denoiser that modifies the structure of denoising autoencoder (DAE), namely noise learning based DAE (nlDAE). The proposed nlDAE learns the noise of the input data. Then, the denoising is performed by subtracting the regenerated noise from the noisy input. Hence, nlDAE is more effective than DAE when the noise is simpler to regenerate than the original data. To validat… ▽ More

    Submitted 21 June, 2021; v1 submitted 19 January, 2021; originally announced January 2021.

  43. arXiv:2010.01799  [pdf, other

    cs.LG eess.IV stat.ML

    Understanding Catastrophic Overfitting in Single-step Adversarial Training

    Authors: Hoki Kim, Woo** Lee, Jaewook Lee

    Abstract: Although fast adversarial training has demonstrated both robustness and efficiency, the problem of "catastrophic overfitting" has been observed. This is a phenomenon in which, during single-step adversarial training, the robust accuracy against projected gradient descent (PGD) suddenly decreases to 0% after a few epochs, whereas the robust accuracy against fast gradient sign method (FGSM) increase… ▽ More

    Submitted 15 December, 2020; v1 submitted 5 October, 2020; originally announced October 2020.

    Comments: Accepted to AAAI 2021. Preprint

  44. arXiv:2008.06146  [pdf

    eess.AS cs.CL cs.SD

    End-to-End Trainable Self-Attentive Shallow Network for Text-Independent Speaker Verification

    Authors: Hyeonmook Park, Jungbae Park, Sang Wan Lee

    Abstract: Generalized end-to-end (GE2E) model is widely used in speaker verification (SV) fields due to its expandability and generality regardless of specific languages. However, the long-short term memory (LSTM) based on GE2E has two limitations: First, the embedding of GE2E suffers from vanishing gradient, which leads to performance degradation for very long input sequences. Secondly, utterances are not… ▽ More

    Submitted 13 August, 2020; originally announced August 2020.

    Comments: 5 pages, 3 figures, 3 tables

  45. arXiv:2007.11653  [pdf

    eess.IV cs.CV cs.LG

    Darwin's Neural Network: AI-based Strategies for Rapid and Scalable Cell and Coronavirus Screening

    Authors: Sang Won Lee, Yueh-Ting Chiu, Philip Brudnicki, Audrey M. Bischoff, Angus Jelinek, Jenny Zijun Wang, Danielle R. Bogdanowicz, Andrew F. Laine, Jia Guo, Helen H. Lu

    Abstract: Recent advances in the interdisciplinary scientific field of machine perception, computer vision, and biomedical engineering underpin a collection of machine learning algorithms with a remarkable ability to decipher the contents of microscope and nanoscope images. Machine learning algorithms are transforming the interpretation and analysis of microscope and nanoscope imaging data through use in co… ▽ More

    Submitted 22 July, 2020; originally announced July 2020.

    Comments: 19 pages, 7 figures

    ACM Class: I.5.0

  46. arXiv:2007.02906  [pdf, other

    eess.SP cs.LG

    Compact representation of temporal processes in echosounder time series via matrix decomposition

    Authors: Wu-Jung Lee, Valentina Staneva

    Abstract: The recent explosion in the availability of echosounder data from diverse ocean platforms has created unprecedented opportunities to observe the marine ecosystems at broad scales. However, the critical lack of methods capable of automatically discovering and summarizing prominent spatio-temporal echogram structures has limited the effective and wider use of these rich datasets. To address this cha… ▽ More

    Submitted 30 November, 2020; v1 submitted 6 July, 2020; originally announced July 2020.

  47. arXiv:2005.08701  [pdf, other

    q-bio.QM cs.LG eess.SP stat.ML

    Machine learning for the diagnosis of early stage diabetes using temporal glucose profiles

    Authors: Woo Seok Lee, Junghyo Jo, Taegeun Song

    Abstract: Machine learning shows remarkable success for recognizing patterns in data. Here we apply the machine learning (ML) for the diagnosis of early stage diabetes, which is known as a challenging task in medicine. Blood glucose levels are tightly regulated by two counter-regulatory hormones, insulin and glucagon, and the failure of the glucose homeostasis leads to the common metabolic disease, diabetes… ▽ More

    Submitted 18 May, 2020; originally announced May 2020.

    Comments: 4 pages, 2 figure

  48. arXiv:2005.02144  [pdf, other

    eess.SP

    Potential, Challenges and Future Directions for Deep Learning in Prognostics and Health Management Applications

    Authors: Olga Fink, Qin Wang, Markus Svensén, Pierre Dersin, Wan-Jui Lee, Melanie Ducoffe

    Abstract: Deep learning applications have been thriving over the last decade in many different domains, including computer vision and natural language understanding. The drivers for the vibrant development of deep learning have been the availability of abundant data, breakthroughs of algorithms and the advancements in hardware. Despite the fact that complex industrial assets have been extensively monitored… ▽ More

    Submitted 5 May, 2020; originally announced May 2020.

    Comments: To appear in Engineering Applications of Artificial Intelligence

  49. arXiv:2004.12786  [pdf, other

    eess.IV cs.CV cs.LG

    A Cascaded Learning Strategy for Robust COVID-19 Pneumonia Chest X-Ray Screening

    Authors: Chun-Fu Yeh, Hsien-Tzu Cheng, Andy Wei, Hsin-Ming Chen, Po-Chen Kuo, Keng-Chi Liu, Mong-Chi Ko, Ray-Jade Chen, Po-Chang Lee, Jen-Hsiang Chuang, Chi-Mai Chen, Yi-Chang Chen, Wen-Jeng Lee, Ning Chien, Jo-Yu Chen, Yu-Sen Huang, Yu-Chien Chang, Yu-Cheng Huang, Nai-Kuan Chou, Kuan-Hua Chao, Yi-Chin Tu, Yeun-Chung Chang, Tyng-Luh Liu

    Abstract: We introduce a comprehensive screening platform for the COVID-19 (a.k.a., SARS-CoV-2) pneumonia. The proposed AI-based system works on chest x-ray (CXR) images to predict whether a patient is infected with the COVID-19 disease. Although the recent international joint effort on making the availability of all sorts of open data, the public collection of CXR images is still relatively small for relia… ▽ More

    Submitted 30 April, 2020; v1 submitted 24 April, 2020; originally announced April 2020.

    Comments: 14 pages, 6 figures

  50. arXiv:2004.11138  [pdf, other

    cs.CV cs.LG eess.IV

    The Creation and Detection of Deepfakes: A Survey

    Authors: Yisroel Mirsky, Wenke Lee

    Abstract: Generative deep learning algorithms have progressed to a point where it is difficult to tell the difference between what is real and what is fake. In 2018, it was discovered how easy it is to use this technology for unethical and malicious applications, such as the spread of misinformation, impersonation of political leaders, and the defamation of innocent individuals. Since then, these `deepfakes… ▽ More

    Submitted 13 September, 2020; v1 submitted 23 April, 2020; originally announced April 2020.

    Journal ref: ACM Computing Surveys (CSUR), 2020, preprint