Skip to main content

Showing 1–47 of 47 results for author: Cho, S

Searching in archive eess. Search in all archives.
.
  1. arXiv:2405.01591  [pdf, other

    cs.CL cs.AI eess.IV

    Simplifying Multimodality: Unimodal Approach to Multimodal Challenges in Radiology with General-Domain Large Language Model

    Authors: Seonhee Cho, Choonghan Kim, Jiho Lee, Chetan Chilkunda, Su** Choi, Joo Heung Yoon

    Abstract: Recent advancements in Large Multimodal Models (LMMs) have attracted interest in their generalization capability with only a few samples in the prompt. This progress is particularly relevant to the medical domain, where the quality and sensitivity of data pose unique challenges for model training and application. However, the dependency on high-quality data for effective in-context learning raises… ▽ More

    Submitted 29 April, 2024; originally announced May 2024.

    Comments: Under review

  2. arXiv:2404.01123  [pdf, other

    cs.CV cs.GR eess.IV

    CLIPtone: Unsupervised Learning for Text-based Image Tone Adjustment

    Authors: Hyeongmin Lee, Kyoungkook Kang, Jungseul Ok, Sunghyun Cho

    Abstract: Recent image tone adjustment (or enhancement) approaches have predominantly adopted supervised learning for learning human-centric perceptual assessment. However, these approaches are constrained by intrinsic challenges of supervised learning. Primarily, the requirement for expertly-curated or retouched images escalates the data acquisition expenses. Moreover, their coverage of target style is con… ▽ More

    Submitted 1 April, 2024; originally announced April 2024.

  3. arXiv:2402.16732  [pdf

    eess.SP

    C-Band Lithium Niobate on Silicon Carbide SAW Resonator with Figure-of-Merit of 124 at 6.5 GHz

    Authors: Tzu-Hsuan Hsu, Joshua Campbell, Jack Kramer, Sinwoo Cho, Ming-Huang Li, Ruochen Lu

    Abstract: In this work, we demonstrate a C-band shear-horizontal surface acoustic wave (SH-SAW) resonator with high electromechanical coupling (kt2) of 22% and a quality factor (Q) of 565 based on a thin-film lithium niobate (LN) on silicon carbide (SiC) platform, featuring an excellent figure-of-merit (FoM = kt2*Q ) of 124 at 6.5 GHz, the highest FoM reported in this frequency range. The resonator frequenc… ▽ More

    Submitted 29 June, 2024; v1 submitted 26 February, 2024; originally announced February 2024.

    Comments: 4 pages, 5 figures, 1 table

  4. arXiv:2402.12194  [pdf

    physics.app-ph eess.SP

    23.8-GHz Acoustic Filter in Periodically Poled Piezoelectric Film Lithium Niobate With 1.52-dB IL and 19.4% FBW

    Authors: Sinwoo Cho, Omar Barrera, Jack Kramer, Vakhtang Chulukhadze, Tzu-Hsuan Hsu, Joshua Campbell, Ian Anderson, Ruochen Lu

    Abstract: This paper reports the first piezoelectric acoustic filter in periodically poled piezoelectric film (P3F) lithium niobate (LiNbO3) at 23.8 GHz with low insertion loss (IL) of 1.52 dB and 3-dB fractional bandwidth (FBW) of 19.4%. The filter features a compact footprint of 0.64 mm2. The third-order ladder filter is implemented with electrically coupled resonators in 150 nm bi-layer P3F 128 rotated Y… ▽ More

    Submitted 28 June, 2024; v1 submitted 19 February, 2024; originally announced February 2024.

    Comments: 4 pages, 7 figures, IEEE Microwave and Wireless Technology Letters

    Journal ref: IEEE Microwave and Wireless Technology Letters, vol. 34, no. 4, pp. 391-394, April 2024

  5. Thin-film Lithium Niobate on Insulator Surface Acoustic Wave Devices for 6G Centimeter Bands

    Authors: Tzu-Hsuan Hsu, Joshua Campbell, Jack Kramer, Sinwoo Cho, Zhi-Qiang Lee, Ming-Huang Li, Ruochen Lu

    Abstract: In this work, we investigate the frequency scaling of shear-horizontal (S.H.) surface acoustic wave (SAW) resonators based on a lithium niobate on insulator (LNOI) substrate into the centimeter bands for 6G wireless systems. Prototyped resonators with wavelengths ranging between 240 nm and 400 nm were fabricated, and the experimental results exhibit a successful frequency scaling between 9.05 and… ▽ More

    Submitted 25 February, 2024; v1 submitted 18 February, 2024; originally announced February 2024.

    Comments: 4 pages, 7 figures, 1 table, submitted to IEEE IC-MAM 2024

    Journal ref: 2024 IEEE MTT-S International Conference on Microwave Acoustics & Mechanics (IC-MAM), Chengdu, China, 2024, pp. 117-120

  6. arXiv:2401.00370  [pdf, other

    cs.CV eess.IV

    UGPNet: Universal Generative Prior for Image Restoration

    Authors: Hwayoon Lee, Kyoungkook Kang, Hyeongmin Lee, Seung-Hwan Baek, Sunghyun Cho

    Abstract: Recent image restoration methods can be broadly categorized into two classes: (1) regression methods that recover the rough structure of the original image without synthesizing high-frequency details and (2) generative methods that synthesize perceptually-realistic high-frequency details even though the resulting image deviates from the original structure of the input. While both directions have b… ▽ More

    Submitted 30 December, 2023; originally announced January 2024.

    Comments: Accepted to WACV 2024

  7. arXiv:2312.13947  [pdf, other

    eess.IV cs.LG math.NA physics.med-ph

    PhysRFANet: Physics-Guided Neural Network for Real-Time Prediction of Thermal Effect During Radiofrequency Ablation Treatment

    Authors: Minwoo Shin, Minjee Seo, Seonaeng Cho, Juil Park, Joon Ho Kwon, Deukhee Lee, Kyungho Yoon

    Abstract: Radiofrequency ablation (RFA) is a widely used minimally invasive technique for ablating solid tumors. Achieving precise personalized treatment necessitates feedback information on in situ thermal effects induced by the RFA procedure. While computer simulation facilitates the prediction of electrical and thermal phenomena associated with RFA, its practical implementation in clinical settings is hi… ▽ More

    Submitted 21 December, 2023; originally announced December 2023.

  8. arXiv:2312.13313  [pdf, other

    eess.IV cs.CV

    ParamISP: Learned Forward and Inverse ISPs using Camera Parameters

    Authors: Woohyeok Kim, Geonu Kim, Junyong Lee, Seungyong Lee, Seung-Hwan Baek, Sunghyun Cho

    Abstract: RAW images are rarely shared mainly due to its excessive data size compared to their sRGB counterparts obtained by camera ISPs. Learning the forward and inverse processes of camera ISPs has been recently demonstrated, enabling physically-meaningful RAW-level image processing on input sRGB images. However, existing learning-based ISP methods fail to handle the large variations in the ISP processes… ▽ More

    Submitted 14 April, 2024; v1 submitted 20 December, 2023; originally announced December 2023.

  9. arXiv:2312.01689  [pdf, other

    eess.IV cs.CV

    Fast and accurate sparse-view CBCT reconstruction using meta-learned neural attenuation field and hash-encoding regularization

    Authors: Heejun Shin, Taehee Kim, Jongho Lee, Se Young Chun, Seungryung Cho, Dongmyung Shin

    Abstract: Cone beam computed tomography (CBCT) is an emerging medical imaging technique to visualize the internal anatomical structures of patients. During a CBCT scan, several projection images of different angles or views are collectively utilized to reconstruct a tomographic image. However, reducing the number of projections in a CBCT scan while preserving the quality of a reconstructed image is challeng… ▽ More

    Submitted 16 January, 2024; v1 submitted 4 December, 2023; originally announced December 2023.

  10. Millimeter Wave Thin-Film Bulk Acoustic Resonator in Sputtered Scandium Aluminum Nitride Using Platinum Electrodes

    Authors: Sinwoo Cho, Omar Barrera, Pietro Simeoni, Ellie Y. Wang, Jack Kramer, Vakhtang Chulukhadze, Joshua Campbell, Matteo Rinaldi, Ruochen Lu

    Abstract: This work describes sputtered scandium aluminum nitride (ScAlN) thin-film bulk acoustic resonators (FBAR) at millimeter wave (mmWave) with high quality factor (Q) using platinum (Pt) electrodes. FBARs with combinations of Pt and aluminum (Al) electrodes, i.e., Al top Al bottom, Pt top Al bottom, Al top Pt bottom, and Pt top Pt bottom, are built to study the impact of electrodes on mmWave FBARs. Th… ▽ More

    Submitted 22 November, 2023; originally announced November 2023.

    Comments: 4 pages, 7 figures, accepted by IEEE MEMS 2024

  11. Transferred Thin Film Lithium Niobate as Millimeter Wave Acoustic Filter Platforms

    Authors: Omar Barrera, Sinwoo Cho, Kenny Hyunh, Jack Kramer, Michael Liao, Vakhtang Chulukhadze, Lezli Matto, Mark S. Goorsky, Ruochen Lu

    Abstract: This paper reports the first high-performance acoustic filters toward millimeter wave (mmWave) bands using transferred single-crystal thin film lithium niobate (LiNbO3). By transferring LiNbO3 on the top of silicon (Si) and sapphire (Al2O3) substrates with an intermediate amorphous Si (aSi) bonding and sacrificial layer, we demonstrate compact acoustic filters with record-breaking performance beyo… ▽ More

    Submitted 21 November, 2023; originally announced November 2023.

    Comments: 4 pages, 8 figures, accepted by IEEE MEMS 2024

  12. arXiv:2311.05712  [pdf

    eess.SP

    38.7 GHz Thin Film Lithium Niobate Acoustic Filter

    Authors: Omar Barrera, Sinwoo Cho, Jack Kramer, Vakhtang Chulukhadze, Joshua Campbell, Ruochen Lu

    Abstract: In this work, a 38.7 GHz acoustic wave ladder filter exhibiting insertion loss (IL) of 5.63 dB and 3-dB fractional bandwidth (FBW) of 17.6% is demonstrated, pushing the frequency limits of thin-film piezoelectric acoustic filter technology. The filter achieves operating frequency up to 5G millimeter wave (mmWave) frequency range 2 (FR2) bands, by thinning thin-film LiNbO3 resonators to sub-50 nm t… ▽ More

    Submitted 9 November, 2023; originally announced November 2023.

    Comments: 4 pages, 6 figures, accepted by IEEE MTT-S International Microwave Filter Workshop (IMFW) 2024

  13. arXiv:2310.17659  [pdf, other

    eess.SP cs.AI

    RTNH+: Enhanced 4D Radar Object Detection Network using Combined CFAR-based Two-level Preprocessing and Vertical Encoding

    Authors: Seung-Hyun Kong, Dong-Hee Paek, Sangjae Cho

    Abstract: Four-dimensional (4D) Radar is a useful sensor for 3D object detection and the relative radial speed estimation of surrounding objects under various weather conditions. However, since Radar measurements are corrupted with invalid components such as noise, interference, and clutter, it is necessary to employ a preprocessing algorithm before the 3D object detection with neural networks. In this pape… ▽ More

    Submitted 19 October, 2023; originally announced October 2023.

    Comments: Arxiv preprint

  14. arXiv:2309.07132  [pdf

    physics.app-ph eess.SP

    Fundamental Antisymmetric Mode Acoustic Resonator in Periodically Poled Piezoelectric Film Lithium Niobate

    Authors: Omar Barrera, Jack Kramer, Ryan Tetro, Sinwoo Cho, Vakhtang Chulukhadze, Luca Colombo, Ruochen Lu

    Abstract: Radio frequency (RF) acoustic resonators have long been used for signal processing and sensing. Devices that integrate acoustic resonators benefit from their slow phase velocity (vp), in the order of 3 to 10 km/s, which allows miniaturization of the device. Regarding the subject of small form factor, acoustic resonators that operate at the so-called fundamental antisymmetric mode (A0), feature eve… ▽ More

    Submitted 27 August, 2023; originally announced September 2023.

    Comments: 4 pages, 6 figures, accepted by IEEE IUS 2023

  15. Millimeter Wave Thin-Film Bulk Acoustic Resonator in Sputtered Scandium Aluminum Nitride

    Authors: Sinwoo Cho, Omar Barrera, Pietro Simeoni, Emily N. Marshall, Jack Kramer, Keisuke Motoki, Tzu-Hsuan Hsu, Vakhtang Chulukhadze, Matteo Rinaldi, W. Alan Doolittle, Ruochen Lu

    Abstract: This work reports a millimeter wave (mmWave) thin-film bulk acoustic resonator (FBAR) in sputtered scandium aluminum nitride (ScAlN). This paper identifies challenges of frequency scaling sputtered ScAlN into mmWave and proposes a stack and new fabrication procedure with a sputtered Sc0.3Al0.7N on Al on Si carrier wafer. The resonator achieves electromechanical coupling (k2) of 7.0% and quality fa… ▽ More

    Submitted 6 September, 2023; originally announced September 2023.

    Comments: 3 pages, 7 figures, submitted to JMEMS

  16. Thin-Film Lithium Niobate Acoustic Resonator with High Q of 237 and k2 of 5.1% at 50.74 GHz

    Authors: Jack Kramer, Vakhtang Chulukhadze, Kenny Huynh, Omar Barrera, Michael Liao, Sinwoo Cho, Lezli Matto, Mark S. Goorsky, Ruochen Lu

    Abstract: This work reports a 50.74 GHz lithium niobate (LiNbO3) acoustic resonator with a high quality factor (Q) of 237 and an electromechanical coupling (k2) of 5.17% resulting in a figure of merit (FoM, Q x k2) of 12.2. The LiNbO3 resonator employs a novel bilayer periodically poled piezoelectric film (P3F) 128 Y-cut LiNbO3 on amorphous silicon (a-Si) on sapphire stack to achieve low losses and high cou… ▽ More

    Submitted 11 July, 2023; originally announced July 2023.

    Comments: 4 pages, 5 figures, published in 2023 Joint Conference of the IEEE International Frequency Control Symposium & European Frequency and Time Forum (IEEE IFCS 2023)

    Journal ref: 2023 Joint Conference of the European Frequency and Time Forum and IEEE International Frequency Control Symposium (EFTF/IFCS), Toyama, Japan, 2023

  17. Thin-Film Lithium Niobate Acoustic Filter at 23.5 GHz with 2.38 dB IL and 18.2% FBW

    Authors: Omar Barrera, Sinwoo Cho, Lezli Matto, Jack Kramer, Kenny Huynh, Vakhtang Chulukhadze, Yen-Wei Chang, Mark S. Goorsky, Ruochen Lu

    Abstract: This work reports an acoustic filter at 23.5 GHz with a low insertion loss (IL) of 2.38 dB and a 3-dB fractional bandwidth (FBW) of 18.2%, significantly surpassing the state-of-the-art. The device leverages electrically coupled acoustic resonators in 100 nm 128° Y-cut lithium niobate (LiNbO3) piezoelectric thin film, operating in the first-order antisymmetric (A1) mode. A new film stack, namely tr… ▽ More

    Submitted 10 July, 2023; originally announced July 2023.

    Comments: 4 pages, 8 figures, submitted to IEEE JMEMS

  18. arXiv:2307.02784  [pdf, other

    cs.IT cs.NI eess.SP

    On the Spatial-Wideband Effects in Millimeter-Wave Cell-Free Massive MIMO

    Authors: Seyoung Ahn, Soohyeong Kim, Yongseok Kwon, Joohan Park, Jiseung Youn, Sunghyun Cho

    Abstract: In this paper, we investigate the spatial-wideband effects in cell-free massive MIMO (CF-mMIMO) systems in mmWave bands. The utilization of mmWave frequencies brings challenges such as signal attenuation and the need for denser networks like ultra-dense networks (UDN) to maintain communication performance. CF-mMIMO is introduced as a solution, where distributed access points (APs) transmit signals… ▽ More

    Submitted 6 July, 2023; originally announced July 2023.

  19. arXiv:2306.12562  [pdf, other

    cs.CV eess.IV

    Neural Spectro-polarimetric Fields

    Authors: Youngchan Kim, Wonjoon **, Sunghyun Cho, Seung-Hwan Baek

    Abstract: Modeling the spatial radiance distribution of light rays in a scene has been extensively explored for applications, including view synthesis. Spectrum and polarization, the wave properties of light, are often neglected due to their integration into three RGB spectral bands and their non-perceptibility to human vision. However, these properties are known to encompass substantial material and geomet… ▽ More

    Submitted 10 December, 2023; v1 submitted 21 June, 2023; originally announced June 2023.

  20. arXiv:2306.05682  [pdf, other

    cs.CV cs.AI cs.LG cs.RO eess.IV

    Lightweight Monocular Depth Estimation via Token-Sharing Transformer

    Authors: Dong-Jae Lee, Jae Young Lee, Hyounguk Shon, Eo**dl Yi, Yeong-Hun Park, Sung-Sik Cho, Junmo Kim

    Abstract: Depth estimation is an important task in various robotics systems and applications. In mobile robotics systems, monocular depth estimation is desirable since a single RGB camera can be deployable at a low cost and compact size. Due to its significant and growing needs, many lightweight monocular depth estimation networks have been proposed for mobile robotics systems. While most lightweight monocu… ▽ More

    Submitted 9 June, 2023; originally announced June 2023.

    Comments: ICRA 2023

  21. An empirical study on speech restoration guided by self supervised speech representation

    Authors: Jaeuk Byun, Youna Ji, Soo Whan Chung, Soyeon Choe, Min Seok Choi

    Abstract: Enhancing speech quality is an indispensable yet difficult task as it is often complicated by a range of degradation factors. In addition to additive noise, reverberation, clip**, and speech attenuation can all adversely affect speech quality. Speech restoration aims to recover speech components from these distortions. This paper focuses on exploring the impact of self-supervised speech represen… ▽ More

    Submitted 30 May, 2023; originally announced May 2023.

    Comments: To be presented at ICASSP 2023

  22. arXiv:2303.13110  [pdf, other

    eess.IV cs.CV

    OCELOT: Overlapped Cell on Tissue Dataset for Histopathology

    Authors: Jeongun Ryu, Aaron Valero Puche, JaeWoong Shin, Seonwook Park, Biagio Brattoli, **hee Lee, Wonkyung Jung, Soo Ick Cho, Kyunghyun Paeng, Chan-Young Ock, Donggeun Yoo, Sérgio Pereira

    Abstract: Cell detection is a fundamental task in computational pathology that can be used for extracting high-level medical information from whole-slide images. For accurate cell detection, pathologists often zoom out to understand the tissue-level structures and zoom in to classify cells based on their morphology and the surrounding context. However, there is a lack of efforts to reflect such behaviors by… ▽ More

    Submitted 23 March, 2023; v1 submitted 23 March, 2023; originally announced March 2023.

    Comments: Accepted for publication at CVPR'23

  23. arXiv:2303.06298  [pdf, other

    cs.CV cs.LG eess.IV

    MLP-SRGAN: A Single-Dimension Super Resolution GAN using MLP-Mixer

    Authors: Samir Mitha, Seungho Choe, Pejman Jahbedar Maralani, Alan R. Moody, April Khademi

    Abstract: We propose a novel architecture called MLP-SRGAN, which is a single-dimension Super Resolution Generative Adversarial Network (SRGAN) that utilizes Multi-Layer Perceptron Mixers (MLP-Mixers) along with convolutional layers to upsample in the slice direction. MLP-SRGAN is trained and validated using high resolution (HR) FLAIR MRI from the MSSEG2 challenge dataset. The method was applied to three mu… ▽ More

    Submitted 10 March, 2023; originally announced March 2023.

    Comments: 14 pages, 10 figures

  24. arXiv:2302.05494  [pdf

    eess.SP

    Data-Driven Web-Based Patching Management Tool Using Multi-Sensor Pavement Structure Measurements

    Authors: Sneha Jha, Yaguang Zhang, Bongsuk Park, Seonghwan Cho, James V. Krogmeier, Tandra Bagchi, John E. Haddock

    Abstract: Automating pavement maintenance suggestions is challenging,especially for actionable recommendations such as patching location,depth and priority.It is common practice among State agencies to manually inspect road segments of interest and decide maintenance requirements based on the pavement condition index (PCI).However,standalone PCI only evaluates the pavement surface condition and coupled with… ▽ More

    Submitted 10 February, 2023; originally announced February 2023.

    Comments: Presented at: Transportation Research Board Annual Meeting 2023

    Report number: PaperID-TRBAM-23-04483

  25. arXiv:2210.17327  [pdf, other

    eess.AS cs.LG cs.SD

    Diffusion-based Generative Speech Source Separation

    Authors: Robin Scheibler, Youna Ji, Soo-Whan Chung, Jaeuk Byun, Soyeon Choe, Min-Seok Choi

    Abstract: We propose DiffSep, a new single channel source separation method based on score-matching of a stochastic differential equation (SDE). We craft a tailored continuous time diffusion-mixing process starting from the separated sources and converging to a Gaussian distribution centered on their mixture. This formulation lets us apply the machinery of score-based generative modelling. First, we train a… ▽ More

    Submitted 2 November, 2022; v1 submitted 31 October, 2022; originally announced October 2022.

    Comments: 5 pages, 3 figures, 2 tables. Submitted to ICASSP 2023

  26. Iterative Filter Adaptive Network for Single Image Defocus Deblurring

    Authors: Junyong Lee, Hyeongseok Son, Jaesung Rim, Sunghyun Cho, Seungyong Lee

    Abstract: We propose a novel end-to-end learning-based approach for single image defocus deblurring. The proposed approach is equipped with a novel Iterative Filter Adaptive Network (IFAN) that is specifically designed to handle spatially-varying and large defocus blur. For adaptively handling spatially-varying blur, IFAN predicts pixel-wise deblurring filters, which are applied to defocused features of an… ▽ More

    Submitted 28 March, 2022; v1 submitted 31 August, 2021; originally announced August 2021.

    Comments: CVPR 2021

    Journal ref: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2021, pp. 2034-2042

  27. arXiv:2108.07640  [pdf, other

    cs.CV cs.SD eess.AS eess.IV

    Look Who's Talking: Active Speaker Detection in the Wild

    Authors: You ** Kim, Hee-Soo Heo, Soyeon Choe, Soo-Whan Chung, Yoohwan Kwon, Bong-** Lee, Youngki Kwon, Joon Son Chung

    Abstract: In this work, we present a novel audio-visual dataset for active speaker detection in the wild. A speaker is considered active when his or her face is visible and the voice is audible simultaneously. Although active speaker detection is a crucial pre-processing step for many audio-visual tasks, there is no existing dataset of natural human speech to evaluate the performance of active speaker detec… ▽ More

    Submitted 17 August, 2021; originally announced August 2021.

    Comments: To appear in Interspeech 2021. Data will be available from https://github.com/clovaai/lookwhostalking

  28. arXiv:2102.07087  [pdf, other

    cs.NI eess.SP

    Survey on Aerial Radio Access Networks: Toward a Comprehensive 6G Access Infrastructure

    Authors: Nhu-Ngoc Dao, Quoc-Viet Pham, Ngo Hoang Tu, Tran Thien Thanh, Vo Nguyen Quoc Bao, Demeke Shumeye Lakew, Sungrae Cho

    Abstract: Current network access infrastructures are characterized by heterogeneity, low latency, high throughput, and high computational capability, enabling massive concurrent connections and various services. Unfortunately, this design does not pay significant attention to mobile services in underserved areas. In this context, the use of aerial radio access networks (ARANs) is a promising strategy to com… ▽ More

    Submitted 27 February, 2021; v1 submitted 14 February, 2021; originally announced February 2021.

    Comments: Accepted by the IEEE Communications Surveys and Tutorials

  29. arXiv:2101.09566  [pdf, other

    physics.app-ph eess.SY

    Isogeometric Configuration Design Optimization of Three-dimensional Curved Beam Structures for Maximal Fundamental Frequency

    Authors: Myung-** Choi, Jae-Hyun Kim, Bonyong Koo, Seonho Cho

    Abstract: This paper presents a configuration design optimization method for three-dimensional curved beam built-up structures having maximized fundamental eigenfrequency. We develop the method of computation of design velocity field and optimal design of beam structures constrained on a curved surface, where both designs of the embedded beams and the curved surface are simultaneously varied during the opti… ▽ More

    Submitted 23 January, 2021; originally announced January 2021.

    Comments: This document is the personal version of an article whose final publication is available at https://doi.org/10.1007/s00158-020-02803-0

    Journal ref: Structural and Multidisciplinary Optimization, 2021

  30. arXiv:2010.16278  [pdf, other

    physics.ins-det eess.SY hep-ex

    Archiver System Management for Belle II Detector Operation

    Authors: Y. -K. Kim, S. -J. Cho, S. -H. Park, M. Nakao, T. Konno

    Abstract: The Belle II experiment is a high-energy physics experiment at the SuperKEKB electron-positron collider. Using Belle II data, high precision measurement of rare decays and CP-violation in heavy quarks and leptons can be performed to probe New Physics. In this paper, we present the archiver system used to store the monitoring data of the Belle II detector and discuss in particular how we maintain t… ▽ More

    Submitted 30 October, 2020; originally announced October 2020.

    Comments: This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible

  31. arXiv:2009.10126  [pdf, other

    eess.SP cs.LG stat.AP stat.ME

    Evaluating phase synchronization methods in fMRI: a comparison study and new approaches

    Authors: Hamed Honari, Ann S. Choe, Martin A. Lindquist

    Abstract: In recent years there has been growing interest in measuring time-varying functional connectivity between different brain regions using resting-state functional magnetic resonance imaging (rs-fMRI) data. One way to assess the relationship between signals from different brain regions is to measure their phase synchronization (PS) across time. There are several ways to perform such analyses, and her… ▽ More

    Submitted 21 September, 2020; originally announced September 2020.

  32. arXiv:2009.08501  [pdf

    cond-mat.mtrl-sci cond-mat.dis-nn eess.IV

    Separating physically distinct mechanisms in complex infrared plasmonic nanostructures via machine learning enhanced electron energy loss spectroscopy

    Authors: Sergei V. Kalinin, Kevin M. Roccapriore, Shin Hum Cho, Delia J. Milliron, Rama Vasudevan, Maxim Ziatdinov, Jordan A. Hachtel

    Abstract: Low-loss electron energy loss spectroscopy (EELS) has emerged as a technique of choice for exploring the localization of plasmonic phenomena at the nanometer level, necessitating analysis of physical behaviors from 3D spectral data sets. For systems with high localization, linear unmixing methods provide an excellent basis for exploratory analysis, while in more complex systems large numbers of co… ▽ More

    Submitted 17 September, 2020; originally announced September 2020.

  33. arXiv:2008.01482  [pdf, other

    eess.SP cs.IT

    Terahertz Line-Of-Sight MIMO Communication: Theory and Practical Challenges

    Authors: Heedong Do, Sungmin Cho, Jeonghun Park, Ho-** Song, Namyoon Lee, Angel Lozano

    Abstract: A relentless trend in wireless communications is the hunger for bandwidth, and fresh bandwidth is only to be found at ever-higher frequencies. While 5G systems are seizing the mmWave band, the attention of researchers is shifting already to the terahertz range. In that distant land of tiny wavelengths, antenna arrays can serve for more than power-enhancing beamforming. Defying lower-frequency wisd… ▽ More

    Submitted 4 August, 2020; originally announced August 2020.

    Comments: 6 figures

  34. arXiv:2007.05191  [pdf, other

    cs.SD eess.AS

    Overcoming label noise in audio event detection using sequential labeling

    Authors: Jae-Bin Kim, Seongkyu Mun, Myungwoo Oh, Soyeon Choe, Yong-Hyeok Lee, Hyung-Min Park

    Abstract: This paper addresses the noisy label issue in audio event detection (AED) by refining strong labels as sequential labels with inaccurate timestamps removed. In AED, strong labels contain the occurrence of a specific event and its timestamps corresponding to the start and end of the event in an audio clip. The timestamps depend on subjectivity of each annotator, and their label noise is inevitable.… ▽ More

    Submitted 10 July, 2020; originally announced July 2020.

  35. arXiv:2005.07074  [pdf, other

    cs.SD cs.CV cs.MM eess.AS

    FaceFilter: Audio-visual speech separation using still images

    Authors: Soo-Whan Chung, Soyeon Choe, Joon Son Chung, Hong-Goo Kang

    Abstract: The objective of this paper is to separate a target speaker's speech from a mixture of two speakers using a deep audio-visual speech separation network. Unlike previous works that used lip movement on video clips or pre-enrolled speaker information as an auxiliary conditional feature, we use a single face image of the target speaker. In this task, the conditional feature is obtained from facial ap… ▽ More

    Submitted 14 May, 2020; originally announced May 2020.

    Comments: Under submission as a conference paper. Video examples: https://youtu.be/ku9xoLh62E

  36. Quality Prediction on Deep Generative Images

    Authors: Hyunsuk Ko, Dae Yeol Lee, Seunghyun Cho, Alan C. Bovik

    Abstract: In recent years, deep neural networks have been utilized in a wide variety of applications including image generation. In particular, generative adversarial networks (GANs) are able to produce highly realistic pictures as part of tasks such as image compression. As with standard compression, it is desirable to be able to automatically assess the perceptual quality of generative images to monitor a… ▽ More

    Submitted 17 April, 2020; originally announced April 2020.

    Comments: Accepted for publication in IEEE Transactions on Image Processing

  37. In defence of metric learning for speaker recognition

    Authors: Joon Son Chung, Jaesung Huh, Seongkyu Mun, Minjae Lee, Hee Soo Heo, Soyeon Choe, Chiheon Ham, Sunghwan Jung, Bong-** Lee, Icksang Han

    Abstract: The objective of this paper is 'open-set' speaker recognition of unseen speakers, where ideal embeddings should be able to condense information into a compact utterance-level representation that has small intra-speaker and large inter-speaker distance. A popular belief in speaker recognition is that networks trained with classification objectives outperform metric learning methods. In this paper… ▽ More

    Submitted 24 April, 2020; v1 submitted 26 March, 2020; originally announced March 2020.

    Comments: The code can be found at https://github.com/clovaai/voxceleb_trainer

  38. arXiv:1912.12817  [pdf, other

    eess.IV cs.CV

    An End-to-End Joint Learning Scheme of Image Compression and Quality Enhancement with Improved Entropy Minimization

    Authors: Jooyoung Lee, Seunghyun Cho, Munchurl Kim

    Abstract: Recently, learned image compression methods have been actively studied. Among them, entropy-minimization based approaches have achieved superior results compared to conventional image codecs such as BPG and JPEG2000. However, the quality enhancement and rate-minimization are conflictively coupled in the process of image compression. That is, maintaining high image quality entails less compression… ▽ More

    Submitted 13 March, 2020; v1 submitted 30 December, 2019; originally announced December 2019.

    Comments: 25 pages, 14 figures

  39. W-Net: Two-stage U-Net with misaligned data for raw-to-RGB map**

    Authors: Kwang-Hyun Uhm, Seung-Wook Kim, Seo-Won Ji, Sung-** Cho, Jun-Pyo Hong, Sung-Jea Ko

    Abstract: Recent research on learning a map** between raw Bayer images and RGB images has progressed with the development of deep convolutional neural networks. A challenging data set namely the Zurich Raw-to-RGB data set (ZRR) has been released in the AIM 2019 raw-to-RGB map** challenge. In ZRR, input raw and target RGB images are captured by two different cameras and thus not perfectly aligned. Moreov… ▽ More

    Submitted 21 November, 2019; v1 submitted 19 November, 2019; originally announced November 2019.

    Comments: Accepted by ICCVW 2019

  40. arXiv:1911.06149  [pdf, other

    eess.AS cs.CL cs.SD

    Emotional Voice Conversion using Multitask Learning with Text-to-speech

    Authors: Tae-Ho Kim, Sungjae Cho, Shinkook Choi, Sejik Park, Soo-Young Lee

    Abstract: Voice conversion (VC) is a task to transform a person's voice to different style while conserving linguistic contents. Previous state-of-the-art on VC is based on sequence-to-sequence (seq2seq) model, which could mislead linguistic information. There was an attempt to overcome it by using textual supervision, it requires explicit alignment which loses the benefit of using seq2seq model. In this pa… ▽ More

    Submitted 27 November, 2019; v1 submitted 11 November, 2019; originally announced November 2019.

    Comments: 4 pages, 3 figures, submitted to ICASSP2020

  41. arXiv:1911.02411  [pdf, other

    cs.SD eess.AS

    The sound of my voice: speaker representation loss for target voice separation

    Authors: Seongkyu Mun, Soyeon Choe, Jaesung Huh, Joon Son Chung

    Abstract: Content and style representations have been widely studied in the field of style transfer. In this paper, we propose a new loss function using speaker content representation for audio source separation, and we call it speaker representation loss. The objective is to extract the target speaker voice from the noisy input and also remove it from the residual components. Compared to the conventional s… ▽ More

    Submitted 27 February, 2020; v1 submitted 6 November, 2019; originally announced November 2019.

    Comments: To appear in ICASSP 2020. The first two authors contributed equally to this work

  42. arXiv:1909.13265  [pdf, ps, other

    cs.NE eess.SY

    Adaptive Control for Marine Vessels Against Harsh Environmental Variation

    Authors: Fangwen Tu, Shuzhi Sam Ge, Yoo Sang Choo, Chang Chieh Hang

    Abstract: In this paper, robust control with sea state observer and dynamic thrust allocation is proposed for the Dynamic Positioning (DP) of an accommodation vessel in the presence of unknown hydrodynamic force variation and the input time delay. In order to overcome the huge force variation due to the adjoining Floating Production Storage and Offloading (FPSO) and accommodation vessel, a novel sea state o… ▽ More

    Submitted 29 September, 2019; originally announced September 2019.

  43. arXiv:1901.04690  [pdf, other

    eess.AS cs.SD

    Orthonormal Embedding-based Deep Clustering for Single-channel Speech Separation

    Authors: Soyeon Choe, Soo-Whan Chung, Youna Ji, Hong-Goo Kang

    Abstract: Deep clustering is a deep neural network-based speech separation algorithm that first trains the mixed component of signals with high-dimensional embeddings, and then uses a clustering algorithm to separate each mixture of sources. In this paper, we extend the baseline criterion of deep clustering with an additional regularization term to further improve the overall performance. This term plays a… ▽ More

    Submitted 15 January, 2019; originally announced January 2019.

  44. arXiv:1809.10452  [pdf, other

    eess.IV

    Context-adaptive Entropy Model for End-to-end Optimized Image Compression

    Authors: Jooyoung Lee, Seunghyun Cho, Seung-Kwon Beack

    Abstract: We propose a context-adaptive entropy model for use in end-to-end optimized image compression. Our model exploits two types of contexts, bit-consuming contexts and bit-free contexts, distinguished based upon whether additional bit allocation is required. Based on these contexts, we allow the model to more accurately estimate the distribution of each latent representation with a more generalized fo… ▽ More

    Submitted 6 May, 2019; v1 submitted 27 September, 2018; originally announced September 2018.

    Comments: Published as a conference paper at ICLR 2019. The test code, evaluation results and reconstructed images are publicly available at https://github.com/JooyoungLeeETRI/CA_Entropy_Model

  45. arXiv:1803.00694  [pdf

    physics.med-ph cs.CV eess.IV

    Deep-neural-network based sinogram synthesis for sparse-view CT image reconstruction

    Authors: Hoyeon Lee, Jongha Lee, Hyeongseok Kim, Byungchul Cho, Seungryong Cho

    Abstract: Recently, a number of approaches to low-dose computed tomography (CT) have been developed and deployed in commercialized CT scanners. Tube current reduction is perhaps the most actively explored technology with advanced image reconstruction algorithms. Sparse data sampling is another viable option to the low-dose CT, and sparse-view CT has been particularly of interest among the researchers in CT… ▽ More

    Submitted 5 March, 2018; v1 submitted 1 March, 2018; originally announced March 2018.

  46. Autonomous Power Allocation based on Distributed Deep Learning for Device-to-Device Communication Underlaying Cellular Network

    Authors: Jeehyeong Kim, Joohan Park, Jaewon Noh, Sunghyun Cho

    Abstract: For Device-to-device (D2D) communication of Internet-of-Things (IoT) enabled 5G system, there is a limit to allocating resources considering a complicated interference between different links in a centralized manner. If D2D link is controlled by an enhanced node base station (eNB), and thus, remains a burden on the eNB and it causes delayed latency. This paper proposes a fully autonomous power all… ▽ More

    Submitted 8 June, 2020; v1 submitted 8 February, 2018; originally announced February 2018.

    Comments: accepted in IEEE Access, 2169-3536

  47. arXiv:1712.00166  [pdf, other

    cs.SD cs.AI cs.LG eess.AS

    Audio Cover Song Identification using Convolutional Neural Network

    Authors: Sungkyun Chang, Juheon Lee, Sang Keun Choe, Kyogu Lee

    Abstract: In this paper, we propose a new approach to cover song identification using a CNN (convolutional neural network). Most previous studies extract the feature vectors that characterize the cover song relation from a pair of songs and used it to compute the (dis)similarity between the two songs. Based on the observation that there is a meaningful pattern between cover songs and that this can be learne… ▽ More

    Submitted 26 October, 2020; v1 submitted 30 November, 2017; originally announced December 2017.

    Comments: NIPS 2017 Workshop on Machine Learning for Audio (ML4A), Long Beach, CA, USA