Skip to main content

Showing 1–33 of 33 results for author: Lim, S

Searching in archive eess. Search in all archives.
.
  1. arXiv:2401.12499  [pdf, ps, other

    cs.IT eess.SP

    On the Fundamental Tradeoff of Joint Communication and Quickest Change Detection

    Authors: Daewon Seo, Sung Hoon Lim

    Abstract: In this work, we take the initiative in studying the fundamental tradeoff between communication and quickest change detection (QCD) under an integrated sensing and communication setting. We formally establish a joint communication and sensing problem for quickest change detection. Then, by utilizing constant subblock-composition codes and a modified QuSum detection rule, which we call subblock QuS… ▽ More

    Submitted 23 January, 2024; originally announced January 2024.

  2. arXiv:2312.07826  [pdf

    cs.RO eess.SY

    Integrated Path Tracking with DYC and MPC using LSTM Based Tire Force Estimator for Four-wheel Independent Steering and Driving Vehicle

    Authors: Sung** Lim, Bilal Sadiq, Yongsik **, Sangho Lee, Gyeungho Choi, Kanghyun Nam, Yongseob Lim

    Abstract: Active collision avoidance system plays a crucial role in ensuring the lateral safety of autonomous vehicles, and it is primarily related to path planning and tracking control algorithms. In particular, the direct yaw-moment control (DYC) system can significantly improve the lateral stability of a vehicle in environments with sudden changes in road conditions. In order to apply the DYC algorithm,… ▽ More

    Submitted 12 December, 2023; originally announced December 2023.

  3. arXiv:2312.05528  [pdf, other

    eess.IV cs.CV

    Exploring 3D U-Net Training Configurations and Post-Processing Strategies for the MICCAI 2023 Kidney and Tumor Segmentation Challenge

    Authors: Kwang-Hyun Uhm, Hyunjun Cho, Zhixin Xu, Seohoon Lim, Seung-Won Jung, Sung-Hoo Hong, Sung-Jea Ko

    Abstract: In 2023, it is estimated that 81,800 kidney cancer cases will be newly diagnosed, and 14,890 people will die from this cancer in the United States. Preoperative dynamic contrast-enhanced abdominal computed tomography (CT) is often used for detecting lesions. However, there exists inter-observer variability due to subtle differences in the imaging features of kidney and kidney tumors. In this paper… ▽ More

    Submitted 9 December, 2023; originally announced December 2023.

    Comments: MICCAI 2023, KITS 2023 challenge 2nd place

  4. arXiv:2304.00471  [pdf, other

    cs.SD cs.CV cs.GR cs.LG eess.AS

    A Unified Compression Framework for Efficient Speech-Driven Talking-Face Generation

    Authors: Bo-Kyeong Kim, Jaemin Kang, Daeun Seo, Hancheol Park, Shinkook Choi, Hyoung-Kyu Song, Hyungshin Kim, Sungsu Lim

    Abstract: Virtual humans have gained considerable attention in numerous industries, e.g., entertainment and e-commerce. As a core technology, synthesizing photorealistic face frames from target speech and facial identity has been actively studied with generative adversarial networks. Despite remarkable results of modern talking-face generation models, they often entail high computational burdens, which limi… ▽ More

    Submitted 28 April, 2023; v1 submitted 2 April, 2023; originally announced April 2023.

    Comments: MLSys Workshop on On-Device Intelligence, 2023; Demo: https://huggingface.co/spaces/nota-ai/compressed_wav2lip

  5. arXiv:2303.00795  [pdf, other

    eess.IV cs.CV

    Improved Segmentation of Deep Sulci in Cortical Gray Matter Using a Deep Learning Framework Incorporating Laplace's Equation

    Authors: Sadhana Ravikumar, Ranjit Ittyerah, Sydney Lim, Long Xie, Sandhitsu Das, Pulkit Khandelwal, Laura E. M. Wisse, Madigan L. Bedard, John L. Robinson, Terry Schuck, Murray Grossman, John Q. Trojanowski, Edward B. Lee, M. Dylan Tisdall, Karthik Prabhakaran, John A. Detre, David J. Irwin, Winifred Trotman, Gabor Mizsei, Emilio Artacho-Pérula, Maria Mercedes Iñiguez de Onzono Martin, Maria del Mar Arroyo Jiménez, Monica Muñoz, Francisco Javier Molina Romero, Maria del Pilar Marcos Rabal , et al. (7 additional authors not shown)

    Abstract: When develo** tools for automated cortical segmentation, the ability to produce topologically correct segmentations is important in order to compute geometrically valid morphometry measures. In practice, accurate cortical segmentation is challenged by image artifacts and the highly convoluted anatomy of the cortex itself. To address this, we propose a novel deep learning-based cortical segmentat… ▽ More

    Submitted 3 March, 2023; v1 submitted 1 March, 2023; originally announced March 2023.

    Comments: Accepted at the 28th biennial international conference on Information Processing in Medical Imaging (IPMI 2023)

  6. arXiv:2206.07651  [pdf

    eess.SP

    Fault Diagnosis of Inter-turn Short Circuit in Permanent Magnet Synchronous Motors with Current Signal Imaging and Unsupervised Learning

    Authors: W. Jung, S. H. Yun, Y. S. Lim, S. Cheong, J. Bae, Y. H. Park

    Abstract: This paper proposes machine-independent feature engineering for winding inter-turn short circuit fault that uses electrical current signals. Electrical current signal collected from permanent magnet synchronous motor (PMSM) is subjected to different environmental and operational conditions. To solve these problems, robust current signal imaging method and deep learning-based feature extraction met… ▽ More

    Submitted 9 June, 2022; originally announced June 2022.

    Comments: submitted to IECON 2022

  7. arXiv:2206.07515  [pdf

    eess.SP cs.AI cs.LG

    A Deep Learning Network for the Classification of Intracardiac Electrograms in Atrial Tachycardia

    Authors: Zerui Chen, Sonia Xhyn Teo, Andrie Ochtman, Shier Nee Saw, Nicholas Cheng, Eric Tien Siang Lim, Murphy Lyu, Hwee Kuan Lee

    Abstract: A key technology enabling the success of catheter ablation treatment for atrial tachycardia is activation map**, which relies on manual local activation time (LAT) annotation of all acquired intracardiac electrogram (EGM) signals. This is a time-consuming and error-prone procedure, due to the difficulty in identifying the signal activation peaks for fractionated signals. This work presents a Dee… ▽ More

    Submitted 2 June, 2022; originally announced June 2022.

    Comments: 34 pages, 10 figures

    ACM Class: J.3

  8. arXiv:2203.13072  [pdf, other

    cs.CV eess.IV

    Multitask Emotion Recognition Model with Knowledge Distillation and Task Discriminator

    Authors: Euiseok Jeong, Geesung Oh, Sejoon Lim

    Abstract: Due to the collection of big data and the development of deep learning, research to predict human emotions in the wild is being actively conducted. We designed a multi-task model using ABAW dataset to predict valence-arousal, expression, and action unit through audio data and face images at in real world. We trained model from the incomplete label by applying the knowledge distillation technique.… ▽ More

    Submitted 24 March, 2022; originally announced March 2022.

  9. arXiv:2112.02164  [pdf, other

    eess.IV cs.CV

    Bridging the gap between prostate radiology and pathology through machine learning

    Authors: Indrani Bhattacharya, David S. Lim, Han Lin Aung, Xingchen Liu, Arun Seetharaman, Christian A. Kunder, Wei Shao, Simon J. C. Soerensen, Richard E. Fan, Pejman Ghanouni, Katherine J. To'o, James D. Brooks, Geoffrey A. Sonn, Mirabela Rusu

    Abstract: Prostate cancer is the second deadliest cancer for American men. While Magnetic Resonance Imaging (MRI) is increasingly used to guide targeted biopsies for prostate cancer diagnosis, its utility remains limited due to high rates of false positives and false negatives as well as low inter-reader agreements. Machine learning methods to detect and localize cancer on prostate MRI can help standardize… ▽ More

    Submitted 3 December, 2021; originally announced December 2021.

    Comments: Indrani Bhattacharya and David S. Lim contributed equally as first authors. Geoffrey A. Sonn and Mirabela Rusu contributed equally as senior authors

  10. arXiv:2110.13903  [pdf, other

    cs.CV eess.IV

    NeRV: Neural Representations for Videos

    Authors: Hao Chen, Bo He, Hanyu Wang, Yixuan Ren, Ser-Nam Lim, Abhinav Shrivastava

    Abstract: We propose a novel neural representation for videos (NeRV) which encodes videos in neural networks. Unlike conventional representations that treat videos as frame sequences, we represent videos as neural networks taking frame index as input. Given a frame index, NeRV outputs the corresponding RGB image. Video encoding in NeRV is simply fitting a neural network to video frames and decoding process… ▽ More

    Submitted 26 October, 2021; originally announced October 2021.

    Comments: To appear at NeurIPS 2021

  11. arXiv:2110.07711  [pdf, other

    eess.IV cs.CV

    Gray Matter Segmentation in Ultra High Resolution 7 Tesla ex vivo T2w MRI of Human Brain Hemispheres

    Authors: Pulkit Khandelwal, Shokufeh Sadaghiani, Michael Tran Duong, Sadhana Ravikumar, Sydney Lim, Sanaz Arezoumandan, Claire Peterson, Eunice Chung, Madigan Bedard, Noah Capp, Ranjit Ittyerah, Elyse Migdal, Grace Choi, Emily Kopp, Bridget Loja, Eusha Hasan, Jiacheng Li, Karthik Prabhakaran, Gabor Mizsei, Marianna Gabrielyan, Theresa Schuck, John Robinson, Daniel Ohm, Edward Lee, John Q. Trojanowski , et al. (8 additional authors not shown)

    Abstract: Ex vivo MRI of the brain provides remarkable advantages over in vivo MRI for visualizing and characterizing detailed neuroanatomy. However, automated cortical segmentation methods in ex vivo MRI are not well developed, primarily due to limited availability of labeled datasets, and heterogeneity in scanner hardware and acquisition protocols. In this work, we present a high resolution 7 Tesla datase… ▽ More

    Submitted 3 March, 2022; v1 submitted 14 October, 2021; originally announced October 2021.

    Comments: Ex vivo analysis framework (work in progress 2022 at the University of Pennsylvania)

  12. Neural Network Facial Authentication for Public Electric Vehicle Charging Station

    Authors: Muhamad Amin Husni Abdul Haris, Sin Liang Lim

    Abstract: This study is to investigate and compare the facial recognition accuracy performance of Dlib ResNet against a K-Nearest Neighbour (KNN) classifier. Particularly when used against a dataset from an Asian ethnicity as Dlib ResNet was reported to have an accuracy deficiency when it comes to Asian faces. The comparisons are both implemented on the facial vectors extracted using the Histogram of Orient… ▽ More

    Submitted 19 June, 2021; originally announced June 2021.

    Journal ref: JETAP Vol.3 No.1 (2021) 17-21

  13. arXiv:2103.10892  [pdf

    eess.IV cs.CV cs.LG

    Deep Label Fusion: A 3D End-to-End Hybrid Multi-Atlas Segmentation and Deep Learning Pipeline

    Authors: Long Xie, Laura E. M. Wisse, Jiancong Wang, Sadhana Ravikumar, Trevor Glenn, Anica Luther, Sydney Lim, David A. Wolk, Paul A. Yushkevich

    Abstract: Deep learning (DL) is the state-of-the-art methodology in various medical image segmentation tasks. However, it requires relatively large amounts of manually labeled training data, which may be infeasible to generate in some applications. In addition, DL methods have relatively poor generalizability to out-of-sample data. Multi-atlas segmentation (MAS), on the other hand, has promising performance… ▽ More

    Submitted 19 March, 2021; originally announced March 2021.

    Comments: 12 pages paper accepted by the international conference of Information Processing in Medical Imaging (IPMI) 2021

  14. An Open-Source Low-Cost Mobile Robot System with an RGB-D Camera and Efficient Real-Time Navigation Algorithm

    Authors: Taekyung Kim, Seunghyun Lim, Gwanjun Shin, Geonhee Sim, Dongwon Yun

    Abstract: Currently, mobile robots are develo** rapidly and are finding numerous applications in the industry. However, several problems remain related to their practical use, such as the need for expensive hardware and high power consumption levels. In this study, we build a low-cost indoor mobile robot platform that does not include a LiDAR or a GPU. Then, we design an autonomous navigation architecture… ▽ More

    Submitted 13 December, 2022; v1 submitted 4 March, 2021; originally announced March 2021.

    Comments: Accepted to IEEE Access 2022. Project Github: https://github.com/shinkansan/2019-UGRP-DPoom Video: https://youtu.be/Li3-RlO28lk

    Journal ref: IEEE Access, vol. 10, pp. 127871-127881, 2022

  15. arXiv:2102.11906  [pdf, other

    eess.AS cs.SD

    Handling Background Noise in Neural Speech Generation

    Authors: Tom Denton, Alejandro Luebs, Felicia S. C. Lim, Andrew Storus, Hengchin Yeh, W. Bastiaan Kleijn, Jan Skoglund

    Abstract: Recent advances in neural-network based generative modeling of speech has shown great potential for speech coding. However, the performance of such models drops when the input is not clean speech, e.g., in the presence of background noise, preventing its use in practical applications. In this paper we examine the reason and discuss methods to overcome this issue. Placing a denoising preprocessing… ▽ More

    Submitted 23 February, 2021; originally announced February 2021.

    Comments: 5 pages, 3 figures, presented at the Asilomar Conference on Signals, Systems, and Computers 2020

  16. arXiv:2102.09785  [pdf, ps, other

    eess.SP cs.IT cs.LG

    Deep Learning-based Beam Tracking for Millimeter-wave Communications under Mobility

    Authors: Sun Hong Lim, Sunwoo Kim, Byonghyo Shim, Jun Won Choi

    Abstract: In this paper, we propose a deep learning-based beam tracking method for millimeter-wave (mmWave)communications. Beam tracking is employed for transmitting the known symbols using the sounding beams and tracking time-varying channels to maintain a reliable communication link. When the pose of a user equipment (UE) device varies rapidly, the mmWave channels also tend to vary fast, which hinders sea… ▽ More

    Submitted 1 December, 2022; v1 submitted 19 February, 2021; originally announced February 2021.

    Comments: 23 pages, 8 figures

  17. arXiv:2102.09660  [pdf, other

    eess.AS cs.SD

    Generative Speech Coding with Predictive Variance Regularization

    Authors: W. Bastiaan Kleijn, Andrew Storus, Michael Chinen, Tom Denton, Felicia S. C. Lim, Alejandro Luebs, Jan Skoglund, Hengchin Yeh

    Abstract: The recent emergence of machine-learning based generative models for speech suggests a significant reduction in bit rate for speech codecs is possible. However, the performance of generative models deteriorates significantly with the distortions present in real-world input signals. We argue that this deterioration is due to the sensitivity of the maximum likelihood criterion to outliers and the in… ▽ More

    Submitted 18 February, 2021; originally announced February 2021.

    MSC Class: 94 ACM Class: I.m

  18. arXiv:2102.00201  [pdf, other

    cs.SD cs.IR cs.LG cs.MM eess.AS

    Melon Playlist Dataset: a public dataset for audio-based playlist generation and music tagging

    Authors: Andres Ferraro, Yuntae Kim, Soohyeon Lee, Biho Kim, Namjun Jo, Semi Lim, Suyon Lim, Jungtaek Jang, Sehwan Kim, Xavier Serra, Dmitry Bogdanov

    Abstract: One of the main limitations in the field of audio signal processing is the lack of large public datasets with audio representations and high-quality annotations due to restrictions of copyrighted commercial music. We present Melon Playlist Dataset, a public dataset of mel-spectrograms for 649,091tracks and 148,826 associated playlists annotated by 30,652 different tags. All the data is gathered fr… ▽ More

    Submitted 30 January, 2021; originally announced February 2021.

    Comments: 2021 IEEE International Conference on Acoustics, Speech and Signal Processing

  19. arXiv:2008.07742  [pdf, other

    eess.IV cs.CV

    UDC 2020 Challenge on Image Restoration of Under-Display Camera: Methods and Results

    Authors: Yuqian Zhou, Michael Kwan, Kyle Tolentino, Neil Emerton, Sehoon Lim, Tim Large, Lijiang Fu, Zhihong Pan, Baopu Li, Qirui Yang, Yihao Liu, Jigang Tang, Tao Ku, Shibin Ma, Bingnan Hu, Jiarong Wang, Densen Puthussery, Hrishikesh P S, Melvin Kuriakose, Jiji C V, Varun Sundar, Sumanth Hegde, Divya Kothandaraman, Kaushik Mitra, Akashdeep Jassal , et al. (20 additional authors not shown)

    Abstract: This paper is the report of the first Under-Display Camera (UDC) image restoration challenge in conjunction with the RLQ workshop at ECCV 2020. The challenge is based on a newly-collected database of Under-Display Camera. The challenge tracks correspond to two types of display: a 4k Transparent OLED (T-OLED) and a phone Pentile OLED (P-OLED). Along with about 150 teams registered the challenge, ei… ▽ More

    Submitted 18 August, 2020; originally announced August 2020.

    Comments: 15 pages

  20. arXiv:2007.01261  [pdf, other

    cs.CV cs.LG eess.IV

    Curriculum Manager for Source Selection in Multi-Source Domain Adaptation

    Authors: Luyu Yang, Yogesh Balaji, Ser-Nam Lim, Abhinav Shrivastava

    Abstract: The performance of Multi-Source Unsupervised Domain Adaptation depends significantly on the effectiveness of transfer from labeled source domain samples. In this paper, we proposed an adversarial agent that learns a dynamic curriculum for source samples, called Curriculum Manager for Source Selection (CMSS). The Curriculum Manager, an independent network module, constantly updates the curriculum d… ▽ More

    Submitted 2 July, 2020; originally announced July 2020.

  21. arXiv:2004.14491  [pdf, other

    cs.CV cs.LG cs.MM eess.IV

    Detecting Deep-Fake Videos from Appearance and Behavior

    Authors: Shruti Agarwal, Tarek El-Gaaly, Hany Farid, Ser-Nam Lim

    Abstract: Synthetically-generated audios and videos -- so-called deep fakes -- continue to capture the imagination of the computer-graphics and computer-vision communities. At the same time, the democratization of access to technology that can create sophisticated manipulated video of anybody saying anything continues to be of concern because of its power to disrupt democratic elections, commit small to lar… ▽ More

    Submitted 29 April, 2020; originally announced April 2020.

    Journal ref: IEEE Workshop on Image Forensics and Security, 2020

  22. arXiv:2004.09584  [pdf, other

    eess.AS cs.SD eess.SP

    ViSQOL v3: An Open Source Production Ready Objective Speech and Audio Metric

    Authors: Michael Chinen, Felicia S. C. Lim, Jan Skoglund, Nikita Gureev, Feargus O'Gorman, Andrew Hines

    Abstract: Estimation of perceptual quality in audio and speech is possible using a variety of methods. The combined v3 release of ViSQOL and ViSQOLAudio (for speech and audio, respectively,) provides improvements upon previous versions, in terms of both design and usage. As an open source C++ library or binary with permissive licensing, ViSQOL can now be deployed beyond the research context into production… ▽ More

    Submitted 20 April, 2020; originally announced April 2020.

    Comments: 2020 Twelfth International Conference on Quality of Multimedia Experience (QoMEX)

  23. arXiv:2004.09320  [pdf, other

    eess.IV cs.CV cs.LG stat.ML

    Quantization Guided JPEG Artifact Correction

    Authors: Max Ehrlich, Larry Davis, Ser-Nam Lim, Abhinav Shrivastava

    Abstract: The JPEG image compression algorithm is the most popular method of image compression because of its ability for large compression ratios. However, to achieve such high compression, information is lost. For aggressive quantization settings, this leads to a noticeable reduction in image quality. Artifact correction has been studied in the context of deep neural networks for some time, but the curren… ▽ More

    Submitted 16 July, 2020; v1 submitted 16 April, 2020; originally announced April 2020.

    Comments: Published in the proceedings of ECCV 2020, please see our released code and models at https://gitlab.com/Queuecumber/quantization-guided-ac

  24. arXiv:2003.12015  [pdf, other

    eess.SP physics.optics

    Photonic convolutional neural networks using integrated diffractive optics

    Authors: Jun Rong Ong, Chin Chun Ooi, Thomas Y. L. Ang, Soon Thor Lim, Ching Eng Png

    Abstract: With recent rapid advances in photonic integrated circuits, it has been demonstrated that programmable photonic chips can be used to implement artificial neural networks. Convolutional neural networks (CNN) are a class of deep learning methods that have been highly successful in applications such as image classification and speech processing. We present an architecture to implement a photonic CNN… ▽ More

    Submitted 17 February, 2020; originally announced March 2020.

    Comments: 9 pages, 6 figures

  25. arXiv:2003.06464  [pdf, other

    eess.SP cs.LG

    LCP: A Low-Communication Parallelization Method for Fast Neural Network Inference in Image Recognition

    Authors: Ramyad Hadidi, Bahar Asgari, Jiashen Cao, Younmin Bae, Da Eun Shim, Hyojong Kim, Sung-Kyu Lim, Michael S. Ryoo, Hyesoon Kim

    Abstract: Deep neural networks (DNNs) have inspired new studies in myriad edge applications with robots, autonomous agents, and Internet-of-things (IoT) devices. However, performing inference of DNNs in the edge is still a severe challenge, mainly because of the contradiction between the intensive resource requirements of DNNs and the tight resource availability in several edge domains. Further, as communic… ▽ More

    Submitted 17 November, 2020; v1 submitted 13 March, 2020; originally announced March 2020.

  26. arXiv:1912.11781  [pdf, ps, other

    eess.AS

    Multi-Source Direction-of-Arrival Estimation Using Improved Estimation Consistency Method

    Authors: Rohith Mars, Hiroyuki Ehara, Srikanth Nagisetty, Chong Soon Lim

    Abstract: We address the problem of estimating direction-of-arrivals (DOAs) for multiple acoustic sources in a reverberant environment using a spherical microphone array. It is well-known that multi-source DOA estimation is challenging in the presence of room reverberation, environmental noise and overlap** sources. In this work, we introduce multiple schemes to improve the robustness of estimation consis… ▽ More

    Submitted 26 December, 2019; originally announced December 2019.

  27. arXiv:1910.13122  [pdf

    cs.CY cs.AI cs.HC cs.LG eess.SY

    Algorithmic decision-making in AVs: Understanding ethical and technical concerns for smart cities

    Authors: Hazel Si Min Lim, Araz Taeihagh

    Abstract: Autonomous Vehicles (AVs) are increasingly embraced around the world to advance smart mobility and more broadly, smart, and sustainable cities. Algorithms form the basis of decision-making in AVs, allowing them to perform driving tasks autonomously, efficiently, and more safely than human drivers and offering various economic, social, and environmental benefits. However, algorithmic decision-makin… ▽ More

    Submitted 29 October, 2019; originally announced October 2019.

    Journal ref: Sustainability, 2019, 11(20), 5791

  28. arXiv:1910.06464  [pdf, other

    cs.LG cs.SD eess.AS stat.ML

    Low Bit-Rate Speech Coding with VQ-VAE and a WaveNet Decoder

    Authors: Cristina Gârbacea, Aäron van den Oord, Yazhe Li, Felicia S C Lim, Alejandro Luebs, Oriol Vinyals, Thomas C Walters

    Abstract: In order to efficiently transmit and store speech signals, speech codecs create a minimally redundant representation of the input signal which is then decoded at the receiver with the best possible perceptual quality. In this work we demonstrate that a neural network architecture based on VQ-VAE with a WaveNet decoder can be used to perform very low bit-rate speech coding with high reconstruction… ▽ More

    Submitted 14 October, 2019; originally announced October 2019.

    Comments: ICASSP 2019

    Journal ref: ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 735-739. IEEE, 2019

  29. arXiv:1909.04776  [pdf, other

    eess.AS cs.SD

    Generative Speech Enhancement Based on Cloned Networks

    Authors: Michael Chinen, W. Bastiaan Kleijn, Felicia S. C. Lim, Jan Skoglund

    Abstract: We propose to implement speech enhancement by the regeneration of clean speech from a salient representation extracted from the noisy signal. The network that extracts salient features is trained using a set of weight-sharing clones of the extractor network. The clones receive mel-frequency spectra of different noisy versions of the same speech signal as input. By encouraging the outputs of the cl… ▽ More

    Submitted 10 September, 2019; originally announced September 2019.

    Comments: Accepted WASPAA 2019

  30. arXiv:1908.09414  [pdf, other

    eess.IV cs.CV cs.LG

    CycleGAN with a Blur Kernel for Deconvolution Microscopy: Optimal Transport Geometry

    Authors: Sungjun Lim, Hyoungjun Park, Sang-Eun Lee, Sunghoe Chang, Jong Chul Ye

    Abstract: Deconvolution microscopy has been extensively used to improve the resolution of the wide-field fluorescent microscopy, but the performance of classical approaches critically depends on the accuracy of a model and optimization algorithms. Recently, the convolutional neural network (CNN) approaches have been studied as a fast and high performance alternative. Unfortunately, the CNN approaches usuall… ▽ More

    Submitted 8 July, 2020; v1 submitted 25 August, 2019; originally announced August 2019.

    Comments: This paper is accepted for IEEE Trans. Computational Imaging

  31. arXiv:1908.07045  [pdf, other

    eess.AS cs.SD

    Salient Speech Representations Based on Cloned Networks

    Authors: W. Bastiaan Kleijn, Felicia S. C. Lim, Michael Chinen, Jan Skoglund

    Abstract: We define salient features as features that are shared by signals that are defined as being equivalent by a system designer. The definition allows the designer to contribute qualitative information. We aim to find salient features that are useful as conditioning for generative networks. We extract salient features by jointly training a set of clones of an encoder network. Each network clone receiv… ▽ More

    Submitted 19 August, 2019; originally announced August 2019.

    Comments: Interspeech 2019

  32. arXiv:1906.05956  [pdf, other

    cs.LG cs.CV eess.IV stat.ML

    Scalable Neural Architecture Search for 3D Medical Image Segmentation

    Authors: Sungwoong Kim, Ildoo Kim, Sungbin Lim, Woonhyuk Baek, Chiheon Kim, Hyungjoo Cho, Boogeon Yoon, Taesup Kim

    Abstract: In this paper, a neural architecture search (NAS) framework is proposed for 3D medical image segmentation, to automatically optimize a neural architecture from a large design space. Our NAS framework searches the structure of each layer including neural connectivities and operation types in both of the encoder and decoder. Since optimizing over a large discrete architecture space is difficult due… ▽ More

    Submitted 13 June, 2019; originally announced June 2019.

    Comments: 9 pages, 3 figures

  33. arXiv:1712.01120  [pdf, other

    eess.AS cs.SD eess.SP

    Wavenet based low rate speech coding

    Authors: W. Bastiaan Kleijn, Felicia S. C. Lim, Alejandro Luebs, Jan Skoglund, Florian Stimberg, Quan Wang, Thomas C. Walters

    Abstract: Traditional parametric coding of speech facilitates low rate but provides poor reconstruction quality because of the inadequacy of the model used. We describe how a WaveNet generative speech model can be used to generate high quality speech from the bit stream of a standard parametric coder operating at 2.4 kb/s. We compare this parametric coder with a waveform coder based on the same generative m… ▽ More

    Submitted 1 December, 2017; originally announced December 2017.

    Comments: 5 pages, 2 figures