Skip to main content

Showing 1–39 of 39 results for author: Woo, J

Searching in archive eess. Search in all archives.
.
  1. arXiv:2406.14954  [pdf, other

    eess.IV cs.CV

    A Unified Framework for Synthesizing Multisequence Brain MRI via Hybrid Fusion

    Authors: Jihoon Cho, Jonghye Woo, **ah Park

    Abstract: Multisequence Magnetic Resonance Imaging (MRI) provides a reliable diagnosis in clinical applications through complementary information within sequences. However, in practice, the absence of certain MR sequences is a common problem that can lead to inconsistent analysis results. In this work, we propose a novel unified framework for synthesizing multisequence MR images, called Hybrid Fusion GAN (H… ▽ More

    Submitted 21 June, 2024; originally announced June 2024.

    Comments: 11 pages, 7 figures

  2. arXiv:2406.08714  [pdf, other

    eess.SP

    Real-time Digital RF Emulation -- II: A Near Memory Custom Accelerator

    Authors: Mandovi Mukherjee, Xiangyu Mao, Nael Rahman, Coleman DeLude, Joe Driscoll, Sudarshan Sharma, Payman Behnam, Uday Kamal, Jongseok Woo, Daehyun Kim, Sharjeel Khan, Jianming Tong, Jamin Seo, Prachi Sinha, Madhavan Swaminathan, Tushar Krishna, Santosh Pande, Justin Romberg, Saibal Mukhopadhyay

    Abstract: A near memory hardware accelerator, based on a novel direct path computational model, for real-time emulation of radio frequency systems is demonstrated. Our evaluation of hardware performance uses both application-specific integrated circuits (ASIC) and field programmable gate arrays (FPGA) methodologies: 1). The ASIC testchip implementation, using TSMC 28nm CMOS, leverages distributed autonomous… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

  3. arXiv:2405.05426  [pdf, other

    eess.SY

    ATLS: Automated Trailer Loading for Surface Vessels

    Authors: Amer Abughaida, Meet Gandhi, Jun Heo, Vaishnav Tadiparthi, Yosuke Sakamoto, Joohyun Woo, Sangjae Bae

    Abstract: Automated docking technologies of marine boats have been enlightened by an increasing number of literature. This paper contributes to the literature by proposing a mathematical framework that automates "trailer loading" in the presence of wind disturbances, which is unexplored despite its importance to boat owners. The comprehensive pipeline of localization, system identification, and trajectory o… ▽ More

    Submitted 8 May, 2024; originally announced May 2024.

    Comments: To be presented at IEEE Intelligent Vehicles Symposium (IV 2024)

  4. arXiv:2405.05107  [pdf, other

    cs.ET cs.AR eess.SY

    Leveraging AES Padding: dBs for Nothing and FEC for Free in IoT Systems

    Authors: Jongchan Woo, Vipindev Adat Vasudevan, Benjamin D. Kim, Rafael G. L. D'Oliveira, Alejandro Cohen, Thomas Stahlbuhk, Ken R. Duffy, Muriel Médard

    Abstract: The Internet of Things (IoT) represents a significant advancement in digital technology, with its rapidly growing network of interconnected devices. This expansion, however, brings forth critical challenges in data security and reliability, especially under the threat of increasing cyber vulnerabilities. Addressing the security concerns, the Advanced Encryption Standard (AES) is commonly employed… ▽ More

    Submitted 8 May, 2024; originally announced May 2024.

  5. arXiv:2405.04752  [pdf, other

    eess.AS cs.SD

    HILCodec: High Fidelity and Lightweight Neural Audio Codec

    Authors: Sunghwan Ahn, Beom Jun Woo, Min Hyun Han, Chanyeong Moon, Nam Soo Kim

    Abstract: The recent advancement of end-to-end neural audio codecs enables compressing audio at very low bitrates while reconstructing the output audio with high fidelity. Nonetheless, such improvements often come at the cost of increased model complexity. In this paper, we identify and address the problems of existing neural audio codecs. We show that the performance of Wave-U-Net does not increase consist… ▽ More

    Submitted 7 May, 2024; originally announced May 2024.

  6. arXiv:2404.07217  [pdf, other

    eess.SP cs.AI cs.CV cs.LG

    Attention-aware Semantic Communications for Collaborative Inference

    Authors: Jiwoong Im, Nayoung Kwon, Taewoo Park, Jiheon Woo, Jaeho Lee, Yongjune Kim

    Abstract: We propose a communication-efficient collaborative inference framework in the domain of edge inference, focusing on the efficient use of vision transformer (ViT) models. The partitioning strategy of conventional collaborative inference fails to reduce communication cost because of the inherent architecture of ViTs maintaining consistent layer dimensions across the entire transformer encoder. There… ▽ More

    Submitted 31 May, 2024; v1 submitted 23 February, 2024; originally announced April 2024.

  7. arXiv:2403.01752  [pdf

    eess.SY

    Cooperative and Interaction-aware Driver Model for Lane Change Maneuver

    Authors: Jemin Woo, Changsun Ahn

    Abstract: To achieve complete autonomous vehicles, it is crucial for autonomous vehicles to communicate and interact with their surrounding vehicles. Especially, since the lane change scenarios do not have traffic signals and traffic rules, the interactions between vehicles need to be considered for the autonomous vehicles. To address this issue, we propose a cooperative and interaction-aware decision-makin… ▽ More

    Submitted 4 March, 2024; originally announced March 2024.

  8. arXiv:2402.18775   

    cs.RO eess.SY

    How to Evaluate Human-likeness of Interaction-aware Driver Models

    Authors: Jemin Woo, Changsun Ahn

    Abstract: This study proposes a method for qualitatively evaluating and designing human-like driver models for autonomous vehicles. While most existing research on human-likeness has been focused on quantitative evaluation, it is crucial to consider qualitative measures to accurately capture human perception. To this end, we conducted surveys utilizing both video study and human experience-based study. The… ▽ More

    Submitted 3 March, 2024; v1 submitted 28 February, 2024; originally announced February 2024.

    Comments: This paper could benefit from further refinement to enhance the significance of its results

  9. arXiv:2402.06984  [pdf, other

    cs.SD cs.CV cs.MM eess.AS eess.IV

    Speech motion anomaly detection via cross-modal translation of 4D motion fields from tagged MRI

    Authors: Xiaofeng Liu, Fangxu Xing, Jiachen Zhuo, Maureen Stone, Jerry L. Prince, Georges El Fakhri, Jonghye Woo

    Abstract: Understanding the relationship between tongue motion patterns during speech and their resulting speech acoustic outcomes -- i.e., articulatory-acoustic relation -- is of great importance in assessing speech quality and develo** innovative treatment and rehabilitative strategies. This is especially important when evaluating and detecting abnormal articulatory features in patients with speech-rela… ▽ More

    Submitted 10 February, 2024; originally announced February 2024.

    Comments: SPIE Medical Imaging 2024: Image Processing

  10. arXiv:2402.00375  [pdf, other

    eess.IV cs.CV

    Disentangled Multimodal Brain MR Image Translation via Transformer-based Modality Infuser

    Authors: Jihoon Cho, Xiaofeng Liu, Fangxu Xing, **song Ouyang, Georges El Fakhri, **ah Park, Jonghye Woo

    Abstract: Multimodal Magnetic Resonance (MR) Imaging plays a crucial role in disease diagnosis due to its ability to provide complementary information by analyzing a relationship between multimodal images on the same subject. Acquiring all MR modalities, however, can be expensive, and, during a scanning session, certain MR images may be missed depending on the study protocol. The typical solution would be t… ▽ More

    Submitted 1 February, 2024; originally announced February 2024.

    Comments: 6 pages

  11. arXiv:2401.17571  [pdf, other

    eess.IV cs.CV

    Is Registering Raw Tagged-MR Enough for Strain Estimation in the Era of Deep Learning?

    Authors: Zhangxing Bian, Ahmed Alshareef, Shuwen Wei, Junyu Chen, Yuli Wang, Jonghye Woo, Dzung L. Pham, Jiachen Zhuo, Aaron Carass, Jerry L. Prince

    Abstract: Magnetic Resonance Imaging with tagging (tMRI) has long been utilized for quantifying tissue motion and strain during deformation. However, a phenomenon known as tag fading, a gradual decrease in tag visibility over time, often complicates post-processing. The first contribution of this study is to model tag fading by considering the interplay between $T_1$ relaxation and the repeated application… ▽ More

    Submitted 30 January, 2024; originally announced January 2024.

    Comments: Accepted to SPIE Medical Imaging 2024 (oral)

  12. arXiv:2311.11339  [pdf

    eess.SY

    Assessment of Transmission-level Fault Impacts on 3-phase and 1-phase Distribution IBR Operation

    Authors: Qi Xiao, Jongha Woo, Lidong Song, Bei Xu, David Lubkeman, Ning Lu, Abdul Shafae Mohammed, Johan Enslin, Cara De Coste Chacko, Kat Sico, Steven G. Whisenant

    Abstract: The widespread deployment of inverter-based resources (IBRs) renders distribution systems susceptible to transmission-level faults. This paper presents a comprehensive analysis of the impact of transmission-level faults on 3-phase and 1-phase distribution IBR operation. To evaluate distributed IBR trip** across various phases and locations on a distribution feeder, we conduct simulations of both… ▽ More

    Submitted 1 April, 2024; v1 submitted 19 November, 2023; originally announced November 2023.

  13. arXiv:2311.05802  [pdf, other

    eess.SY

    Generative Modeling of Residuals for Real-Time Risk-Sensitive Safety with Discrete-Time Control Barrier Functions

    Authors: Ryan K. Cosner, Igor Sadalski, Jana K. Woo, Preston Culbertson, Aaron D. Ames

    Abstract: A key source of brittleness for robotic systems is the presence of model uncertainty and external disturbances. Most existing approaches to robust control either seek to bound the worst-case disturbance (which results in conservative behavior), or to learn a deterministic dynamics model (which is unable to capture uncertain dynamics or disturbances). This work proposes a different approach: traini… ▽ More

    Submitted 13 November, 2023; v1 submitted 9 November, 2023; originally announced November 2023.

    Comments: 9 pages, 6 figures, submitted to the 2024 IEEE International Conference on Robotics and Automation (ICRA 2024)

  14. arXiv:2310.15850  [pdf, other

    physics.med-ph cs.AI eess.SP

    Posterior Estimation for Dynamic PET imaging using Conditional Variational Inference

    Authors: Xiaofeng Liu, Thibault Marin, Tiss Amal, Jonghye Woo, Georges El Fakhri, **song Ouyang

    Abstract: This work aims efficiently estimating the posterior distribution of kinetic parameters for dynamic positron emission tomography (PET) imaging given a measurement of time of activity curve. Considering the inherent information loss from parametric imaging to measurement space with the forward kinetic model, the inverse map** is ambiguous. The conventional (but expensive) solution can be the Marko… ▽ More

    Submitted 24 October, 2023; originally announced October 2023.

    Comments: Published on IEEE NSS&MIC

  15. arXiv:2309.14586  [pdf, other

    cs.SD cs.AI cs.CV eess.AS eess.SP

    Speech Audio Synthesis from Tagged MRI and Non-Negative Matrix Factorization via Plastic Transformer

    Authors: Xiaofeng Liu, Fangxu Xing, Maureen Stone, Jiachen Zhuo, Sidney Fels, Jerry L. Prince, Georges El Fakhri, Jonghye Woo

    Abstract: The tongue's intricate 3D structure, comprising localized functional units, plays a crucial role in the production of speech. When measured using tagged MRI, these functional units exhibit cohesive displacements and derived quantities that facilitate the complex process of speech production. Non-negative matrix factorization-based approaches have been shown to estimate the functional units through… ▽ More

    Submitted 25 September, 2023; originally announced September 2023.

    Comments: MICCAI 2023 (Oral presentation)

  16. arXiv:2308.05063  [pdf, other

    cs.CR cs.AR cs.IT eess.SY

    CERMET: Coding for Energy Reduction with Multiple Encryption Techniques -- $It's\ easy\ being\ green$

    Authors: Jongchan Woo, Vipindev Adat Vasudevan, Benjamin Kim, Alejandro Cohen, Rafael G. L. D'Oliveira, Thomas Stahlbuhk, Muriel Médard

    Abstract: This paper presents CERMET, an energy-efficient hardware architecture designed for hardware-constrained cryptosystems. CERMET employs a base cryptosystem in conjunction with network coding to provide both information-theoretic and computational security while reducing energy consumption per bit. This paper introduces the hardware architecture for the system and explores various optimizations to en… ▽ More

    Submitted 9 August, 2023; originally announced August 2023.

  17. arXiv:2308.02949  [pdf, other

    eess.IV cs.CV physics.med-ph

    MomentaMorph: Unsupervised Spatial-Temporal Registration with Momenta, Shooting, and Correction

    Authors: Zhangxing Bian, Shuwen Wei, Yihao Liu, Junyu Chen, Jiachen Zhuo, Fangxu Xing, Jonghye Woo, Aaron Carass, Jerry L. Prince

    Abstract: Tagged magnetic resonance imaging (tMRI) has been employed for decades to measure the motion of tissue undergoing deformation. However, registration-based motion estimation from tMRI is difficult due to the periodic patterns in these images, particularly when the motion is large. With a larger motion the registration approach gets trapped in a local optima, leading to motion estimation errors. We… ▽ More

    Submitted 5 August, 2023; originally announced August 2023.

    Comments: Accepted by MICCAI Workshop 2023: Time-Series Data Analytics and Learning (MTSAIL)

  18. arXiv:2305.14589  [pdf, other

    eess.IV cs.CV cs.LG physics.med-ph

    Attentive Continuous Generative Self-training for Unsupervised Domain Adaptive Medical Image Translation

    Authors: Xiaofeng Liu, Jerry L. Prince, Fangxu Xing, Jiachen Zhuo, Reese Timothy, Maureen Stone, Georges El Fakhri, Jonghye Woo

    Abstract: Self-training is an important class of unsupervised domain adaptation (UDA) approaches that are used to mitigate the problem of domain shift, when applying knowledge learned from a labeled source domain to unlabeled and heterogeneous target domains. While self-training-based UDA has shown considerable promise on discriminative tasks, including classification and segmentation, through reliable pseu… ▽ More

    Submitted 23 May, 2023; originally announced May 2023.

    Comments: Accepted to Medical Image Analysis

  19. arXiv:2305.11310  [pdf, other

    cs.HC cs.LG cs.SD eess.AS

    AMII: Adaptive Multimodal Inter-personal and Intra-personal Model for Adapted Behavior Synthesis

    Authors: Jieyeon Woo, Mireille Fares, Catherine Pelachaud, Catherine Achard

    Abstract: Socially Interactive Agents (SIAs) are physical or virtual embodied agents that display similar behavior as human multimodal behavior. Modeling SIAs' non-verbal behavior, such as speech and facial gestures, has always been a challenging task, given that a SIA can take the role of a speaker or a listener. A SIA must emit appropriate behavior adapted to its own speech, its previous behaviors (intra-… ▽ More

    Submitted 18 May, 2023; originally announced May 2023.

    Comments: 8 pages, 1 figure

    MSC Class: 68T07 ACM Class: I.2.11

  20. arXiv:2303.10057  [pdf, other

    eess.IV cs.LG physics.med-ph

    Posterior Estimation Using Deep Learning: A Simulation Study of Compartmental Modeling in Dynamic PET

    Authors: Xiaofeng Liu, Thibault Marin, Tiss Amal, Jonghye Woo, Georges El Fakhri, **song Ouyang

    Abstract: Background: In medical imaging, images are usually treated as deterministic, while their uncertainties are largely underexplored. Purpose: This work aims at using deep learning to efficiently estimate posterior distributions of imaging parameters, which in turn can be used to derive the most probable parameters as well as their uncertainties. Methods: Our deep learning-based approaches are based o… ▽ More

    Submitted 17 March, 2023; originally announced March 2023.

    Comments: Published in Medical Physics

  21. arXiv:2302.07203  [pdf, other

    eess.IV cs.CV cs.SD eess.AS eess.SP

    Synthesizing audio from tongue motion during speech using tagged MRI via transformer

    Authors: Xiaofeng Liu, Fangxu Xing, Jerry L. Prince, Maureen Stone, Georges El Fakhri, Jonghye Woo

    Abstract: Investigating the relationship between internal tissue point motion of the tongue and oropharyngeal muscle deformation measured from tagged MRI and intelligible speech can aid in advancing speech motor control theories and develo** novel treatment methods for speech related-disorders. However, elucidating the relationship between these two sources of information is challenging, due in part to th… ▽ More

    Submitted 14 February, 2023; originally announced February 2023.

    Comments: SPIE Medical Imaging: Deep Dive Oral

  22. arXiv:2301.08959  [pdf, other

    eess.IV cs.CV

    Successive Subspace Learning for Cardiac Disease Classification with Two-phase Deformation Fields from Cine MRI

    Authors: Xiaofeng Liu, Fangxu Xing, Hanna K. Gaggin, C. -C. Jay Kuo, Georges El Fakhri, Jonghye Woo

    Abstract: Cardiac cine magnetic resonance imaging (MRI) has been used to characterize cardiovascular diseases (CVD), often providing a noninvasive phenoty** tool.~While recently flourished deep learning based approaches using cine MRI yield accurate characterization results, the performance is often degraded by small training samples. In addition, many deep learning models are deemed a ``black box," for w… ▽ More

    Submitted 21 January, 2023; originally announced January 2023.

    Comments: ISBI 2023

  23. arXiv:2301.07234  [pdf, other

    eess.IV cs.CV

    DRIMET: Deep Registration for 3D Incompressible Motion Estimation in Tagged-MRI with Application to the Tongue

    Authors: Zhangxing Bian, Fangxu Xing, **glun Yu, Muhan Shao, Yihao Liu, Aaron Carass, Jiachen Zhuo, Jonghye Woo, Jerry L. Prince

    Abstract: Tagged magnetic resonance imaging~(MRI) has been used for decades to observe and quantify the detailed motion of deforming tissue. However, this technique faces several challenges such as tag fading, large motion, long computation times, and difficulties in obtaining diffeomorphic incompressible flow fields. To address these issues, this paper presents a novel unsupervised phase-based 3D motion es… ▽ More

    Submitted 30 April, 2023; v1 submitted 17 January, 2023; originally announced January 2023.

    Comments: Accepted to MIDL 2023 (oral)

  24. arXiv:2211.15075  [pdf, other

    eess.AS cs.SD

    Inter-KD: Intermediate Knowledge Distillation for CTC-Based Automatic Speech Recognition

    Authors: Ji Won Yoon, Beom Jun Woo, Sunghwan Ahn, Hyeonseung Lee, Nam Soo Kim

    Abstract: Recently, the advance in deep learning has brought a considerable improvement in the end-to-end speech recognition field, simplifying the traditional pipeline while producing promising results. Among the end-to-end models, the connectionist temporal classification (CTC)-based model has attracted research interest due to its non-autoregressive nature. However, such CTC models require a heavy comput… ▽ More

    Submitted 28 November, 2022; originally announced November 2022.

    Comments: Accepted by 2022 SLT Workshop

  25. arXiv:2209.07910  [pdf, other

    cs.CV cs.AI cs.LG eess.IV

    Memory Consistent Unsupervised Off-the-Shelf Model Adaptation for Source-Relaxed Medical Image Segmentation

    Authors: Xiaofeng Liu, Fangxu Xing, Georges El Fakhri, Jonghye Woo

    Abstract: Unsupervised domain adaptation (UDA) has been a vital protocol for migrating information learned from a labeled source domain to facilitate the implementation in an unlabeled heterogeneous target domain. Although UDA is typically jointly trained on data from both domains, accessing the labeled source domain data is often restricted, due to concerns over patient data privacy or intellectual propert… ▽ More

    Submitted 16 September, 2022; originally announced September 2022.

    Comments: Published in Medical Image Analysis (extension of MICCAI paper)

  26. arXiv:2208.07769  [pdf, other

    cs.CV cs.AI cs.LG eess.IV eess.SP

    Unsupervised Domain Adaptation for Segmentation with Black-box Source Model

    Authors: Xiaofeng Liu, Chaehwa Yoo, Fangxu Xing, C. -C. Jay Kuo, Georges El Fakhri, Jonghye Woo

    Abstract: Unsupervised domain adaptation (UDA) has been widely used to transfer knowledge from a labeled source domain to an unlabeled target domain to counter the difficulty of labeling in a new domain. The training of conventional solutions usually relies on the existence of both source and target domain data. However, privacy of the large-scale and well-labeled data in the source domain and trained model… ▽ More

    Submitted 16 August, 2022; originally announced August 2022.

    Comments: SPIE Medical Imaging 2022: Image Processing

  27. arXiv:2208.07754  [pdf, other

    cs.CV cs.AI cs.LG eess.IV eess.SP

    Subtype-Aware Dynamic Unsupervised Domain Adaptation

    Authors: Xiaofeng Liu, Fangxu Xing, Jia You, Jun Lu, C. -C. Jay Kuo, Georges El Fakhri, Jonghye Woo

    Abstract: Unsupervised domain adaptation (UDA) has been successfully applied to transfer knowledge from a labeled source domain to target domains without their labels. Recently introduced transferable prototypical networks (TPN) further addresses class-wise conditional alignment. In TPN, while the closeness of class centers between source and target domains is explicitly enforced in a latent space, the unde… ▽ More

    Submitted 16 August, 2022; originally announced August 2022.

    Comments: IEEE Transactions on Neural Networks and Learning Systems (TNNLS)

  28. arXiv:2208.07422  [pdf, other

    cs.CV cs.AI cs.LG eess.IV eess.SP

    Deep Unsupervised Domain Adaptation: A Review of Recent Advances and Perspectives

    Authors: Xiaofeng Liu, Chaehwa Yoo, Fangxu Xing, Hye** Oh, Georges El Fakhri, Je-Won Kang, Jonghye Woo

    Abstract: Deep learning has become the method of choice to tackle real-world problems in different domains, partly because of its ability to learn from data and achieve impressive performance on a wide range of applications. However, its success usually relies on two assumptions: (i) vast troves of labeled datasets are required for accurate model fitting, and (ii) training and testing data are independent a… ▽ More

    Submitted 15 August, 2022; originally announced August 2022.

    Comments: APSIPA Transactions on Signal and Information Processing

  29. arXiv:2206.02284  [pdf, other

    cs.SD cs.CV cs.MM eess.AS

    Tagged-MRI Sequence to Audio Synthesis via Self Residual Attention Guided Heterogeneous Translator

    Authors: Xiaofeng Liu, Fangxu Xing, Jerry L. Prince, Jiachen Zhuo, Maureen Stone, Georges El Fakhri, Jonghye Woo

    Abstract: Understanding the underlying relationship between tongue and oropharyngeal muscle deformation seen in tagged-MRI and intelligible speech plays an important role in advancing speech motor control theories and treatment of speech related-disorders. Because of their heterogeneous representations, however, direct map** between the two modalities -- i.e., two-dimensional (mid-sagittal slice) plus tim… ▽ More

    Submitted 25 September, 2022; v1 submitted 5 June, 2022; originally announced June 2022.

    Comments: MICCAI 2022 (early accept, Oral Presentation ~3%)

  30. arXiv:2204.06328  [pdf, other

    cs.CL cs.SD eess.AS

    HuBERT-EE: Early Exiting HuBERT for Efficient Speech Recognition

    Authors: Ji Won Yoon, Beom Jun Woo, Nam Soo Kim

    Abstract: Pre-training with self-supervised models, such as Hidden-unit BERT (HuBERT) and wav2vec 2.0, has brought significant improvements in automatic speech recognition (ASR). However, these models usually require an expensive computational cost to achieve outstanding performance, slowing down the inference speed. To improve the model efficiency, we introduce an early exit scheme for ASR, namely HuBERT-E… ▽ More

    Submitted 19 June, 2024; v1 submitted 13 April, 2022; originally announced April 2022.

    Comments: Accepted by INTERSPEECH 2024

  31. arXiv:2202.12474  [pdf, other

    eess.IV cs.CV cs.LG

    Structure-aware Unsupervised Tagged-to-Cine MRI Synthesis with Self Disentanglement

    Authors: Xiaofeng Liu, Fangxu Xing, Jerry L. Prince, Maureen Stone, Georges El Fakhri, Jonghye Woo

    Abstract: Cycle reconstruction regularized adversarial training -- e.g., CycleGAN, DiscoGAN, and DualGAN -- has been widely used for image style transfer with unpaired training data. Several recent works, however, have shown that local distortions are frequent, and structural consistency cannot be guaranteed. Targeting this issue, prior works usually relied on additional segmentation or consistent feature e… ▽ More

    Submitted 24 February, 2022; originally announced February 2022.

    Comments: SPIE Medical Imaging: Image Processing (Oral presentation)

  32. arXiv:2107.10718  [pdf, other

    eess.IV cs.CV cs.LG

    Segmentation of Cardiac Structures via Successive Subspace Learning with Saab Transform from Cine MRI

    Authors: Xiaofeng Liu, Fangxu Xing, Hanna K. Gaggin, Weichung Wang, C. -C. Jay Kuo, Georges El Fakhri, Jonghye Woo

    Abstract: Assessment of cardiovascular disease (CVD) with cine magnetic resonance imaging (MRI) has been used to non-invasively evaluate detailed cardiac structure and function. Accurate segmentation of cardiac structures from cine MRI is a crucial step for early diagnosis and prognosis of CVD, and has been greatly improved with convolutional neural networks (CNN). There, however, are a number of limitation… ▽ More

    Submitted 22 July, 2021; originally announced July 2021.

    Comments: 43rd Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC 2021)

  33. arXiv:2103.05109  [pdf, other

    cs.CV cs.LG eess.IV

    Highly Efficient Representation and Active Learning Framework and Its Application to Imbalanced Medical Image Classification

    Authors: Heng Hao, Hankyu Moon, Sima Didari, Jae Oh Woo, Patrick Bangert

    Abstract: We propose a highly data-efficient active learning framework for image classification. Our novel framework combines: (1) unsupervised representation learning of a Convolutional Neural Network and (2) the Gaussian Process (GP) method, in sequence to achieve highly data and label efficient classifications. Moreover, both elements are less sensitive to the prevalent and challenging class imbalance is… ▽ More

    Submitted 20 June, 2022; v1 submitted 24 February, 2021; originally announced March 2021.

    Comments: Published in NeurIPs Data-Centric AI workshop

  34. arXiv:2101.06775  [pdf, other

    eess.IV cs.CV

    Symmetric-Constrained Irregular Structure Inpainting for Brain MRI Registration with Tumor Pathology

    Authors: Xiaofeng Liu, Fangxu Xing, Chao Yang, C. -C. Jay Kuo, Georges ElFakhri, Jonghye Woo

    Abstract: Deformable registration of magnetic resonance images between patients with brain tumors and healthy subjects has been an important tool to specify tumor geometry through location alignment and facilitate pathological analysis. Since tumor region does not match with any ordinary brain tissue, it has been difficult to deformably register a patients brain to a normal one. Many patient images are asso… ▽ More

    Submitted 17 January, 2021; originally announced January 2021.

    Comments: Published at MICCAI Brainles 2020

  35. Dual-cycle Constrained Bijective VAE-GAN For Tagged-to-Cine Magnetic Resonance Image Synthesis

    Authors: Xiaofeng Liu, Fangxu Xing, Jerry L. Prince, Aaron Carass, Maureen Stone, Georges El Fakhri, Jonghye Woo

    Abstract: Tagged magnetic resonance imaging (MRI) is a widely used imaging technique for measuring tissue deformation in moving organs. Due to tagged MRI's intrinsic low anatomical resolution, another matching set of cine MRI with higher resolution is sometimes acquired in the same scanning session to facilitate tissue segmentation, thus adding extra time and cost. To mitigate this, in this work, we propose… ▽ More

    Submitted 13 January, 2021; originally announced January 2021.

    Comments: Accepted to IEEE International Symposium on Biomedical Imaging (ISBI) 2021

    Journal ref: 2021 IEEE 18th International Symposium on Biomedical Imaging (ISBI)

  36. arXiv:2101.05434  [pdf, other

    eess.IV cs.CV

    A Unified Conditional Disentanglement Framework for Multimodal Brain MR Image Translation

    Authors: Xiaofeng Liu, Fangxu Xing, Georges El Fakhri, Jonghye Woo

    Abstract: Multimodal MRI provides complementary and clinically relevant information to probe tissue condition and to characterize various diseases. However, it is often difficult to acquire sufficiently many modalities from the same subject due to limitations in study plans, while quantitative analysis is still demanded. In this work, we propose a unified conditional disentanglement framework to synthesize… ▽ More

    Submitted 5 June, 2021; v1 submitted 13 January, 2021; originally announced January 2021.

    Comments: Published in IEEE International Symposium on Biomedical Imaging (ISBI) 2021 for Oral presentation

  37. arXiv:2101.05131  [pdf, other

    eess.IV cs.CV

    VoxelHop: Successive Subspace Learning for ALS Disease Classification Using Structural MRI

    Authors: Xiaofeng Liu, Fangxu Xing, Chao Yang, C. -C. Jay Kuo, Suma Babu, Georges El Fakhri, Thomas Jenkins, Jonghye Woo

    Abstract: Deep learning has great potential for accurate detection and classification of diseases with medical imaging data, but the performance is often limited by the number of training datasets and memory requirements. In addition, many deep learning models are considered a "black-box," thereby often limiting their adoption in clinical applications. To address this, we present a successive subspace learn… ▽ More

    Submitted 13 January, 2021; originally announced January 2021.

  38. arXiv:2008.12048  [pdf, ps, other

    eess.AS

    End-to-end Music-mixed Speech Recognition

    Authors: Jeongwoo Woo, Masato Mimura, Kazuyoshi Yoshii, Tatsuya Kawahara

    Abstract: Automatic speech recognition (ASR) in multimedia content is one of the promising applications, but speech data in this kind of content are frequently mixed with background music, which is harmful for the performance of ASR. In this study, we propose a method for improving ASR with background music based on time-domain source separation. We utilize Conv-TasNet as a separation network, which has ach… ▽ More

    Submitted 27 August, 2020; originally announced August 2020.

    Comments: Submitted to APSIPA 2020

  39. arXiv:2007.04865  [pdf, other

    cs.CV eess.IV

    A Deep Joint Sparse Non-negative Matrix Factorization Framework for Identifying the Common and Subject-specific Functional Units of Tongue Motion During Speech

    Authors: Jonghye Woo, Fangxu Xing, Jerry L. Prince, Maureen Stone, Arnold Gomez, Timothy G. Reese, Van J. Wedeen, Georges El Fakhri

    Abstract: Intelligible speech is produced by creating varying internal local muscle grou**s -- i.e., functional units -- that are generated in a systematic and coordinated manner. There are two major challenges in characterizing and analyzing functional units.~First, due to the complex and convoluted nature of tongue structure and function, it is of great importance to develop a method that can accurately… ▽ More

    Submitted 6 June, 2021; v1 submitted 9 July, 2020; originally announced July 2020.

    Comments: Accepted by Medical Image Analysis