Skip to main content

Showing 1–27 of 27 results for author: Oh, K

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.14308  [pdf, other

    eess.IV cs.CV cs.LG

    FIESTA: Fourier-Based Semantic Augmentation with Uncertainty Guidance for Enhanced Domain Generalizability in Medical Image Segmentation

    Authors: Kwanseok Oh, Eun** Jeon, Da-Woon Heo, Yooseung Shin, Heung-Il Suk

    Abstract: Single-source domain generalization (SDG) in medical image segmentation (MIS) aims to generalize a model using data from only one source domain to segment data from an unseen target domain. Despite substantial advances in SDG with data augmentation, existing methods often fail to fully consider the details and uncertain areas prevalent in MIS, leading to mis-segmentation. This paper proposes a Fou… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

    Comments: 40 pages, 7 figures, 5 tables

  2. arXiv:2403.13941  [pdf, ps, other

    cs.RO eess.SY

    Sensory Glove-Based Surgical Robot User Interface

    Authors: Leonardo Borgioli, Ki-Hwan Oh, Alberto Mangano, Alvaro Ducas, Luciano Ambrosini, Federico Pinto, Paula A Lopez, Jessica Cassiani, Milos Zefran, Liaohai Chen, Pier Cristoforo Giulianotti

    Abstract: Robotic surgery has reached a high level of maturity and has become an integral part of standard surgical care. However, existing surgeon consoles are bulky and take up valuable space in the operating room, present challenges for surgical team coordination, and their proprietary nature makes it difficult to take advantage of recent technological advances, especially in virtual and augmented realit… ▽ More

    Submitted 20 March, 2024; originally announced March 2024.

    Comments: 6 pages, 5 figures, 7 tables, submitted to International Conference on Intelligent Robots and Systems (IROS)2024

  3. arXiv:2402.19237  [pdf, ps, other

    cs.CV cs.AI

    Context-based Interpretable Spatio-Temporal Graph Convolutional Network for Human Motion Forecasting

    Authors: Edgar Medina, Leyong Loh, Namrata Gurung, Kyung Hun Oh, Niels Heller

    Abstract: Human motion prediction is still an open problem extremely important for autonomous driving and safety applications. Due to the complex spatiotemporal relation of motion sequences, this remains a challenging problem not only for movement prediction but also to perform a preliminary interpretation of the joint connections. In this work, we present a Context-based Interpretable Spatio-Temporal Graph… ▽ More

    Submitted 21 February, 2024; originally announced February 2024.

    Comments: 10 pages, 6 figures

  4. arXiv:2402.09025  [pdf, other

    cs.CL cs.LG

    SLEB: Streamlining LLMs through Redundancy Verification and Elimination of Transformer Blocks

    Authors: Jiwon Song, Kyungseok Oh, Taesu Kim, Hyungjun Kim, Yulhwa Kim, Jae-Joon Kim

    Abstract: Large language models (LLMs) have proven to be highly effective across various natural language processing tasks. However, their large number of parameters poses significant challenges for practical deployment. Pruning, a technique aimed at reducing the size and complexity of LLMs, offers a potential solution by removing redundant components from the network. Despite the promise of pruning, existi… ▽ More

    Submitted 11 June, 2024; v1 submitted 14 February, 2024; originally announced February 2024.

  5. arXiv:2402.08409  [pdf, other

    cs.CV cs.AI

    Transferring Ultrahigh-Field Representations for Intensity-Guided Brain Segmentation of Low-Field Magnetic Resonance Imaging

    Authors: Kwanseok Oh, Jieun Lee, Da-Woon Heo, Dinggang Shen, Heung-Il Suk

    Abstract: Ultrahigh-field (UHF) magnetic resonance imaging (MRI), i.e., 7T MRI, provides superior anatomical details of internal brain structures owing to its enhanced signal-to-noise ratio and susceptibility-induced contrast. However, the widespread use of 7T MRI is limited by its high cost and lower accessibility compared to low-field (LF) MRI. This study proposes a deep-learning framework that systematic… ▽ More

    Submitted 13 February, 2024; originally announced February 2024.

    Comments: 32 pages, 9 figures, and 5 tables

  6. arXiv:2312.01183  [pdf, other

    cs.RO

    Comprehensive Robotic Cholecystectomy Dataset (CRCD): Integrating Kinematics, Pedal Signals, and Endoscopic Videos

    Authors: Ki-Hwan Oh, Leonardo Borgioli, Alberto Mangano, Valentina Valle, Marco Di Pangrazio, Francesco Toti, Gioia Pozza, Luciano Ambrosini, Alvaro Ducas, Milos Zefran, Liaohai Chen, Pier Cristoforo Giulianotti

    Abstract: In recent years, the potential applications of machine learning to Minimally Invasive Surgery (MIS) have spurred interest in data sets that can be used to develop data-driven tools. This paper introduces a novel dataset recorded during ex vivo pseudo-cholecystectomy procedures on pig livers, utilizing the da Vinci Research Kit (dVRK). Unlike current datasets, ours bridges a critical gap by offerin… ▽ More

    Submitted 6 April, 2024; v1 submitted 2 December, 2023; originally announced December 2023.

    Comments: 6 pages, 8 figures, 5 tables. Accepted for presentation at the 2024 International Symposium on Medical Robotics

  7. arXiv:2311.04250  [pdf, other

    cs.AI cs.CL cs.LG

    Unifying Structure and Language Semantic for Efficient Contrastive Knowledge Graph Completion with Structured Entity Anchors

    Authors: Sang-Hyun Je, Wontae Choi, Kwang** Oh

    Abstract: The goal of knowledge graph completion (KGC) is to predict missing links in a KG using trained facts that are already known. In recent, pre-trained language model (PLM) based methods that utilize both textual and structural information are emerging, but their performances lag behind state-of-the-art (SOTA) structure-based methods or some methods lose their inductive inference capabilities in the p… ▽ More

    Submitted 7 November, 2023; originally announced November 2023.

  8. arXiv:2310.09669  [pdf, other

    cs.RO

    A Framework For Automated Dissection Along Tissue Boundary

    Authors: Ki-Hwan Oh, Leonardo Borgioli, Milos Zefran, Liaohai Chen, Pier Cristoforo Giulianotti

    Abstract: Robotic surgery promises enhanced precision and adaptability over traditional surgical methods. It also offers the possibility of automating surgical interventions, resulting in reduced stress on the surgeon, better surgical outcomes, and lower costs. Cholecystectomy, the removal of the gallbladder, serves as an ideal model procedure for automation due to its distinct and well-contrasted anatomica… ▽ More

    Submitted 27 February, 2024; v1 submitted 14 October, 2023; originally announced October 2023.

    Comments: 7 pages, 7 figures, 7 tables, submitted to 2024 International Conference on Biomedical Robotics and Biomechatronics

  9. arXiv:2310.08598  [pdf, other

    eess.IV cs.AI cs.CV

    Domain Generalization for Medical Image Analysis: A Survey

    Authors: Jee Seok Yoon, Kwanseok Oh, Yooseung Shin, Maciej A. Mazurowski, Heung-Il Suk

    Abstract: Medical image analysis (MedIA) has become an essential tool in medicine and healthcare, aiding in disease diagnosis, prognosis, and treatment planning, and recent successes in deep learning (DL) have made significant contributions to its advances. However, deploying DL models for MedIA in real-world situations remains challenging due to their failure to generalize across the distributional gap bet… ▽ More

    Submitted 15 February, 2024; v1 submitted 5 October, 2023; originally announced October 2023.

  10. arXiv:2310.03457  [pdf, other

    cs.AI eess.IV

    A Quantitatively Interpretable Model for Alzheimer's Disease Prediction Using Deep Counterfactuals

    Authors: Kwanseok Oh, Da-Woon Heo, Ahmad Wisnu Mulyadi, Wonsik Jung, Eunsong Kang, Kun Ho Lee, Heung-Il Suk

    Abstract: Deep learning (DL) for predicting Alzheimer's disease (AD) has provided timely intervention in disease progression yet still demands attentive interpretability to explain how their DL models make definitive decisions. Recently, counterfactual reasoning has gained increasing attention in medical research because of its ability to provide a refined visual explanatory map. However, such visual explan… ▽ More

    Submitted 5 October, 2023; originally announced October 2023.

    Comments: 15 pages, 5 figures, 4 tables

  11. Recognizing Intent in Collaborative Manipulation

    Authors: Zhanibek Rysbek, Ki Hwan Oh, Milos Zefran

    Abstract: Collaborative manipulation is inherently multimodal, with haptic communication playing a central role. When performed by humans, it involves back-and-forth force exchanges between the participants through which they resolve possible conflicts and determine their roles. Much of the existing work on collaborative human-robot manipulation assumes that the robot follows the human. But for a robot to m… ▽ More

    Submitted 17 August, 2023; originally announced August 2023.

  12. Smartpick: Workload Prediction for Serverless-enabled Scalable Data Analytics Systems

    Authors: Anshuman Das Mohapatra, Kwangsung Oh

    Abstract: Many data analytic systems have adopted a newly emerging compute resource, serverless (SL), to handle data analytics queries in a timely and cost-efficient manner, i.e., serverless data analytics. While these systems can start processing queries quickly thanks to the agility and scalability of SL, they may encounter performance- and cost-bottlenecks based on workloads due to SL's worse performance… ▽ More

    Submitted 25 July, 2023; originally announced July 2023.

    Comments: This paper is accepted for publication at the 24th ACM/IFIP International Middleware Conference (Middleware '23)

  13. arXiv:2307.10204  [pdf, ps, other

    cs.IR cs.LG stat.ML

    An IPW-based Unbiased Ranking Metric in Two-sided Markets

    Authors: Keisho Oh, Naoki Nishimura, Minje Sung, Ken Kobayashi, Kazuhide Nakata

    Abstract: In modern recommendation systems, unbiased learning-to-rank (LTR) is crucial for prioritizing items from biased implicit user feedback, such as click data. Several techniques, such as Inverse Propensity Weighting (IPW), have been proposed for single-sided markets. However, less attention has been paid to two-sided markets, such as job platforms or dating services, where successful conversions requ… ▽ More

    Submitted 13 July, 2023; originally announced July 2023.

  14. arXiv:2304.12288  [pdf, other

    cs.RO

    Robots Taking Initiative in Collaborative Object Manipulation: Lessons from Physical Human-Human Interaction

    Authors: Zhanibek Rysbek, Ki Hwan Oh, Afagh Mehri Shervedani, Timotej Klemencic, Milos Zefran, Barbara Di Eugenio

    Abstract: Physical Human-Human Interaction (pHHI) involves the use of multiple sensory modalities. Studies of communication through spoken utterances and gestures are well established, but communication through force signals is not well understood. In this paper, we focus on investigating the mechanisms employed by humans during the negotiation through force signals, and how the robot can communicate task g… ▽ More

    Submitted 29 July, 2023; v1 submitted 24 April, 2023; originally announced April 2023.

  15. arXiv:2304.08878  [pdf, other

    cs.CV

    Deep Collective Knowledge Distillation

    Authors: Jihyeon Seo, Kyusam Oh, Chanho Min, Yongkeun Yun, Sungwoo Cho

    Abstract: Many existing studies on knowledge distillation have focused on methods in which a student model mimics a teacher model well. Simply imitating the teacher's knowledge, however, is not sufficient for the student to surpass that of the teacher. We explore a method to harness the knowledge of other students to complement the knowledge of the teacher. We propose deep collective knowledge distill… ▽ More

    Submitted 18 April, 2023; originally announced April 2023.

  16. arXiv:2212.10425  [pdf, other

    cs.RO

    Evaluating Multimodal Interaction of Robots Assisting Older Adults

    Authors: Afagh Mehri Shervedani, Ki-Hwan Oh, Bahareh Abbasi, Natawut Monaikul, Zhanibek Rysbek, Barbara Di Eugenio, Milos Zefran

    Abstract: We outline our work on evaluating robots that assist older adults by engaging with them through multiple modalities that include physical interaction. Our thesis is that to increase the effectiveness of assistive robots: 1) robots need to understand and effect multimodal actions, 2) robots should not only react to the human, they need to take the initiative and lead the task when it is necessary.… ▽ More

    Submitted 20 December, 2022; originally announced December 2022.

  17. arXiv:2207.13223  [pdf, other

    cs.LG eess.IV

    XADLiME: eXplainable Alzheimer's Disease Likelihood Map Estimation via Clinically-guided Prototype Learning

    Authors: Ahmad Wisnu Mulyadi, Wonsik Jung, Kwanseok Oh, Jee Seok Yoon, Heung-Il Suk

    Abstract: Diagnosing Alzheimer's disease (AD) involves a deliberate diagnostic process owing to its innate traits of irreversibility with subtle and gradual progression. These characteristics make AD biomarker identification from structural brain imaging (e.g., structural MRI) scans quite challenging. Furthermore, there is a high possibility of getting entangled with normal aging. We propose a novel deep-le… ▽ More

    Submitted 26 July, 2022; originally announced July 2022.

  18. e-CLIP: Large-Scale Vision-Language Representation Learning in E-commerce

    Authors: Wonyoung Shin, Jonghun Park, Taekang Woo, Yongwoo Cho, Kwang** Oh, Hwanjun Song

    Abstract: Understanding vision and language representations of product content is vital for search and recommendation applications in e-commerce. As a backbone for online shop** platforms and inspired by the recent success in representation learning research, we propose a contrastive learning framework that aligns language and visual models using unlabeled raw product text and images. We present technique… ▽ More

    Submitted 22 August, 2022; v1 submitted 1 July, 2022; originally announced July 2022.

    Comments: Accepted to CIKM 2022

  19. arXiv:2108.09451  [pdf, other

    cs.CV cs.AI

    Learn-Explain-Reinforce: Counterfactual Reasoning and Its Guidance to Reinforce an Alzheimer's Disease Diagnosis Model

    Authors: Kwanseok Oh, Jee Seok Yoon, Heung-Il Suk

    Abstract: Existing studies on disease diagnostic models focus either on diagnostic model learning for performance improvement or on the visual explanation of a trained diagnostic model. We propose a novel learn-explain-reinforce (LEAR) framework that unifies diagnostic model learning, visual explanation generation (explanation unit), and trained diagnostic model reinforcement (reinforcement unit) guided by… ▽ More

    Submitted 21 August, 2021; originally announced August 2021.

    Comments: 14 pages, 9 figures

  20. arXiv:2106.11756  [pdf, other

    cs.SE cs.AI cs.CV

    Trinity: A No-Code AI platform for complex spatial datasets

    Authors: C. V. Krishnakumar Iyer, Feili Hou, Henry Wang, Yonghong Wang, Kay Oh, Swetava Ganguli, Vipul Pandey

    Abstract: We present a no-code Artificial Intelligence (AI) platform called Trinity with the main design goal of enabling both machine learning researchers and non-technical geospatial domain experts to experiment with domain-specific signals and datasets for solving a variety of complex problems on their own. This versatility to solve diverse problems is achieved by transforming complex Spatio-temporal dat… ▽ More

    Submitted 1 July, 2021; v1 submitted 21 June, 2021; originally announced June 2021.

    Comments: 12 pages

  21. DUET: Detection Utilizing Enhancement for Text in Scanned or Captured Documents

    Authors: Eun-Soo Jung, HyeongGwan Son, Kyusam Oh, Yongkeun Yun, Soonhwan Kwon, Min Soo Kim

    Abstract: We present a novel deep neural model for text detection in document images. For robust text detection in noisy scanned documents, the advantages of multi-task learning are adopted by adding an auxiliary task of text enhancement. Namely, our proposed model is designed to perform noise reduction and text region enhancement as well as text detection. Moreover, we enrich the training data for the mode… ▽ More

    Submitted 10 June, 2021; originally announced June 2021.

    Journal ref: 2020 25th International Conference on Pattern Recognition (ICPR)

  22. arXiv:2105.03386  [pdf, other

    cs.DM cs.CG

    Topology and Routing Problems: The Circular Frame

    Authors: Rak-Kyeong Seong, Chanho Min, Sang-Hoon Han, Jaeho Yang, Seungwoo Nam, Kyusam Oh

    Abstract: In this work, we solve the problem of finding non-intersecting paths between points on a plane with a new approach by borrowing ideas from geometric topology, in particular, from the study of polygonal schema in mathematics. We use a topological transformation on the 2-dimensional planar routing environment that simplifies the routing problem into a problem of connecting points on a circle with st… ▽ More

    Submitted 7 May, 2021; originally announced May 2021.

    Comments: 15 pages, 10 figures

  23. arXiv:2011.12429  [pdf

    eess.IV cs.CV cs.LG

    Fully Automated Mitral Inflow Doppler Analysis Using Deep Learning

    Authors: Mohamed Y. Elwazir, Zeynettin Akkus, Didem Oguz, Jae K. Oh

    Abstract: Echocardiography (echo) is an indispensable tool in a cardiologist's diagnostic armamentarium. To date, almost all echocardiographic parameters require time-consuming manual labeling and measurements by an experienced echocardiographer and exhibit significant variability, owing to the noisy and artifact-laden nature of echo images. For example, mitral inflow (MI) Doppler is used to assess left ven… ▽ More

    Submitted 24 November, 2020; originally announced November 2020.

    Journal ref: IEEE BIBE 2020 Proceedings

  24. arXiv:2011.10381  [pdf, other

    cs.CV cs.AI cs.LG

    Born Identity Network: Multi-way Counterfactual Map Generation to Explain a Classifier's Decision

    Authors: Kwanseok Oh, Jee Seok Yoon, Heung-Il Suk

    Abstract: There exists an apparent negative correlation between performance and interpretability of deep learning models. In an effort to reduce this negative correlation, we propose a Born Identity Network (BIN), which is a post-hoc approach for producing multi-way counterfactual maps. A counterfactual map transforms an input sample to be conditioned and classified as a target label, which is similar to ho… ▽ More

    Submitted 8 April, 2021; v1 submitted 20 November, 2020; originally announced November 2020.

    Comments: 17 pages, 10 figures

  25. arXiv:2011.06769  [pdf

    cs.LG nlin.CD

    Toward the Fully Physics-Informed Echo State Network -- an ODE Approximator Based on Recurrent Artificial Neurons

    Authors: Dong Keun Oh

    Abstract: Inspired by recent theoretical arguments, physics-informed echo state network (ESN) is discussed on the attempt to train a reservoir model absolutely in physics-informed manner. As the plainest work on such a purpose, an ODE (ordinary differential equation) approximator is designed to replicate the solution in sequence with respect to the recurrent evaluations. On the principal invariance of diffe… ▽ More

    Submitted 13 November, 2020; originally announced November 2020.

    Comments: 30 pages, 12 figures, research paper

    MSC Class: 68T27; 65L99 ACM Class: I.2.8; J.2

  26. Regional Multi-scale Approach for Visually Pleasing Explanations of Deep Neural Networks

    Authors: Dasom Seo, Kanghan Oh, Il-Seok Oh

    Abstract: Recently, many methods to interpret and visualize deep neural network predictions have been proposed and significant progress has been made. However, a more class-discriminative and visually pleasing explanation is required. Thus, this paper proposes a region-based approach that estimates feature importance in terms of appropriately segmented regions. By fusing the saliency maps generated from mul… ▽ More

    Submitted 1 August, 2018; v1 submitted 31 July, 2018; originally announced July 2018.

    Comments: 9 pages, 5 figures, submitted on NIPS 2018

  27. arXiv:1501.01725  [pdf

    cs.IT

    Load-Modulated Single-RF MIMO Transmission for Spatially Multiplexed QAM Signals

    Authors: Seung-Eun Hong, Kyoung-Sub Oh

    Abstract: Today, MIMO has become an indispensable scheme for providing significant spectral efficiency in wireless communication and for future wireless system, recently, it goes to two extremes: massive MIMO and single-RF MIMO. This paper, which is put in the latter, utilizes load-modulated arrays with only reactance loads for single-RF transmission of spatially multiplexed QAM signals. To alleviate the ne… ▽ More

    Submitted 7 January, 2015; originally announced January 2015.

    Comments: 5 pages with 2-column format