Skip to main content

Showing 1–50 of 147 results for author: Bak, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.19634  [pdf, other

    cs.RO

    CLOi-Mapper: Consistent, Lightweight, Robust, and Incremental Mapper With Embedded Systems for Commercial Robot Services

    Authors: DongKi Noh, Hyungtae Lim, Gyuho Eoh, Duckyu Choi, Jeongsik Choi, Hyunjun Lim, SeungMin Baek, Hyun Myung

    Abstract: In commercial autonomous service robots with several form factors, simultaneous localization and map** (SLAM) is an essential technology for providing proper services such as cleaning and guidance. Such robots require SLAM algorithms suitable for specific applications and environments. Hence, several SLAM frameworks have been proposed to address various requirements in the past decade. However,… ▽ More

    Submitted 27 June, 2024; originally announced June 2024.

    Journal ref: IEEE Robotics and Automation Letters, 2024

  2. arXiv:2406.14277  [pdf, other

    cs.CL cs.AI

    Augmenting Query and Passage for Retrieval-Augmented Generation using LLMs for Open-Domain Question Answering

    Authors: Minsang Kim, Cheoneum Park, Seungjun Baek

    Abstract: Retrieval-augmented generation (RAG) has received much attention for Open-domain question-answering (ODQA) tasks as a means to compensate for the parametric knowledge of large language models (LLMs). While previous approaches focused on processing retrieved passages to remove irrelevant context, they still rely heavily on the quality of retrieved passages which can degrade if the question is ambig… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

  3. arXiv:2406.14124  [pdf, other

    cs.AI cs.LG

    Measuring Sample Importance in Data Pruning for Training LLMs from a Data Compression Perspective

    Authors: Minsang Kim, Seungjun Baek

    Abstract: Compute-efficient training of large language models (LLMs) has become an important research problem. In this work, we consider data pruning as a method of data-efficient training of LLMs, where we take a data compression view on data pruning. We argue that the amount of information of a sample, or the achievable compression on its description length, represents its sample importance. The key idea… ▽ More

    Submitted 20 June, 2024; v1 submitted 20 June, 2024; originally announced June 2024.

  4. arXiv:2406.03461  [pdf, other

    cs.CV eess.IV

    Polarization Wavefront Lidar: Learning Large Scene Reconstruction from Polarized Wavefronts

    Authors: Dominik Scheuble, Chenyang Lei, Seung-Hwan Baek, Mario Bijelic, Felix Heide

    Abstract: Lidar has become a cornerstone sensing modality for 3D vision, especially for large outdoor scenarios and autonomous driving. Conventional lidar sensors are capable of providing centimeter-accurate distance information by emitting laser pulses into a scene and measuring the time-of-flight (ToF) of the reflection. However, the polarization of the received light that depends on the surface orientati… ▽ More

    Submitted 11 June, 2024; v1 submitted 5 June, 2024; originally announced June 2024.

    Comments: Accepted at CVPR 2024; Project Website: https://light.princeton.edu/publication/pollidar

  5. arXiv:2406.00157  [pdf, other

    eess.SY cs.AI

    Verification of Neural Network Control Systems in Continuous Time

    Authors: Ali ArjomandBigdeli, Andrew Mata, Stanley Bak

    Abstract: Neural network controllers are currently being proposed for use in many safety-critical tasks. Most analysis methods for neural network control systems assume a fixed control period. In control theory, higher frequency usually improves performance. However, for current analysis methods, increasing the frequency complicates verification. In the limit, when actuation is performed continuously, no ex… ▽ More

    Submitted 31 May, 2024; originally announced June 2024.

    Comments: 17 pages, 7 figures, Proceedings of the 7th International Symposium on AI Verification (SAIV)

  6. arXiv:2405.18554  [pdf, other

    cs.LG cs.RO eess.SY

    Scalable Surrogate Verification of Image-based Neural Network Control Systems using Composition and Unrolling

    Authors: Feiyang Cai, Chuchu Fan, Stanley Bak

    Abstract: Verifying safety of neural network control systems that use images as input is a difficult problem because, from a given system state, there is no known way to mathematically model what images are possible in the real-world. We build on recent work that considers a surrogate verification approach, training a conditional generative adversarial network (cGAN) as an image generator in place of the re… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

  7. arXiv:2405.02499  [pdf, other

    cs.CR cs.AR

    DRAMScope: Uncovering DRAM Microarchitecture and Characteristics by Issuing Memory Commands

    Authors: Hwayong Nam, Seungmin Baek, Minbok Wi, Michael Jaemin Kim, Jaehyun Park, Chihun Song, Nam Sung Kim, Jung Ho Ahn

    Abstract: The demand for precise information on DRAM microarchitectures and error characteristics has surged, driven by the need to explore processing in memory, enhance reliability, and mitigate security vulnerability. Nonetheless, DRAM manufacturers have disclosed only a limited amount of information, making it difficult to find specific information on their DRAM microarchitectures. This paper addresses t… ▽ More

    Submitted 3 May, 2024; originally announced May 2024.

    Comments: To appear at the 51st IEEE/ACM International Symposium on Computer Architecture (ISCA)

  8. arXiv:2404.13541  [pdf, other

    cs.CV

    Generalizable Novel-View Synthesis using a Stereo Camera

    Authors: Haechan Lee, Wonjoon **, Seung-Hwan Baek, Sunghyun Cho

    Abstract: In this paper, we propose the first generalizable view synthesis approach that specifically targets multi-view stereo-camera images. Since recent stereo matching has demonstrated accurate geometry prediction, we introduce stereo matching into novel-view synthesis for high-quality geometry reconstruction. To this end, this paper proposes a novel framework, dubbed StereoNeRF, which integrates stereo… ▽ More

    Submitted 21 April, 2024; originally announced April 2024.

    Comments: Accepted to CVPR 2024. Project page URL: https://**wonjoon.github.io/stereonerf/

  9. arXiv:2404.01954  [pdf, other

    cs.CL cs.AI

    HyperCLOVA X Technical Report

    Authors: Kang Min Yoo, Jaegeun Han, Sookyo In, Heewon Jeon, Jisu Jeong, Jaewook Kang, Hyunwook Kim, Kyung-Min Kim, Munhyong Kim, Sungju Kim, Donghyun Kwak, Hanock Kwak, Se Jung Kwon, Bado Lee, Dongsoo Lee, Gichang Lee, Jooho Lee, Baeseong Park, Seong** Shin, Joonsang Yu, Seolki Baek, Sumin Byeon, Eungsup Cho, Dooseok Choe, Jeesung Han , et al. (371 additional authors not shown)

    Abstract: We introduce HyperCLOVA X, a family of large language models (LLMs) tailored to the Korean language and culture, along with competitive capabilities in English, math, and coding. HyperCLOVA X was trained on a balanced mix of Korean, English, and code data, followed by instruction-tuning with high-quality human-annotated datasets while abiding by strict safety guidelines reflecting our commitment t… ▽ More

    Submitted 13 April, 2024; v1 submitted 2 April, 2024; originally announced April 2024.

    Comments: 44 pages; updated authors list and fixed author names

  10. arXiv:2404.00916  [pdf, other

    cs.CV

    Gyro-based Neural Single Image Deblurring

    Authors: Heemin Yang, Jaesung Rim, Seungyong Lee, Seung-Hwan Baek, Sunghyun Cho

    Abstract: In this paper, we present GyroDeblurNet, a novel single image deblurring method that utilizes a gyro sensor to effectively resolve the ill-posedness of image deblurring. The gyro sensor provides valuable information about camera motion during exposure time that can significantly improve deblurring quality. However, effectively exploiting real-world gyro data is challenging due to significant error… ▽ More

    Submitted 8 April, 2024; v1 submitted 1 April, 2024; originally announced April 2024.

    Comments: 14 pages, 11 figures

  11. arXiv:2404.00562  [pdf, other

    cs.CV

    Text2HOI: Text-guided 3D Motion Generation for Hand-Object Interaction

    Authors: Junuk Cha, Jihyeon Kim, Jae Shin Yoon, Seungryul Baek

    Abstract: This paper introduces the first text-guided work for generating the sequence of hand-object interaction in 3D. The main challenge arises from the lack of labeled data where existing ground-truth datasets are nowhere near generalizable in interaction type and object category, which inhibits the modeling of diverse 3D hand-object interaction with the correct physical implication (e.g., contacts and… ▽ More

    Submitted 1 April, 2024; v1 submitted 31 March, 2024; originally announced April 2024.

    Comments: Accepted to CVPR 2024

  12. arXiv:2403.16428  [pdf, other

    cs.CV

    Benchmarks and Challenges in Pose Estimation for Egocentric Hand Interactions with Objects

    Authors: Zicong Fan, Takehiko Ohkawa, Linlin Yang, Nie Lin, Zhishan Zhou, Shihao Zhou, Jiajun Liang, Zhong Gao, Xuanyang Zhang, Xue Zhang, Fei Li, Liu Zheng, Feng Lu, Karim Abou Zeid, Bastian Leibe, Jeongwan On, Seungryul Baek, Aditya Prakash, Saurabh Gupta, Kun He, Yoichi Sato, Otmar Hilliges, Hyung ** Chang, Angela Yao

    Abstract: We interact with the world with our hands and see it through our own (egocentric) perspective. A holistic 3D understanding of such interactions from egocentric views is important for tasks in robotics, AR/VR, action recognition and motion generation. Accurately reconstructing such interactions in 3D is challenging due to heavy occlusion, viewpoint bias, camera distortion, and motion blur from the… ▽ More

    Submitted 25 March, 2024; originally announced March 2024.

  13. arXiv:2403.06592  [pdf, other

    cs.CV cs.AI

    Exploiting Style Latent Flows for Generalizing Deepfake Video Detection

    Authors: Jongwook Choi, Taehoon Kim, Yonghyun Jeong, Seungryul Baek, Jongwon Choi

    Abstract: This paper presents a new approach for the detection of fake videos, based on the analysis of style latent vectors and their abnormal behavior in temporal changes in the generated videos. We discovered that the generated facial videos suffer from the temporal distinctiveness in the temporal changes of style latent vectors, which are inevitable during the generation of temporally stable videos with… ▽ More

    Submitted 20 May, 2024; v1 submitted 11 March, 2024; originally announced March 2024.

    Comments: Preprint version, final version will be available at https://openaccess.thecvf.com The IEEE / CVF Computer Vision and Pattern Recognition Conference (CVPR) (2024) Published by: IEEE & CVF

  14. arXiv:2403.05346  [pdf, other

    cs.CV

    VLM-PL: Advanced Pseudo Labeling Approach for Class Incremental Object Detection via Vision-Language Model

    Authors: Junsu Kim, Yunhoe Ku, Jihyeon Kim, Junuk Cha, Seungryul Baek

    Abstract: In the field of Class Incremental Object Detection (CIOD), creating models that can continuously learn like humans is a major challenge. Pseudo-labeling methods, although initially powerful, struggle with multi-scenario incremental learning due to their tendency to forget past knowledge. To overcome this, we introduce a new approach called Vision-Language Model assisted Pseudo-Labeling (VLM-PL). T… ▽ More

    Submitted 8 May, 2024; v1 submitted 8 March, 2024; originally announced March 2024.

    Comments: Accept to CVPRW2024 (CLvision). The camera-ready version of the manuscript

  15. arXiv:2402.17323  [pdf, other

    cs.CV

    SDDGR: Stable Diffusion-based Deep Generative Replay for Class Incremental Object Detection

    Authors: Junsu Kim, Hoseong Cho, Jihyeon Kim, Yihalem Yimolal Tiruneh, Seungryul Baek

    Abstract: In the field of class incremental learning (CIL), generative replay has become increasingly prominent as a method to mitigate the catastrophic forgetting, alongside the continuous improvements in generative models. However, its application in class incremental object detection (CIOD) has been significantly limited, primarily due to the complexities of scenes involving multiple labels. In this pape… ▽ More

    Submitted 7 May, 2024; v1 submitted 27 February, 2024; originally announced February 2024.

    Comments: Accept to CVPR 2024. The camera-ready version

  16. arXiv:2402.12503  [pdf, other

    cs.LG

    PARCv2: Physics-aware Recurrent Convolutional Neural Networks for Spatiotemporal Dynamics Modeling

    Authors: Phong C. H. Nguyen, Xinlun Cheng, Shahab Azarfar, Pradeep Seshadri, Yen T. Nguyen, Munho Kim, Sanghun Choi, H. S. Udaykumar, Stephen Baek

    Abstract: Modeling unsteady, fast transient, and advection-dominated physics problems is a pressing challenge for physics-aware deep learning (PADL). The physics of complex systems is governed by large systems of partial differential equations (PDEs) and ancillary constitutive models with nonlinear structures, as well as evolving state fields exhibiting sharp gradients and rapidly deforming material interfa… ▽ More

    Submitted 24 May, 2024; v1 submitted 19 February, 2024; originally announced February 2024.

  17. arXiv:2402.11597  [pdf, other

    cs.CL

    Multi-Task Inference: Can Large Language Models Follow Multiple Instructions at Once?

    Authors: Gui** Son, Sangwon Baek, Sangdae Nam, Ilgyun Jeong, Seungone Kim

    Abstract: Large language models (LLMs) are typically prompted to follow a single instruction per inference call. In this work, we analyze whether LLMs also hold the capability to handle multiple instructions simultaneously, denoted as Multi-Task Inference. For this purpose, we introduce the MTI Bench(Multi-Task Inference Benchmark), a comprehensive evaluation benchmark encompassing 5,000 instances across 25… ▽ More

    Submitted 6 June, 2024; v1 submitted 18 February, 2024; originally announced February 2024.

    Comments: acl 2024 (main)

  18. Hierarchical Position Embedding of Graphs with Landmarks and Clustering for Link Prediction

    Authors: Minsang Kim, Seungjun Baek

    Abstract: Learning positional information of nodes in a graph is important for link prediction tasks. We propose a representation of positional information using representative nodes called landmarks. A small number of nodes with high degree centrality are selected as landmarks, which serve as reference points for the nodes' positions. We justify this selection strategy for well-known random graph models an… ▽ More

    Submitted 19 April, 2024; v1 submitted 12 February, 2024; originally announced February 2024.

    Comments: The International World Wide Web Conference (WWW) 2024, Accepted paper

  19. arXiv:2402.06440  [pdf, other

    cs.CR

    A Method for Decrypting Data Infected with Rhysida Ransomware

    Authors: Giyoon Kim, Soo** Kang, Seungjun Baek, Kimoon Kim, Jongsung Kim

    Abstract: Ransomware is malicious software that is a prominent global cybersecurity threat. Typically, ransomware encrypts data on a system, rendering the victim unable to decrypt it without the attacker's private key. Subsequently, victims often pay a substantial ransom to recover their data, yet some may still incur damage or loss. This study examines Rhysida ransomware, which caused significant damage in… ▽ More

    Submitted 9 February, 2024; originally announced February 2024.

  20. arXiv:2402.03559  [pdf, other

    cs.LG cs.AI

    Constrained Synthesis with Projected Diffusion Models

    Authors: Jacob K Christopher, Stephen Baek, Ferdinando Fioretto

    Abstract: This paper introduces an approach to endow generative diffusion processes the ability to satisfy and certify compliance with constraints and physical principles. The proposed method recast the traditional sampling process of generative diffusion models as a constrained optimization problem, steering the generated data distribution to remain within a specified region to ensure adherence to the give… ▽ More

    Submitted 23 May, 2024; v1 submitted 5 February, 2024; originally announced February 2024.

  21. arXiv:2401.16771  [pdf

    cs.LG

    MolPLA: A Molecular Pretraining Framework for Learning Cores, R-Groups and their Linker Joints

    Authors: Mogan Gim, Jueon Park, Soyon Park, Sanghoon Lee, Seungheun Baek, Junhyun Lee, Ngoc-Quang Nguyen, Jaewoo Kang

    Abstract: Molecular core structures and R-groups are essential concepts in drug development. Integration of these concepts with conventional graph pre-training approaches can promote deeper understanding in molecules. We propose MolPLA, a novel pre-training framework that employs masked graph contrastive learning in understanding the underlying decomposable parts inmolecules that implicate their core struct… ▽ More

    Submitted 30 January, 2024; originally announced January 2024.

  22. arXiv:2401.07331  [pdf, other

    cs.CE

    Rapid Estimation of Left Ventricular Contractility with a Physics-Informed Neural Network Inverse Modeling Approach

    Authors: Ehsan Naghavi, Haifeng Wang, Lei Fan, Jenny S. Choy, Ghassan Kassab, Seungik Baek, Lik-Chuan Lee

    Abstract: Physics-based computer models based on numerical solution of the governing equations generally cannot make rapid predictions, which in turn, limits their applications in the clinic. To address this issue, we developed a physics-informed neural network (PINN) model that encodes the physics of a closed-loop blood circulation system embedding a left ventricle (LV). The PINN model is trained to satisf… ▽ More

    Submitted 14 January, 2024; originally announced January 2024.

  23. arXiv:2401.06415  [pdf, other

    cs.CV

    3D Reconstruction of Interacting Multi-Person in Clothing from a Single Image

    Authors: Junuk Cha, Hansol Lee, Jaewon Kim, Nhat Nguyen Bao Truong, Jae Shin Yoon, Seungryul Baek

    Abstract: This paper introduces a novel pipeline to reconstruct the geometry of interacting multi-person in clothing on a globally coherent scene space from a single image. The main challenge arises from the occlusion: a part of a human body is not visible from a single view due to the occlusion by others or the self, which introduces missing geometry and physical implausibility (e.g., penetration). We over… ▽ More

    Submitted 2 April, 2024; v1 submitted 12 January, 2024; originally announced January 2024.

    Comments: Accepted to WACV 2024

  24. arXiv:2401.03835  [pdf, other

    cs.CV eess.IV

    Limitations of Data-Driven Spectral Reconstruction -- Optics-Aware Analysis and Mitigation

    Authors: Qiang Fu, Matheus Souza, Eunsue Choi, Suhyun Shin, Seung-Hwan Baek, Wolfgang Heidrich

    Abstract: Hyperspectral imaging empowers machine vision systems with the distinct capability of identifying materials through recording their spectral signatures. Recent efforts in data-driven spectral reconstruction aim at extracting spectral information from RGB images captured by cost-effective RGB cameras, instead of dedicated hardware. In this paper we systematically analyze the performance of such m… ▽ More

    Submitted 2 April, 2024; v1 submitted 8 January, 2024; originally announced January 2024.

    Comments: 12 pages, 7 figures, 8 tables

  25. arXiv:2401.00370  [pdf, other

    cs.CV eess.IV

    UGPNet: Universal Generative Prior for Image Restoration

    Authors: Hwayoon Lee, Kyoungkook Kang, Hyeongmin Lee, Seung-Hwan Baek, Sunghyun Cho

    Abstract: Recent image restoration methods can be broadly categorized into two classes: (1) regression methods that recover the rough structure of the original image without synthesizing high-frequency details and (2) generative methods that synthesize perceptually-realistic high-frequency details even though the resulting image deviates from the original structure of the input. While both directions have b… ▽ More

    Submitted 30 December, 2023; originally announced January 2024.

    Comments: Accepted to WACV 2024

  26. arXiv:2312.16842  [pdf, other

    cs.CV

    Dynamic Appearance Modeling of Clothed 3D Human Avatars using a Single Camera

    Authors: Hansol Lee, Junuk Cha, Yunhoe Ku, Jae Shin Yoon, Seungryul Baek

    Abstract: The appearance of a human in clothing is driven not only by the pose but also by its temporal context, i.e., motion. However, such context has been largely neglected by existing monocular human modeling methods whose neural networks often struggle to learn a video of a person with large dynamics due to the motion ambiguity, i.e., there exist numerous geometric configurations of clothes that are de… ▽ More

    Submitted 28 December, 2023; originally announced December 2023.

  27. arXiv:2312.16760  [pdf, other

    cs.LG cs.AI cs.SE

    The Fourth International Verification of Neural Networks Competition (VNN-COMP 2023): Summary and Results

    Authors: Christopher Brix, Stanley Bak, Changliu Liu, Taylor T. Johnson

    Abstract: This report summarizes the 4th International Verification of Neural Networks Competition (VNN-COMP 2023), held as a part of the 6th Workshop on Formal Methods for ML-Enabled Autonomous Systems (FoMLAS), that was collocated with the 35th International Conference on Computer-Aided Verification (CAV). VNN-COMP is held annually to facilitate the fair and objective comparison of state-of-the-art neural… ▽ More

    Submitted 27 December, 2023; originally announced December 2023.

    Comments: arXiv admin note: text overlap with arXiv:2212.10376

  28. arXiv:2312.13313  [pdf, other

    eess.IV cs.CV

    ParamISP: Learned Forward and Inverse ISPs using Camera Parameters

    Authors: Woohyeok Kim, Geonu Kim, Junyong Lee, Seungyong Lee, Seung-Hwan Baek, Sunghyun Cho

    Abstract: RAW images are rarely shared mainly due to its excessive data size compared to their sRGB counterparts obtained by camera ISPs. Learning the forward and inverse processes of camera ISPs has been recently demonstrated, enabling physically-meaningful RAW-level image processing on input sRGB images. However, existing learning-based ISP methods fail to handle the large variations in the ISP processes… ▽ More

    Submitted 14 April, 2024; v1 submitted 20 December, 2023; originally announced December 2023.

  29. arXiv:2312.09139  [pdf, other

    cs.CV

    Class-Wise Buffer Management for Incremental Object Detection: An Effective Buffer Training Strategy

    Authors: Junsu Kim, Sumin Hong, Chanwoo Kim, Jihyeon Kim, Yihalem Yimolal Tiruneh, Jeongwan On, Jihyun Song, Sunhwa Choi, Seungryul Baek

    Abstract: Class incremental learning aims to solve a problem that arises when continuously adding unseen class instances to an existing model This approach has been extensively studied in the context of image classification; however its applicability to object detection is not well established yet. Existing frameworks using replay methods mainly collect replay data without considering the model being traine… ▽ More

    Submitted 14 December, 2023; originally announced December 2023.

    Comments: 5 pages, 3 figures, Accepted at ICASSP 2024

  30. arXiv:2312.02480  [pdf, other

    cs.CV

    Differentiable Point-based Inverse Rendering

    Authors: Hoon-Gyu Chung, Seokjun Choi, Seung-Hwan Baek

    Abstract: We present differentiable point-based inverse rendering, DPIR, an analysis-by-synthesis method that processes images captured under diverse illuminations to estimate shape and spatially-varying BRDF. To this end, we adopt point-based rendering, eliminating the need for multiple samplings per ray, typical of volumetric rendering, thus significantly enhancing the speed of inverse rendering. To reali… ▽ More

    Submitted 25 March, 2024; v1 submitted 4 December, 2023; originally announced December 2023.

  31. arXiv:2311.18287  [pdf, other

    eess.IV cs.CV cs.GR

    Dispersed Structured Light for Hyperspectral 3D Imaging

    Authors: Suhyun Shin, Seokjun Choi, Felix Heide, Seung-Hwan Baek

    Abstract: Hyperspectral 3D imaging aims to acquire both depth and spectral information of a scene. However, existing methods are either prohibitively expensive and bulky or compromise on spectral and depth accuracy. In this work, we present Dispersed Structured Light (DSL), a cost-effective and compact method for accurate hyperspectral 3D imaging. DSL modifies a traditional projector-camera system by placin… ▽ More

    Submitted 25 March, 2024; v1 submitted 30 November, 2023; originally announced November 2023.

  32. arXiv:2311.17396  [pdf, other

    cs.CV eess.IV

    Spectral and Polarization Vision: Spectro-polarimetric Real-world Dataset

    Authors: Yu** Jeon, Eunsue Choi, Youngchan Kim, Yunseong Moon, Khalid Omer, Felix Heide, Seung-Hwan Baek

    Abstract: Image datasets are essential not only in validating existing methods in computer vision but also in develo** new methods. Most existing image datasets focus on trichromatic intensity images to mimic human vision. However, polarization and spectrum, the wave properties of light that animals in harsh environments and with limited brain capacity often rely on, remain underrepresented in existing da… ▽ More

    Submitted 30 November, 2023; v1 submitted 29 November, 2023; originally announced November 2023.

  33. 3D Teeth Reconstruction from Panoramic Radiographs using Neural Implicit Functions

    Authors: Sihwa Park, Seongjun Kim, In-Seok Song, Seung Jun Baek

    Abstract: Panoramic radiography is a widely used imaging modality in dental practice and research. However, it only provides flattened 2D images, which limits the detailed assessment of dental structures. In this paper, we propose Occudent, a framework for 3D teeth reconstruction from panoramic radiographs using neural implicit functions, which, to the best of our knowledge, is the first work to do so. For… ▽ More

    Submitted 28 November, 2023; originally announced November 2023.

    Comments: 12 pages, 2 figures, accepted to International Conference on Medical Image Computing and Computer-Assisted Intervention MICCAI 2023

  34. arXiv:2311.08433   

    q-bio.QM cs.LG stat.AP

    Clinical Characteristics and Laboratory Biomarkers in ICU-admitted Septic Patients with and without Bacteremia

    Authors: Sangwon Baek, Seung Jun Lee

    Abstract: Few studies have investigated the diagnostic utilities of biomarkers for predicting bacteremia among septic patients admitted to intensive care units (ICU). Therefore, this study evaluated the prediction power of laboratory biomarkers to utilize those markers with high performance to optimize the predictive model for bacteremia. This retrospective cross-sectional study was conducted at the ICU dep… ▽ More

    Submitted 16 November, 2023; v1 submitted 14 November, 2023; originally announced November 2023.

    Comments: This article is not the right fit to be published as preprint in arXiv

  35. arXiv:2311.05161  [pdf, other

    cs.CL

    Enhancing Computation Efficiency in Large Language Models through Weight and Activation Quantization

    Authors: Jangwhan Lee, Minsoo Kim, Seungcheol Baek, Seok Joong Hwang, Wonyong Sung, Jungwook Choi

    Abstract: Large Language Models (LLMs) are proficient in natural language processing tasks, but their deployment is often restricted by extensive parameter sizes and computational demands. This paper focuses on post-training quantization (PTQ) in LLMs, specifically 4-bit weight and 8-bit activation (W4A8) quantization, to enhance computational efficiency -- a topic less explored compared to weight-only quan… ▽ More

    Submitted 9 November, 2023; originally announced November 2023.

    Comments: EMNLP 2023 Main Conference

  36. arXiv:2311.03383  [pdf, other

    cs.LG cs.AI cs.AR cs.HC

    Toward Reinforcement Learning-based Rectilinear Macro Placement Under Human Constraints

    Authors: Tuyen P. Le, Hieu T. Nguyen, Seungyeol Baek, Taeyoun Kim, Jungwoo Lee, Seongjung Kim, Hyun** Kim, Misu Jung, Daehoon Kim, Seokyong Lee, Daewoo Choi

    Abstract: Macro placement is a critical phase in chip design, which becomes more intricate when involving general rectilinear macros and layout areas. Furthermore, macro placement that incorporates human-like constraints, such as design hierarchy and peripheral bias, has the potential to significantly reduce the amount of additional manual labor required from designers. This study proposes a methodology tha… ▽ More

    Submitted 2 November, 2023; originally announced November 2023.

    Comments: Fast ML for Science @ ICCAD 2023

  37. arXiv:2309.14072  [pdf, other

    cs.CV

    BoIR: Box-Supervised Instance Representation for Multi-Person Pose Estimation

    Authors: Uyoung Jeong, Seungryul Baek, Hyung ** Chang, Kwang In Kim

    Abstract: Single-stage multi-person human pose estimation (MPPE) methods have shown great performance improvements, but existing methods fail to disentangle features by individual instances under crowded scenes. In this paper, we propose a bounding box-level instance representation learning called BoIR, which simultaneously solves instance detection, instance disentanglement, and instance-keypoint associati… ▽ More

    Submitted 2 November, 2023; v1 submitted 25 September, 2023; originally announced September 2023.

    Comments: Accepted to BMVC 2023, 19 pages including the appendix, 6 figures, 7 tables

  38. arXiv:2309.12289  [pdf, ps, other

    cs.RO eess.SY

    Real-Time Capable Decision Making for Autonomous Driving Using Reachable Sets

    Authors: Niklas Kochdumper, Stanley Bak

    Abstract: Despite large advances in recent years, real-time capable motion planning for autonomous road vehicles remains a huge challenge. In this work, we present a decision module that is based on set-based reachability analysis: First, we identify all possible driving corridors by computing the reachable set for the longitudinal position of the vehicle along the lanelets of the road network, where lane c… ▽ More

    Submitted 21 September, 2023; originally announced September 2023.

  39. arXiv:2309.00237  [pdf, other

    cs.CL cs.AI

    Publicly Shareable Clinical Large Language Model Built on Synthetic Clinical Notes

    Authors: Sunjun Kweon, Junu Kim, Jiyoun Kim, Sujeong Im, Eunbyeol Cho, Seongsu Bae, Jungwoo Oh, Gyubok Lee, Jong Hak Moon, Seng Chan You, Seung** Baek, Chang Hoon Han, Yoon Bin Jung, Yohan Jo, Edward Choi

    Abstract: The development of large language models tailored for handling patients' clinical notes is often hindered by the limited accessibility and usability of these notes due to strict privacy regulations. To address these challenges, we first create synthetic large-scale clinical notes using publicly available case reports extracted from biomedical literature. We then use these synthetic notes to train… ▽ More

    Submitted 13 June, 2024; v1 submitted 1 September, 2023; originally announced September 2023.

    Comments: ACL 2024 (Findings)

  40. arXiv:2308.06957  [pdf, other

    eess.IV cs.CV cs.LG

    CEmb-SAM: Segment Anything Model with Condition Embedding for Joint Learning from Heterogeneous Datasets

    Authors: Dongik Shin, Beomsuk Kim, Seungjun Baek

    Abstract: Automated segmentation of ultrasound images can assist medical experts with diagnostic and therapeutic procedures. Although using the common modality of ultrasound, one typically needs separate datasets in order to segment, for example, different anatomical structures or lesions with different levels of malignancy. In this paper, we consider the problem of jointly learning from heterogeneous datas… ▽ More

    Submitted 14 August, 2023; originally announced August 2023.

  41. Mesh Density Adaptation for Template-based Shape Reconstruction

    Authors: Yucheol Jung, Hyomin Kim, Gyeongha Hwang, Seung-Hwan Baek, Seungyong Lee

    Abstract: In 3D shape reconstruction based on template mesh deformation, a regularization, such as smoothness energy, is employed to guide the reconstruction into a desirable direction. In this paper, we highlight an often overlooked property in the regularization: the vertex density in the mesh. Without careful control on the density, the reconstruction may suffer from under-sampling of vertices near shape… ▽ More

    Submitted 30 July, 2023; originally announced July 2023.

    Comments: To appear at SIGGRAPH 2023. Jung and Kim shares equal contribution. For codes, see https://github.com/ycjungSubhuman/density-adaptation/

    ACM Class: I.4.5; I.3.5

  42. arXiv:2307.08985  [pdf, other

    cs.HC cs.AI

    PromptCrafter: Crafting Text-to-Image Prompt through Mixed-Initiative Dialogue with LLM

    Authors: Seungho Baek, Hyerin Im, Jiseung Ryu, Juhyeong Park, Takyeon Lee

    Abstract: Text-to-image generation model is able to generate images across a diverse range of subjects and styles based on a single prompt. Recent works have proposed a variety of interaction methods that help users understand the capabilities of models and utilize them. However, how to support users to efficiently explore the model's capability and to create effective prompts are still open-ended research… ▽ More

    Submitted 18 July, 2023; originally announced July 2023.

    Comments: 5 pages, AI & HCI Workshop at the 40 International Conference on Machine Learning (ICML) 2023

  43. arXiv:2306.17618  [pdf, other

    cs.CV

    Polarimetric iToF: Measuring High-Fidelity Depth through Scattering Media

    Authors: Daniel S. Jeon, Andreas Meuleman, Seung-Hwan Baek, Min H. Kim

    Abstract: Indirect time-of-flight (iToF) imaging allows us to capture dense depth information at a low cost. However, iToF imaging often suffers from multipath interference (MPI) artifacts in the presence of scattering media, resulting in severe depth-accuracy degradation. For instance, iToF cameras cannot measure depth accurately through fog because ToF active illumination scatters back to the sensor befor… ▽ More

    Submitted 30 June, 2023; originally announced June 2023.

    Journal ref: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2023, pp. 12353-12362

  44. arXiv:2306.13361  [pdf, other

    physics.optics cs.CV eess.IV

    Neural 360$^\circ$ Structured Light with Learned Metasurfaces

    Authors: Eunsue Choi, Gyeongtae Kim, Jooyeong Yun, Yu** Jeon, Junsuk Rho, Seung-Hwan Baek

    Abstract: Structured light has proven instrumental in 3D imaging, LiDAR, and holographic light projection. Metasurfaces, comprised of sub-wavelength-sized nanostructures, facilitate 180$^\circ$ field-of-view (FoV) structured light, circumventing the restricted FoV inherent in traditional optics like diffractive optical elements. However, extant metasurface-facilitated structured light exhibits sub-optimal p… ▽ More

    Submitted 27 June, 2023; v1 submitted 23 June, 2023; originally announced June 2023.

  45. arXiv:2306.13325  [pdf, other

    cs.CV

    Differentiable Display Photometric Stereo

    Authors: Seokjun Choi, Seungwoo Yoon, Giljoo Nam, Seungyong Lee, Seung-Hwan Baek

    Abstract: Photometric stereo leverages variations in illumination conditions to reconstruct surface normals. Display photometric stereo, which employs a conventional monitor as an illumination source, has the potential to overcome limitations often encountered in bulky and difficult-to-use conventional setups. In this paper, we present differentiable display photometric stereo (DDPS), addressing an often ov… ▽ More

    Submitted 12 March, 2024; v1 submitted 23 June, 2023; originally announced June 2023.

  46. arXiv:2306.12562  [pdf, other

    cs.CV eess.IV

    Neural Spectro-polarimetric Fields

    Authors: Youngchan Kim, Wonjoon **, Sunghyun Cho, Seung-Hwan Baek

    Abstract: Modeling the spatial radiance distribution of light rays in a scene has been extensively explored for applications, including view synthesis. Spectrum and polarization, the wave properties of light, are often neglected due to their integration into three RGB spectral bands and their non-perceptibility to human vision. However, these properties are known to encompass substantial material and geomet… ▽ More

    Submitted 10 December, 2023; v1 submitted 21 June, 2023; originally announced June 2023.

  47. arXiv:2306.04957  [pdf, other

    cs.CV

    IFaceUV: Intuitive Motion Facial Image Generation by Identity Preservation via UV map

    Authors: Hansol Lee, Yunhoe Ku, Eunseo Kim, Seungryul Baek

    Abstract: Reenacting facial images is an important task that can find numerous applications. We proposed IFaceUV, a fully differentiable pipeline that properly combines 2D and 3D information to conduct the facial reenactment task. The three-dimensional morphable face models (3DMMs) and corresponding UV maps are utilized to intuitively control facial motions and textures, respectively. Two-dimensional techni… ▽ More

    Submitted 8 June, 2023; originally announced June 2023.

  48. arXiv:2306.04089  [pdf, other

    cs.LO eess.SY math.DS

    Fully Automated Verification of Linear Time-Invariant Systems against Signal Temporal Logic Specifications via Reachability Analysis

    Authors: Niklas Kochdumper, Stanley Bak

    Abstract: While reachability analysis is one of the most promising approaches for formal verification of dynamic systems, a major disadvantage preventing a more widespread application is the requirement to manually tune algorithm parameters such as the time step size. Manual tuning is especially problematic if one aims to verify that the system satisfies complicated specifications described by signal tempor… ▽ More

    Submitted 8 April, 2024; v1 submitted 6 June, 2023; originally announced June 2023.

  49. X-ray: Discovering DRAM Internal Structure and Error Characteristics by Issuing Memory Commands

    Authors: Hwayong Nam, Seungmin Baek, Minbok Wi, Michael Jaemin Kim, Jaehyun Park, Chihun Song, Nam Sung Kim, Jung Ho Ahn

    Abstract: The demand for accurate information about the internal structure and characteristics of dynamic random-access memory (DRAM) has been on the rise. Recent studies have explored the structure and characteristics of DRAM to improve processing in memory, enhance reliability, and mitigate a vulnerability known as rowhammer. However, DRAM manufacturers only disclose limited information through official d… ▽ More

    Submitted 12 August, 2023; v1 submitted 5 June, 2023; originally announced June 2023.

    Comments: 4 pages, 7 figures, accepted at IEEE Computer Architecture Letters

  50. Improved Training for End-to-End Streaming Automatic Speech Recognition Model with Punctuation

    Authors: Hanbyul Kim, Seunghyun Seo, Lukas Lee, Seolki Baek

    Abstract: Punctuated text prediction is crucial for automatic speech recognition as it enhances readability and impacts downstream natural language processing tasks. In streaming scenarios, the ability to predict punctuation in real-time is particularly desirable but presents a difficult technical challenge. In this work, we propose a method for predicting punctuated text from input speech using a chunk-bas… ▽ More

    Submitted 2 June, 2023; originally announced June 2023.

    Comments: Accepted at INTERSPEECH 2023

    Journal ref: Proc. INTERSPEECH 2023, 1653-1657