Skip to main content

Showing 1–6 of 6 results for author: Kang, D U

.
  1. arXiv:2404.04544  [pdf, other

    cs.CV cs.AI

    BeyondScene: Higher-Resolution Human-Centric Scene Generation With Pretrained Diffusion

    Authors: Gwanghyun Kim, Hayeon Kim, Hoigi Seo, Dong Un Kang, Se Young Chun

    Abstract: Generating higher-resolution human-centric scenes with details and controls remains a challenge for existing text-to-image diffusion models. This challenge stems from limited training image size, text encoder capacity (limited tokens), and the inherent difficulty of generating complex scenes involving multiple humans. While current methods attempted to address training size limit only, they often… ▽ More

    Submitted 6 April, 2024; originally announced April 2024.

    Comments: Project page: https://janeyeon.github.io/beyond-scene

  2. arXiv:2311.18654  [pdf, other

    cs.CV cs.AI

    Detailed Human-Centric Text Description-Driven Large Scene Synthesis

    Authors: Gwanghyun Kim, Dong Un Kang, Hoigi Seo, Hayeon Kim, Se Young Chun

    Abstract: Text-driven large scene image synthesis has made significant progress with diffusion models, but controlling it is challenging. While using additional spatial controls with corresponding texts has improved the controllability of large scene synthesis, it is still challenging to faithfully reflect detailed text descriptions without user-provided controls. Here, we propose DetText2Scene, a novel tex… ▽ More

    Submitted 30 November, 2023; originally announced November 2023.

  3. arXiv:2305.10732  [pdf, ps, other

    eess.IV cs.CV

    BlindHarmony: "Blind" Harmonization for MR Images via Flow model

    Authors: Hwihun Jeong, Heejoon Byun, Dong Un Kang, Jongho Lee

    Abstract: In MRI, images of the same contrast (e.g., T$_1$) from the same subject can exhibit noticeable differences when acquired using different hardware, sequences, or scan parameters. These differences in images create a domain gap that needs to be bridged by a step called image harmonization, to process the images successfully using conventional or deep learning-based image analysis (e.g., segmentation… ▽ More

    Submitted 16 August, 2023; v1 submitted 18 May, 2023; originally announced May 2023.

    Comments: ICCV 2023 accepted. 9 pages and 5 Figures for manuscipt, supplementary included

  4. arXiv:2207.01520  [pdf, other

    eess.IV cs.CV

    Adaptive GLCM sampling for transformer-based COVID-19 detection on CT

    Authors: Okchul Jung, Dong Un Kang, Gwanghyun Kim, Se Young Chun

    Abstract: The world has suffered from COVID-19 (SARS-CoV-2) for the last two years, causing much damage and change in people's daily lives. Thus, automated detection of COVID-19 utilizing deep learning on chest computed tomography (CT) scans became promising, which helps correct diagnosis efficiently. Recently, transformer-based COVID-19 detection method on CT is proposed to utilize 3D information in CT vol… ▽ More

    Submitted 4 July, 2022; originally announced July 2022.

    Comments: 6 pages

  5. arXiv:2012.12507  [pdf, other

    cs.CV

    Blur More To Deblur Better: Multi-Blur2Deblur For Efficient Video Deblurring

    Authors: Dongwon Park, Dong Un Kang, Se Young Chun

    Abstract: One of the key components for video deblurring is how to exploit neighboring frames. Recent state-of-the-art methods either used aligned adjacent frames to the center frame or propagated the information on past frames to the current frame recurrently. Here we propose multi-blur-to-deblur (MB2D), a novel concept to exploit neighboring frames for efficient video deblurring. Firstly, inspired by unsh… ▽ More

    Submitted 23 December, 2020; originally announced December 2020.

    Comments: 9 pages, 7 figures

  6. arXiv:1911.07410  [pdf, other

    eess.IV cs.CV

    Multi-Temporal Recurrent Neural Networks For Progressive Non-Uniform Single Image Deblurring With Incremental Temporal Training

    Authors: Dongwon Park, Dong Un Kang, Jisoo Kim, Se Young Chun

    Abstract: Multi-scale (MS) approaches have been widely investigated for blind single image / video deblurring that sequentially recovers deblurred images in low spatial scale first and then in high spatial scale later with the output of lower scales. MS approaches have been effective especially for severe blurs induced by large motions in high spatial scale since those can be seen as small blurs in low spat… ▽ More

    Submitted 17 November, 2019; originally announced November 2019.

    Comments: 10 pages, 8 figures, 6 tables, work in progress