Skip to main content

Showing 1–15 of 15 results for author: Ahn, P

.
  1. arXiv:2406.09388  [pdf, other

    cs.CV cs.AI cs.LG

    Exploring the Spectrum of Visio-Linguistic Compositionality and Recognition

    Authors: Youngtaek Oh, Pyunghwan Ahn, **hyung Kim, Gwangmo Song, Soonyoung Lee, In So Kweon, Junmo Kim

    Abstract: Vision and language models (VLMs) such as CLIP have showcased remarkable zero-shot recognition abilities yet face challenges in visio-linguistic compositionality, particularly in linguistic comprehension and fine-grained image-text alignment. This paper explores the intricate relationship between compositionality and recognition -- two pivotal aspects of VLM capability. We conduct a comprehensive… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

    Comments: Accepted to CVPRW 2024 on 'What is Next in Multimodal Foundation Models?'. Code: https://github.com/ytaek-oh/vl_compo

  2. ContextMix: A context-aware data augmentation method for industrial visual inspection systems

    Authors: Hyungmin Kim, Donghun Kim, Pyunghwan Ahn, Sungho Suh, Hansang Cho, Junmo Kim

    Abstract: While deep neural networks have achieved remarkable performance, data augmentation has emerged as a crucial strategy to mitigate overfitting and enhance network performance. These techniques hold particular significance in industrial manufacturing contexts. Recently, image mixing-based methods have been introduced, exhibiting improved performance on public benchmark datasets. However, their applic… ▽ More

    Submitted 18 January, 2024; originally announced January 2024.

    Comments: Accepted to EAAI

  3. arXiv:2309.01961  [pdf, other

    cs.CV

    NICE: CVPR 2023 Challenge on Zero-shot Image Captioning

    Authors: Taehoon Kim, Pyunghwan Ahn, Sangyun Kim, Sihaeng Lee, Mark Marsden, Alessandra Sala, Seung Hwan Kim, Bohyung Han, Kyoung Mu Lee, Honglak Lee, Kyounghoon Bae, Xiangyu Wu, Yi Gao, Hailiang Zhang, Yang Yang, Weili Guo, Jianfeng Lu, Youngtaek Oh, Jae Won Cho, Dong-** Kim, In So Kweon, Junmo Kim, Wooyoung Kang, Won Young Jhoo, Byungseok Roh , et al. (17 additional authors not shown)

    Abstract: In this report, we introduce NICE (New frontiers for zero-shot Image Captioning Evaluation) project and share the results and outcomes of 2023 challenge. This project is designed to challenge the computer vision community to develop robust image captioning models that advance the state-of-the-art both in terms of accuracy and fairness. Through the challenge, the image captioning models were tested… ▽ More

    Submitted 10 September, 2023; v1 submitted 5 September, 2023; originally announced September 2023.

    Comments: Tech report, project page https://nice.lgresearch.ai/

  4. arXiv:2211.06774  [pdf, other

    cs.CV cs.CL

    Large-Scale Bidirectional Training for Zero-Shot Image Captioning

    Authors: Taehoon Kim, Mark Marsden, Pyunghwan Ahn, Sangyun Kim, Sihaeng Lee, Alessandra Sala, Seung Hwan Kim

    Abstract: When trained on large-scale datasets, image captioning models can understand the content of images from a general domain but often fail to generate accurate, detailed captions. To improve performance, pretraining-and-finetuning has been a key strategy for image captioning. However, we find that large-scale bidirectional training between image and text enables zero-shot image captioning. In this pa… ▽ More

    Submitted 1 October, 2023; v1 submitted 12 November, 2022; originally announced November 2022.

    Comments: Arxiv Preprint. Work in progress

  5. Projection-based Point Convolution for Efficient Point Cloud Segmentation

    Authors: Pyunghwan Ahn, Juyoung Yang, Eo**dl Yi, Chanho Lee, Junmo Kim

    Abstract: Understanding point cloud has recently gained huge interests following the development of 3D scanning devices and the accumulation of large-scale 3D data. Most point cloud processing algorithms can be classified as either point-based or voxel-based methods, both of which have severe limitations in processing time or memory, or both. To overcome these limitations, we propose Projection-based Point… ▽ More

    Submitted 4 February, 2022; originally announced February 2022.

    Comments: Published in IEEE Access (Early Access)

  6. arXiv:2201.07436  [pdf, other

    cs.CV

    Global-Local Path Networks for Monocular Depth Estimation with Vertical CutDepth

    Authors: Doyeon Kim, Woonghyun Ka, Pyungwhan Ahn, Donggyu Joo, Sehwan Chun, Junmo Kim

    Abstract: Depth estimation from a single image is an important task that can be applied to various fields in computer vision, and has grown rapidly with the development of convolutional neural networks. In this paper, we propose a novel structure and training strategy for monocular depth estimation to further improve the prediction accuracy of the network. We deploy a hierarchical transformer encoder to cap… ▽ More

    Submitted 29 October, 2022; v1 submitted 19 January, 2022; originally announced January 2022.

    Comments: 11pages, 5 figures

  7. arXiv:2112.05213  [pdf, other

    cs.CV

    Progressive Seed Generation Auto-encoder for Unsupervised Point Cloud Learning

    Authors: Juyoung Yang, Pyunghwan Ahn, Doyeon Kim, Haeil Lee, Junmo Kim

    Abstract: With the development of 3D scanning technologies, 3D vision tasks have become a popular research area. Owing to the large amount of data acquired by sensors, unsupervised learning is essential for understanding and utilizing point clouds without an expensive annotation process. In this paper, we propose a novel framework and an effective auto-encoder architecture named "PSG-Net" for reconstruction… ▽ More

    Submitted 9 December, 2021; originally announced December 2021.

    Comments: ICCV2021

  8. arXiv:2011.00988  [pdf, other

    cs.CV

    PBP-Net: Point Projection and Back-Projection Network for 3D Point Cloud Segmentation

    Authors: JuYoung Yang, Chanho Lee, Pyunghwan Ahn, Haeil Lee, Eo**dl Yi, Junmo Kim

    Abstract: Following considerable development in 3D scanning technologies, many studies have recently been proposed with various approaches for 3D vision tasks, including some methods that utilize 2D convolutional neural networks (CNNs). However, even though 2D CNNs have achieved high performance in many 2D vision tasks, existing works have not effectively applied them onto 3D vision tasks. In particular, se… ▽ More

    Submitted 2 November, 2020; originally announced November 2020.

    Comments: 7 pages, accepted by IROS2020

  9. arXiv:2001.03414  [pdf

    physics.med-ph eess.IV

    Attenuation Coefficient Estimation for PET/MRI With Bayesian Deep Learning pseudo-CT and Maximum Likelihood Estimation of Activity and Attenuation

    Authors: Andrew P. Leynes, Sangtae P. Ahn, Kristen A. Wangerin, Sandeep S. Kaushik, Florian Wiesinger, Thomas A. Hope, Peder E. Z. Larson

    Abstract: A major remaining challenge for magnetic resonance-based attenuation correction methods (MRAC) is their susceptibility to sources of MRI artifacts (e.g. implants, motion) and uncertainties due to the limitations of MRI contrast (e.g. accurate bone delineation and density, and separation of air/bone). We propose using a Bayesian deep convolutional neural network that, in addition to generating an i… ▽ More

    Submitted 13 October, 2021; v1 submitted 10 January, 2020; originally announced January 2020.

    Comments: Accepted to the IEEE Transactions on Radiation and Plasma Medical Sciences on October 3, 2021. To be published under open access Creative Commons Attribution License (CC BY)

    Journal ref: IEEE Transactions on Radiation and Plasma Medical Sciences, Early access, 2021

  10. arXiv:1912.01237  [pdf, other

    cs.CV

    EDAS: Efficient and Differentiable Architecture Search

    Authors: Hyeong Gwon Hong, Pyunghwan Ahn, Junmo Kim

    Abstract: Transferrable neural architecture search can be viewed as a binary optimization problem where a single optimal path should be selected among candidate paths in each edge within the repeated cell block of the directed a cyclic graph form. Recently, the field of differentiable architecture search attempts to relax the search problem continuously using a one-shot network that combines all the candida… ▽ More

    Submitted 4 December, 2019; v1 submitted 3 December, 2019; originally announced December 2019.

  11. The Black Hole in the Most Massive Ultracompact Dwarf Galaxy M59-UCD3

    Authors: Christopher P. Ahn, Anil C. Seth, Michele Cappellari, Davor Krajnović, Jay Strader, Karina T. Voggel, Jonelle L. Walsh, Arash Bahramian, Holger Baumgardt, Jean Brodie, Igor Chilingarian, Laura Chomiuk, Mark den Brok, Matthias Frank, Michael Hilker, Richard M. McDermid, Steffen Mieske, Nadine Neumayer, Dieu D. Nguyen, Renuka Pechetti, Aaron J. Romanowsky, Lee Spitler

    Abstract: We examine the internal properties of the most massive ultracompact dwarf galaxy (UCD), M59-UCD3, by combining adaptive optics assisted near-IR integral field spectroscopy from Gemini/NIFS, and Hubble Space Telescope (HST) imaging. We use the multi-band HST imaging to create a mass model that suggests and accounts for the presence of multiple stellar populations and structural components. We combi… ▽ More

    Submitted 6 April, 2018; originally announced April 2018.

    Comments: 17 pages, 14 figures, 5 tables

  12. Detection of Supermassive Black Holes in Two Virgo Ultracompact Dwarf Galaxies

    Authors: Christopher P. Ahn, Anil C. Seth, Mark den Brok, Jay Strader, Holger Baumgardt, Remco van den Bosch, Igor Chilingarian, Matthias Frank, Michael Hilker, Richard McDermid, Steffen Mieske, Aaron J. Romanowsky, Lee Spitler, Jean Brodie, Nadine Neumayer, Jonelle L. Walsh

    Abstract: We present the detection of supermassive black holes (BHs) in two Virgo ultracompact dwarf galaxies (UCDs), VUCD3 and M59cO. We use adaptive optics assisted data from the Gemini/NIFS instrument to derive radial velocity dispersion profiles for both objects. Mass models for the two UCDs are created using multi-band Hubble Space Telescope (HST) imaging, including the modeling of mild color gradients… ▽ More

    Submitted 27 March, 2017; originally announced March 2017.

    Comments: 17 pages, 9 Figures, 3 Tables, accepted for publication in The Astrophysical Journal

  13. arXiv:1307.7735  [pdf, other

    astro-ph.IM astro-ph.CO astro-ph.GA

    The Tenth Data Release of the Sloan Digital Sky Survey: First Spectroscopic Data from the SDSS-III Apache Point Observatory Galactic Evolution Experiment

    Authors: Christopher P. Ahn, Rachael Alexandroff, Carlos Allende Prieto, Friedrich Anders, Scott F. Anderson, Timothy Anderton, Brett H. Andrews, Éric Aubourg, Stephen Bailey, Fabienne A. Bastien, Julian E. Bautista, Timothy C. Beers, Alessandra Beifiori, Chad F. Bender, Andreas A. Berlind, Florian Beutler, Vaishali Bhardwaj, Jonathan C. Bird, Dmitry Bizyaev, Cullen H. Blake, Michael R. Blanton, Michael Blomqvist, John J. Bochanski, Adam S. Bolton, Arnaud Borde , et al. (210 additional authors not shown)

    Abstract: The Sloan Digital Sky Survey (SDSS) has been in operation since 2000 April. This paper presents the tenth public data release (DR10) from its current incarnation, SDSS-III. This data release includes the first spectroscopic data from the Apache Point Observatory Galaxy Evolution Experiment (APOGEE), along with spectroscopic data from the Baryon Oscillation Spectroscopic Survey (BOSS) taken through… ▽ More

    Submitted 17 January, 2014; v1 submitted 29 July, 2013; originally announced July 2013.

    Comments: 15 figures; 1 table. Accepted to ApJS. DR10 is available at http://www.sdss3.org/dr10 v3 fixed 3 diacritic markings in the arXiv HTML listing of the author names

  14. The Baryon Oscillation Spectroscopic Survey of SDSS-III

    Authors: Kyle S. Dawson, David J. Schlegel, Christopher P. Ahn, Scott F. Anderson, Éric Aubourg, Stephen Bailey, Robert H. Barkhouser, Julian E. Bautista, Alessandra Beifiori, Andreas A. Berlind, Vaishali Bhardwaj, Dmitry Bizyaev, Cullen H. Blake, Michael R. Blanton, Michael Blomqvist, Adam S. Bolton, Arnaud Borde, Jo Bovy, W. N. Brandt, Howard Brewington, Jon Brinkmann, Peter J. Brown, Joel R. Brownstein, Kevin Bundy, N. G. Busca , et al. (140 additional authors not shown)

    Abstract: The Baryon Oscillation Spectroscopic Survey (BOSS) is designed to measure the scale of baryon acoustic oscillations (BAO) in the clustering of matter over a larger volume than the combined efforts of all previous spectroscopic surveys of large scale structure. BOSS uses 1.5 million luminous galaxies as faint as i=19.9 over 10,000 square degrees to measure BAO to redshifts z<0.7. Observations of ne… ▽ More

    Submitted 7 November, 2012; v1 submitted 31 July, 2012; originally announced August 2012.

    Comments: 49 pages, 16 figures, accepted by AJ

  15. arXiv:1207.7137  [pdf, ps, other

    astro-ph.IM astro-ph.CO

    The Ninth Data Release of the Sloan Digital Sky Survey: First Spectroscopic Data from the SDSS-III Baryon Oscillation Spectroscopic Survey

    Authors: SDSS-III Collaboration, :, Christopher P. Ahn, Rachael Alexandroff, Carlos Allende Prieto, Scott F. Anderson, Timothy Anderton, Brett H. Andrews, Éric Aubourg Stephen Bailey, Rory Barnes, Julian Bautista, Timothy C. Beers, Alessandra Beifiori, Andreas A. Berlind, Vaishali Bhardwaj, Dmitry Bizyaev, Cullen H. Blake, Michael R. Blanton, Michael Blomqvist, John J. Bochanski, Adam S. Bolton, Arnaud Borde, Jo Bovy, W. N. Brandt, J. Brinkmann , et al. (203 additional authors not shown)

    Abstract: The Sloan Digital Sky Survey III (SDSS-III) presents the first spectroscopic data from the Baryon Oscillation Spectroscopic Survey (BOSS). This ninth data release (DR9) of the SDSS project includes 535,995 new galaxy spectra (median z=0.52), 102,100 new quasar spectra (median z=2.32), and 90,897 new stellar spectra, along with the data presented in previous data releases. These spectra were obtain… ▽ More

    Submitted 30 July, 2012; originally announced July 2012.

    Comments: 9 figures; 2 tables. Submitted to ApJS. DR9 is available at http://www.sdss3.org/dr9