Skip to main content

Showing 1–21 of 21 results for author: Kim, B J

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.14126  [pdf, other

    cs.LG cs.AI cs.CV

    The Disappearance of Timestep Embedding in Modern Time-Dependent Neural Networks

    Authors: Bum Jun Kim, Yoshinobu Kawahara, Sang Woo Kim

    Abstract: Dynamical systems are often time-varying, whose modeling requires a function that evolves with respect to time. Recent studies such as the neural ordinary differential equation proposed a time-dependent neural network, which provides a neural network varying with respect to time. However, we claim that the architectural choice to build a time-dependent neural network significantly affects its time… ▽ More

    Submitted 22 May, 2024; originally announced May 2024.

    Comments: 14 pages, 7 figures

  2. arXiv:2405.14115  [pdf, other

    cs.CV cs.AI cs.LG

    Configuring Data Augmentations to Reduce Variance Shift in Positional Embedding of Vision Transformers

    Authors: Bum Jun Kim, Sang Woo Kim

    Abstract: Vision transformers (ViTs) have demonstrated remarkable performance in a variety of vision tasks. Despite their promising capabilities, training a ViT requires a large amount of diverse data. Several studies empirically found that using rich data augmentations, such as Mixup, Cutmix, and random erasing, is critical to the successful training of ViTs. Now, the use of rich data augmentations has bec… ▽ More

    Submitted 22 May, 2024; originally announced May 2024.

    Comments: 16 pages, 4 figures

  3. arXiv:2402.01149  [pdf, other

    cs.CV

    Scale Equalization for Multi-Level Feature Fusion

    Authors: Bum Jun Kim, Sang Woo Kim

    Abstract: Deep neural networks have exhibited remarkable performance in a variety of computer vision fields, especially in semantic segmentation tasks. Their success is often attributed to multi-level feature fusion, which enables them to understand both global and local information from an image. However, we found that multi-level features from parallel branches are on different scales. The scale disequili… ▽ More

    Submitted 2 February, 2024; originally announced February 2024.

    Comments: 10 pages, 3 figures

  4. arXiv:2311.03938  [pdf, other

    cs.CV

    Analysis of NaN Divergence in Training Monocular Depth Estimation Model

    Authors: Bum Jun Kim, Hyeonah Jang, Sang Woo Kim

    Abstract: The latest advances in deep learning have facilitated the development of highly accurate monocular depth estimation models. However, when training a monocular depth estimation network, practitioners and researchers have observed not a number (NaN) loss, which disrupts gradient descent optimization. Although several practitioners have reported the stochastic and mysterious occurrence of NaN loss th… ▽ More

    Submitted 7 November, 2023; originally announced November 2023.

    Comments: 10 pages, 3 figures

  5. arXiv:2307.14179  [pdf, other

    cs.CV

    Resolution-Aware Design of Atrous Rates for Semantic Segmentation Networks

    Authors: Bum Jun Kim, Hyeyeon Choi, Hyeonah Jang, Sang Woo Kim

    Abstract: DeepLab is a widely used deep neural network for semantic segmentation, whose success is attributed to its parallel architecture called atrous spatial pyramid pooling (ASPP). ASPP uses multiple atrous convolutions with different atrous rates to extract both local and global information. However, fixed values of atrous rates are used for the ASPP module, which restricts the size of its field of vie… ▽ More

    Submitted 26 July, 2023; originally announced July 2023.

    Comments: 18 pages, 12 figures

  6. arXiv:2305.04722  [pdf, other

    cs.CV

    Understanding Gaussian Attention Bias of Vision Transformers Using Effective Receptive Fields

    Authors: Bum Jun Kim, Hyeyeon Choi, Hyeonah Jang, Sang Woo Kim

    Abstract: Vision transformers (ViTs) that model an image as a sequence of partitioned patches have shown notable performance in diverse vision tasks. Because partitioning patches eliminates the image structure, to reflect the order of patches, ViTs utilize an explicit component called positional embedding. However, we claim that the use of positional embedding does not simply guarantee the order-awareness o… ▽ More

    Submitted 8 May, 2023; originally announced May 2023.

    Comments: 11 pages, 7 Figures

  7. arXiv:2302.06112  [pdf, other

    cs.LG cs.CV

    How to Use Dropout Correctly on Residual Networks with Batch Normalization

    Authors: Bum Jun Kim, Hyeyeon Choi, Hyeonah Jang, Donggeon Lee, Sang Woo Kim

    Abstract: For the stable optimization of deep neural networks, regularization methods such as dropout and batch normalization have been used in various tasks. Nevertheless, the correct position to apply dropout has rarely been discussed, and different positions have been employed depending on the practitioners. In this study, we investigate the correct position to apply dropout. We demonstrate that for a re… ▽ More

    Submitted 13 February, 2023; originally announced February 2023.

    Comments: 10 pages, 4 figures

  8. arXiv:2302.03193  [pdf, other

    cs.LG cs.CV

    On the Ideal Number of Groups for Isometric Gradient Propagation

    Authors: Bum Jun Kim, Hyeyeon Choi, Hyeonah Jang, Sang Woo Kim

    Abstract: Recently, various normalization layers have been proposed to stabilize the training of deep neural networks. Among them, group normalization is a generalization of layer normalization and instance normalization by allowing a degree of freedom in the number of groups it uses. However, to determine the optimal number of groups, trial-and-error-based hyperparameter tuning is required, and such experi… ▽ More

    Submitted 6 February, 2023; originally announced February 2023.

    Comments: 10 pages, 2 figures

  9. arXiv:2205.07260  [pdf, other

    cs.CV

    Guidelines for the Regularization of Gammas in Batch Normalization for Deep Residual Networks

    Authors: Bum Jun Kim, Hyeyeon Choi, Hyeonah Jang, Dong Gu Lee, Wonseok Jeong, Sang Woo Kim

    Abstract: L2 regularization for weights in neural networks is widely used as a standard training trick. However, L2 regularization for gamma, a trainable parameter of batch normalization, remains an undiscussed mystery and is applied in different ways depending on the library and practitioner. In this paper, we study whether L2 regularization for gamma is valid. To explore this issue, we consider two approa… ▽ More

    Submitted 15 May, 2022; originally announced May 2022.

    Comments: 12 pages, 6 figures

  10. arXiv:2204.12687  [pdf, other

    physics.soc-ph cs.SI

    Multiresolution community analysis of international trade networks

    Authors: Wonguk Cho, Daekyung Lee, Beom Jun Kim

    Abstract: The international trade network is a complex system where multiple trade blocs with varying sizes coexist and overlap with each other. However, the resulting structures of community detection in trade networks are often inconsistent and fails to capture the complex landscape of international trade. To address these problems, we propose a multiresolution framework that aggregates all the configurat… ▽ More

    Submitted 26 April, 2022; originally announced April 2022.

    Comments: 19 pages, 5 figures, 1 table

  11. arXiv:2111.08413  [pdf, other

    cs.CV

    Improved Robustness of Vision Transformer via PreLayerNorm in Patch Embedding

    Authors: Bum Jun Kim, Hyeyeon Choi, Hyeonah Jang, Dong Gu Lee, Wonseok Jeong, Sang Woo Kim

    Abstract: Vision transformers (ViTs) have recently demonstrated state-of-the-art performance in a variety of vision tasks, replacing convolutional neural networks (CNNs). Meanwhile, since ViT has a different architecture than CNN, it may behave differently. To investigate the reliability of ViT, this paper studies the behavior and robustness of ViT. We compared the robustness of CNN and ViT by assuming vari… ▽ More

    Submitted 16 November, 2021; originally announced November 2021.

    Comments: 7 pages, 8 figures. Work in Progress

  12. arXiv:2108.13576  [pdf, other

    cs.CV

    Dead Pixel Test Using Effective Receptive Field

    Authors: Bum Jun Kim, Hyeyeon Choi, Hyeonah Jang, Dong Gu Lee, Wonseok Jeong, Sang Woo Kim

    Abstract: Deep neural networks have been used in various fields, but their internal behavior is not well known. In this study, we discuss two counterintuitive behaviors of convolutional neural networks (CNNs). First, we evaluated the size of the receptive field. Previous studies have attempted to increase or control the size of the receptive field. However, we observed that the size of the receptive field d… ▽ More

    Submitted 30 August, 2021; originally announced August 2021.

    Comments: 9 pages, 5 figures

  13. arXiv:2105.07562  [pdf, other

    physics.soc-ph cs.LG eess.SY

    Power-grid stability predictions using transferable machine learning

    Authors: Seong-Gyu Yang, Beom Jun Kim, Seung-Woo Son, Heetae Kim

    Abstract: Complex network analyses have provided clues to improve power-grid stability with the help of numerical models. The high computational cost of numerical simulations, however, has inhibited the approach, especially when it deals with the dynamic properties of power grids such as frequency synchronization. In this study, we investigate machine learning techniques to estimate the stability of power-g… ▽ More

    Submitted 7 December, 2021; v1 submitted 16 May, 2021; originally announced May 2021.

    Comments: 10 pages, 6 figures, 4 tables

  14. arXiv:2002.07239  [pdf, other

    cs.SI physics.data-an

    Extracting hierarchical backbones from bipartite networks

    Authors: Woo Seong Jo, Jaehyuk Park, Arthur Luhur, Beom Jun Kim, Yong-Yeol Ahn

    Abstract: We propose a method for extracting hierarchical backbones from a bipartite network. Our method leverages the observation that a hierarchical relationship between two nodes in a bipartite network is often manifested as an asymmetry in the conditional probability of observing the connections to them from the other node set. Our method estimates both the importance and direction of the hierarchical r… ▽ More

    Submitted 18 March, 2020; v1 submitted 17 February, 2020; originally announced February 2020.

    Comments: 8 pages, 4 figures

  15. arXiv:2001.05153  [pdf, other

    cs.CV

    Extending Class Activation Map** Using Gaussian Receptive Field

    Authors: Bum Jun Kim, Gyogwon Koo, Hyeyeon Choi, Sang Woo Kim

    Abstract: This paper addresses the visualization task of deep learning models. To improve Class Activation Map** (CAM) based visualization method, we offer two options. First, we propose Gaussian upsampling, an improved upsampling method that can reflect the characteristics of deep learning models. Second, we identify and modify unnatural terms in the mathematical derivation of the existing CAM studies. B… ▽ More

    Submitted 15 January, 2020; originally announced January 2020.

    Comments: 7 pages, 5 figures

  16. arXiv:1712.06074  [pdf, ps, other

    cs.CL physics.soc-ph

    Benford's Law and First Letter of Word

    Authors: Xiaoyong Yan, Seong-Gyu Yang, Beom Jun Kim, Petter Minnhagen

    Abstract: A universal First-Letter Law (FLL) is derived and described. It predicts the percentages of first letters for words in novels. The FLL is akin to Benford's law (BL) of first digits, which predicts the percentages of first digits in a data collection of numbers. Both are universal in the sense that FLL only depends on the numbers of letters in the alphabet, whereas BL only depends on the number of… ▽ More

    Submitted 17 December, 2017; originally announced December 2017.

    Comments: 10 pages, 11 figures

    Journal ref: Physica A 512, 305-315 (2018)

  17. arXiv:1310.7532  [pdf, other

    physics.soc-ph cond-mat.dis-nn cs.CE nlin.AO q-bio.PE

    Matchmaker, Matchmaker, Make Me a Match: Migration of Populations via Marriages in the Past

    Authors: Sang Hoon Lee, Robyn Ffrancon, Daniel M. Abrams, Beom Jun Kim, Mason A. Porter

    Abstract: The study of human mobility is both of fundamental importance and of great potential value. For example, it can be leveraged to facilitate efficient city planning and improve prevention strategies when faced with epidemics. The newfound wealth of rich sources of data---including banknote flows, mobile phone records, and transportation data---has led to an explosion of attempts to characterize mode… ▽ More

    Submitted 16 October, 2014; v1 submitted 28 October, 2013; originally announced October 2013.

    Comments: 24 pages, 23 figures, 5 tables

    Journal ref: Phys. Rev. X 4, 041009 (2014)

  18. A Wireless Channel Sounding System for Rapid Propagation Measurements

    Authors: Muhammad Nazmul Islam, Byoung-Jo J. Kim, Paul Henry, Eric Rozner

    Abstract: Wireless systems are getting deployed in many new environments with different antenna heights, frequency bands and multipath conditions. This has led to an increasing demand for more channel measurements to understand wireless propagation in specific environments and assist deployment engineering. We design and implement a rapid wireless channel sounding system, using the Universal Software Radio… ▽ More

    Submitted 21 November, 2012; originally announced November 2012.

    Comments: Submitted to ICC 2013 (2012 AT\&T Intellectual Property. All rights reserved.)

    Journal ref: IEEE International Conference on Communications 2013, page 5720-5725

  19. arXiv:1206.6921  [pdf, ps, other

    physics.soc-ph cs.SI

    Dworkin's Paradox

    Authors: Seung Ki Baek, Jung-Kyoo Choi, Beom Jun Kim

    Abstract: How to distribute welfare in a society is a key issue in the subject of distributional justice, which is deeply involved with notions of fairness. Following a thought experiment by Dworkin, this work considers a society of individuals with different preferences on the welfare distribution and an official to mediate the coordination among them. Based on a simple assumption that an individual's welf… ▽ More

    Submitted 28 June, 2012; originally announced June 2012.

    Comments: 15 pages, 4 figures

    Journal ref: PLoS One 7, e38529 (2012)

  20. arXiv:1109.6221  [pdf, ps, other

    physics.soc-ph cs.DL

    The Ten Thousand Kims

    Authors: Seung Ki Baek, Petter Minnhagen, Beom Jun Kim

    Abstract: In the Korean culture the family members are recorded in special family books. This makes it possible to follow the distribution of Korean family names far back in history. It is here shown that these name distributions are well described by a simple null model, the random group formation (RGF) model. This model makes it possible to predict how the name distributions change and these predictions a… ▽ More

    Submitted 28 September, 2011; originally announced September 2011.

    Comments: 13 pages, 8 figures

    Journal ref: New J. Phys. 13, 073036 (2011)

  21. Heterogeneous attachment strategies optimize the topology of dynamic wireless networks

    Authors: Beom Jun Kim, Petter Holme, Viktoria Fodor

    Abstract: In optimizing the topology of wireless networks built of a dynamic set of spatially embedded agents, there are many trade-offs to be dealt with. The network should preferably be as small (in the sense that the average, or maximal, pathlength is short) as possible, it should be robust to failures, not consume too much power, and so on. In this paper, we investigate simple models of how agents can… ▽ More

    Submitted 18 May, 2009; originally announced May 2009.

    ACM Class: C.2.1; G.2.2

    Journal ref: Eur. Phys. J. B 73, 597--604 (2010)