Skip to main content

Showing 1–27 of 27 results for author: Jeong, H

Searching in archive stat. Search in all archives.
.
  1. arXiv:2406.00396  [pdf, other

    cs.LG cond-mat.stat-mech cs.AI stat.ML

    Stochastic Restarting to Overcome Overfitting in Neural Networks with Noisy Labels

    Authors: Youngkyoung Bae, Yeongwoo Song, Hawoong Jeong

    Abstract: Despite its prevalence, giving up and starting over may seem wasteful in many situations such as searching for a target or training deep neural networks (DNNs). Our study, though, demonstrates that restarting from a checkpoint can significantly improve generalization performance when training DNNs with noisy labels. In the presence of noisy labels, DNNs initially learn the general patterns of the… ▽ More

    Submitted 1 June, 2024; originally announced June 2024.

    Comments: 21 pages, 10 figures

  2. arXiv:2405.00642  [pdf, other

    stat.ML cond-mat.dis-nn cond-mat.stat-mech cs.LG

    From Empirical Observations to Universality: Dynamics of Deep Learning with Inputs Built on Gaussian mixture

    Authors: Jaeyong Bae, Hawoong Jeong

    Abstract: This study broadens the scope of theoretical frameworks in deep learning by delving into the dynamics of neural networks with inputs that demonstrate the structural characteristics to Gaussian Mixture (GM). We analyzed how the dynamics of neural networks under GM-structured inputs diverge from the predictions of conventional theories based on simple Gaussian structures. A revelation of our work is… ▽ More

    Submitted 1 May, 2024; originally announced May 2024.

    Comments: 19 pages, 9 figures

  3. arXiv:2403.01204  [pdf, ps, other

    cs.LG math.NA stat.ML

    Stochastic gradient descent for streaming linear and rectified linear systems with Massart noise

    Authors: Halyun Jeong, Deanna Needell, Elizaveta Rebrova

    Abstract: We propose SGD-exp, a stochastic gradient descent approach for linear and ReLU regressions under Massart noise (adversarial semi-random corruption model) for the fully streaming setting. We show novel nearly linear convergence guarantees of SGD-exp to the true parameter with up to $50\%$ Massart corruption rate, and with any corruption rate in the case of symmetric oblivious corruptions. This is t… ▽ More

    Submitted 2 March, 2024; originally announced March 2024.

    Comments: Submitted to a journal

    MSC Class: 65F10; 60-XX

  4. arXiv:2402.10482  [pdf, other

    cs.LG stat.ML

    Understanding Self-Distillation and Partial Label Learning in Multi-Class Classification with Label Noise

    Authors: Hyeonsu Jeong, Hye Won Chung

    Abstract: Self-distillation (SD) is the process of training a student model using the outputs of a teacher model, with both models sharing the same architecture. Our study theoretically examines SD in multi-class classification with cross-entropy loss, exploring both multi-round SD and SD with refined teacher outputs, inspired by partial label learning (PLL). By deriving a closed-form solution for the stude… ▽ More

    Submitted 16 February, 2024; originally announced February 2024.

  5. arXiv:2310.01107  [pdf, other

    cs.CV cs.AI cs.LG stat.ML

    Ground-A-Video: Zero-shot Grounded Video Editing using Text-to-image Diffusion Models

    Authors: Hyeonho Jeong, Jong Chul Ye

    Abstract: Recent endeavors in video editing have showcased promising results in single-attribute editing or style transfer tasks, either by training text-to-video (T2V) models on text-video data or adopting training-free methods. However, when confronted with the complexities of multi-attribute editing scenarios, they exhibit shortcomings such as omitting or overlooking intended attribute changes, modifying… ▽ More

    Submitted 24 February, 2024; v1 submitted 2 October, 2023; originally announced October 2023.

    Comments: Accepted to ICLR 2024, Project Page: http://ground-a-video.github.io

  6. arXiv:2308.16058  [pdf, other

    stat.ME math.ST stat.AP

    A Classification of Observation-Driven State-Space Count Models for Panel Data

    Authors: Jae Youn Ahn, Himchan Jeong, Yang Lu, Mario V. Wüthrich

    Abstract: State-space models are widely used in many applications. In the domain of count data, one such example is the model proposed by Harvey and Fernandes (1989). Unlike many of its parameter-driven alternatives, this model is observation-driven, leading to closed-form expressions for the predictive density. In this paper, we demonstrate the need to extend the model of Harvey and Fernandes (1989) by sho… ▽ More

    Submitted 30 August, 2023; originally announced August 2023.

    Comments: 28 pages, 2 figures

    MSC Class: 62M10 ACM Class: G.3

  7. arXiv:2304.10123  [pdf, other

    stat.ML math.NA

    Linear Convergence of Reshuffling Kaczmarz Methods With Sparse Constraints

    Authors: Halyun Jeong, Deanna Needell

    Abstract: The Kaczmarz method (KZ) and its variants, which are types of stochastic gradient descent (SGD) methods, have been extensively studied due to their simplicity and efficiency in solving linear equation systems. The iterative thresholding (IHT) method has gained popularity in various research fields, including compressed sensing or sparse linear regression, machine learning with additional structure… ▽ More

    Submitted 20 April, 2023; originally announced April 2023.

    Comments: Submitted to a journal

    MSC Class: 65F10; 65F22; 90C26

  8. arXiv:2302.03900  [pdf, other

    cs.CV cs.AI cs.LG stat.ML

    Zero-shot Generation of Coherent Storybook from Plain Text Story using Diffusion Models

    Authors: Hyeonho Jeong, Gihyun Kwon, Jong Chul Ye

    Abstract: Recent advancements in large scale text-to-image models have opened new possibilities for guiding the creation of images through human-devised natural language. However, while prior literature has primarily focused on the generation of individual images, it is essential to consider the capability of these models to ensure coherency within a sequence of images to fulfill the demands of real-world a… ▽ More

    Submitted 8 February, 2023; originally announced February 2023.

  9. arXiv:2301.00006  [pdf, other

    cs.HC cs.IT cs.LG stat.ML

    Recovering Top-Two Answers and Confusion Probability in Multi-Choice Crowdsourcing

    Authors: Hyeonsu Jeong, Hye Won Chung

    Abstract: Crowdsourcing has emerged as an effective platform for labeling large amounts of data in a cost- and time-efficient manner. Most previous work has focused on designing an efficient algorithm to recover only the ground-truth labels of the data. In this paper, we consider multi-choice crowdsourcing tasks with the goal of recovering not only the ground truth, but also the most confusing answer and th… ▽ More

    Submitted 31 May, 2023; v1 submitted 29 December, 2022; originally announced January 2023.

    Comments: ICML 2023

  10. arXiv:2212.01168  [pdf, other

    cs.LG cs.AI physics.comp-ph stat.ML

    Towards Cross Domain Generalization of Hamiltonian Representation via Meta Learning

    Authors: Yeongwoo Song, Hawoong Jeong

    Abstract: Recent advances in deep learning for physics have focused on discovering shared representations of target systems by incorporating physics priors or inductive biases into neural networks. While effective, these methods are limited to the system domain, where the type of system remains consistent and thus cannot ensure the adaptation to new, or unseen physical systems governed by different laws. Fo… ▽ More

    Submitted 27 April, 2024; v1 submitted 2 December, 2022; originally announced December 2022.

    Comments: Conference paper at ICLR 2024

  11. arXiv:2210.05816  [pdf, other

    stat.ME cs.AI cs.LG

    Finding and Listing Front-door Adjustment Sets

    Authors: Hyunchai Jeong, ** Tian, Elias Bareinboim

    Abstract: Identifying the effects of new interventions from data is a significant challenge found across a wide range of the empirical sciences. A well-known strategy for identifying such effects is Pearl's front-door (FD) criterion (Pearl, 1995). The definition of the FD criterion is declarative, only allowing one to decide whether a specific set satisfies the criterion. In this paper, we present algorithm… ▽ More

    Submitted 14 October, 2022; v1 submitted 11 October, 2022; originally announced October 2022.

    Comments: Pages: 18 (main paper 10, references 2, appendix 6), Figures: 9 (main paper 7, appendix 2), to be published in Proceedings of the 36th Annual Conference on Neural Information Processing Systems

  12. arXiv:2110.09657  [pdf, ps, other

    stat.AP

    A simple Bayesian state-space model for the collective risk model

    Authors: Jae Youn Ahn, Himchan Jeong, Yang Lu

    Abstract: The collective risk model (CRM) for frequency and severity is an important tool for retail insurance ratemaking, macro-level catastrophic risk forecasting, as well as operational risk in banking regulation. This model, which is initially designed for cross-sectional data, has recently been adapted to a longitudinal context to conduct both a priori and a posteriori ratemaking, through the introduct… ▽ More

    Submitted 18 October, 2021; originally announced October 2021.

  13. arXiv:2109.10431  [pdf, other

    cs.LG cs.CY cs.IT stat.ML

    Fairness without Imputation: A Decision Tree Approach for Fair Prediction with Missing Values

    Authors: Haewon Jeong, Hao Wang, Flavio P. Calmon

    Abstract: We investigate the fairness concerns of training a machine learning model using data with missing values. Even though there are a number of fairness intervention methods in the literature, most of them require a complete training set as input. In practice, data can have missing values, and data missing patterns can depend on group attributes (e.g. gender or race). Simply applying off-the-shelf fai… ▽ More

    Submitted 13 April, 2022; v1 submitted 21 September, 2021; originally announced September 2021.

  14. arXiv:2109.07956  [pdf, other

    stat.AP

    On the ordering of credibility factors

    Authors: Jae Youn Ahn, Himchan Jeong, Yang Lu

    Abstract: Traditional credibility analysis of risks in insurance is based on the random effects model, where the heterogeneity across the policyholders is assumed to be time-invariant. One popular extension is the dynamic random effects (or state-space) model. However, while the latter allows for time-varying heterogeneity, its application to the credibility analysis should be conducted with care due to the… ▽ More

    Submitted 16 September, 2021; originally announced September 2021.

  15. arXiv:2102.04008  [pdf, other

    cs.LG physics.class-ph physics.comp-ph physics.data-an stat.ML

    Discovering conservation laws from trajectories via machine learning

    Authors: Seungwoong Ha, Hawoong Jeong

    Abstract: Invariants and conservation laws convey critical information about the underlying dynamics of a system, yet it is generally infeasible to find them from large-scale data without any prior knowledge or human insight. We propose ConservNet to achieve this goal, a neural network that spontaneously discovers a conserved quantity from grouped data where the members of each group share invariants, simil… ▽ More

    Submitted 30 June, 2021; v1 submitted 8 February, 2021; originally announced February 2021.

    Comments: 12 pages, 9 figures

  16. arXiv:2102.03065  [pdf, other

    cs.LG cs.AI cs.CV stat.ML

    Co-Mixup: Saliency Guided Joint Mixup with Supermodular Diversity

    Authors: Jang-Hyun Kim, Wonho Choo, Hosan Jeong, Hyun Oh Song

    Abstract: While deep neural networks show great performance on fitting to the training distribution, improving the networks' generalization performance to the test distribution and robustness to the sensitivity to input perturbations still remain as a challenge. Although a number of mixup based augmentation strategies have been proposed to partially address them, it remains unclear as to how to best utilize… ▽ More

    Submitted 5 February, 2021; originally announced February 2021.

    Comments: Published at ICLR 2021 (Oral)

  17. arXiv:2006.12777  [pdf, other

    cs.LG stat.ML

    Clinical Risk Prediction with Temporal Probabilistic Asymmetric Multi-Task Learning

    Authors: A. Tuan Nguyen, Hyewon Jeong, Eunho Yang, Sung Ju Hwang

    Abstract: Although recent multi-task learning methods have shown to be effective in improving the generalization of deep neural networks, they should be used with caution for safety-critical applications, such as clinical risk prediction. This is because even if they achieve improved task-average performance, they may still yield degraded performance on individual tasks, which may be critical (e.g., predict… ▽ More

    Submitted 18 February, 2021; v1 submitted 23 June, 2020; originally announced June 2020.

    Comments: AAAI 2021. The first two authors contributed equally to this work. 10 pages, 4 figures, 4 tables

  18. arXiv:2006.10190  [pdf, other

    cs.LG cs.RO stat.ML

    Learning to Track Dynamic Targets in Partially Known Environments

    Authors: Hee** Jeong, Hamed Hassani, Manfred Morari, Daniel D. Lee, George J. Pappas

    Abstract: We solve active target tracking, one of the essential tasks in autonomous systems, using a deep reinforcement learning (RL) approach. In this problem, an autonomous agent is tasked with acquiring information about targets of interests using its onboard sensors. The classical challenges in this problem are system model dependence and the difficulty of computing information-theoretic cost functions… ▽ More

    Submitted 17 June, 2020; originally announced June 2020.

    Comments: IEEE Transaction on Robotics (under review); Demo video: https://youtu.be/0ZFyOWJ2ulo ; Source code: https://github.com/coco66/ttenv

  19. arXiv:2006.06151  [pdf, other

    stat.AP

    On a Multi-Year Microlevel Collective Risk Model

    Authors: Rosy Oh, Himchan Jeong, Jae Youn Ahn, Emiliano A. Valdez

    Abstract: For a typical insurance portfolio, the claims process for a short period, typically one year, is characterized by observing frequency of claims together with the associated claims severities. The collective risk model describes this portfolio as a random sum of the aggregation of the claim amounts. In the classical framework, for simplicity, the claim frequency and claim severities are assumed to… ▽ More

    Submitted 10 June, 2020; originally announced June 2020.

  20. arXiv:2006.05419  [pdf, other

    cs.LG cs.HC stat.ML

    Cost-effective Interactive Attention Learning with Neural Attention Processes

    Authors: Jay Heo, Junhyeon Park, Hyewon Jeong, Kwang Joon Kim, Juho Lee, Eunho Yang, Sung Ju Hwang

    Abstract: We propose a novel interactive learning framework which we refer to as Interactive Attention Learning (IAL), in which the human supervisors interactively manipulate the allocated attentions, to correct the model's behavior by updating the attention-generating network. However, such a model is prone to overfitting due to scarcity of human annotations, and requires costly retraining. Moreover, it is… ▽ More

    Submitted 9 June, 2020; originally announced June 2020.

  21. arXiv:2004.08032  [pdf, other

    stat.ME stat.AP

    A non-convex regularization approach for stable estimation of loss development factors

    Authors: Himchan Jeong, Hyunwoong Chang, Emiliano A. Valdez

    Abstract: In this article, we apply non-convex regularization methods in order to obtain stable estimation of loss development factors in insurance claims reserving. Among the non-convex regularization methods, we focus on the use of the log-adjusted absolute deviation (LAAD) penalty and provide discussion on optimization of LAAD penalized regression model, which we prove to converge with a coordinate desce… ▽ More

    Submitted 6 December, 2020; v1 submitted 16 April, 2020; originally announced April 2020.

    Comments: 23 pages, 11 Tables, 6 Figures

    MSC Class: 62P05

  22. arXiv:2003.04166  [pdf, other

    cond-mat.stat-mech cs.LG stat.ML

    Learning entropy production via neural networks

    Authors: Dong-Kyum Kim, Youngkyoung Bae, Sangyun Lee, Hawoong Jeong

    Abstract: This Letter presents a neural estimator for entropy production, or NEEP, that estimates entropy production (EP) from trajectories of relevant variables without detailed information on the system dynamics. For steady state, we rigorously prove that the estimator, which can be built up from different choices of deep neural networks, provides stochastic EP by optimizing the objective function propose… ▽ More

    Submitted 11 September, 2020; v1 submitted 9 March, 2020; originally announced March 2020.

    Comments: 6+8 pages, 4+8 figures

    Journal ref: Phys. Rev. Lett. 125, 140604 (2020)

  23. arXiv:2001.10631  [pdf, ps, other

    cs.IT math.ST stat.ML

    Sub-Gaussian Matrices on Sets: Optimal Tail Dependence and Applications

    Authors: Halyun Jeong, Xiaowei Li, Yaniv Plan, Özgür Yılmaz

    Abstract: Random linear map**s are widely used in modern signal processing, compressed sensing and machine learning. These map**s may be used to embed the data into a significantly lower dimension while at the same time preserving useful information. This is done by approximately preserving the distances between data points, which are assumed to belong to $\mathbb{R}^n$. Thus, the performance of these m… ▽ More

    Submitted 20 January, 2021; v1 submitted 28 January, 2020; originally announced January 2020.

  24. arXiv:1912.05827  [pdf, other

    cs.LG stat.ML

    An Efficient Explorative Sampling Considering the Generative Boundaries of Deep Generative Neural Networks

    Authors: Giyoung Jeon, Haedong Jeong, Jaesik Choi

    Abstract: Deep generative neural networks (DGNNs) have achieved realistic and high-quality data generation. In particular, the adversarial training scheme has been applied to many DGNNs and has exhibited powerful performance. Despite of recent advances in generative networks, identifying the image generation mechanism still remains challenging. In this paper, we present an explorative sampling algorithm to… ▽ More

    Submitted 12 December, 2019; originally announced December 2019.

    Comments: AAAI 2020

  25. arXiv:1910.10754  [pdf, other

    cs.LG cs.RO stat.ML

    Learning Q-network for Active Information Acquisition

    Authors: Hee** Jeong, Brent Schlotfeldt, Hamed Hassani, Manfred Morari, Daniel D. Lee, George J. Pappas

    Abstract: In this paper, we propose a novel Reinforcement Learning approach for solving the Active Information Acquisition problem, which requires an agent to choose a sequence of actions in order to acquire information about a process of interest using on-board sensors. The classic challenges in the information acquisition problem are the dependence of a planning algorithm on known models and the difficult… ▽ More

    Submitted 23 October, 2019; originally announced October 2019.

    Comments: IROS 2019, Video https://youtu.be/0ZFyOWJ2ulo

    Journal ref: IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), 2019

  26. arXiv:physics/0702148  [pdf, ps, other

    physics.soc-ph cond-mat.stat-mech physics.data-an stat.AP

    Reliability of rank order in sampled networks

    Authors: Pan-Jun Kim, Hawoong Jeong

    Abstract: In complex scale-free networks, ranking the individual nodes based upon their importance has useful applications, such as the identification of hubs for epidemic control, or bottlenecks for controlling traffic congestion. However, in most real situations, only limited sub-structures of entire networks are available, and therefore the reliability of the order relationships in sampled networks req… ▽ More

    Submitted 16 February, 2007; originally announced February 2007.

    Journal ref: Eur. Phys. J. B 55, 109-114 (2007)

  27. arXiv:cond-mat/0505232  [pdf, ps, other

    cond-mat.dis-nn physics.soc-ph stat.ME

    Statistical properties of sampled networks

    Authors: Sang Hoon Lee, Pan-Jun Kim, Hawoong Jeong

    Abstract: We study the statistical properties of the sampled scale-free networks, deeply related to the proper identification of various real-world networks. We exploit three methods of sampling and investigate the topological properties such as degree and betweenness centrality distribution, average path length, assortativity, and clustering coefficient of sampled networks compared with those of original… ▽ More

    Submitted 24 November, 2009; v1 submitted 10 May, 2005; originally announced May 2005.

    Comments: 8 pages, 11 figures

    Journal ref: Phys. Rev. E 73, 016102 (2006)