Skip to main content

Showing 1–14 of 14 results for author: An, J

Searching in archive stat. Search in all archives.
.
  1. arXiv:2304.09221  [pdf, ps, other

    cs.LG math.OC stat.ML

    Convergence of stochastic gradient descent under a local Lojasiewicz condition for deep neural networks

    Authors: **g An, Jianfeng Lu

    Abstract: We study the convergence of stochastic gradient descent (SGD) for non-convex objective functions. We establish the local convergence with positive probability under the local Łojasiewicz condition introduced by Chatterjee in \cite{chatterjee2022convergence} and an additional local structural assumption of the loss function landscape. A key component of our proof is to ensure that the whole traject… ▽ More

    Submitted 12 January, 2024; v1 submitted 18 April, 2023; originally announced April 2023.

    Comments: v2 fixed several mistakes. Some parts have been rewritten

  2. arXiv:2303.03027  [pdf, other

    stat.ML cs.LG

    Critical Points and Convergence Analysis of Generative Deep Linear Networks Trained with Bures-Wasserstein Loss

    Authors: Pierre Bréchet, Katerina Papagiannouli, **g An, Guido Montúfar

    Abstract: We consider a deep matrix factorization model of covariance matrices trained with the Bures-Wasserstein distance. While recent works have made advances in the study of the optimization problem for overparametrized low-rank matrix approximation, much emphasis has been placed on discriminative settings and the square loss. In contrast, our model considers another type of loss and connects with the g… ▽ More

    Submitted 13 July, 2023; v1 submitted 6 March, 2023; originally announced March 2023.

    Comments: 42 pages, 3 figures, accepted at ICML 2023

  3. arXiv:2103.02565  [pdf, other

    cs.LG cs.CY q-bio.BM q-bio.QM stat.ML

    GLAMOUR: Graph Learning over Macromolecule Representations

    Authors: Somesh Mohapatra, Joyce An, Rafael Gómez-Bombarelli

    Abstract: The near-infinite chemical diversity of natural and artificial macromolecules arises from the vast range of possible component monomers, linkages, and polymers topologies. This enormous variety contributes to the ubiquity and indispensability of macromolecules but hinders the development of general machine learning methods with macromolecules as input. To address this, we developed GLAMOUR, a fram… ▽ More

    Submitted 23 August, 2021; v1 submitted 3 March, 2021; originally announced March 2021.

    Comments: Main text: 4 pages, 2 figures; Appendix: 33 pages, 46 figures, 7 in-text tables, 4 supplementary tables

    ACM Class: J.2.4; J.3.1

  4. arXiv:2009.13447  [pdf, other

    cs.LG math.NA stat.ML

    Why resampling outperforms reweighting for correcting sampling bias with stochastic gradients

    Authors: **g An, Lexing Ying, Yuhua Zhu

    Abstract: A data set sampled from a certain population is biased if the subgroups of the population are sampled at proportions that are significantly different from their underlying proportions. Training machine learning models on biased data sets requires correction techniques to compensate for the bias. We consider two commonly-used techniques, resampling and reweighting, that rebalance the proportions of… ▽ More

    Submitted 27 August, 2021; v1 submitted 28 September, 2020; originally announced September 2020.

  5. Are You A Risk Taker? Adversarial Learning of Asymmetric Cross-Domain Alignment for Risk Tolerance Prediction

    Authors: Zhe Liu, Lina Yao, Xianzhi Wang, Lei Bai, Jake An

    Abstract: Most current studies on survey analysis and risk tolerance modelling lack professional knowledge and domain-specific models. Given the effectiveness of generative adversarial learning in cross-domain information, we design an Asymmetric cross-Domain Generative Adversarial Network (ADGAN) for domain scale inequality. ADGAN utilizes the information-sufficient domain to provide extra information to i… ▽ More

    Submitted 18 April, 2020; originally announced April 2020.

  6. arXiv:1911.08252  [pdf, other

    cs.LG stat.ML

    IC-Network: Efficient Structure for Convolutional Neural Networks

    Authors: Junyi An, Fengshan Liu, Jian Zhao, Furao Shen

    Abstract: Neural networks have been widely used, and most networks achieve excellent performance by stacking certain types of basic units. Compared to increasing the depth and width of the network, designing more effective basic units has become an important research topic. Inspired by the elastic collision model in physics, we present a universal structure that could be integrated into the existing network… ▽ More

    Submitted 4 June, 2020; v1 submitted 19 November, 2019; originally announced November 2019.

    Comments: 9 pages, 4 figures

  7. arXiv:1909.02391  [pdf, other

    cs.LG eess.SP stat.ML

    Data-driven simulation for general purpose multibody dynamics using deep neural networks

    Authors: Hee-Sun Choi, Junmo An, **-Gyun Kim, Jae-Yoon Jung, Juhwan Choi, Grzegorz Orzechowski, Aki Mikkola, ** Hwan Choi

    Abstract: In this paper, a machine learning-based simulation framework of general-purpose multibody dynamics is introduced. The aim of the framework is to generate a well-trained meta-model of multibody dynamics (MBD) systems. To this end, deep neural network (DNN) is employed to the framework so as to construct data-based meta-model representing multibody systems. Constructing well-defined training data se… ▽ More

    Submitted 2 September, 2019; originally announced September 2019.

    Comments: 32 pages, 17 figures, 11 tables

  8. arXiv:1907.05286  [pdf

    cs.CV cs.LG stat.ML

    Voxel-FPN: multi-scale voxel feature aggregation in 3D object detection from point clouds

    Authors: Bei Wang, Jian** An, Jiayan Cao

    Abstract: Object detection in point cloud data is one of the key components in computer vision systems, especially for autonomous driving applications. In this work, we present Voxel-FPN, a novel one-stage 3D object detector that utilizes raw data from LIDAR sensors only. The core framework consists of an encoder network and a corresponding decoder followed by a region proposal network. Encoder extracts mul… ▽ More

    Submitted 16 July, 2019; v1 submitted 28 June, 2019; originally announced July 2019.

  9. arXiv:1811.00246  [pdf, other

    cs.LG stat.ML

    SARN: Relational Reasoning through Sequential Attention

    Authors: **won An, Sungwon Lyu, Sungzoon Cho

    Abstract: This paper proposes an attention module augmented relational network called SARN(Sequential Attention Relational Network) that can carry out relational reasoning by extracting reference objects and making efficient pairing between objects. SARN greatly reduces the computational and memory requirements of the relational network, which computes all object pairs. It also shows high accuracy on the So… ▽ More

    Submitted 1 November, 2018; originally announced November 2018.

  10. Stochastic modified equations for the asynchronous stochastic gradient descent

    Authors: **g An, Jianfeng Lu, Lexing Ying

    Abstract: We propose a stochastic modified equations (SME) for modeling the asynchronous stochastic gradient descent (ASGD) algorithms. The resulting SME of Langevin type extracts more information about the ASGD dynamics and elucidates the relationship between different types of stochastic gradient algorithms. We show the convergence of ASGD to the SME in the continuous time limit, as well as the SME's prec… ▽ More

    Submitted 27 September, 2019; v1 submitted 21 May, 2018; originally announced May 2018.

    Comments: Final version. To appear in Information and Inference

  11. arXiv:1612.05021  [pdf, other

    stat.AP q-fin.GN

    Dynamic Modeling of Price Responsive Demand in Real-time Electricity Market: Empirical Analysis

    Authors: Jaeyong An, P. R. Kumar, Le Xie

    Abstract: In this paper, we study the price responsiveness of electricity consumption from empirical commercial and industrial load data obtained from Texas. Employing a dynamical system perspective, we show that price responsive demand can be modeled as a hybrid of a Hammerstein model with delay following a price surge, and a linear ARX model under moderate price changes. It is observed that electricity co… ▽ More

    Submitted 15 December, 2016; originally announced December 2016.

  12. arXiv:1409.7489  [pdf, other

    cs.SI cs.CY cs.HC physics.soc-ph stat.ML

    Recommending Investors for Crowdfunding Projects

    Authors: Jisun An, Daniele Quercia, Jon Crowcroft

    Abstract: To bring their innovative ideas to market, those embarking in new ventures have to raise money, and, to do so, they have often resorted to banks and venture capitalists. Nowadays, they have an additional option: that of crowdfunding. The name refers to the idea that funds come from a network of people on the Internet who are passionate about supporting others' projects. One of the most popular cro… ▽ More

    Submitted 12 October, 2014; v1 submitted 26 September, 2014; originally announced September 2014.

    Comments: Published in Proc. of WWW 2014

  13. arXiv:1308.4017  [pdf, other

    stat.ML cs.HC q-bio.NC

    A Study on Stroke Rehabilitation through Task-Oriented Control of a Haptic Device via Near-Infrared Spectroscopy-Based BCI

    Authors: Berdakh Abibullaev, **ung An, Seung-Hyun Lee, Jeon-Il Moon

    Abstract: This paper presents a study in task-oriented approach to stroke rehabilitation by controlling a haptic device via near-infrared spectroscopy-based brain-computer interface (BCI). The task is to command the haptic device to move in opposing directions of leftward and rightward movement. Our study consists of data acquisition, signal preprocessing, and classification. In data acquisition, we conduct… ▽ More

    Submitted 14 April, 2014; v1 submitted 19 August, 2013; originally announced August 2013.

    Comments: 13 pages, 6 figures

  14. arXiv:1209.5467   

    stat.ML cs.LG

    Minimizing inter-subject variability in fNIRS based Brain Computer Interfaces via multiple-kernel support vector learning

    Authors: Berdakh Abibullaev, **ung An, Seung-Hyun Lee, Sang-Hyeon **, Jeon-Il Moon

    Abstract: Brain signal variability in the measurements obtained from different subjects during different sessions significantly deteriorates the accuracy of most brain-computer interface (BCI) systems. Moreover these variabilities, also known as inter-subject or inter-session variabilities, require lengthy calibration sessions before the BCI system can be used. Furthermore, the calibration session has to be… ▽ More

    Submitted 7 May, 2013; v1 submitted 24 September, 2012; originally announced September 2012.

    Comments: This paper has been withdrawn by the author due to an error in equation 19