Skip to main content

Showing 1–49 of 49 results for author: Lee, W

Searching in archive stat. Search in all archives.
.
  1. arXiv:2312.01541  [pdf, other

    cs.LG cs.AI stat.ML

    Revisiting Non-separable Binary Classification and its Applications in Anomaly Detection

    Authors: Matthew Lau, Ismaila Seck, Athanasios P Meliopoulos, Wenke Lee, Eugene Ndiaye

    Abstract: The inability to linearly classify XOR has motivated much of deep learning. We revisit this age-old problem and show that linear classification of XOR is indeed possible. Instead of separating data between halfspaces, we propose a slightly different paradigm, equality separation, that adapts the SVM objective to distinguish data within or outside the margin. Our classifier can then be integrated i… ▽ More

    Submitted 18 June, 2024; v1 submitted 3 December, 2023; originally announced December 2023.

    Comments: Accepted in Transactions on Machine Learning Research (TMLR) 2024. Code: https://github.com/mattlaued/XOR-is-Linearly-Classifiable

    MSC Class: 68T37 (Primary); 68T07 (Secondary) ACM Class: I.2.6; I.5.1

  2. arXiv:2309.03097  [pdf, other

    stat.AP

    An Algorithm for Modelling Escalator Fixed Loss Energy for PHM and sustainable energy usage

    Authors: Xuwen Hu, Jiaqi Qiu, Yu Lin, Inez Maria Zwetsloot, William Ka Fai Lee, Edmond Yin San Yeung, Colman Yiu Wah Yeung, Chris Chun Long Wong

    Abstract: Prognostic Health Management (PHM) is designed to assess and monitor the health status of systems, anticipate the onset of potential failure, and prevent unplanned downtime. In recent decades, collecting massive amounts of real-time sensor data enabled condition monitoring (CM) and consequently, detection of abnormalities to support maintenance decision-making. Additionally, the utilization of PHM… ▽ More

    Submitted 6 September, 2023; originally announced September 2023.

  3. arXiv:2306.05436  [pdf, other

    stat.AP cs.CY

    Remaining Useful Life Modelling with an Escalator Health Condition Analytic System

    Authors: Inez M. Zwetsloot, Yu Lin, Jiaqi Qiu, Lishuai Li, William Ka Fai Lee, Edmond Yin San Yeung, Colman Yiu Wah Yeung, Chris Chun Long Wong

    Abstract: The refurbishment of an escalator is usually linked with its design life as recommended by the manufacturer. However, the actual useful life of an escalator should be determined by its operating condition which is affected by the runtime, workload, maintenance quality, vibration, etc., rather than age only. The objective of this project is to develop a comprehensive health condition analytic syste… ▽ More

    Submitted 7 June, 2023; originally announced June 2023.

    Comments: 14 pages, 12 figures, 7 tables

  4. arXiv:2301.13370  [pdf, other

    cs.LG stat.ML

    On the Correctness of Automatic Differentiation for Neural Networks with Machine-Representable Parameters

    Authors: Wonyeol Lee, Sejun Park, Alex Aiken

    Abstract: Recent work has shown that forward- and reverse- mode automatic differentiation (AD) over the reals is almost always correct in a mathematically precise sense. However, actual programs work with machine-representable numbers (e.g., floating-point numbers), not reals. In this paper, we study the correctness of AD when the parameter space of a neural network consists solely of machine-representable… ▽ More

    Submitted 6 June, 2023; v1 submitted 30 January, 2023; originally announced January 2023.

    Comments: To appear at ICML 2023

  5. arXiv:2210.05248  [pdf, other

    cs.LG cs.AI stat.ML

    Self-supervised debiasing using low rank regularization

    Authors: Geon Yeong Park, Chanyong Jung, Sangmin Lee, Jong Chul Ye, Sang Wan Lee

    Abstract: Spurious correlations can cause strong biases in deep neural networks, impairing generalization ability. While most existing debiasing methods require full supervision on either spurious attributes or target labels, training a debiased model from a limited amount of both annotations is still an open question. To address this issue, we investigate an interesting phenomenon using the spectral analys… ▽ More

    Submitted 8 October, 2023; v1 submitted 11 October, 2022; originally announced October 2022.

  6. arXiv:2210.05247  [pdf, other

    cs.LG cs.AI stat.ML

    Training Debiased Subnetworks with Contrastive Weight Pruning

    Authors: Geon Yeong Park, Sangmin Lee, Sang Wan Lee, Jong Chul Ye

    Abstract: Neural networks are often biased to spuriously correlated features that provide misleading statistical evidence that does not generalize. This raises an interesting question: ``Does an optimal unbiased functional subnetwork exist in a severely biased network? If so, how to extract such subnetwork?" While empirical evidence has been accumulated about the existence of such unbiased subnetworks, thes… ▽ More

    Submitted 26 June, 2023; v1 submitted 11 October, 2022; originally announced October 2022.

    Comments: CVPR 2023, code: https://github.com/ParkGeonYeong/DCWP

  7. arXiv:2205.07833  [pdf, other

    cs.LG stat.ML

    Decision Making for Hierarchical Multi-label Classification with Multidimensional Local Precision Rate

    Authors: Yuting Ye, Christine Ho, Ci-Ren Jiang, Wayne Tai Lee, Haiyan Huang

    Abstract: Hierarchical multi-label classification (HMC) has drawn increasing attention in the past few decades. It is applicable when hierarchical relationships among classes are available and need to be incorporated along with the multi-label classification whereby each object is assigned to one or more classes. There are two key challenges in HMC: i) optimizing the classification accuracy, and meanwhile i… ▽ More

    Submitted 16 May, 2022; originally announced May 2022.

    Comments: 34 pages, 11 figures, 9 tables

  8. arXiv:2110.14953  [pdf, other

    cs.LG stat.ML

    Multi-Task Neural Processes

    Authors: Donggyun Kim, Seongwoong Cho, Wonkwang Lee, Seunghoon Hong

    Abstract: Neural Processes (NPs) consider a task as a function realized from a stochastic process and flexibly adapt to unseen tasks through inference on functions. However, naive NPs can model data from only a single stochastic process and are designed to infer each task independently. Since many real-world data represent a set of correlated tasks from multiple sources (e.g., multiple attributes and multi-… ▽ More

    Submitted 25 March, 2022; v1 submitted 28 October, 2021; originally announced October 2021.

    Comments: 49 pages, 19 figures

  9. arXiv:2110.14374  [pdf, other

    physics.comp-ph cond-mat.dis-nn stat.ML

    A2I Transformer: Permutation-equivariant attention network for pairwise and many-body interactions with minimal featurization

    Authors: Ji Woong Yu, Min Young Ha, Bumjoon Seo, Won Bo Lee

    Abstract: The combination of neural network potential (NNP) with molecular simulations plays an important role in an efficient and thorough understanding of a molecular system's potential energy surface (PES). However, gras** the interplay between input features and their local contribution to NNP is growingly evasive due to heavy featurization. In this work, we suggest an end-to-end model which directly… ▽ More

    Submitted 27 October, 2021; originally announced October 2021.

  10. arXiv:2105.03228  [pdf, other

    stat.CO stat.ME

    SEAGLE: A Scalable Exact Algorithm for Large-Scale Set-Based GxE Tests in Biobank Data

    Authors: Jocelyn T. Chi, Ilse C. F. Ipsen, Tzu-Hung Hsiao, Ching-Heng Lin, Li-San Wang, Wan-** Lee, Tzu-Pin Lu, Jung-Ying Tzeng

    Abstract: The explosion of biobank data offers immediate opportunities for gene-environment (GxE) interaction studies of complex diseases because of the large sample sizes and the rich collection in genetic and non-genetic information. However, the extremely large sample size also introduces new computational challenges in GxE assessment, especially for set-based GxE variance component (VC) tests, which are… ▽ More

    Submitted 14 May, 2021; v1 submitted 7 May, 2021; originally announced May 2021.

  11. arXiv:2010.09283  [pdf, other

    cs.LG stat.ML

    Neuralizing Efficient Higher-order Belief Propagation

    Authors: Mohammed Haroon Dupty, Wee Sun Lee

    Abstract: Graph neural network models have been extensively used to learn node representations for graph structured data in an end-to-end setting. These models often rely on localized first order approximations of spectral graph convolutions and hence are unable to capture higher-order relational information between nodes. Probabilistic Graphical Models form another class of models that provide rich flexibi… ▽ More

    Submitted 19 October, 2020; originally announced October 2020.

  12. arXiv:2010.01799  [pdf, other

    cs.LG eess.IV stat.ML

    Understanding Catastrophic Overfitting in Single-step Adversarial Training

    Authors: Hoki Kim, Woo** Lee, Jaewook Lee

    Abstract: Although fast adversarial training has demonstrated both robustness and efficiency, the problem of "catastrophic overfitting" has been observed. This is a phenomenon in which, during single-step adversarial training, the robust accuracy against projected gradient descent (PGD) suddenly decreases to 0% after a few epochs, whereas the robust accuracy against fast gradient sign method (FGSM) increase… ▽ More

    Submitted 15 December, 2020; v1 submitted 5 October, 2020; originally announced October 2020.

    Comments: Accepted to AAAI 2021. Preprint

  13. arXiv:2009.10978  [pdf, other

    cs.LG cs.CV stat.ML

    Semantics-Preserving Adversarial Training

    Authors: Wonseok Lee, Hanbit Lee, Sang-goo Lee

    Abstract: Adversarial training is a defense technique that improves adversarial robustness of a deep neural network (DNN) by including adversarial examples in the training data. In this paper, we identify an overlooked problem of adversarial training in that these adversarial examples often have different semantics than the original data, introducing unintended biases into the model. We hypothesize that suc… ▽ More

    Submitted 23 September, 2020; originally announced September 2020.

    Comments: Preprint. Under Review

  14. arXiv:2009.10380  [pdf, other

    cs.LG stat.ML

    PS8-Net: A Deep Convolutional Neural Network to Predict the Eight-State Protein Secondary Structure

    Authors: Md Aminur Rab Ratul, Maryam Tavakol Elahi, M. Hamed Mozaffari, WonSook Lee

    Abstract: Protein secondary structure is crucial to creating an information bridge between the primary and tertiary (3D) structures. Precise prediction of eight-state protein secondary structure (PSS) has significantly utilized in the structural and functional analysis of proteins in bioinformatics. Deep learning techniques have been recently applied in this research area and raised the eight-state (Q8) pro… ▽ More

    Submitted 22 September, 2020; originally announced September 2020.

  15. arXiv:2008.02430  [pdf, other

    cs.LG stat.ML

    Contrastive Variational Reinforcement Learning for Complex Observations

    Authors: Xiao Ma, Siwei Chen, David Hsu, Wee Sun Lee

    Abstract: Deep reinforcement learning (DRL) has achieved significant success in various robot tasks: manipulation, navigation, etc. However, complex visual observations in natural environments remains a major challenge. This paper presents Contrastive Variational Reinforcement Learning (CVRL), a model-based method that tackles complex visual observations in DRL. CVRL learns a contrastive variational model b… ▽ More

    Submitted 9 November, 2020; v1 submitted 5 August, 2020; originally announced August 2020.

    Comments: CoRL 2020 camera ready

  16. arXiv:2007.01420  [pdf, other

    cs.LG physics.comp-ph quant-ph stat.ML

    CoPhy-PGNN: Learning Physics-guided Neural Networks with Competing Loss Functions for Solving Eigenvalue Problems

    Authors: Mohannad Elhamod, Jie Bu, Christopher Singh, Matthew Redell, Abantika Ghosh, Viktor Podolskiy, Wei-Cheng Lee, Anuj Karpatne

    Abstract: Physics-guided Neural Networks (PGNNs) represent an emerging class of neural networks that are trained using physics-guided (PG) loss functions (capturing violations in network outputs with known physics), along with the supervision contained in data. Existing work in PGNNs has demonstrated the efficacy of adding single PG loss functions in the neural network objectives, using constant trade-off p… ▽ More

    Submitted 16 December, 2021; v1 submitted 2 July, 2020; originally announced July 2020.

  17. arXiv:2006.07107  [pdf, other

    cs.LG stat.ML

    Understanding and Resolving Performance Degradation in Graph Convolutional Networks

    Authors: Kuangqi Zhou, Yanfei Dong, Kaixin Wang, Wee Sun Lee, Bryan Hooi, Huan Xu, Jiashi Feng

    Abstract: A Graph Convolutional Network (GCN) stacks several layers and in each layer performs a PROPagation operation (PROP) and a TRANsformation operation (TRAN) for learning node representations over graph-structured data. Though powerful, GCNs tend to suffer performance drop when the model gets deep. Previous works focus on PROPs to study and mitigate this issue, but the role of TRANs is barely investig… ▽ More

    Submitted 13 September, 2021; v1 submitted 12 June, 2020; originally announced June 2020.

    Comments: CIKM 2021

  18. arXiv:2006.06903  [pdf, other

    cs.LG stat.ML

    On Correctness of Automatic Differentiation for Non-Differentiable Functions

    Authors: Wonyeol Lee, Hangyeol Yu, Xavier Rival, Hongseok Yang

    Abstract: Differentiation lies at the core of many machine-learning algorithms, and is well-supported by popular autodiff systems, such as TensorFlow and PyTorch. Originally, these systems have been developed to compute derivatives of differentiable functions, but in practice, they are commonly applied to functions with non-differentiabilities. For instance, neural networks using ReLU define non-differentia… ▽ More

    Submitted 26 October, 2020; v1 submitted 11 June, 2020; originally announced June 2020.

    Comments: To appear at NeurIPS 2020

  19. arXiv:2005.08701  [pdf, other

    q-bio.QM cs.LG eess.SP stat.ML

    Machine learning for the diagnosis of early stage diabetes using temporal glucose profiles

    Authors: Woo Seok Lee, Junghyo Jo, Taegeun Song

    Abstract: Machine learning shows remarkable success for recognizing patterns in data. Here we apply the machine learning (ML) for the diagnosis of early stage diabetes, which is known as a challenging task in medicine. Blood glucose levels are tightly regulated by two counter-regulatory hormones, insulin and glucagon, and the failure of the glucose homeostasis leads to the common metabolic disease, diabetes… ▽ More

    Submitted 18 May, 2020; originally announced May 2020.

    Comments: 4 pages, 2 figure

  20. arXiv:2004.10980  [pdf, other

    cs.LG nlin.CD physics.comp-ph stat.ML

    Deep Learning of Chaos Classification

    Authors: Woo Seok Lee, Sergej Flach

    Abstract: We train an artificial neural network which distinguishes chaotic and regular dynamics of the two-dimensional Chirikov standard map. We use finite length trajectories and compare the performance with traditional numerical methods which need to evaluate the Lyapunov exponent. The neural network has superior performance for short periods with length down to 10 Lyapunov times on which the traditional… ▽ More

    Submitted 23 April, 2020; originally announced April 2020.

    Comments: 8 pages, 8 figures

  21. arXiv:2003.00218  [pdf, other

    cs.LG stat.ML

    Multiplicative Gaussian Particle Filter

    Authors: Xuan Su, Wee Sun Lee, Zhen Zhang

    Abstract: We propose a new sampling-based approach for approximate inference in filtering problems. Instead of approximating conditional distributions with a finite set of states, as done in particle filters, our approach approximates the distribution with a weighted sum of functions from a set of continuous functions. Central to the approach is the use of sampling to approximate multiplications in the Baye… ▽ More

    Submitted 29 February, 2020; originally announced March 2020.

  22. arXiv:2002.09884  [pdf, other

    cs.LG cs.AI stat.ML

    Discriminative Particle Filter Reinforcement Learning for Complex Partial Observations

    Authors: Xiao Ma, Peter Karkus, David Hsu, Wee Sun Lee, Nan Ye

    Abstract: Deep reinforcement learning is successful in decision making for sophisticated games, such as Atari, Go, etc. However, real-world decision making often requires reasoning with partial information extracted from complex visual observations. This paper presents Discriminative Particle Filter Reinforcement Learning (DPFRL), a new reinforcement learning framework for complex partial observations. DPFR… ▽ More

    Submitted 23 February, 2020; originally announced February 2020.

    Comments: Accepted to ICLR 2020

  23. arXiv:2002.01171   

    cs.LG cs.HC eess.SP stat.ML

    Towards a Fast Steady-State Visual Evoked Potentials (SSVEP) Brain-Computer Interface (BCI)

    Authors: Aung Aung Phyo Wai, Yangsong Zhang, Heng Guo, Ying Chi, Lei Zhang, Xian-Sheng Hua, Seong Whan Lee, Cuntai Guan

    Abstract: Steady-state visual evoked potentials (SSVEP) brain-computer interface (BCI) provides reliable responses leading to high accuracy and information throughput. But achieving high accuracy typically requires a relatively long time window of one second or more. Various methods were proposed to improve sub-second response accuracy through subject-specific training and calibration. Substantial performan… ▽ More

    Submitted 12 May, 2020; v1 submitted 4 February, 2020; originally announced February 2020.

    Comments: Further improvements or modifications required to algorithm design

  24. arXiv:1911.09839  [pdf, other

    cs.LG stat.CO stat.ML

    Differentiable Algorithm for Marginalising Changepoints

    Authors: Hyoung** Lim, Gwonsoo Che, Wonyeol Lee, Hongseok Yang

    Abstract: We present an algorithm for marginalising changepoints in time-series models that assume a fixed number of unknown changepoints. Our algorithm is differentiable with respect to its inputs, which are the values of latent random variables other than changepoints. Also, it runs in time O(mn) where n is the number of time steps and m the number of changepoints, an improvement over a naive marginalisat… ▽ More

    Submitted 21 November, 2019; originally announced November 2019.

    Comments: To appear at AAAI 2020

  25. arXiv:1911.05522  [pdf, other

    stat.ME cs.CR cs.SI stat.AP stat.ML

    Anomaly Detection in Large Scale Networks with Latent Space Models

    Authors: Wesley Lee, Tyler H. McCormick, Joshua Neil, Cole Sodja, Yanran Cui

    Abstract: We develop a real-time anomaly detection algorithm for directed activity on large, sparse networks. We model the propensity for future activity using a dynamic logistic model with interaction terms for sender- and receiver-specific latent factors in addition to sender- and receiver-specific popularity scores; deviations from this underlying model constitute potential anomalies. Latent nodal attrib… ▽ More

    Submitted 29 January, 2021; v1 submitted 13 November, 2019; originally announced November 2019.

  26. arXiv:1911.03972  [pdf

    eess.IV cs.CV cs.LG stat.ML

    IrisNet: Deep Learning for Automatic and Real-time Tongue Contour Tracking in Ultrasound Video Data using Peripheral Vision

    Authors: M. Hamed Mozaffari, Md. Aminur Rab Ratul, Won-Sook Lee

    Abstract: The progress of deep convolutional neural networks has been successfully exploited in various real-time computer vision tasks such as image classification and segmentation. Owing to the development of computational units, availability of digital datasets, and improved performance of deep learning models, fully automatic and accurate tracking of tongue contours in real-time ultrasound data became p… ▽ More

    Submitted 17 April, 2020; v1 submitted 10 November, 2019; originally announced November 2019.

  27. arXiv:1909.12326  [pdf, other

    cs.LG cs.DC stat.ML

    Model Pruning Enables Efficient Federated Learning on Edge Devices

    Authors: Yuang Jiang, Shiqiang Wang, Victor Valls, Bong Jun Ko, Wei-Han Lee, Kin K. Leung, Leandros Tassiulas

    Abstract: Federated learning (FL) allows model training from local data collected by edge/mobile devices while preserving data privacy, which has wide applicability to image and vision applications. A challenge is that client devices in FL usually have much more limited computation and communication resources compared to servers in a datacenter. To overcome this challenge, we propose PruneFL -- a novel FL a… ▽ More

    Submitted 6 April, 2022; v1 submitted 26 September, 2019; originally announced September 2019.

    Comments: Accepted for publication in IEEE Transactions on Neural Networks and Learning Systems (TNNLS)

  28. arXiv:1909.05637  [pdf, other

    cs.CV cs.LG stat.ML

    DeepIST: Deep Image-based Spatio-Temporal Network for Travel Time Estimation

    Authors: Tao-yang Fu, Wang-Chien Lee

    Abstract: Estimating the travel time for a given path is a fundamental problem in many urban transportation systems. However, prior works fail to well capture moving behaviors embedded in paths and thus do not estimate the travel time accurately. To fill in this gap, in this work, we propose a novel neural network framework, namely {\em Deep Image-based Spatio-Temporal network (DeepIST)}, for travel time es… ▽ More

    Submitted 5 September, 2019; originally announced September 2019.

    Comments: 10 pages, accepted by The 28th ACM International Conference on Information and Knowledge Management (CIKM) 2019

    Journal ref: The 28th ACM International Conference on Information and Knowledge Management (CIKM) 2019

  29. arXiv:1907.04711  [pdf, other

    cs.AI cs.LG stat.ML

    Data-driven Policy on Feasibility Determination for the Train Shunting Problem

    Authors: Paulo R. de O. da Costa, J. Rhuggenaath, Y. Zhang, A. Akcay, W. Lee, U. Kaymak

    Abstract: Parking, matching, scheduling, and routing are common problems in train maintenance. In particular, train units are commonly maintained and cleaned at dedicated shunting yards. The planning problem that results from such situations is referred to as the Train Unit Shunting Problem (TUSP). This problem involves matching arriving train units to service tasks and determining the schedule for departin… ▽ More

    Submitted 10 July, 2019; originally announced July 2019.

    Comments: Accepted as conference paper at ECML PKDD 2019

  30. arXiv:1906.04301  [pdf, other

    cs.LG cs.CL cs.SD eess.AS eess.IV stat.ML

    Transfer Learning for Ultrasound Tongue Contour Extraction with Different Domains

    Authors: M. Hamed Mozaffari, Won-Sook Lee

    Abstract: Medical ultrasound technology is widely used in routine clinical applications such as disease diagnosis and treatment as well as other applications like real-time monitoring of human tongue shapes and motions as visual feedback in second language training. Due to the low-contrast characteristic and noisy nature of ultrasound images, it might require expertise for non-expert users to recognize tong… ▽ More

    Submitted 10 June, 2019; originally announced June 2019.

    Comments: 3 figures, 9 pages, 1 table, 16 references

    Journal ref: The Journal of the Acoustical Society of America 146, 2940 (2019)

  31. arXiv:1906.03604  [pdf, other

    stat.AP

    On Copula-based Collective Risk Models

    Authors: Rosy Oh, Jae Youn Ahn, Woojoo Lee

    Abstract: Several collective risk models have recently been proposed by relaxing the widely used but controversial assumption of independence between claim frequency and severity. Approaches include the bivariate copula model, random effect model, and two-part frequency-severity model. This study focuses on the copula approach to develop collective risk models that allow a flexible dependence structure for… ▽ More

    Submitted 9 June, 2019; originally announced June 2019.

  32. arXiv:1906.00554  [pdf, other

    cs.LG stat.ML

    Factor Graph Neural Network

    Authors: Zhen Zhang, Fan Wu, Wee Sun Lee

    Abstract: Most of the successful deep neural network architectures are structured, often consisting of elements like convolutional neural networks and gated recurrent neural networks. Recently, graph neural networks have been successfully applied to graph structured data such as point cloud and molecular data. These networks often only consider pairwise dependencies, as they operate on a graph structure. We… ▽ More

    Submitted 2 June, 2019; originally announced June 2019.

  33. arXiv:1905.12885  [pdf, other

    cs.LG stat.ML

    Particle Filter Recurrent Neural Networks

    Authors: Xiao Ma, Peter Karkus, David Hsu, Wee Sun Lee

    Abstract: Recurrent neural networks (RNNs) have been extraordinarily successful for prediction with sequential data. To tackle highly variable and noisy real-world data, we introduce Particle Filter Recurrent Neural Networks (PF-RNNs), a new RNN family that explicitly models uncertainty in its internal structure: while an RNN relies on a long, deterministic latent state vector, a PF-RNN maintains a latent s… ▽ More

    Submitted 1 December, 2019; v1 submitted 30 May, 2019; originally announced May 2019.

    Comments: Accepted to AAAI 2020

  34. arXiv:1905.11602  [pdf, other

    cs.LG cs.AI cs.RO stat.ML

    Differentiable Algorithm Networks for Composable Robot Learning

    Authors: Peter Karkus, Xiao Ma, David Hsu, Leslie Pack Kaelbling, Wee Sun Lee, Tomas Lozano-Perez

    Abstract: This paper introduces the Differentiable Algorithm Network (DAN), a composable architecture for robot learning systems. A DAN is composed of neural network modules, each encoding a differentiable robot algorithm and an associated model; and it is trained end-to-end from data. DAN combines the strengths of model-driven modular system design and data-driven end-to-end learning. The algorithms and mo… ▽ More

    Submitted 28 May, 2019; originally announced May 2019.

    Comments: RSS 2019 camera ready. Video is available at https://youtu.be/4jcYlTSJF4Y

  35. arXiv:1904.11761  [pdf, other

    cs.LG cs.AI stat.ML

    Factored Contextual Policy Search with Bayesian Optimization

    Authors: Robert Pinsler, Peter Karkus, Andras Kupcsik, David Hsu, Wee Sun Lee

    Abstract: Scarce data is a major challenge to scaling robot learning to truly complex tasks, as we need to generalize locally learned policies over different task contexts. Contextual policy search offers data-efficient learning and generalization by explicitly conditioning the policy on a parametric context space. In this paper, we further structure the contextual policy representation. We propose to facto… ▽ More

    Submitted 26 April, 2019; originally announced April 2019.

    Comments: To appear in ICRA 2019

  36. arXiv:1904.09816  [pdf, other

    cs.LG stat.ML

    Adversarial Dropout for Recurrent Neural Networks

    Authors: Sungrae Park, Kyungwoo Song, Mingi Ji, Wonsung Lee, Il-Chul Moon

    Abstract: Successful application processing sequential data, such as text and speech, requires an improved generalization performance of recurrent neural networks (RNNs). Dropout techniques for RNNs were introduced to respond to these demands, but we conjecture that the dropout on RNNs could have been improved by adopting the adversarial concept. This paper investigates ways to improve the dropout for RNNs… ▽ More

    Submitted 22 April, 2019; originally announced April 2019.

    Comments: published in AAAI19

  37. arXiv:1904.00136  [pdf, other

    stat.ME

    Estimating spillovers using imprecisely measured networks

    Authors: Morgan Hardy, Rachel M. Heath, Wesley Lee, Tyler H. McCormick

    Abstract: In many experimental contexts, whether and how network interactions impact the outcome of interest for both treated and untreated individuals are key concerns. Networks data is often assumed to perfectly represent these possible interactions. This paper considers the problem of estimating treatment effects when measured connections are, instead, a noisy representation of the true spillover pathway… ▽ More

    Submitted 8 March, 2024; v1 submitted 29 March, 2019; originally announced April 2019.

  38. arXiv:1901.02739  [pdf, other

    cs.LG stat.ML

    Dirichlet Variational Autoencoder

    Authors: Weonyoung Joo, Wonsung Lee, Sungrae Park, Il-Chul Moon

    Abstract: This paper proposes Dirichlet Variational Autoencoder (DirVAE) using a Dirichlet prior for a continuous latent variable that exhibits the characteristic of the categorical probabilities. To infer the parameters of DirVAE, we utilize the stochastic gradient method by approximating the Gamma distribution, which is a component of the Dirichlet distribution, with the inverse Gamma CDF approximation. A… ▽ More

    Submitted 9 January, 2019; originally announced January 2019.

  39. arXiv:1810.07954  [pdf, other

    stat.ML cs.LG

    HierLPR: Decision making in hierarchical multi-label classification with local precision rates

    Authors: Christine Ho, Yuting Ye, Ci-Ren Jiang, Wayne Tai Lee, Haiyan Huang

    Abstract: In this article we propose a novel ranking algorithm, referred to as HierLPR, for the multi-label classification problem when the candidate labels follow a known hierarchical structure. HierLPR is motivated by a new metric called eAUC that we design to assess the ranking of classification decisions. This metric, associated with the hit curve and local precision rate, emphasizes the accuracy of the… ▽ More

    Submitted 18 October, 2018; originally announced October 2018.

    Comments: 27 pages, 9 figures

  40. arXiv:1806.00176  [pdf, other

    cs.LG stat.ML

    Reparameterization Gradient for Non-differentiable Models

    Authors: Wonyeol Lee, Hangyeol Yu, Hongseok Yang

    Abstract: We present a new algorithm for stochastic variational inference that targets at models with non-differentiable densities. One of the key challenges in stochastic variational inference is to come up with a low-variance estimator of the gradient of a variational objective. We tackle the challenge by generalizing the reparameterization trick, one of the most effective techniques for addressing the va… ▽ More

    Submitted 25 October, 2018; v1 submitted 31 May, 2018; originally announced June 2018.

    Comments: To appear at Neural Information Processing Systems (NIPS) 2018

  41. arXiv:1805.08975  [pdf, other

    cs.RO cs.AI cs.CV cs.LG stat.ML

    Particle Filter Networks with Application to Visual Localization

    Authors: Peter Karkus, David Hsu, Wee Sun Lee

    Abstract: Particle filtering is a powerful approach to sequential state estimation and finds application in many domains, including robot localization, object tracking, etc. To apply particle filtering in practice, a critical challenge is to construct probabilistic system models, especially for systems with complex dynamics or rich sensory inputs such as camera images. This paper introduces the Particle Fil… ▽ More

    Submitted 25 October, 2018; v1 submitted 23 May, 2018; originally announced May 2018.

    Comments: CoRL 2018 camera ready

  42. arXiv:1802.09197  [pdf

    q-bio.QM cs.LG stat.ML

    AI4AI: Quantitative Methods for Classifying Host Species from Avian Influenza DNA Sequence

    Authors: Woo Yong Choi, Kyu Ye Song, Chan Woo Lee

    Abstract: Avian Influenza breakouts cause millions of dollars in damage each year globally, especially in Asian countries such as China and South Korea. The impact magnitude of a breakout directly correlates to time required to fully understand the influenza virus, particularly the interspecies pathogenicity. The procedure requires laboratory tests that require resources typically lacking in a breakout emer… ▽ More

    Submitted 26 February, 2018; originally announced February 2018.

  43. arXiv:1802.08727  [pdf, other

    stat.ME

    Bayesian Semiparametric Functional Mixed Models for Serially Correlated Functional Data, with Application to Glaucoma Data

    Authors: Wonyul Lee, Michelle F. Miranda, Phlip Rausch, Veerbhadran Baladandayuthapani, Massimo Fazio, J. Crawford Downs, Jeffrey S. Morris

    Abstract: Glaucoma, a leading cause of blindness, is characterized by optic nerve damage related to intraocular pressure (IOP), but its full etiology is unknown. Researchers at UAB have devised a custom device to measure scleral strain continuously around the eye under fixed levels of IOP, which here is used to assess how strain varies around the posterior pole, with IOP, and across glaucoma risk factors su… ▽ More

    Submitted 7 May, 2018; v1 submitted 23 February, 2018; originally announced February 2018.

    Comments: paper accepted in Journal of the American Statistical Association, 2018 -- to appear

  44. arXiv:1801.05394  [pdf, ps, other

    cs.LG cs.AI stat.ML

    Time Series Segmentation through Automatic Feature Learning

    Authors: Wei-Han Lee, Jorge Ortiz, Bongjun Ko, Ruby Lee

    Abstract: Internet of things (IoT) applications have become increasingly popular in recent years, with applications ranging from building energy monitoring to personal health tracking and activity recognition. In order to leverage these data, automatic knowledge extraction - whereby we map from observations to interpretable states and transitions - must be done at scale. As such, we have seen many recent Io… ▽ More

    Submitted 26 January, 2018; v1 submitted 16 January, 2018; originally announced January 2018.

  45. arXiv:1703.06692  [pdf, other

    cs.AI cs.LG cs.NE stat.ML

    QMDP-Net: Deep Learning for Planning under Partial Observability

    Authors: Peter Karkus, David Hsu, Wee Sun Lee

    Abstract: This paper introduces the QMDP-net, a neural network architecture for planning under partial observability. The QMDP-net combines the strengths of model-free learning and model-based planning. It is a recurrent policy network, but it represents a policy for a parameterized set of tasks by connecting a model with a planning algorithm that solves the model, thus embedding the solution structure of p… ▽ More

    Submitted 2 November, 2017; v1 submitted 20 March, 2017; originally announced March 2017.

    Comments: NIPS 2017 camera-ready

  46. arXiv:1612.01746  [pdf, other

    cs.LG cs.AI cs.RO stat.ML

    Factored Contextual Policy Search with Bayesian Optimization

    Authors: Peter Karkus, Andras Kupcsik, David Hsu, Wee Sun Lee

    Abstract: Scarce data is a major challenge to scaling robot learning to truly complex tasks, as we need to generalize locally learned policies over different "contexts". Bayesian optimization approaches to contextual policy search (CPS) offer data-efficient policy learning that generalize over a context space. We propose to improve data-efficiency by factoring typically considered contexts into two componen… ▽ More

    Submitted 28 May, 2019; v1 submitted 6 December, 2016; originally announced December 2016.

    Comments: BayesOpt 2016, NeurIPS Workshop. A full paper extension is available at arXiv:1904.11761

  47. arXiv:1609.02629  [pdf, other

    stat.ME stat.AP

    Inferring social structure from continuous-time interaction data

    Authors: Wesley Lee, Bailey K. Fosdick, Tyler H. McCormick

    Abstract: Relational event data, which consist of events involving pairs of actors over time, are now commonly available at the finest of temporal resolutions. Existing continuous-time methods for modeling such data are based on point processes and directly model interaction "contagion," whereby one interaction increases the propensity of future interactions among actors, often as dictated by some latent va… ▽ More

    Submitted 15 January, 2018; v1 submitted 8 September, 2016; originally announced September 2016.

    Journal ref: Applied Stochastic Models in Business and Industry 2018, Vol. 34, No. 2, 87-104

  48. arXiv:1603.09050  [pdf, other

    cs.LG stat.ML

    Robustness of Bayesian Pool-based Active Learning Against Prior Misspecification

    Authors: Nguyen Viet Cuong, Nan Ye, Wee Sun Lee

    Abstract: We study the robustness of active learning (AL) algorithms against prior misspecification: whether an algorithm achieves similar performance using a perturbed prior as compared to using the true prior. In both the average and worst cases of the maximum coverage setting, we prove that all $α$-approximate algorithms are robust (i.e., near $α$-approximate) if the utility is Lipschitz continuous in th… ▽ More

    Submitted 30 March, 2016; originally announced March 2016.

    Comments: This paper is published at AAAI Conference on Artificial Intelligence (AAAI 2016)

  49. arXiv:1206.6449  [pdf

    cs.LG stat.ML

    Monte Carlo Bayesian Reinforcement Learning

    Authors: Yi Wang, Kok Sung Won, David Hsu, Wee Sun Lee

    Abstract: Bayesian reinforcement learning (BRL) encodes prior knowledge of the world in a model and represents uncertainty in model parameters by maintaining a probability distribution over them. This paper presents Monte Carlo BRL (MC-BRL), a simple and general approach to BRL. MC-BRL samples a priori a finite set of hypotheses for the model parameter values and forms a discrete partially observable Markov… ▽ More

    Submitted 27 June, 2012; originally announced June 2012.

    Comments: Appears in Proceedings of the 29th International Conference on Machine Learning (ICML 2012)