Skip to main content

Showing 1–39 of 39 results for author: Moon, K

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.04421  [pdf, other

    cs.LG stat.ML

    Enhancing Supervised Visualization through Autoencoder and Random Forest Proximities for Out-of-Sample Extension

    Authors: Shuang Ni, Adrien Aumon, Guy Wolf, Kevin R. Moon, Jake S. Rhodes

    Abstract: The value of supervised dimensionality reduction lies in its ability to uncover meaningful connections between data features and labels. Common dimensionality reduction methods embed a set of fixed, latent points, but are not capable of generalizing to an unseen test set. In this paper, we provide an out-of-sample extension method for the random forest-based supervised dimensionality reduction met… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

    Comments: 7 pages, 3 figures

  2. arXiv:2406.03619  [pdf, other

    cs.LG stat.ML

    Symmetry Discovery Beyond Affine Transformations

    Authors: Ben Shaw, Abram Magner, Kevin R. Moon

    Abstract: Symmetry detection has been shown to improve various machine learning tasks. In the context of continuous symmetry detection, current state of the art experiments are limited to the detection of affine transformations. Under the manifold assumption, we outline a framework for discovering continuous symmetry in data beyond the affine transformation group. We also provide a similar framework for dis… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

  3. arXiv:2406.03396  [pdf, other

    cs.LG math.FA stat.ML

    Noisy Data Visualization using Functional Data Analysis

    Authors: Haozhe Chen, Andres Felipe Duque Correa, Guy Wolf, Kevin R. Moon

    Abstract: Data visualization via dimensionality reduction is an important tool in exploratory data analysis. However, when the data are noisy, many existing methods fail to capture the underlying structure of the data. The method called Empirical Intrinsic Geometry (EIG) was previously proposed for performing dimensionality reduction on high dimensional dynamical processes while theoretically eliminating al… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

  4. arXiv:2404.01954  [pdf, other

    cs.CL cs.AI

    HyperCLOVA X Technical Report

    Authors: Kang Min Yoo, Jaegeun Han, Sookyo In, Heewon Jeon, Jisu Jeong, Jaewook Kang, Hyunwook Kim, Kyung-Min Kim, Munhyong Kim, Sungju Kim, Donghyun Kwak, Hanock Kwak, Se Jung Kwon, Bado Lee, Dongsoo Lee, Gichang Lee, Jooho Lee, Baeseong Park, Seong** Shin, Joonsang Yu, Seolki Baek, Sumin Byeon, Eungsup Cho, Dooseok Choe, Jeesung Han , et al. (371 additional authors not shown)

    Abstract: We introduce HyperCLOVA X, a family of large language models (LLMs) tailored to the Korean language and culture, along with competitive capabilities in English, math, and coding. HyperCLOVA X was trained on a balanced mix of Korean, English, and code data, followed by instruction-tuning with high-quality human-annotated datasets while abiding by strict safety guidelines reflecting our commitment t… ▽ More

    Submitted 13 April, 2024; v1 submitted 2 April, 2024; originally announced April 2024.

    Comments: 44 pages; updated authors list and fixed author names

  5. arXiv:2403.13253  [pdf, other

    cs.CL eess.AS

    Document Author Classification Using Parsed Language Structure

    Authors: Todd K Moon, Jacob H. Gunther

    Abstract: Over the years there has been ongoing interest in detecting authorship of a text based on statistical properties of the text, such as by using occurrence rates of noncontextual words. In previous work, these techniques have been used, for example, to determine authorship of all of \emph{The Federalist Papers}. Such methods may be useful in more modern times to detect fake or AI authorship. Progres… ▽ More

    Submitted 19 March, 2024; originally announced March 2024.

    Journal ref: International Journal on Natural Language Computing (IJNLC), Feb. 24, 2024

  6. arXiv:2402.06441  [pdf, other

    cs.LG

    Incorporating Taylor Series and Recursive Structure in Neural Networks for Time Series Prediction

    Authors: Jarrod Mau, Kevin Moon

    Abstract: Time series analysis is relevant in various disciplines such as physics, biology, chemistry, and finance. In this paper, we present a novel neural network architecture that integrates elements from ResNet structures, while introducing the innovative incorporation of the Taylor series framework. This approach demonstrates notable enhancements in test accuracy across many of the baseline datasets in… ▽ More

    Submitted 9 February, 2024; originally announced February 2024.

  7. arXiv:2402.04440  [pdf, other

    cs.LG stat.ML

    Exploring higher-order neural network node interactions with total correlation

    Authors: Thomas Kerby, Teresa White, Kevin Moon

    Abstract: In domains such as ecological systems, collaborations, and the human brain the variables interact in complex ways. Yet accurately characterizing higher-order variable interactions (HOIs) is a difficult problem that is further exacerbated when the HOIs change across the data. To solve this problem we propose a new method called Local Correlation Explanation (CorEx) to capture HOIs at a local scale… ▽ More

    Submitted 6 February, 2024; originally announced February 2024.

  8. arXiv:2401.13068  [pdf, other

    cs.CV cs.AI

    Local Background Estimation for Improved Gas Plume Identification in Hyperspectral Images

    Authors: Scout Jarman, Zigfried Hampel-Arias, Adra Carr, Kevin R. Moon

    Abstract: Deep learning identification models have shown promise for identifying gas plumes in Longwave IR hyperspectral images of urban scenes, particularly when a large library of gases are being considered. Because many gases have similar spectral signatures, it is important to properly estimate the signal from a detected plume. Typically, a scene's global mean spectrum and covariance matrix are estimate… ▽ More

    Submitted 23 January, 2024; originally announced January 2024.

    Comments: Submitted to International Geoscience and Remote Sensing Symposium (IGARSS), 2024. 5 pages, 2 figures

  9. arXiv:2304.04598  [pdf

    cs.SD eess.AS eess.SP

    In-situ crack and keyhole pore detection in laser directed energy deposition through acoustic signal and deep learning

    Authors: Lequn Chen, Xiling Yao, Chaolin Tan, Weiyang He, **long Su, Fei Weng, Youxiang Chew, Nicholas Poh Huat Ng, Seung Ki Moon

    Abstract: Cracks and keyhole pores are detrimental defects in alloys produced by laser directed energy deposition (LDED). Laser-material interaction sound may hold information about underlying complex physical events such as crack propagation and pores formation. However, due to the noisy environment and intricate signal content, acoustic-based monitoring in LDED has received little attention. This paper pr… ▽ More

    Submitted 10 April, 2023; originally announced April 2023.

    Comments: 36 Pages, 16 Figures, accepted at journal Additive Manufacturing

  10. arXiv:2212.12756  [pdf, ps, other

    cs.DM cs.CC math.DS

    Computational Complexity of Minimal Trap Spaces in Boolean Networks

    Authors: Kyungduk Moon, Kangbok Lee, Loïc Paulevé

    Abstract: A Boolean network (BN) is a discrete dynamical system defined by a Boolean function that maps to the domain itself. A trap space of a BN is a generalization of a fixed point, which is defined as the sub-hypercubes closed by the function of the BN. A trap space is minimal if it does not contain any smaller trap space. Minimal trap spaces have applications for the analysis of attractors of BNs with… ▽ More

    Submitted 14 March, 2023; v1 submitted 24 December, 2022; originally announced December 2022.

    MSC Class: 68Q17; 68R07; 94C11; 37M22; 37N25

  11. arXiv:2210.12774  [pdf, other

    stat.ML cs.LG

    Manifold Alignment with Label Information

    Authors: Andres F. Duque, Myriam Lizotte, Guy Wolf, Kevin R. Moon

    Abstract: Multi-domain data is becoming increasingly common and presents both challenges and opportunities in the data science community. The integration of distinct data-views can be used for exploratory data analysis, and benefit downstream analysis including machine learning related tasks. With this in mind, we present a novel manifold alignment method called MALI (Manifold alignment with label informati… ▽ More

    Submitted 30 October, 2022; v1 submitted 23 October, 2022; originally announced October 2022.

  12. arXiv:2208.06360   

    q-bio.BM cs.LG

    3D Graph Contrastive Learning for Molecular Property Prediction

    Authors: Kisung Moon, Sunyoung Kwon

    Abstract: Self-supervised learning (SSL) is a method that learns the data representation by utilizing supervision inherent in the data. This learning method is in the spotlight in the drug field, lacking annotated data due to time-consuming and expensive experiments. SSL using enormous unlabeled data has shown excellent performance for molecular property prediction, but a few issues exist. (1) Existing SSL… ▽ More

    Submitted 18 August, 2022; v1 submitted 31 May, 2022; originally announced August 2022.

    Comments: need to be edited

  13. arXiv:2206.07305  [pdf, other

    stat.ML cs.LG

    Diffusion Transport Alignment

    Authors: Andres F. Duque, Guy Wolf, Kevin R. Moon

    Abstract: The integration of multimodal data presents a challenge in cases when the study of a given phenomena by different instruments or conditions generates distinct but related domains. Many existing data integration methods assume a known one-to-one correspondence between domains of the entire dataset, which may be unrealistic. Furthermore, existing manifold alignment methods are not suited for cases w… ▽ More

    Submitted 15 June, 2022; originally announced June 2022.

  14. arXiv:2201.12682  [pdf, other

    stat.ML cs.LG stat.AP stat.ME

    Geometry- and Accuracy-Preserving Random Forest Proximities

    Authors: Jake S. Rhodes, Adele Cutler, Kevin R. Moon

    Abstract: Random forests are considered one of the best out-of-the-box classification and regression algorithms due to their high level of predictive performance with relatively little tuning. Pairwise proximities can be computed from a trained random forest and measure the similarity between data points relative to the supervised task. Random forest proximities have been used in many applications including… ▽ More

    Submitted 28 February, 2023; v1 submitted 29 January, 2022; originally announced January 2022.

  15. arXiv:2010.12108  [pdf, other

    cs.CV eess.IV

    GPS-Denied Navigation Using SAR Images and Neural Networks

    Authors: Teresa White, Jesse Wheeler, Colton Lindstrom, Randall Christensen, Kevin R. Moon

    Abstract: Unmanned aerial vehicles (UAV) often rely on GPS for navigation. GPS signals, however, are very low in power and easily jammed or otherwise disrupted. This paper presents a method for determining the navigation errors present at the beginning of a GPS-denied period utilizing data from a synthetic aperture radar (SAR) system. This is accomplished by comparing an online-generated SAR image with a re… ▽ More

    Submitted 22 October, 2020; originally announced October 2020.

    Comments: 5 pages, 5 figures

  16. Extendable and invertible manifold learning with geometry regularized autoencoders

    Authors: Andrés F. Duque, Sacha Morin, Guy Wolf, Kevin R. Moon

    Abstract: A fundamental task in data exploration is to extract simplified low dimensional representations that capture intrinsic geometry in data, especially for faithfully visualizing data in two or three dimensions. Common approaches to this task use kernel methods for manifold learning. However, these methods typically only provide an embedding of fixed input data and cannot extend to new data points. Au… ▽ More

    Submitted 22 November, 2020; v1 submitted 14 July, 2020; originally announced July 2020.

    Comments: 10 pages, 6 figures

    Journal ref: IEEE International Conference on Big Data, pp. 5027-5036, Dec. 2020

  17. arXiv:2006.08701  [pdf, other

    stat.ML cs.HC cs.LG stat.AP

    Supervised Visualization for Data Exploration

    Authors: Jake S. Rhodes, Adele Cutler, Guy Wolf, Kevin R. Moon

    Abstract: Dimensionality reduction is often used as an initial step in data exploration, either as preprocessing for classification or regression or for visualization. Most dimensionality reduction techniques to date are unsupervised; they do not take class labels into account (e.g., PCA, MDS, t-SNE, Isomap). Such methods require large amounts of data and are often sensitive to noise that may obfuscate impo… ▽ More

    Submitted 15 June, 2020; originally announced June 2020.

    Comments: 21 pages, 9 figures

  18. arXiv:1912.00342  [pdf, other

    cs.CL

    Machines Getting with the Program: Understanding Intent Arguments of Non-Canonical Directives

    Authors: Won Ik Cho, Young Ki Moon, Sangwhan Moon, Seok Min Kim, Nam Soo Kim

    Abstract: Modern dialog managers face the challenge of having to fulfill human-level conversational skills as part of common user expectations, including but not limited to discourse with no clear objective. Along with these requirements, agents are expected to extrapolate intent from the user's dialogue even when subjected to non-canonical forms of speech. This depends on the agent's comprehension of parap… ▽ More

    Submitted 7 October, 2020; v1 submitted 1 December, 2019; originally announced December 2019.

    Comments: Findings of ACL: EMNLP 2020

  19. arXiv:1907.04463  [pdf, other

    cs.HC cs.CV cs.LG q-bio.QM

    Coarse Graining of Data via Inhomogeneous Diffusion Condensation

    Authors: Nathan Brugnone, Alex Gonopolskiy, Mark W. Moyle, Manik Kuchroo, David van Dijk, Kevin R. Moon, Daniel Colon-Ramos, Guy Wolf, Matthew J. Hirn, Smita Krishnaswamy

    Abstract: Big data often has emergent structure that exists at multiple levels of abstraction, which are useful for characterizing complex interactions and dynamics of the observations. Here, we consider multiple levels of abstraction via a multiresolution geometry of data points at different granularities. To construct this geometry we define a time-inhomogeneous diffusion process that effectively condense… ▽ More

    Submitted 9 March, 2020; v1 submitted 9 July, 2019; originally announced July 2019.

    Comments: 14 pages, 7 figures

    ACM Class: I.5.3

    Journal ref: Proceedings of the 2019 IEEE International Conference on Big Data, pages 2624-2633, 2019

  20. arXiv:1906.10725  [pdf, ps, other

    stat.ML cs.LG eess.SP

    Visualizing High Dimensional Dynamical Processes

    Authors: Andrés F. Duque, Guy Wolf, Kevin R. Moon

    Abstract: Manifold learning techniques for dynamical systems and time series have shown their utility for a broad spectrum of applications in recent years. While these methods are effective at learning a low-dimensional representation, they are often insufficient for visualizing the global and local structure of the data. In this paper, we present DIG (Dynamical Information Geometry), a visualization method… ▽ More

    Submitted 25 June, 2019; originally announced June 2019.

    Comments: 7 pages, 3 figures

    Journal ref: IEEE International Workshop on Machine Learning for Signal Processing, Oct. 2019

  21. arXiv:1902.00033  [pdf, other

    cs.LG stat.ML

    Compressed Diffusion

    Authors: Scott Gigante, Jay S. Stanley III, Ngan Vu, David van Dijk, Kevin Moon, Guy Wolf, Smita Krishnaswamy

    Abstract: Diffusion maps are a commonly used kernel-based method for manifold learning, which can reveal intrinsic structures in data and embed them in low dimensions. However, as with most kernel methods, its implementation requires a heavy computational load, reaching up to cubic complexity in the number of data points. This limits its usability in modern data analysis. Here, we present a new approach to… ▽ More

    Submitted 10 June, 2019; v1 submitted 31 January, 2019; originally announced February 2019.

    Comments: 4 pages double column, published in SampTA 2019

    Journal ref: Sampling Theory & Applications (2019)

  22. arXiv:1810.04631  [pdf, ps, other

    cs.CL

    Extracting Arguments from Korean Question and Command: An Annotated Corpus for Structured Paraphrasing

    Authors: Won Ik Cho, Young Ki Moon, Woo Hyun Kang, Nam Soo Kim

    Abstract: Intention identification is a core issue in dialog management. However, due to the non-canonicality of the spoken language, it is difficult to extract the content automatically from the conversation-style utterances. This is much more challenging for languages like Korean and Japanese since the agglutination between morphemes make it difficult for the machines to parse the sentence and understand… ▽ More

    Submitted 9 July, 2019; v1 submitted 10 October, 2018; originally announced October 2018.

    Comments: 5 pages and 2 tables, Annotation guideline for Seoul Korean sentences

  23. arXiv:1810.01015  [pdf, other

    cs.IT cs.LG

    Convergence Rates for Empirical Estimation of Binary Classification Bounds

    Authors: Salimeh Yasaei Sekeh, Morteza Noshad, Kevin R. Moon, Alfred O. Hero

    Abstract: Bounding the best achievable error probability for binary classification problems is relevant to many applications including machine learning, signal processing, and information theory. Many bounds on the Bayes binary classification error rate depend on information divergences between the pair of class distributions. Recently, the Henze-Penrose (HP) divergence has been proposed for bounding classi… ▽ More

    Submitted 1 October, 2018; originally announced October 2018.

    Comments: 27 pages, 8 figures

  24. arXiv:1802.03497  [pdf, other

    cs.LG stat.ML

    Modeling Global Dynamics from Local Snapshots with Deep Generative Neural Networks

    Authors: Scott Gigante, David van Dijk, Kevin Moon, Alexander Strzalkowski, Guy Wolf, Smita Krishnaswamy

    Abstract: Complex high dimensional stochastic dynamic systems arise in many applications in the natural sciences and especially biology. However, while these systems are difficult to describe analytically, "snapshot" measurements that sample the output of the system are often available. In order to model the dynamics of such systems given snapshot data, or local transitions, we present a deep neural network… ▽ More

    Submitted 10 June, 2019; v1 submitted 9 February, 2018; originally announced February 2018.

    Comments: Published in SampTA 2019

    Journal ref: Sampling Theory & Applications (2019)

  25. arXiv:1707.03083  [pdf, ps, other

    cs.IT

    Ensemble Estimation of Distributional Functionals via $k$-Nearest Neighbors

    Authors: Kevin R. Moon, Kumar Sricharan, Alfred O. Hero III

    Abstract: The problem of accurate nonparametric estimation of distributional functionals (integral functionals of one or more probability distributions) has received recent interest due to their wide applicability in signal processing, information theory, machine learning, and statistics. In particular, $k$-nearest neighbor (nn) based methods have received a lot of attention due to their adaptive nature and… ▽ More

    Submitted 10 July, 2017; originally announced July 2017.

    Comments: 26 pages

  26. arXiv:1705.06315  [pdf, ps, other

    cs.IT

    Direct Ensemble Estimation of Density Functionals

    Authors: Alan Wisler, Kevin Moon, Visar Berisha

    Abstract: Estimating density functionals of analog sources is an important problem in statistical signal processing and information theory. Traditionally, estimating these quantities requires either making parametric assumptions about the underlying distributions or using non-parametric density estimation followed by integration. In this paper we introduce a direct nonparametric approach which bypasses the… ▽ More

    Submitted 17 May, 2017; originally announced May 2017.

    Comments: 5 pages

  27. arXiv:1702.05222  [pdf, other

    cs.IT cs.AI stat.ML

    Direct Estimation of Information Divergence Using Nearest Neighbor Ratios

    Authors: Morteza Noshad, Kevin R. Moon, Salimeh Yasaei Sekeh, Alfred O. Hero III

    Abstract: We propose a direct estimation method for Rényi and f-divergence measures based on a new graph theoretical interpretation. Suppose that we are given two sample sets $X$ and $Y$, respectively with $N$ and $M$ samples, where $η:=M/N$ is a constant value. Considering the $k$-nearest neighbor ($k$-NN) graph of $Y$ in the joint data set $(X,Y)$, we show that the average powered ratio of the number of… ▽ More

    Submitted 20 November, 2017; v1 submitted 16 February, 2017; originally announced February 2017.

    Comments: 2017 IEEE International Symposium on Information Theory (ISIT)

    Journal ref: In Information Theory (ISIT), 2017 IEEE International Symposium on (pp. 903-907). IEEE

  28. Ensemble Estimation of Generalized Mutual Information with Applications to Genomics

    Authors: Kevin R. Moon, Kumar Sricharan, Alfred O. Hero III

    Abstract: Mutual information is a measure of the dependence between random variables that has been used successfully in myriad applications in many fields. Generalized mutual information measures that go beyond classical Shannon mutual information have also received much interest in these applications. We derive the mean squared error convergence rates of kernel density-based plug-in estimators of general m… ▽ More

    Submitted 29 July, 2021; v1 submitted 27 January, 2017; originally announced January 2017.

    Comments: Published in IEEE Transactions on Information Theory in 2021; 42 pages, 3 figures; a shorter version of this paper was published at IEEE ISIT 2017 under the title "Ensemble estimation of mutual information"

  29. arXiv:1609.03912  [pdf, ps, other

    cs.IT cs.LG stat.ML

    Information Theoretic Structure Learning with Confidence

    Authors: Kevin R. Moon, Morteza Noshad, Salimeh Yasaei Sekeh, Alfred O. Hero III

    Abstract: Information theoretic measures (e.g. the Kullback Liebler divergence and Shannon mutual information) have been used for exploring possibly nonlinear multivariate dependencies in high dimension. If these dependencies are assumed to follow a Markov factor graph model, this exploration process is called structure discovery. For discrete-valued samples, estimates of the information divergence over the… ▽ More

    Submitted 13 September, 2016; originally announced September 2016.

    Comments: 10 pages, 3 figures

    Journal ref: IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pp. 6095-6099, Mar. 2017

  30. arXiv:1602.00374  [pdf, ps, other

    cs.LG

    ConfidentCare: A Clinical Decision Support System for Personalized Breast Cancer Screening

    Authors: Ahmed M. Alaa, Kyeong H. Moon, William Hsu, Mihaela van der Schaar

    Abstract: Breast cancer screening policies attempt to achieve timely diagnosis by the regular screening of apparently healthy women. Various clinical decisions are needed to manage the screening process; those include: selecting the screening tests for a woman to take, interpreting the test outcomes, and deciding whether or not a woman should be referred to a diagnostic test. Such decisions are currently gu… ▽ More

    Submitted 31 January, 2016; originally announced February 2016.

  31. Ensemble Estimation of Information Divergence

    Authors: Kevin R. Moon, Kumar Sricharan, Kristjan Greenewald, Alfred O. Hero III

    Abstract: Recent work has focused on the problem of nonparametric estimation of information divergence functionals. Many existing approaches are restrictive in their assumptions on the density support set or require difficult calculations at the support boundary which must be known a priori. The MSE convergence rate of a leave-one-out kernel density plug-in divergence functional estimator for general bounde… ▽ More

    Submitted 4 June, 2018; v1 submitted 25 January, 2016; originally announced January 2016.

    Comments: 27 pages, 4 figures; A previous version of this paper was posted under the title of "Improving Convergence of Divergence Functional Ensemble Estimators"

    Journal ref: Entropy, vol. 20, no. 8, pp. 560, July 2018

  32. arXiv:1510.03507  [pdf, ps, other

    q-bio.NC cs.LG stat.ML

    The intrinsic value of HFO features as a biomarker of epileptic activity

    Authors: Stephen V. Gliske, Kevin R. Moon, William C. Stacey, Alfred O. Hero III

    Abstract: High frequency oscillations (HFOs) are a promising biomarker of epileptic brain tissue and activity. HFOs additionally serve as a prototypical example of challenges in the analysis of discrete events in high-temporal resolution, intracranial EEG data. Two primary challenges are 1) dimensionality reduction, and 2) assessing feasibility of classification. Dimensionality reduction assumes that the da… ▽ More

    Submitted 12 October, 2015; originally announced October 2015.

    Comments: 5 pages, 5 figures

    Journal ref: IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pp. 6290-6294, Mar. 2016

  33. arXiv:1504.07116  [pdf, ps, other

    cs.LG astro-ph.SR cs.CV cs.IT

    Meta learning of bounds on the Bayes classifier error

    Authors: Kevin R. Moon, Veronique Delouille, Alfred O. Hero III

    Abstract: Meta learning uses information from base learners (e.g. classifiers or estimators) as well as information about the learning problem to improve upon the performance of a single base learner. For example, the Bayes error rate of a given feature space, if known, can be used to aid in choosing a classifier, as well as in feature selection and model selection for the base classifiers and the meta clas… ▽ More

    Submitted 3 July, 2015; v1 submitted 27 April, 2015; originally announced April 2015.

    Comments: 6 pages, 3 figures, to appear in proceedings of 2015 IEEE Signal Processing and SP Education Workshop

    Journal ref: IEEE Signal Processing and SP Education Workshop, pp. 13-18, Aug. 2015

  34. arXiv:1504.02762  [pdf, other

    astro-ph.SR cs.CV

    Image patch analysis of sunspots and active regions. II. Clustering via matrix factorization

    Authors: Kevin R. Moon, Veronique Delouille, Jimmy J. Li, Ruben De Visscher, Fraser Watson, Alfred O. Hero III

    Abstract: Separating active regions that are quiet from potentially eruptive ones is a key issue in Space Weather applications. Traditional classification schemes such as Mount Wilson and McIntosh have been effective in relating an active region large scale magnetic configuration to its ability to produce eruptive events. However, their qualitative nature prevents systematic studies of an active region's ev… ▽ More

    Submitted 10 December, 2015; v1 submitted 10 April, 2015; originally announced April 2015.

    Comments: Accepted for publication in the Journal of Space Weather and Space Climate (SWSC). 33 pages, 12 figures

    Journal ref: Journal of Space Weather and Space Climate, Vol. 6, A3 (2016)

  35. arXiv:1503.04127  [pdf, other

    astro-ph.SR cs.CV

    Image patch analysis of sunspots and active regions. I. Intrinsic dimension and correlation analysis

    Authors: Kevin R. Moon, Jimmy J. Li, Veronique Delouille, Ruben De Visscher, Fraser Watson, Alfred O. Hero III

    Abstract: The flare-productivity of an active region is observed to be related to its spatial complexity. Mount Wilson or McIntosh sunspot classifications measure such complexity but in a categorical way, and may therefore not use all the information present in the observations. Moreover, such categorical schemes hinder a systematic study of an active region's evolution for example. We propose fine-scale qu… ▽ More

    Submitted 14 December, 2015; v1 submitted 13 March, 2015; originally announced March 2015.

    Comments: Accepted for publication in the Journal of Space Weather and Space Climate (SWSC). 23 pages, 11 figures

    Journal ref: Journal of Space Weather and Space Climate, Vol. 6, A2 (2016)

  36. arXiv:1411.2045  [pdf, other

    cs.IT stat.ML

    Multivariate f-Divergence Estimation With Confidence

    Authors: Kevin R. Moon, Alfred O. Hero III

    Abstract: The problem of f-divergence estimation is important in the fields of machine learning, information theory, and statistics. While several nonparametric divergence estimators exist, relatively few have known convergence properties. In particular, even for those estimators whose MSE convergence rates are known, the asymptotic distributions are unknown. We establish the asymptotic normality of a recen… ▽ More

    Submitted 7 November, 2014; originally announced November 2014.

    Comments: 20 pages, 1 figure. Accepted to NIPS 2014 (supplementary material is included in the appendices)

    Journal ref: K.R. Moon and A.O. Hero III, "Multivariate f-Divergence Estimation With Confidence," In Advances in Neural Information Processing Systems, pp. 2420-2428, 2014

  37. arXiv:1406.6390  [pdf, other

    cs.CV astro-ph.SR

    Image patch analysis and clustering of sunspots: a dimensionality reduction approach

    Authors: Kevin R. Moon, Jimmy J. Li, Veronique Delouille, Fraser Watson, Alfred O. Hero III

    Abstract: Sunspots, as seen in white light or continuum images, are associated with regions of high magnetic activity on the Sun, visible on magnetogram images. Their complexity is correlated with explosive solar activity and so classifying these active regions is useful for predicting future solar activity. Current classification of sunspot groups is visually based and suffers from bias. Supervised learnin… ▽ More

    Submitted 24 June, 2014; originally announced June 2014.

    Comments: 5 pages, 7 figures, accepted to ICIP 2014

    Journal ref: K.R. Moon, J.J. Li, V. Delouille, F. Watson, and A.O. Hero III, "Image patch analysis and clustering of sunspots: a dimensionality reduction approach," In Image Processing (ICIP), 2014 IEEE Conference on, pp. 1623-1627, 2014

  38. Ensemble estimation of multivariate f-divergence

    Authors: Kevin R. Moon, Alfred O. Hero III

    Abstract: f-divergence estimation is an important problem in the fields of information theory, machine learning, and statistics. While several divergence estimators exist, relatively few of their convergence rates are known. We derive the MSE convergence rate for a density plug-in estimator of f-divergence. Then by applying the theory of optimally weighted ensemble estimation, we derive a divergence estimat… ▽ More

    Submitted 9 June, 2014; v1 submitted 24 April, 2014; originally announced April 2014.

    Comments: 14 pages, 6 figures, a condensed version of this paper was accepted to ISIT 2014, Version 2: Moved the proofs of the theorems from the main body to appendices at the end

    Journal ref: K.R. Moon and A.O. Hero III, "Ensemble estimation of multivariate f-divergence," In Information Theory (ISIT), 2014 IEEE International Symposium on, pp. 356-360, 2014

  39. Training-Free Non-Intrusive Load Monitoring of Electric Vehicle Charging with Low Sampling Rate

    Authors: Zhilin Zhang, Jae Hyun Son, Ying Li, Mark Trayer, Zhouyue Pi, Dong Yoon Hwang, Joong Ki Moon

    Abstract: Non-intrusive load monitoring (NILM) is an important topic in smart-grid and smart-home. Many energy disaggregation algorithms have been proposed to detect various individual appliances from one aggregated signal observation. However, few works studied the energy disaggregation of plug-in electric vehicle (EV) charging in the residential environment since EVs charging at home has emerged only rece… ▽ More

    Submitted 6 August, 2014; v1 submitted 20 April, 2014; originally announced April 2014.

    Comments: Accepted by The 40th Annual Conference of the IEEE Industrial Electronics Society (IECON 2014)