Skip to main content

Showing 1–28 of 28 results for author: Fokoué, E

.
  1. arXiv:2307.14285  [pdf, other

    stat.AP stat.ML

    Emerging Statistical Machine Learning Techniques for Extreme Temperature Forecasting in U.S. Cities

    Authors: Kameron B. Kinast, Ernest Fokoué

    Abstract: In this paper, we present a comprehensive analysis of extreme temperature patterns using emerging statistical machine learning techniques. Our research focuses on exploring and comparing the effectiveness of various statistical models for climate time series forecasting. The models considered include Auto-Regressive Integrated Moving Average, Exponential Smoothing, Multilayer Perceptrons, and Gaus… ▽ More

    Submitted 26 July, 2023; originally announced July 2023.

    Comments: 13 pages, 4 large figures

  2. arXiv:2211.02600  [pdf, other

    stat.ML cs.LG

    Improving the Predictive Performances of $k$ Nearest Neighbors Learning by Efficient Variable Selection

    Authors: Eddie Pei, Ernest Fokoue

    Abstract: This paper computationally demonstrates a sharp improvement in predictive performance for $k$ nearest neighbors thanks to an efficient forward selection of the predictor variables. We show both simulated and real-world data that this novel repeatedly approaches outperformance regression models under stepwise selection

    Submitted 4 November, 2022; originally announced November 2022.

    Comments: 11 pages, 7 figures

  3. arXiv:2208.04732  [pdf, other

    cs.CY cs.LG

    Efficient Novelty Detection Methods for Early Warning of Potential Fatal Diseases

    Authors: Sèdjro Salomon Hotegni, Ernest Fokoué

    Abstract: Fatal diseases, as Critical Health Episodes (CHEs), represent real dangers for patients hospitalized in Intensive Care Units. These episodes can lead to irreversible organ damage and death. Nevertheless, diagnosing them in time would greatly reduce their inconvenience. This study therefore focused on building a highly effective early warning system for CHEs such as Acute Hypotensive Episodes and T… ▽ More

    Submitted 6 August, 2022; originally announced August 2022.

    Comments: 12 pages, 3 figures

  4. arXiv:2208.03373  [pdf, other

    stat.ML cs.LG

    A Computational Exploration of Emerging Methods of Variable Importance Estimation

    Authors: Louis Mozart Kamdem, Ernest Fokoue

    Abstract: Estimating the importance of variables is an essential task in modern machine learning. This help to evaluate the goodness of a feature in a given model. Several techniques for estimating the importance of variables have been developed during the last decade. In this paper, we proposed a computational and theoretical exploration of the emerging methods of variable importance estimation, namely: Le… ▽ More

    Submitted 5 August, 2022; originally announced August 2022.

  5. arXiv:2102.04421  [pdf, other

    stat.OT

    A Text Mining Discovery of Similarities and Dissimilarities Among Sacred Scriptures

    Authors: Younous Mofenjou Peuriekeu, Victoire Djimna Noyum, Cyrille Feudjio, Alkan Goktug, Ernest Fokoue

    Abstract: The careful examination of sacred texts gives valuable insights into human psychology, different ideas regarding the organization of societies as well as into terms like truth and God. To improve and deepen our understanding of sacred texts, their comparison, and their separation is crucial. For this purpose, we use our data set has nine sacred scriptures. This work deals with the separation of th… ▽ More

    Submitted 8 February, 2021; originally announced February 2021.

  6. arXiv:2102.01647  [pdf, other

    cs.CE cs.LG eess.SP stat.OT

    A Novel Use of Discrete Wavelet Transform Features in the Prediction of Epileptic Seizures from EEG Data

    Authors: Cyrille Feudjio, Victoire Djimna Noyum, Younous Perieukeu Mofendjou, Rockefeller, Ernest Fokoué

    Abstract: This paper demonstrates the predictive superiority of discrete wavelet transform (DWT) over previously used methods of feature extraction in the diagnosis of epileptic seizures from EEG data. Classification accuracy, specificity, and sensitivity are used as evaluation metrics. We specifically show the immense potential of 2 combinations (DWT-db4 combined with SVM and DWT-db2 combined with RF) as c… ▽ More

    Submitted 31 January, 2021; originally announced February 2021.

  7. arXiv:2102.00550  [pdf, other

    cs.SD cs.LG eess.AS

    Boosting the Predictive Accurary of Singer Identification Using Discrete Wavelet Transform For Feature Extraction

    Authors: Victoire Djimna Noyum, Younous Perieukeu Mofenjou, Cyrille Feudjio, Alkan Göktug, Ernest Fokoué

    Abstract: Facing the diversity and growth of the musical field nowadays, the search for precise songs becomes more and more complex. The identity of the singer facilitates this search. In this project, we focus on the problem of identifying the singer by using different methods for feature extraction. Particularly, we introduce the Discrete Wavelet Transform (DWT) for this purpose. To the best of our knowle… ▽ More

    Submitted 31 January, 2021; originally announced February 2021.

  8. Nonnegative Matrix Factorization with Zellner Penalty

    Authors: Matthew Corsetti, Ernest Fokoué

    Abstract: Nonnegative matrix factorization (NMF) is a relatively new unsupervised learning algorithm that decomposes a nonnegative data matrix into a parts-based, lower dimensional, linear representation of the data. NMF has applications in image processing, text mining, recommendation systems and a variety of other fields. Since its inception, the NMF algorithm has been modified and explored by numerous au… ▽ More

    Submitted 7 December, 2020; originally announced December 2020.

    Comments: 10 pages, 4 figures, 2 tables

    MSC Class: 62H35

    Journal ref: Open Journal of Statistics 5 (2015) 777-786

  9. Nonnegative Matrix Factorization with Toeplitz Penalty

    Authors: Matthew Corsetti, Ernest Fokoué

    Abstract: Nonnegative Matrix Factorization (NMF) is an unsupervised learning algorithm that produces a linear, parts-based approximation of a data matrix. NMF constructs a nonnegative low rank basis matrix and a nonnegative low rank matrix of weights which, when multiplied together, approximate the data matrix of interest using some cost function. The NMF algorithm can be modified to include auxiliary const… ▽ More

    Submitted 7 December, 2020; originally announced December 2020.

    Comments: 15 pages, 6 figures, 3 tables

    MSC Class: 62H35

    Journal ref: Journal.of.Informatics.and.Mathematical.Sciences 10 (2018) 201-215

  10. arXiv:2011.01990  [pdf, other

    stat.ML cs.LG

    Graph Enhanced High Dimensional Kernel Regression

    Authors: E. Pei, E. Fokoué

    Abstract: In this paper, the flexibility, versatility and predictive power of kernel regression are combined with now lavishly available network data to create regression models with even greater predictive performances. Building from previous work featuring generalized linear models built in the presence of network cohesion data, we construct a kernelized extension that captures subtler nonlinearities in e… ▽ More

    Submitted 3 November, 2020; originally announced November 2020.

    Comments: 11 pages, 6 figures

    MSC Class: 47B34

  11. arXiv:1912.10847  [pdf, other

    cs.CL cs.LG stat.AP

    What do Asian Religions Have in Common? An Unsupervised Text Analytics Exploration

    Authors: Preeti Sah, Ernest Fokoué

    Abstract: The main source of various religious teachings is their sacred texts which vary from religion to religion based on different factors like the geographical location or time of the birth of a particular religion. Despite these differences, there could be similarities between the sacred texts based on what lessons it teaches to its followers. This paper attempts to find the similarity using text mini… ▽ More

    Submitted 20 December, 2019; originally announced December 2019.

    Comments: 18 pages, 22 figures

  12. arXiv:1903.06700  [pdf, other

    cs.LG stat.ML

    Multi-Stage Fault Warning for Large Electric Grids Using Anomaly Detection and Machine Learning

    Authors: Sanjeev Raja, Ernest Fokoué

    Abstract: In the monitoring of a complex electric grid, it is of paramount importance to provide operators with early warnings of anomalies detected on the network, along with a precise classification and diagnosis of the specific fault type. In this paper, we propose a novel multi-stage early warning system prototype for electric grid fault detection, classification, subgroup discovery, and visualization.… ▽ More

    Submitted 15 March, 2019; originally announced March 2019.

    Comments: 13 pages, 14 figures

    MSC Class: 62H30; 62F25; 62F40; 62J12 ACM Class: I.2.6; G.3; I.5.3

  13. arXiv:1811.12802  [pdf, other

    cs.IR cs.LG cs.SD eess.AS stat.ML

    Naive Dictionary On Musical Corpora: From Knowledge Representation To Pattern Recognition

    Authors: Qiuyi Wu, Ernest Fokoue

    Abstract: In this paper, we propose and develop the novel idea of treating musical sheets as literary documents in the traditional text analytics parlance, to fully benefit from the vast amount of research already existing in statistical text mining and topic modelling. We specifically introduce the idea of representing any given piece of music as a collection of "musical words" that we codenamed "muselets"… ▽ More

    Submitted 28 November, 2018; originally announced November 2018.

    Comments: 25 pages

    MSC Class: 62P15; 62P25; 62P99; 68W40; 68W01; 91E10; 91E45; 82-08; 62-07 ACM Class: E.2; F.1.1; F.2.0; I.1.3; I.1.4; I.2.4; I.2.1; I.2.6; I.5.5; I.7.0

  14. arXiv:1805.11012  [pdf, ps, other

    stat.OT

    To Bayes or Not To Bayes? That's no longer the question!

    Authors: Ernest Fokoue

    Abstract: This paper seeks to provide a thorough account of the ubiquitous nature of the Bayesian paradigm in modern statistics, data science and artificial intelligence. Once maligned, on the one hand by those who philosophically hated the very idea of subjective probability used in prior specification, and on the other hand because of the intractability of the computations needed for Bayesian estimation a… ▽ More

    Submitted 28 May, 2018; originally announced May 2018.

    Comments: 14 pages, 4 figures

    MSC Class: 62A01

  15. arXiv:1805.08462  [pdf, other

    cs.LG stat.ML

    Meta-Learning with Hessian-Free Approach in Deep Neural Nets Training

    Authors: Boyu Chen, Wenlian Lu, Ernest Fokoue

    Abstract: Meta-learning is a promising method to achieve efficient training method towards deep neural net and has been attracting increases interests in recent years. But most of the current methods are still not capable to train complex neuron net model with long-time training process. In this paper, a novel second-order meta-optimizer, named Meta-learning with Hessian-Free(MLHF) approach, is proposed bas… ▽ More

    Submitted 7 September, 2018; v1 submitted 22 May, 2018; originally announced May 2018.

  16. arXiv:1802.07369  [pdf, other

    stat.ML cs.LG

    On the Statistical Challenges of Echo State Networks and Some Potential Remedies

    Authors: Qiuyi Wu, Ernest Fokoue, Dhireesha Kudithipudi

    Abstract: Echo state networks are powerful recurrent neural networks. However, they are often unstable and shaky, making the process of finding an good ESN for a specific dataset quite hard. Obtaining a superb accuracy by using the Echo State Network is a challenging task. We create, develop and implement a family of predictably optimal robust and stable ensemble of Echo State Networks via regularizing the… ▽ More

    Submitted 20 February, 2018; originally announced February 2018.

    Comments: 18 pages, 10 figures

    MSC Class: 62F40; 62G90; 62M10; 62M20; 62M45; 62M15; 68T05; 68T10

  17. arXiv:1601.06116  [pdf, other

    stat.ML cs.LG q-bio.NC

    A Mathematical Formalization of Hierarchical Temporal Memory's Spatial Pooler

    Authors: James Mnatzaganian, Ernest Fokoué, Dhireesha Kudithipudi

    Abstract: Hierarchical temporal memory (HTM) is an emerging machine learning algorithm, with the potential to provide a means to perform predictions on spatiotemporal data. The algorithm, inspired by the neocortex, currently does not have a comprehensive mathematical framework. This work brings together all aspects of the spatial pooler (SP), a critical learning component in HTM, under a single unifying fra… ▽ More

    Submitted 8 September, 2016; v1 submitted 22 January, 2016; originally announced January 2016.

    Comments: This work was submitted for publication and is currently under review. For associated code, see https://github.com/tehtechguy/mHTM

  18. arXiv:1503.06370  [pdf, other

    stat.ME

    Bayesian Variable Selection for Linear Regression with the $κ$-$G$ Priors

    Authors: Zichen Ma, Ernest Fokoué

    Abstract: In this paper, we introduce a new methodology for Bayesian variable selection in linear regression that is independent of the traditional indicator method. A diagonal matrix $\mathbf{G}$ is introduced to the prior of the coefficient vector $\boldsymbolβ$, with each of the $g_j$'s, bounded between $0$ and $1$, on the diagonal serves as a stabilizer of the corresponding $β_j$. Mathematically, a prom… ▽ More

    Submitted 18 October, 2016; v1 submitted 21 March, 2015; originally announced March 2015.

    Comments: 19 pages, 3 figures, 1 table, 1 algorithm

    MSC Class: 62C10; 62J05; 62J07

  19. arXiv:1502.04742  [pdf, ps, other

    stat.ML stat.ME

    On the Predictive Properties of Binary Link Functions

    Authors: Necla Gunduz, Ernest Fokoue

    Abstract: This paper provides a theoretical and computational justification of the long held claim that of the similarity of the probit and logit link functions often used in binary classification. Despite this widespread recognition of the strong similarities between these two link functions, very few (if any) researchers have dedicated time to carry out a formal study aimed at establishing and characteriz… ▽ More

    Submitted 16 February, 2015; originally announced February 2015.

    Comments: 17 pages, 10 figures. arXiv admin note: text overlap with arXiv:math-ph/0607066 by other authors

  20. arXiv:1502.04416  [pdf, ps, other

    stat.ML

    Random Subspace Learning Approach to High-Dimensional Outliers Detection

    Authors: Bohan Liu, Ernest Fokoue

    Abstract: We introduce and develop a novel approach to outlier detection based on adaptation of random subspace learning. Our proposed method handles both high-dimension low-sample size and traditional low-dimensional high-sample size datasets. Essentially, we avoid the computational bottleneck of techniques like minimum covariance determinant (MCD) by computing the needed determinants and associated measur… ▽ More

    Submitted 3 May, 2015; v1 submitted 15 February, 2015; originally announced February 2015.

    Comments: 13 pages, 18 figures

    MSC Class: 62H25; 62H30

  21. arXiv:1502.02599  [pdf, other

    cs.LG

    Adaptive Random SubSpace Learning (RSSL) Algorithm for Prediction

    Authors: Mohamed Elshrif, Ernest Fokoue

    Abstract: We present a novel adaptive random subspace learning algorithm (RSSL) for prediction purpose. This new framework is flexible where it can be adapted with any learning technique. In this paper, we tested the algorithm for regression and classification problems. In addition, we provide a variety of weighting schemes to increase the robustness of the developed algorithm. These different wighting flav… ▽ More

    Submitted 9 February, 2015; originally announced February 2015.

  22. arXiv:1501.07866  [pdf

    cs.SD stat.AP

    A Comparison of Classifiers in Performing Speaker Accent Recognition Using MFCCs

    Authors: Zichen Ma, Ernest Fokoue

    Abstract: An algorithm involving Mel-Frequency Cepstral Coefficients (MFCCs) is provided to perform signal feature extraction for the task of speaker accent recognition. Then different classifiers are compared based on the MFCC feature. For each signal, the mean vector of MFCC matrix is used as an input vector for pattern recognition. A sample of 330 signals, containing 165 US voice and 165 non-US voice, is… ▽ More

    Submitted 28 January, 2015; originally announced January 2015.

    Comments: 9 pages, 7 figures

    MSC Class: 62H25; 62H30

    Journal ref: Open Journal of Statistics, 2014, 4, 258-266

  23. arXiv:1501.06116  [pdf, ps, other

    stat.ML

    Prediction Error Reduction Function as a Variable Importance Score

    Authors: Ernest Fokoué

    Abstract: This paper introduces and develops a novel variable importance score function in the context of ensemble learning and demonstrates its appeal both theoretically and empirically. Our proposed score function is simple and more straightforward than its counterpart proposed in the context of random forest, and by avoiding permutations, it is by design computationally more efficient than the random for… ▽ More

    Submitted 25 January, 2015; originally announced January 2015.

    Comments: 7 pages, 12 figures

    MSC Class: 62H25; 62H30

  24. arXiv:1501.04070  [pdf, ps, other

    math.ST stat.ME

    An Information-Theoretic Alternative to the Cronbach's Alpha Coefficient of Item Reliability

    Authors: Ernest Fokoue, Necla Gunduz

    Abstract: We propose an information-theoretic alternative to the popular Cronbach alpha coefficient of reliability. Particularly suitable for contexts in which instruments are scored on a strictly nonnumeric scale, our proposed index is based on functions of the entropy of the distributions of defined on the sample space of responses. Our reliability index tracks the Cronbach alpha coefficient uniformly whi… ▽ More

    Submitted 16 January, 2015; originally announced January 2015.

    Comments: 8 pages, 2 tables, 1 figure

    MSC Class: 62H30; 62H25

  25. arXiv:1501.02263  [pdf, ps, other

    stat.AP

    Pattern Discovery in Students' Evaluations of Professors: A Statistical Data Mining Approach

    Authors: Necla Gunduz, Ernest Fokoue

    Abstract: The evaluation of instructors by their students has been practiced at most universities for many decades, and there has always been a great interest in a variety of aspects of the evaluations. Are students matured and knowledgeable enough to provide useful and dependable feedback for the improvement of their instructors' teaching skills/abilities? Does the level of difficulty of the course have a… ▽ More

    Submitted 9 January, 2015; originally announced January 2015.

    Comments: 20 pages, 5 figures, 11 tables

    MSC Class: 62H30; 62H25

  26. arXiv:1501.00604  [pdf, ps, other

    stat.ML

    A Taxonomy of Big Data for Optimal Predictive Machine Learning and Data Mining

    Authors: Ernest Fokoue

    Abstract: Big data comes in various ways, types, shapes, forms and sizes. Indeed, almost all areas of science, technology, medicine, public health, economics, business, linguistics and social science are bombarded by ever increasing flows of data begging to analyzed efficiently and effectively. In this paper, we propose a rough idea of a possible taxonomy of big data, along with some of the most commonly us… ▽ More

    Submitted 3 January, 2015; originally announced January 2015.

    Comments: 18 pages, 2 figures 3 tables

    MSC Class: 60K35

  27. arXiv:1501.00592  [pdf, ps, other

    stat.AP stat.ME

    Robust Classification of High Dimension Low Sample Size Data

    Authors: Necla Gunduz, Ernest Fokoue

    Abstract: The robustification of pattern recognition techniques has been the subject of intense research in recent years. Despite the multiplicity of papers on the subject, very few articles have deeply explored the topic of robust classification in the high dimension low sample size context. In this work, we explore and compare the predictive performances of robust classification techniques with a special… ▽ More

    Submitted 3 January, 2015; originally announced January 2015.

    Comments: 17 pages, 29 figures

    MSC Class: 60K35

  28. arXiv:1410.0908  [pdf, ps, other

    stat.ML cs.IR cs.LG

    Probit Normal Correlated Topic Models

    Authors: Xingchen Yu, Ernest Fokoue

    Abstract: The logistic normal distribution has recently been adapted via the transformation of multivariate Gaus- sian variables to model the topical distribution of documents in the presence of correlations among topics. In this paper, we propose a probit normal alternative approach to modelling correlated topical structures. Our use of the probit model in the context of topic discovery is novel, as many a… ▽ More

    Submitted 3 October, 2014; originally announced October 2014.

    Comments: 11 pages, 2 figures and 2 tables

    MSC Class: 62H25; 62H30