Skip to main content

Showing 1–12 of 12 results for author: Kakde, D

.
  1. arXiv:1910.01150  [pdf

    eess.SP stat.ML

    Fault Detection Using Nonlinear Low-Dimensional Representation of Sensor Data

    Authors: Kai Shen, Anya Mcguirk, Yuwei Liao, Arin Chaudhuri, Deovrat Kakde

    Abstract: Sensor data analysis plays a key role in health assessment of critical equipment. Such data are multivariate and exhibit nonlinear relationships. This paper describes how one can exploit nonlinear dimension reduction techniques, such as the t-distributed stochastic neighbor embedding (t-SNE) and kernel principal component analysis (KPCA) for fault detection. We show that using anomaly detection wi… ▽ More

    Submitted 2 October, 2019; originally announced October 2019.

  2. Automatic Hyperparameter Tuning Method for Local Outlier Factor, with Applications to Anomaly Detection

    Authors: Zekun Xu, Deovrat Kakde, Arin Chaudhuri

    Abstract: In recent years, there have been many practical applications of anomaly detection such as in predictive maintenance, detection of credit fraud, network intrusion, and system failure. The goal of anomaly detection is to identify in the test data anomalous behaviors that are either rare or unseen in the training data. This is a common goal in predictive maintenance, which aims to forecast the immine… ▽ More

    Submitted 1 February, 2019; originally announced February 2019.

    Comments: 15 pages, 5 figures

  3. arXiv:1811.06838  [pdf, other

    stat.ML cs.LG math.NA

    The Trace Criterion for Kernel Bandwidth Selection for Support Vector Data Description

    Authors: Arin Chaudhuri, Carol Sadek, Deovrat Kakde, Wenhao Hu, Hansi Jiang, Seunghyun Kong, Yuewei Liao, Sergiy Peredriy, Haoyu Wang

    Abstract: Support vector data description (SVDD) is a popular anomaly detection technique. The SVDD classifier partitions the whole data space into an inlier region, which consists of the region near the training data, and an outlier region, which consists of points away from the training data. The computation of the SVDD classifier requires a kernel function, for which the Gaussian kernel is a common choic… ▽ More

    Submitted 5 February, 2020; v1 submitted 15 November, 2018; originally announced November 2018.

    Comments: note: some text overlap with arXiv:1708.05106 because common background material is covered in both papers

  4. arXiv:1811.05561  [pdf, other

    stat.AP cs.LG stat.ML

    A New SVDD-Based Multivariate Non-parametric Process Capability Index

    Authors: Deovrat Kakde, Arin Chaudhuri, Diana Shaw

    Abstract: Process capability index (PCI) is a commonly used statistic to measure ability of a process to operate within the given specifications or to produce products which meet the required quality specifications. PCI can be univariate or multivariate depending upon the number of process specifications or quality characteristics of interest. Most PCIs make distributional assumptions which are often unreal… ▽ More

    Submitted 13 November, 2018; originally announced November 2018.

  5. A new bandwidth selection criterion for using SVDD to analyze hyperspectral data

    Authors: Yuwei Liao, Deovrat Kakde, Arin Chaudhuri, Hansi Jiang, Carol Sadek, Seunghyun Kong

    Abstract: This paper presents a method for hyperspectral image classification that uses support vector data description (SVDD) with the Gaussian kernel function. SVDD has been a popular machine learning technique for single-class classification, but selecting the proper Gaussian kernel bandwidth to achieve the best classification performance is always a challenging problem. This paper proposes a new automat… ▽ More

    Submitted 5 April, 2019; v1 submitted 8 March, 2018; originally announced March 2018.

  6. arXiv:1709.00139  [pdf, other

    stat.ML cs.LG

    Fast Incremental SVDD Learning Algorithm with the Gaussian Kernel

    Authors: Hansi Jiang, Haoyu Wang, Wenhao Hu, Deovrat Kakde, Arin Chaudhuri

    Abstract: Support vector data description (SVDD) is a machine learning technique that is used for single-class classification and outlier detection. The idea of SVDD is to find a set of support vectors that defines a boundary around data. When dealing with online or large data, existing batch SVDD methods have to be rerun in each iteration. We propose an incremental learning algorithm for SVDD that uses the… ▽ More

    Submitted 1 November, 2018; v1 submitted 31 August, 2017; originally announced September 2017.

    Comments: 18 pages, 1 table, 4 figures

  7. arXiv:1708.05106  [pdf, other

    cs.LG cs.AI stat.ML

    The Mean and Median Criterion for Automatic Kernel Bandwidth Selection for Support Vector Data Description

    Authors: Arin Chaudhuri, Deovrat Kakde, Carol Sadek, Laura Gonzalez, Seunghyun Kong

    Abstract: Support vector data description (SVDD) is a popular technique for detecting anomalies. The SVDD classifier partitions the whole space into an inlier region, which consists of the region near the training data, and an outlier region, which consists of points away from the training data. The computation of the SVDD classifier requires a kernel function, and the Gaussian kernel is a common choice for… ▽ More

    Submitted 21 August, 2017; v1 submitted 16 August, 2017; originally announced August 2017.

    ACM Class: I.2.7

  8. Kernel Bandwidth Selection for SVDD: Peak Criterion Approach for Large Data

    Authors: Sergiy Peredriy, Deovrat Kakde, Arin Chaudhuri

    Abstract: Support Vector Data Description (SVDD) provides a useful approach to construct a description of multivariate data for single-class classification and outlier detection with various practical applications. Gaussian kernel used in SVDD formulation allows flexible data description defined by observations designated as support vectors. The data boundary of such description is non-spherical and conform… ▽ More

    Submitted 19 May, 2017; v1 submitted 31 October, 2016; originally announced November 2016.

    MSC Class: 68T10; 62H99; 65Y20; 68T05 ACM Class: G.3; G.4; I.2.6

  9. arXiv:1607.07745  [pdf

    cs.AI stat.AP stat.ME stat.ML

    Leveraging Unstructured Data to Detect Emerging Reliability Issues

    Authors: Deovrat Kakde, Arin Chaudhuri

    Abstract: Unstructured data refers to information that does not have a predefined data model or is not organized in a pre-defined manner. Loosely speaking, unstructured data refers to text data that is generated by humans. In after-sales service businesses, there are two main sources of unstructured data: customer complaints, which generally describe symptoms, and technician comments, which outline diagnost… ▽ More

    Submitted 26 July, 2016; originally announced July 2016.

  10. arXiv:1607.07423  [pdf

    cs.LG stat.AP stat.ME stat.ML

    A Non-Parametric Control Chart For High Frequency Multivariate Data

    Authors: Deovrat Kakde, Sergriy Peredriy, Arin Chaudhuri, Anya Mcguirk

    Abstract: Support Vector Data Description (SVDD) is a machine learning technique used for single class classification and outlier detection. SVDD based K-chart was first introduced by Sun and Tsung for monitoring multivariate processes when underlying distribution of process parameters or quality characteristics depart from Normality. The method first trains a SVDD model on data obtained from stable or in-c… ▽ More

    Submitted 29 July, 2016; v1 submitted 25 July, 2016; originally announced July 2016.

    MSC Class: 62N05; 90B25 ACM Class: G.3; H.2.8

  11. arXiv:1606.05382  [pdf, other

    cs.LG stat.AP stat.ML

    Sampling Method for Fast Training of Support Vector Data Description

    Authors: Arin Chaudhuri, Deovrat Kakde, Maria Jahja, Wei Xiao, Hansi Jiang, Seunghyun Kong, Sergiy Peredriy

    Abstract: Support Vector Data Description (SVDD) is a popular outlier detection technique which constructs a flexible description of the input data. SVDD computation time is high for large training datasets which limits its use in big-data process-monitoring applications. We propose a new iterative sampling-based method for SVDD training. The method incrementally learns the training data description at each… ▽ More

    Submitted 25 September, 2016; v1 submitted 16 June, 2016; originally announced June 2016.

  12. arXiv:1602.05257  [pdf, other

    cs.LG stat.AP stat.ML

    Peak Criterion for Choosing Gaussian Kernel Bandwidth in Support Vector Data Description

    Authors: Deovrat Kakde, Arin Chaudhuri, Seunghyun Kong, Maria Jahja, Hansi Jiang, Jorge Silva

    Abstract: Support Vector Data Description (SVDD) is a machine-learning technique used for single class classification and outlier detection. SVDD formulation with kernel function provides a flexible boundary around data. The value of kernel function parameters affects the nature of the data boundary. For example, it is observed that with a Gaussian kernel, as the value of kernel bandwidth is lowered, the da… ▽ More

    Submitted 8 August, 2017; v1 submitted 16 February, 2016; originally announced February 2016.