Skip to main content

Showing 1–3 of 3 results for author: Sadek, C

Searching in archive cs. Search in all archives.
.
  1. arXiv:1811.06838  [pdf, other

    stat.ML cs.LG math.NA

    The Trace Criterion for Kernel Bandwidth Selection for Support Vector Data Description

    Authors: Arin Chaudhuri, Carol Sadek, Deovrat Kakde, Wenhao Hu, Hansi Jiang, Seunghyun Kong, Yuewei Liao, Sergiy Peredriy, Haoyu Wang

    Abstract: Support vector data description (SVDD) is a popular anomaly detection technique. The SVDD classifier partitions the whole data space into an inlier region, which consists of the region near the training data, and an outlier region, which consists of points away from the training data. The computation of the SVDD classifier requires a kernel function, for which the Gaussian kernel is a common choic… ▽ More

    Submitted 5 February, 2020; v1 submitted 15 November, 2018; originally announced November 2018.

    Comments: note: some text overlap with arXiv:1708.05106 because common background material is covered in both papers

  2. arXiv:1708.05106  [pdf, other

    cs.LG cs.AI stat.ML

    The Mean and Median Criterion for Automatic Kernel Bandwidth Selection for Support Vector Data Description

    Authors: Arin Chaudhuri, Deovrat Kakde, Carol Sadek, Laura Gonzalez, Seunghyun Kong

    Abstract: Support vector data description (SVDD) is a popular technique for detecting anomalies. The SVDD classifier partitions the whole space into an inlier region, which consists of the region near the training data, and an outlier region, which consists of points away from the training data. The computation of the SVDD classifier requires a kernel function, and the Gaussian kernel is a common choice for… ▽ More

    Submitted 21 August, 2017; v1 submitted 16 August, 2017; originally announced August 2017.

    ACM Class: I.2.7

  3. arXiv:1408.5427  [pdf, other

    stat.ML cs.CL cs.IR cs.LG

    A Case Study in Text Mining: Interpreting Twitter Data From World Cup Tweets

    Authors: Daniel Godfrey, Caley Johns, Carl Meyer, Shaina Race, Carol Sadek

    Abstract: Cluster analysis is a field of data analysis that extracts underlying patterns in data. One application of cluster analysis is in text-mining, the analysis of large collections of text to find similarities between documents. We used a collection of about 30,000 tweets extracted from Twitter just before the World Cup started. A common problem with real world text data is the presence of linguistic… ▽ More

    Submitted 21 August, 2014; originally announced August 2014.

    ACM Class: I.5.4; I.2.7; H.2.8; H.3.3