Skip to main content

Showing 1–4 of 4 results for author: Thirumuruganathan, S

Searching in archive stat. Search in all archives.
.
  1. arXiv:2006.13025   

    cs.LG stat.ML

    Fair Active Learning

    Authors: Hadis Anahideh, Abolfazl Asudeh, Saravanan Thirumuruganathan

    Abstract: Machine learning (ML) is increasingly being used in high-stakes applications impacting society. Therefore, it is of critical importance that ML models do not propagate discrimination. Collecting accurate labeled data in societal applications is challenging and costly. Active learning is a promising approach to build an accurate classifier by interactively querying an oracle within a labeling budge… ▽ More

    Submitted 1 July, 2020; v1 submitted 20 June, 2020; originally announced June 2020.

    Comments: This was intended as a replacement of arXiv:2001.01796 please see the updated version there

  2. arXiv:2001.01796  [pdf, other

    cs.LG stat.ML

    Fair Active Learning

    Authors: Hadis Anahideh, Abolfazl Asudeh, Saravanan Thirumuruganathan

    Abstract: Machine learning (ML) is increasingly being used in high-stakes applications impacting society. Therefore, it is of critical importance that ML models do not propagate discrimination. Collecting accurate labeled data in societal applications is challenging and costly. Active learning is a promising approach to build an accurate classifier by interactively querying an oracle within a labeling budge… ▽ More

    Submitted 31 March, 2021; v1 submitted 6 January, 2020; originally announced January 2020.

  3. arXiv:1907.13276  [pdf, other

    cs.LG stat.ML

    Are Outlier Detection Methods Resilient to Sampling?

    Authors: Laure Berti-Equille, Ji Meng Loh, Saravanan Thirumuruganathan

    Abstract: Outlier detection is a fundamental task in data mining and has many applications including detecting errors in databases. While there has been extensive prior work on methods for outlier detection, modern datasets often have sizes that are beyond the ability of commonly used methods to process the data within a reasonable time. To overcome this issue, outlier detection methods can be trained over… ▽ More

    Submitted 30 July, 2019; originally announced July 2019.

    Comments: 18 pages

  4. arXiv:1809.11084  [pdf, other

    cs.DB cs.LG stat.ML

    Reuse and Adaptation for Entity Resolution through Transfer Learning

    Authors: Saravanan Thirumuruganathan, Shameem A Puthiya Parambath, Mourad Ouzzani, Nan Tang, Shafiq Joty

    Abstract: Entity resolution (ER) is one of the fundamental problems in data integration, where machine learning (ML) based classifiers often provide the state-of-the-art results. Considerable human effort goes into feature engineering and training data creation. In this paper, we investigate a new problem: Given a dataset D_T for ER with limited or no training data, is it possible to train a good ML classif… ▽ More

    Submitted 28 September, 2018; originally announced September 2018.