Skip to main content

Showing 1–1 of 1 results for author: Mittal, R S

.
  1. arXiv:2108.05935  [pdf, other

    cs.LG

    Data Quality Toolkit: Automatic assessment of data quality and remediation for machine learning datasets

    Authors: Nitin Gupta, Hima Patel, Shazia Afzal, Naveen Panwar, Ruhi Sharma Mittal, Shanmukha Guttula, Abhinav Jain, Lokesh Nagalapatti, Sameep Mehta, Sandeep Hans, Pranay Lohia, Aniya Aggarwal, Diptikalyan Saha

    Abstract: The quality of training data has a huge impact on the efficiency, accuracy and complexity of machine learning tasks. Various tools and techniques are available that assess data quality with respect to general cleaning and profiling checks. However these techniques are not applicable to detect data issues in the context of machine learning tasks, like noisy labels, existence of overlap** classes… ▽ More

    Submitted 5 September, 2021; v1 submitted 12 August, 2021; originally announced August 2021.