Skip to main content

Showing 1–1 of 1 results for author: Naeem, Z A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2303.16909  [pdf, other

    cs.DB cs.AI

    RetClean: Retrieval-Based Data Cleaning Using Foundation Models and Data Lakes

    Authors: Mohammad Shahmeer Ahmad, Zan Ahmad Naeem, Mohamed Eltabakh, Mourad Ouzzani, Nan Tang

    Abstract: Can foundation models (such as ChatGPT) clean your data? In this proposal, we demonstrate that indeed ChatGPT can assist in data cleaning by suggesting corrections for specific cells in a data table (scenario 1). However, ChatGPT may struggle with datasets it has never encountered before (e.g., local enterprise data) or when the user requires an explanation of the source of the suggested clean val… ▽ More

    Submitted 29 March, 2023; originally announced March 2023.