Scalability aspects of data cleaning