A fast and effective partitional clustering algorithm for large categorical datasets using a k-means based approach. (May 2018)