Grid-aware approach to data statistics, data understanding and data preprocessing. (8th June 2009)