A comparison of data preparation approaches for e-mail categorisation. (21st August 2007)