A novel approach for effective web page classification. (1st January 2013)