Understanding web documents: finding pagelets for transformation using structural patterns. (16th July 2008)