A benchmark comparison of deterministic and probabilistic methods for defining manual review datasets in duplicate records reconciliation. (23rd May 2013)