A formal and empirical comparison of two score measures for best–worst scaling. (December 2016)