Scalable multi-relaxation-time lattice Boltzmann simulations on multi-GPU cluster. (30th March 2015)