Batched matrix computations on hardware accelerators based on GPUs. (May 2015)