Optimizations in a high-performance conjugate gradient benchmark for IA-based multi- and many-core processors. (February 2016)