AMD interview question

How to optimize matrix multiplication on CPU & GPU