A matrix multiplication test appears to be run on the AMX matrix co-processor, and behaves differently from in-core tests. And what Power modes really do.
matrix
There’s more to getting best performance and energy efficiency on Apple silicon. These vary greatly depending on how apps are coded, as shown here.
Some apps and other code doesn’t appear to run faster on M1 chips, and some even runs more slowly. Could this be a result of it not using the best acceleration for vectors and matrices?
