What changed CPU performance from the Macintosh 128K to the M3?

How have the CPUs in our Macs become faster since the Macintosh 128K was launched by Steve Jobs forty years ago?

Code in ARM Assembly: Lanes and loads in NEON

How ARM64 uses its special SIMD registers in lanes, and how they can be loaded with and without de-interleaving.

Three recent WWDC sessions extolling Apple’s “extensive reference material” and Xcode can’t find anything on these rich and extensive libraries.

More cores are great for running more processes, but how can you make individual operations within a process faster? SIMD is one solution.

Benchmarking 32-bit Float vector dot-product calculations using Swift, NEON assembly, and Apple’s SIMD libraries, on Intel and M1 Macs.