LordOfChaos
Member
Let's see here...
-An in-order CPU with SMT commanding wide SIMD units, reducing complexity over out-of-order in favor of more transistors doing SIMD and other functions that make things fast
-No or not much cache, largely uses local memory, same idea as above
-No GPU in the mix, everything done on a CPU. GPUs just ended up being good at massively parallel compute, but you don't /need/ a GPU to do that if you're not a GPU design company to start with.
-Heavy focus on fabric bandwidth, a unit can do a job and quickly pass it off, do both a calculation and transfer in the same cycle
The worlds top Fugaku supercomputer shares a lot of similar principals, there's no GPU in the mix, but the A64FX CPUs have a heavy focus on SIMD. A CPU-only system becoming the top supercomputer in the world is wild!
This is not of course a thread saying Cell was comparable in performance to this 16 years ago, but the ideas that seemed bizarre and complicated back then are now well represented in the worlds top supercomputer, and Tesla's Dojo which may also have a shot up there at a targetted 1.1 exaflops.
Was it worth it in a game console, that's a whole different debate and I'd tend to side with no, but until 2009 it was also powering another top supercomputer before development waned, but these ideas are back in blush now. I wonder if they had kept going with it if it could have been represented in something like these.
Just thought this was interesting watching AI day.
Last edited: