Support NeoGAF

Argyle · Aug 25, 2005

Not sure if it's old, but...

http://www-128.ibm.com/developerworks/power/cell/

Registration required for the good stuff.

Enjoy

Panajev2001a · Aug 25, 2005

Oh YEAH!

*Does the Pana dance*

duderon · Aug 25, 2005

holy shit, he's alive.

Argyle · Aug 25, 2005

C'mon, if anything was going to bring Pana back it was gonna be the public release of information on CELL

There should be more posts in here! This is the true Wednesdayton...

Panajev2001a · Aug 25, 2005

Argyle said:
C'mon, if anything was going to bring Pana back it was gonna be the public release of information on CELL

Hey, I like Signal and not the usual GAF/B3D noise, so you hit the nail on the head

.

ChryZ · Aug 25, 2005

Same documents, no registration:

http://cell.scei.co.jp/e_download.html

Argyle · Aug 25, 2005

*cough*

(Hey, for once I wasn't beaten to the punch at something here, I gotta bump my own thread...

)

trmas · Aug 25, 2005

So can anyone decipher it, and put it in normal people terms?

Fafalada · Aug 25, 2005

IBM is being nice - now if only NVidia was HALF this nice and at least released technical documentation to developers

inpHilltr8r · Aug 25, 2005

Hey Argyle! Nice avatar

TheInkyVoid · Aug 25, 2005

trmas said:
So can anyone decipher it, and put it in normal people terms?

Of the five docs released, only:

CBE_Architecture_v10.pdf
SPU_ISA_v10.pdf

are going to have much info interesting to a non-engineer. Section 1. of those two docs are general overviews that should be somewhat understandable by someone with a moderate amount of computer knowledge.

The rest of the docs are really targeted at people writing code for Cell chips. Interesting to quickly browse through if you aren't a software engineer, but 90 percent of the info in the pdfs is very low level.

Probably the most interesting thing to look at in the docs is the diagram on page 20 of CBE_Architecture. Note how the SPUs and PPEs are labled 0..N connected to the Element Interconnect Bus. The Broadband Engine in the PS3 is just the first of many Cell chips to come. The architecture of Cell is designed to scale at will by adding more and more cores.

Once you have migrated your code to one Cell chip, it should be trivial to move it to more powerful Cell chips with more cores in the future. The PS3 will not be upgraded over the life of the platform, but taking a wider view of what Sony and NVidia are planning on working on together beyond the PS3, Cell will become a common media platform that scales from the smallest to the largest computing devices.

antipode · Aug 25, 2005

Does this surprise anyone, from the language extensions doc (p. 20)?

-----------------
... Programmer-directed branch prediction is provided using an enhanced version of GCCs __builtin_expect function.... For dynamic prediction, the value argument can be either a compile-time constant or a variable....

Dynamic Prediction Example
cond2 = ... /* predict a value for cond1 */
...
cond1 = ...
if (__builtin_expect(cond1, cond2)) {
foo();
}
cond2 = cond1; /* predict that next branch is the same as the previous */
------------------

In comparison, this is what the GCC 4.0 documentation says
(http://gcc.gnu.org/onlinedocs/gcc-4.0.0/gcc/Other-Builtins.html#Other-Builtins):

-----------------
The value of c must be a compile-time constant.
-----------------

That example Sony gave seems like a pretty powerful enhancement.

Argyle · Aug 26, 2005

inpHilltr8r said:
Hey Argyle! Nice avatar

Hey, thanks! I made it myself - total programmer art.

aaaaa0 · Aug 26, 2005

antipode said:
That example Sony gave seems like a pretty powerful enhancement.

I expect you can get much of the same effect via profiler guided optimization (which is in VC++ 2005) without having to add any language extensions.

This would also tend to be less effort for a dev to implement, because it's pretty much automatic and you can apply it over your entire program, not just the pieces you sit down to optimize.

Fafalada · Aug 26, 2005

aaaaa0 said:
I expect you can get much of the same effect via profiler guided optimization (which is in VC++ 2005) without having to add any language extensions.

Erhm - the point is that branch hints are dynamic in hardware - if you can only issue static hints no language extension would make dynamic ones happen.

Remember this is CPUs with no branch predictor, I completely agree profiler guided optimization would be great to have, but this extension will IMO still be usefull with or without it.

Anyway, having gone through the docs, I see no microarchitecture details, instruction latencies etc

Yes we've seen some numbers about that before, but there were also some conflicting info (on load/store in particular), I'd like to see final word on it

aaaaa0 · Aug 26, 2005

Fafalada said:
Remember this is CPUs with no branch predictor, I completely agree profiler guided optimization would be great to have, but this extension will IMO still be usefull with or without it.

Good point, it is a nice extension to have on a CPU with no branch predictor.

PGO is really nice. If your profiler runs are representative of real data sets your app is going to process, then the branch hints inserted by the compiler will tend to be right more of the time, plus the optimizer will be able to do nice things like automatically reorder your code to improve locality and all sorts of other goodies -- best of all it doesn't make me do any extra work.

antipode · Aug 26, 2005

Yeah, I agree that PGO is going to be what you want to use 99% of the time. I could see some useful game things this dynamic branch prediction could do that the static b.p. that (I presume) PGO is currently doing couldn't though - like dynamically predicting whether a collison happens or not based upon whether a region is more or less sparse or a counter of the last few collisions. Even on-chip branch prediction can only get so close to that ideal.

I wonder if PGO is going to be improved to insert dynamic branch prediction. I also wonder - is it useful to expect a variable is going to be equal to itself?
__builtin_expect(cond1,cond1)
I'm curious how this is implemented.

Support NeoGAF

CELL technical documentation

Argyle

Member

Panajev2001a

GAF's Pleasant Genius

duderon

rollin' in the gutter

Argyle

Member

Panajev2001a

GAF's Pleasant Genius

ChryZ

Member

Argyle

Member

trmas

Banned

Fafalada

Fafracer forever

inpHilltr8r

Member

TheInkyVoid

Member

antipode

Member

Argyle

Member

aaaaa0

Member

Fafalada

Fafracer forever

aaaaa0

Member

antipode

Member

Similar threads