With that being said, so far these are the SE count of RDNA4.
Navi44 - 16WGP (2SE, 32CU, 64ROPs)
Navi48 - 32WGP (4SE, 64CU, 128ROPs)
So the WGP per SE is still the same as RDNA3.
8WGP or 16CU per SE.
PS5 having 4SE is more probable than 2SE.
2SE with that many CU seems to be inefficient as well.
For higher end RDNA4 going by Navi44/48, it appears to be still 8WGP or 16CU per SE.
And it seems each SED houses 2 Shader Engines.
Navi41
1SED = 2SE/16WGP(32CU)
6SED = 12SE/96WGP(192CU)
1AID = 2×64bit or (4×32bit GDDR PHY)
2AID = 4×64bit GDDR PHY = 256bit
Navi40
1SED = 2SE/16WGP(32CU)
9SED = 18SE/144WGP(288CU)
1AID = 2×64bit or (4×32bit GDDR PHY)
3AID = 6×64bit GDDR PHY = 384bit