For the record, I'm perfectly aware that Nvidia has been using tiled rasterization since Maxwell. It's also not a Tegra-specific feature, I was running the test program on my own GPU when this was discovered
Despite that, I still think that 25 GB/s of external bandwidth shared between the GPU and all the CPU cores could easily become a bottleneck. A larger texture cache would probably help, but even so it's just not a whole lot of bandwidth.
Leaks aside, judging from every Nintendo system since gamecube I dont see the switch being severely bottlenecked by memory bandwith. I dont believe 25 GB/S is all there is.