No, I'm saying you look at the buttons (I suppose I should have said "button prompts") because in a QTE the button prompts are there on screen.
And yes, of course music games are all about being in the zone. That's what I was saying. In music games, your responses become reflexive, you take in what you're seeing on screen and hearing, interpret it and hit the correct button without really thinking about it. In a QTE, you're explicitly told what button to hit. There's no interpretation step, and, for me, there's no way to get "in the zone", because the screen is constantly telling me what to do. For me, "the zone" is about knowing what to do instinctively and executing it. QTEs don't allow that because they explicitly tell you what to do at every step.