I'm curious to see how small the box is.
I know that may sound completely trivial to most people, but if this thing is a media center that wants to compete with Apple TV's and Roku's, then it has to be smaller than a 360. Doesn't have to be the size of an Apple TV, but somewhere in between.
Speaking of that, I think that MS is going to be ahead of the curve on how you control the box compared to Apple. I don't know how many of you have seen the Apple 'rings,' but they basically exist (or existed if they were can) because they're a more intelligent way to navigate the UI of an Apple TV. Now what's the problem with that? People don't want to wear dumb rings if all they do is control the TV. I think wearable tech is the future, but not like that.
Now, the reason I bring this up is that Kinect 2.0 may possibly be a huge breakthrough in this field. Microsoft basically did a nation-wide field test with the original Kinect, but now that they're going to have better tech (who knows how detailed the scanning/reading will be) and people who better understand (remote)controller-less navigation than they did when the Kinect came out. Not to mention all of this is built from the ground up with Kinect 2.0 in mind, where as they did their best to change and mold things about the 360 that were already in place.
It truly is a great time to love technology.