Let me explain. When the IGF chose TestFlight for iOS distribution, they made a big mistake. We were given a list of all of our judges’ email addresses, revealing their identities. We aren’t going to release those names to respect the judges, but let’s just say we had a heavy-hitter. For every judge, we could see how much they played; if they even started the game at all. How do we know this?
TestFlight records NSLogs (iPhone version of console logging) and custom “checkpoints,” uploading this data seamlessly for us to see. In addition, when a judge opened the TestFlight invitation email, downloaded and then installed the game on their iDevice, we can see all of that. I believe that the IGF organizers, who are usually lips-sealed on the judging process, did not know about this functionality. We can see exactly when a judge installed the game, when they started playing, how long they played, and how far they got.
As you can imagine, this was an opportunity for us to see what really goes on behind closed doors at the IGF. How much do games really get played? Does hype count for everything? Is it true that to be a contender in the current IGF, your game has to already be widely known in indie circles? Does this mean that most of the judges won’t end up playing your game in these circumstances regardless of the quality of the title?
Here are the statistics:
Eight (8) judges were assigned to Kale In Dinoland. Of those judges, 1 didn’t install the game or respond to any of our invitations (which we had to send multiple times before judges joined). 3 judges didn’t play the game. Of the remaining 5 judges that played the game, 3 played it very close to the IGF deadline, which was December 5th. One judge, our outlier, played the game for 53.2 minutes. Excluding the outlier, on average each judge – including the 3 that didn’t play it – played the game for almost 5 minutes’ time. Back in that build, Kale’s intro cutscene took about a minute’s time. So we’re talking almost 4 minutes for each judge of actual game time.
Granted, they could have deduced the game was absolutely terrible and didn’t deserve their time. About this time, though, we were also running a beta that was being played by anonymous iOS gamers from the community. These helpful gamers were all interested in the game, having seen it on TouchArcade and IndieGames.com. What is the influence of prior marketing? The average play time for these external beta testers was 34 minutes, accounting for that one minute of cutscene time.
So, a large group of anonymous gamers who were not required to play the game averaged about 30 minutes more play time than the the 7 judges who were required to play the game, 3 of whom did not even play the game. Is 4 minutes enough time for someone to give a fair assessment of a 2-hour-long game? How many more games were given similar treatment? Had we not taken initiative and sent multiple emails urging judges to download the game via TestFlight, how many judges would have ended up playing the game?