9 AI models were given the same prompt: build a Game Hub with Tic-Tac-Toe and Rock-Paper-Scissors, complete with multiplayer rooms and a dark theme. These are the actual apps they built, unedited.
Anthropic · Code: 37/40 · Tests: 11/11 · $1.42
Anthropic · Code: 36/40 · Tests: 10/11 · $5.06
OpenAI · Code: 35/40 · Tests: 8/11 · $0.28
Moonshot AI · Code: 35/40 · Tests: 11/11 · $0.50
MiniMax · Code: 33/40 · Tests: — · $0.20
Zhipu AI · Code: 30/40 · Tests: 7/11
Alibaba · Code: 29/40 · Tests: 10/11
Google · Build failed in R2
DeepSeek · Code: 22/40 · Build failed in R2
Part of ronnierocha.dev — Built by Ronnie Rocha