Google’s new Android Bench ranks the top AI models for Android coding, with Gemini 3.1 Pro Preview leading Claude Opus 4.6 and GPT-5.2-Codex.
OpenAI teases “5.4” as reports point to a 1M+ token context window and “extreme reasoning,” amid scrutiny over its Pentagon deal.
Some results have been hidden because they may be inaccessible to you
Show inaccessible results