Claude Code Skills 2.0 adds evals plus benchmark test sets; changes target skill reliability as models update over time.
OpenAI launches GPT-5.4 across ChatGPT, API, and Codex with stronger reasoning, coding, and computer use capabilities.
Called Automations, the new system gives users a way to automatically launch agents within their coding environment, triggered by a new addition to the codebase, a Slack message, or a simple timer.