The Shortcut
GPT-5.4 is the first general-purpose model to surpass humans at operating a computer. When agents use screens instead of APIs, the integration layer that protected SaaS companies for two decades be...

Source: DEV Community
GPT-5.4 is the first general-purpose model to surpass humans at operating a computer. When agents use screens instead of APIs, the integration layer that protected SaaS companies for two decades becomes optional. OpenAI released GPT-5.4 on March 5, 2026. On OSWorld-Verified, a benchmark that measures whether a model can navigate a real desktop environment — opening applications, clicking menus, filling forms, switching windows — GPT-5.4 scored 75.0 percent. The human baseline is 72.4 percent. The previous version, GPT-5.2, scored 47.3 percent. This is the first general-purpose AI model to outperform the average human at using a computer. The same week, the model shipped with native financial plugins: ChatGPT embedded directly in Microsoft Excel and Google Sheets, plus integrations with FactSet, MSCI, Third Bridge, and Moody's. On OpenAI's internal investment banking benchmark, performance jumped from 43.7 percent with GPT-5 to 87.3 percent with GPT-5.4 Thinking. Google had already ship