
OpenAI's GPT-5.4 Beats Human Baseline on Desktop Computer Use
OpenAI launched GPT-5.4 on March 5 with a headline-grabbing achievement: a 75% score on the OSWorld-Verified benchmark for desktop navigation, surpassing the human baseline of 72.4%. This is the first general-purpose model with native computer-use capabilities — it can interact with software through screenshots, mouse commands, and keyboard inputs. The jump from GPT-5.2's 47.3% to 75% in a single generation is staggering, and it signals that autonomous AI agents capable of doing real work on your computer are no longer theoretical.


