testified.ai Logo

OpenAI Launches GPT-5.4 with Desktop Automation Skills

OpenAI's latest GPT-5.4 release marks a significant step forward, introducing a model that now outperforms the average human in desktop navigation and complex professional tasks. This update is part of a broader wave of innovation, with new specialized tools emerging for creative work, development, agriculture, and personal productivity, signaling a major expansion of AI's practical applications.

OpenAI's GPT-5.4 Release: A New Benchmark in AI Capability

it has been only been a few days since OpenAI has rolled out GPT-5.3 Instant as the new default model for all ChatGPT users, and now they have officially launched GPT-5.4 and GPT-5.4 Pro, its most advanced models to date, now available across ChatGPT, the API, and Codex. The new models integrate significant improvements in reasoning, coding, and tool use, with a particular focus on professional workflows involving documents, spreadsheets, and software environments. This is a key part of the GPT-5.4 release, aiming to make AI a more capable coworker.

The model's ability to operate a computer now surpasses the average person. It scored 75% on the OSWorld benchmark for desktop navigation, outperforming the human baseline of 72.4%. For knowledge work, GPT-5.4 matched or exceeded human professionals 83% of the time on the GDPval benchmark, a notable increase from 70.9%.

Key Features and Performance

GPT-5.4 supports up to 1 million tokens of context and introduces a new "x-high" reasoning effort setting, enabling it to handle complex, multi-hour tasks. Community feedback has been strong, with experts calling it the "best model in the world" for its Opus-level planning and coding abilities. Additionally, OpenAI has introduced a ChatGPT add-in for Excel, allowing users to build models and analyze data directly within their workbooks.

The launch comes with a strong statement from OAI researcher Noam Brown: "We see no wall” suggesting that the pace of AI advancement is not slowing down.

Creative and Developer Tools See Major Upgrades

The industry is buzzing with new tools for creators and developers. Luma AI unveiled Luma Agents, a system powered by its Uni-1 model that handles end-to-end creative campaigns. These agents can manage text, images, video, and audio, and even coordinate with external models from Google and ElevenLabs, refining their output through self-critique.

For video professionals, Lightricks released LTX-2.3, an upgraded open-source video model offering more detail and cleaner audio, alongside a free local video editor, LTX Desktop. In the developer space, Cursor Automations now allows for the creation of always-on agents that run on schedules or event triggers, such as PR merges or Slack messages, to continuously improve a codebase.

Specialized Tools for Niche Applications

Innovation isn't limited to creative and coding tasks. The agricultural sector is seeing new AI-powered solutions to reduce herbicide use.

  • LaserWeeder G2 by Carbon Robotics uses cameras and lasers to identify and destroy 450,000 weeds per hour with 99% accuracy.

  • Stout Industrial's Smart Cultivator uses AI-guided mechanical blades to physically remove weeds.

  • John Deere’s See & Spray system identifies and sprays only weeds, significantly cutting down on chemical usage.

Other notable tool launches focus on productivity and business operations.

Tool Name

Primary Function

CFO X

An AI Personal CFO for financial decision-making.

Textio

Assists in recruiting and developing high-performing teams.

Monodesk

An AI-powered workspace for freelance creative professionals.

Daily Stack

Summarizes emails, tasks, and calendars into a daily briefing.

Krisp

Records, transcribes, and summarizes calls into action items.

Willow Teams

Converts speech into formatted text with enterprise-grade security.

Vela

Automates complex multi-party scheduling across various platforms.

This wave of new and updated tools demonstrates a clear trend toward specialized AI assistants that are deeply integrated into professional workflows. You can find more in-depth analysis in our AI tool reviews section.

#AI Models#OpenAI#GPT-5.4#AI Tools#Luma Agents#Automation
Tamás Bőzsöny
Partnership Manager, System Auditor

Meet Tamás Bőzsöny, Senior Systems Auditor at testified.ai. With 22 years in digital media forensics and 15 years as a software workflow coach, Tamás leverages his background as a professional accountant to audit AI tools for UI efficiency, technical integrity, and financial ROI.