testified.ai Logo

Top AI Tool Updates: Gemini Computer Use & Claude Tag

The ecosystem of AI tool updates has expanded drastically this week. Google has introduced native computer-use capabilities for Gemini 3.5 Flash, while Anthropic launched Claude Tag to bring context-aware AI directly into Slack. Developers are also seeing a massive influx of specialized agent frameworks.

Native Desktop Automation and Agentic Browsing

One of the most consequential AI tool updates is the introduction of native computer use for the Gemini 3.5 Flash platform. Google has enabled this lightweight model to interact directly with digital desktop interfaces by processing continuous screenshots. This allows the AI to seamlessly execute clicks, scrolls, and typing actions across entirely different software environments.

A parallel development in browser automation comes from Aside's new agentic browser. Backed by YC, this browser turns your history into local on-device memory and uses autofill to sign into accounts without human intervention. Its launch video notably demonstrated the AI identifying and canceling unused subscriptions independently, showcasing a new era of task-oriented web surfing.

However, automation efforts have also sparked internal friction. Former Google engineer Justin Poehnelt was reportedly fired after creating the open-source Google Workspace CLI. This wildly popular tool allowed humans and AI agents to control Gmail, Drive, and Docs from a single command line, highlighting a massive demand for better administrative controls.

Team Collaboration and Development Workflows

For team environments, Anthropic has released Claude Tag for its Team and Enterprise users. By mentioning the assistant in a Slack channel, the AI jumps into the conversation while retaining the context of the entire thread. It even features an Ambient Mode that proactively flags issues, such as spotting login errors in support channels and alerting the engineering team.

Design and productivity platforms are also rolling out extensive AI tool updates. Figma's recent Config event introduced the ability to turn design layers directly into code, editable shaders, and third-party connections for Figma Agent. Meanwhile, Notion's new developer platform is adding code-based workflows and the ability to integrate external agents like the Cursor editor directly into shared task boards.

Model Frameworks and Fine-Tuning

Engineers have several new infrastructure tools at their disposal. NVIDIA launched NeMo AutoModel on Hugging Face to optimize the fine-tuning of massive Mixture-of-Experts architectures like Qwen3. By utilizing Expert Parallelism, it delivers up to a 3.7x increase in training throughput. Other significant AI tool updates in the model space include:

  • GLM-5.2: A general agent framework that has received high praise for its coding harness capabilities.
  • Qwen-AgentWorld: Alibaba's language world models trained on over 10 million interaction trajectories to simulate agent environments.
  • Orca: An open-source Agent Development Environment built to manage fleets of parallel coding agents.
  • Modal Auto Endpoints: A tool for running open models in production using a single command.
  • Executor: An open-source gateway designed to connect AI agents to external services.

Specialized Industry Solutions

Vertical-specific AI tool updates are pushing boundaries in law, biology, and voice communications. In the legal sector, the Perplexity Computer for Counsel platform automates administrative research and contract triage. Harvey Labs is also contributing to this space by developing legal foundation models tailored for firm-owned intelligence.

In the life sciences, NVIDIA released the BioNeMo Agent Toolkit, enabling AI agents to act as junior scientists by reading papers and generating hypotheses. Nabla Bio's JAM-2 model has successfully designed drug-quality antibodies directly from a computer, matching traditional lab discovery rates. Additionally, the Arc Institute released Proto, an open framework combining multiple AI biology tools for complex protein and RNA design.

Voice and Audio Innovations

Voice AI is becoming more reliable thanks to targeted infrastructure improvements. AssemblyAI launched Universal-3.5 Pro Realtime, which uniquely uses the AI agent's side of a call as context for better transcription.

To improve these inputs, tools like AI-coustics are actively cleaning up background noise in real-time. For outbound tasks, AgenticCalling and Asmi allow users to deploy agents to handle real phone calls, such as booking hotels or managing customer service lines.

Rapid-Fire Productivity Apps

Finally, the market is flooded with high-utility applications designed to automate daily friction. Here is a summary of the latest productivity releases:

Tool NamePrimary Function
Mercury CommandAI built into banking for automated invoice payments.
RelineGenerates comprehensive meeting notes without deploying a visible bot.
TesanaAllows users to generate entire playable games via text prompts.
HaroldExtracts and validates invoice data before importing it into ERPs.
SnapVee StudioAn all-in-one content workspace for marketers and educators.
Exa ConnectWeb agents designed to seamlessly query Crunchbase and ZoomInfo.
Genspark DesignGenerates UI prototypes, videos, and complex HTML animations.
HubbleA markdown notepad for agents featuring live HTML previews.
LocalClickyAn offline, open-source Mac voice assistant with zero data tracking.

Tracking these AI tool updates is critical for professionals looking to maintain a competitive edge. From browser manipulation to complex biological modeling, the scope of automated intelligence is broadening at an unprecedented pace.

#AI Tools#Agentic Workflows#Browser Automation#Coding Frameworks
Tamás Bőzsöny
Partnership Manager, System Auditor

Meet Tamás Bőzsöny, Senior Systems Auditor at testified.ai. With 22 years in digital media forensics and 15 years as a software workflow coach, Tamás leverages his background as a professional accountant to audit AI tools for UI efficiency, technical integrity, and financial ROI.

Frequently Asked Questions

Google has given Gemini 3.5 Flash native computer-use capabilities. It processes continuous screenshots to seamlessly execute click, scroll, and typing actions across varied digital desktop interfaces.