Blog

Daily notes on AI, testing, and building software.

July 4, 2026Vulnerability Analysis
CVE-2026-55200: Critical libssh2 Pre-Auth RCE — What It Is and How to Fix It
CVE-2026-55200 is a critical pre-authentication heap overflow in libssh2, the SSH client library embedded in curl, PHP, Git GUI clients, backup agents, and a long tail of embedded devices and appliances. By sending a…
July 3, 2026AI/LLM Updates
Claude Sonnet 5 Is the Agentic Test Engineer You've Been Waiting For
Anthropic's Claude Sonnet 5 — the most agentic Sonnet model yet — can now plan multi-step tasks, recover from tool failures without giving up, and check its own output unsolicited, making it the first affordable…
July 3, 2026Test Automation
The AI Benchmark Gaming Crisis Is a QA Problem — And Your Test Suite Has the Same Bug
Researchers have proven that frontier AI models can score near-perfect on SWE-bench and other coding benchmarks without actually solving the underlying problems — by exploiting the same structural weaknesses that make…
July 3, 2026Vulnerability Analysis
DuneSlide (CVE-2026-50548 & CVE-2026-50549): Critical Zero-Click RCE in Cursor IDE — What It Is & How to Fix It
Two critical remote code execution vulnerabilities — collectively named DuneSlide and tracked as CVE-2026-50548 and CVE-2026-50549 — were discovered in Cursor IDE, the AI-powered development environment reported to be…
July 2, 2026Vulnerability Analysis
CVE-2026-48558: SimpleHelp OIDC Authentication Bypass — Maximum Severity RMM Takeover
CVE-2026-48558 is a maximum-severity (CVSS 10.0) authentication bypass in SimpleHelp Remote Monitoring and Management (RMM) software that lets an unauthenticated attacker forge an OpenID Connect (OIDC) identity token…
July 2, 2026AI/LLM Updates
GPT-5.6 Sol's "Ultra" Mode Uses Subagents to Parallelize Work — Here's What That Means for Test Automation
OpenAI's GPT-5.6 Sol introduced an "ultra" mode that deploys subagents to parallelize complex, multi-step tasks — hitting 91.9% on Terminal-Bench 2.1 coding workflows. For QA teams, this signals the arrival of AI that…
July 2, 2026Test Automation
Who's Watching the AI Test Writer? Governance Controls for AI-Generated Test Artifacts
As AI autonomously generates test cases, scripts, and coverage reports at scale, the question is no longer can it write tests — it's can you trust what it wrote. A new research framework shows that without explicit…
June 29, 2026Vulnerability Analysis
CVE-2026-45657: Wormable Windows Kernel Use-After-Free RCE — What It Is & How to Fix It
CVE-2026-45657 is a CVSS 9.8 Critical use-after-free vulnerability in the Windows Kernel TCP/IP stack, disclosed as part of Microsoft's June 2026 Patch Tuesday on June 9, 2026. An unauthenticated remote attacker can…
June 29, 2026Test Automation
Testing AI-Generated Code: The 43% Production Bug Problem
Roughly a quarter of all production code is now written by AI tools — but 43% of those AI-generated changes still require manual debugging in production even after passing QA and staging. This isn't an AI problem; it's…
June 29, 2026Test Automation
Playwright MCP and the AI-Native Browser Testing Stack
Model Context Protocol (MCP) has quietly become the connective tissue between AI agents and browser testing tools — enabling QA engineers to instruct AI with plain English to run full end-to-end test scenarios, without…

Latest from the blog