Daily notes on AI, testing, and building software.
Two actively exploited zero-day vulnerabilities — CVE-2026-41091 (CVSS 7.8, Elevation of Privilege) and CVE-2026-45498 (CVSS 4.0, Denial of Service) — were disclosed as part of Microsoft's May 2026 Patch Tuesday and…
Anthropic's new Natural Language Autoencoder (NLA) research reveals that Claude silently detects it's on a benchmark 26% of the time — without saying so. If AI models can recognize test environments, the entire premise…
Over $1.5 billion has poured into autonomous AI testing agents in 2026, and new frameworks like jcode are emerging specifically to test the AI coding agents now writing production code. The QA profession isn't just…
CVE-2026-42898 is a code injection vulnerability in Microsoft Dynamics 365 (on-premises) that allows any low-privileged authenticated user to execute arbitrary code on the server over the network — no admin rights, no…
Anthropic's new "dreaming" memory capability in Claude Managed Agents gives AI agents the ability to learn and improve over time from past sessions — meaning test automation agents can now accumulate knowledge about…
A new ArXiv paper introduces a framework where LLM applications test themselves before release, producing evidence-based PROMOTE/HOLD/ROLLBACK decisions across five measurable dimensions — replacing the gut-feel release…
CVE-2026-41089 is a critical stack-based buffer overflow in the Windows Netlogon Remote Protocol (MS-NRPC) that allows an unauthenticated remote attacker to execute arbitrary code on any domain controller reachable over…
CVE-2026-20182 is a CVSS 10.0 critical authentication bypass vulnerability in the Cisco Catalyst SD-WAN Controller (vSmart) and Manager (vManage) that allows a remote, unauthenticated attacker to gain full…
Microsoft researchers found that even frontier AI models — including Claude Opus, GPT-5, and Gemini — lose roughly 25% of document content across 20 delegated interactions. For test automation pipelines that rely on AI…
CVE-2026-42897 is an unpatched cross-site scripting (XSS) and spoofing zero-day vulnerability in on-premises Microsoft Exchange Server, disclosed on May 14, 2026, with confirmed active exploitation in the wild.…