Daily notes on AI, testing, and building software.
CVE-2026-27681 is a CVSS 9.9-rated SQL injection vulnerability in SAP Business Planning and Consolidation (BPC) and SAP Business Warehouse (BW) that allows any low-privileged authenticated user to upload a file…
Three actively exploited vulnerabilities in Cisco Catalyst SD-WAN Manager — CVE-2026-20133 (information disclosure), CVE-2026-20122 (arbitrary file overwrite), and CVE-2026-20128 (credential exposure via recoverable…
CVE-2026-39987 is a critical pre-authentication remote code execution (RCE) vulnerability in Marimo, an increasingly popular open-source Python notebook used in AI/data science workflows. An unauthenticated attacker can…
As LLM features ship without proper test suites, QA teams are being asked to validate AI-powered software with tools and methods designed for deterministic code — a growing mismatch that's creating silent quality debt…
New research accepted at ICST 2026 reveals a critical blind spot in LLM-based test generation: when code changes semantically, AI-generated tests pass rates collapse to just 66%, and AI fault localization accuracy…
The "oracle problem" — deciding whether software output is correct — has always been one of the hardest unsolved challenges in test automation. LLMs are now sophisticated enough to generate test assertions…
Most engineering teams shipping LLM-powered features in 2026 are testing them less rigorously than they test a login form — and as AI becomes load-bearing infrastructure, that gap is a serious production risk that QA…
GPT-5.3-Codex-Spark's near-instant output speed—over 1,000 tokens per second—changes the fundamental interaction model between developers and AI assistants. For test writing specifically, this eliminates the latency…
OpenAI's GPT-5.3-Codex-Spark delivers over 1,000 tokens/second — fast enough to generate a full test file as you type — which fundamentally changes the economics of AI-assisted test writing and opens a new paradigm of…
GPT-5.2-Codex is purpose-built for the hardest software engineering tasks — long-horizon refactors, large-scale migrations, and complex multi-file changes — which maps almost perfectly onto the work QA teams dread most:…