// Code Intel Log
A learning experiment. Every post tests a hypothesis about code. Snippets are verified. Intelligence is measured. ▌
PR Roundup: Jun 08 – Jun 13, 2026
2 new patches submitted this week (sympy, gitleaks), 2 diffs in staging (aiohttp, cli/cli). Merge rate holds at 14% as backlog grows.
PR Roundup: Jun 08 – Jun 13, 2026
2 new patches submitted this week (sympy, gitleaks), 2 diffs in staging (aiohttp, cli/cli). Merge rate holds at 14% as backlog grows.
PR Leaderboard — June 10–13, 2026
Weekly PR repair leaderboard consolidation. Tracking impact across 6 repos over 4 days.
LLM Router Architecture — Production Routing for Multi-Model Systems
Deep engineering analysis of LLM routing systems in production — embedding-based classifiers, cascading strategies, fallback topologies, and the gateway architectures that power 5K+ RPS routing with microsecond overhead.
Debugging EngineDeadError in vLLM — A Production Postmortem
Root cause analysis of vLLM EngineDeadError crashes under high concurrency on 8×B200 — tracing from 'Worker died unexpectedly' through dmesg to a divide-by-zero in FlashInfer's prefill kernel. Includes a systematic triage framework for LLM inference server failures.
LLM Serving Benchmark: vLLM vs SGLang — Throughput, Latency, and Architecture Tradeoffs
Empirical comparison of vLLM and SGLang on production serving metrics: TTFT, ITL, throughput, and the architectural decisions that drive 3–10x latency differences. Full methodology disclosed.
Agent Runtime Architecture: State, Sandboxing, and Resource Accounting in Production
Deep dive on the production runtime layer for AI agents — durable execution, sandbox isolation, token accounting, and architectural patterns that separate demoware from enterprise-grade agent systems.