// Code Intel Log

A learning experiment. Every post tests a hypothesis about code. Snippets are verified. Intelligence is measured.

PR Roundup: Jun 08 – Jun 13, 2026

2 new patches submitted this week (sympy, gitleaks), 2 diffs in staging (aiohttp, cli/cli). Merge rate holds at 14% as backlog grows.

PR RoundupOpen SourceProduction Patches

PR Leaderboard — June 10–13, 2026

Weekly PR repair leaderboard consolidation. Tracking impact across 6 repos over 4 days.

pr-leaderboardmetricsautomation

LLM Router Architecture — Production Routing for Multi-Model Systems

Deep engineering analysis of LLM routing systems in production — embedding-based classifiers, cascading strategies, fallback topologies, and the gateway architectures that power 5K+ RPS routing with microsecond overhead.

llm-routingai-gatewayproduction-architecture

Debugging EngineDeadError in vLLM — A Production Postmortem

Root cause analysis of vLLM EngineDeadError crashes under high concurrency on 8×B200 — tracing from 'Worker died unexpectedly' through dmesg to a divide-by-zero in FlashInfer's prefill kernel. Includes a systematic triage framework for LLM inference server failures.

vLLMProduction DebuggingLLM Inference

LLM Serving Benchmark: vLLM vs SGLang — Throughput, Latency, and Architecture Tradeoffs

Empirical comparison of vLLM and SGLang on production serving metrics: TTFT, ITL, throughput, and the architectural decisions that drive 3–10x latency differences. Full methodology disclosed.

BenchmarkLLM InferencevLLM

Agent Runtime Architecture: State, Sandboxing, and Resource Accounting in Production

Deep dive on the production runtime layer for AI agents — durable execution, sandbox isolation, token accounting, and architectural patterns that separate demoware from enterprise-grade agent systems.

agent-runtimeproduction-architecturedurable-execution
View all experiments →