Throughput: 160–170k RPS locally; ~250k RPS sustained on 24‑core bare‑metal with a 50GB cache.
Memory safety: 1.5–3GB overhead at 50GB (no traces); ~7GB at 100% OTEL sampling.
Hot path discipline: zero allocations, sharded counters, per‑shard LRU, TinyLFU admission.
Control plane: runtime API for toggles (admission, eviction, refresh, compression, observability).
Observability: Prometheus/VictoriaMetrics metrics + OpenTelemetry tracing.
Kubernetes‑friendly: health probes, config via ConfigMap, Docker image.
Throughput: 160–170k RPS locally; ~250k RPS sustained on 24‑core bare‑metal with a 50GB cache. Memory safety: 1.5–3GB overhead at 50GB (no traces); ~7GB at 100% OTEL sampling. Hot path discipline: zero allocations, sharded counters, per‑shard LRU, TinyLFU admission. Control plane: runtime API for toggles (admission, eviction, refresh, compression, observability). Observability: Prometheus/VictoriaMetrics metrics + OpenTelemetry tracing. Kubernetes‑friendly: health probes, config via ConfigMap, Docker image.