A distributed, partitioned, replicated commit log — hand-built from the disk up. Segmented append-only log with sparse index and CRC, ISR replication, epoch-based leader election, idempotent producer, and group-commit batching. Zero external dependencies on the data path.
A benchmark-proven retrieval-augmented generation system over a 6M+ vector Wikipedia corpus. Hand-profiled with py-spy — the flamegraph exposed a batching bottleneck, yielding a 3× QPS gain. Agentic multi-hop reasoning (iterative retrieve-reason-retrieve with self-reflection) lifts HotpotQA exact-match by +15 points over single-shot RAG.
A real-time multi-agent architecture review system. Eight specialised LLM agents — planner, four parallel specialists (scalability, security, cost, maintainability), conflict detector, critic, synthesizer — run concurrently via LangGraph fan-out and stream verdicts over WebSocket. A human-in-the-loop revision loop re-runs the synthesizer without re-invoking the critic.
Systems depth
- Distributed storage engines — segmented logs, WAL, CRC checksums, crash recovery
- Replication protocols — ISR, high-watermark, epoch fencing, log truncation
- Leader election — term-based, controller architecture, failover <5s
- ANN indexing — HNSW, IVF-PQ, benchmarked at 6M vectors
- Binary TCP protocols — framing, batching, backpressure
- Multi-agent orchestration — LangGraph parallel fan-out
Production practice
- Go — goroutines, channels, race-clean, golangci-lint, pprof
- Python — FastAPI, Pydantic, mypy strict, pytest / hypothesis
- Observability — Prometheus, Grafana, OpenTelemetry, SLOs
- CI/CD — GitHub Actions, Docker multi-stage, Hetzner deploys
- Testing — property-based, deterministic fault injection
- ML serving — RAG, reranking, eval-gated pipelines, Claude API