Selected writeups

Client engagements and products I build. Each writeup covers a real system, not a hypothetical architecture. When numbers appear, they come from measured deployments. These get technical. Thehome page has the plain-English version.

2026-06-30 / 6 min / whatsapp / crm / automation / saas / small-business

Building a WhatsApp CRM for small-business operations

A productised CRM for turning WhatsApp conversations into owned sales and support work: one customer record, a shared inbox, pipeline stages, follow-up automation, and clean handoffs to the systems that complete the job.

2026-04-08 / 4 min / ai / llm / underwriting / fintech / production

An AI underwriting assistant adopted by a 120-person credit operation in 10 weeks

Not a model demo. A workflow tool the credit team actually opened every morning. Built in 10 weeks, took manual review off the top decile of cases, and saved roughly five minutes of handling time per accepted draft against the pre-launch six-minute baseline. Here is how it shipped without an LLM-replaces-humans pitch.

2025-09-12 / 2 min / llm / infra / cost / latency

Routing inference across LLM providers without breaking latency

An orchestration layer that picks the right provider per request. 28% lower provider/API spend against the prior single-provider baseline, normalised for request volume and token mix. p95 latency stayed sub-second. Caller code never changed.

2025-05-22 / 2 min / llm / evals / rag / production

Building an eval harness that actually catches regressions

Retrieval and prompt evaluation pipelines that drove an 18% relative lift in rubric pass rate over the prior eval harness, measured on production-derived canary sets. Plus why most eval setups silently lie to you.

2024-02-18 / 3 min / payments / fraud / ml / production

Fraud ML for a payments platform at 20M transactions a month

Fraudulent transactions fell 70% and manual review load fell 55% relative to the pre-model rules-and-review baseline, normalised for volume over the post-rollout measurement window. What worked, what the model could not solve on its own, and the three pieces we built before the model went live.

2023-09-04 / 3 min / payments / reliability / postgres / production

Hitting five-nines on a payment settlement service

What 99.999% actually means at 20M transactions a month. The Postgres patterns, the idempotency surface, and the operational tax that nobody talks about until they have already missed an SLA.