From the Inventory Allocation Simulator system
A planner approved a transfer after the warehouse data changed. The only honest answer was to freeze the run inputs before solving.
May 28, 2026
5 min read
Essays on architectural decisions, engineering trade-offs, and the thinking behind the systems.
Showing 29 entries
Technical essays compared by failure mode, constraints, code evidence, and system context.
From the Inventory Allocation Simulator system
A planner approved a transfer after the warehouse data changed. The only honest answer was to freeze the run inputs before solving.
May 28, 2026
5 min read
From the Returns & Claims Orchestration Engine system
A Redis ack landed before PostgreSQL committed. One bad event could erase a good one from recovery while leaving no claim behind.
May 22, 2026
6 min read
From the Compliance Document Substrate system
The docs CLI proved that a document type can be reachable while still bypassing the service that owns its legal behavior.
May 11, 2026
3 min read
From the Compliance Document Substrate system
Entity snapshots turned legal identity from template text into durable document state, including the dormant Nigerian Ltd constraint.
May 11, 2026
3 min read
From the Compliance Document Substrate system
The invoice send path refuses to invent a valid-looking document when XML, PDF/A, or signing checks fail.
May 11, 2026
3 min read
From the Compliance Document Substrate system
Each finalized document enters a per-entity chain so later verification can prove order, content, and tamper evidence.
May 11, 2026
2 min read
From the Compliance Document Substrate system
Document numbers are reserved before finalization, reclaimed after failure, and isolated by entity, type, and year.
May 11, 2026
2 min read
From the Delivery Tracking Gateway system
Redis can disappear after Postgres accepts the event. The gateway had to make live delivery disposable without losing shipment truth.
May 6, 2026
9 min read
From the Dispute Resolution Workbench system
A 0.70 correlation score can still attach the wrong exception to the wrong case. I made confidence create a decision, not mutate the dispute.
May 6, 2026
8 min read
From the SLA Penalty Settlement Engine system
A penalty row without its mirror is worse than a missing accrual. The ledger had to reject half-truths before SQL ever saw them.
May 5, 2026
8 min read
From the Event-Driven Notification Hub system
The Hub and the tenant hashed the same JSON payload to different signatures. JSON.stringify key order is non-deterministic, so callback verification silently failed.
Apr 26, 2026
8 min read
From the Event-Driven Notification Hub system
Five suppression rows seeded in one transaction shared a microsecond-level timestamp. Cursor pagination returned three of them. Date.toISOString lost the rest.
Apr 26, 2026
7 min read
From the Event-Driven Notification Hub system
Per-tenant rate limit silently capped every tenant at the default 200/minute. The rate check ran before auth populated the tenant on the request.
Apr 26, 2026
8 min read
From the Vendor Performance Intelligence Engine system
A Sidekiq retry runs three days after the alert fired. The tenant has been renamed twice. The Hub email had better not show the new name.
Apr 26, 2026
9 min read
From the Contract Lifecycle Engine system
False-positive AI obligations create phantom compliance records worse than missed real ones. Every extraction lands in Pending until a human confirms it.
Apr 18, 2026
8 min read
From the Invoice Reconciliation Engine system
Vendors copy-paste PO numbers differently, omit sales orders, or pad format noise. A single exact match catches 60%. A five-strategy cascade catches 98%.
Apr 16, 2026
10 min read
From the Sensor Telemetry Engine system
A downstream HTTP timeout inside synchronous anomaly evaluation stalled the NATS consumer. 3,000 messages piled up before fire-and-forget tasks fixed it.
Apr 15, 2026
8 min read
From the Subscription Lifecycle Engine system
Stripe sent past_due on a paused subscription. The local state machine rejected the whole payload and lost everything downstream of the status field.
Apr 15, 2026
8 min read
From the Clinical Scheduling Engine system
Pre-generated slot tables drift the moment a booking or availability change misses a trigger. Computing from scratch per request sidesteps the sync problem.
Apr 13, 2026
7 min read
From the Financial Compliance Ledger system
Concurrent escalation rules produced duplicate closure events when ordered by timestamp alone. Row locks plus BIGSERIAL sequence ordering killed the race.
Apr 13, 2026
7 min read
From the Webhook Ingestion Engine system
Creation-time URL validation missed DNS rebinding. The destination hostname resolved to 169.254.169.254 weeks later and hit the cloud metadata endpoint.
Apr 13, 2026
8 min read
From the Workflow Automation Engine system
DAGs don't branch natively. Encoding branches in graph structure produced path explosion. Mutable skip sets at runtime kept the topological sort flat.
Apr 13, 2026
8 min read
From the Event-Driven Notification Hub system
Redpanda burned 150-200MB of a 1GB VPS before the application even started. The Kafka consumer sits behind a flag until the RAM budget grows.
Apr 9, 2026
7 min read
From the Multi-Agent RAG Platform system
The knowledge graph added two hundredths of a point to retrieval relevance for 512MB of extra memory. In production, it runs behind a flag.
Mar 30, 2026
7 min read
From the Transaction Reconciliation Engine system
Bank settlement dates differ from charge dates on 23% of transactions, and 4% by two days. Exact-match reconciliation flags all of them as discrepancies.
Mar 21, 2026
5 min read
From the Centralized Property Intelligence Hub system
A 58-table logical replication subscription crashed the cluster because 3 subscriptions and N temp sync workers fought over 6 worker slots.
Mar 3, 2026
6 min read
From the Idealo Price Optimization Platform system
A single undercut strategy made rank-1 products cheaper every run, burning margin that the price gap already gave away for free.
Mar 3, 2026
7 min read
From the NBA Scenario Engine system
NBA game logs hide every DNP, so the regressor never learns zero-minute players and predicts 11 minutes for anyone ruled out. A two-stage model fixes it.
Mar 3, 2026
7 min read
From the Solar Pricing Engine system
Fuzzy matching on the last 15% of solar SKUs paired STP8.0 with STP10.0. Running the pipeline twice beat every fuzzy threshold I tuned.
Mar 3, 2026
7 min read
Get Notified
New system breakdown? You'll know first.