Skip to main content
Build Log

From the Workshop

Engineering notes from the field — architecture, automation, and agentic AI.

Four-stage flow diagram showing a living business entity resolving through a fingerprint of signals, into a graph of weighted connections, through change detection, and finally into business value
AIEntity ResolutionArchitecture

The Same Entity Problem: Why every payments, fraud, and compliance company is solving the same thing

Businesses are shape-shifters — they merge, rebrand, relocate, change banks. The universal problem underneath fraud, compliance, payments, and supply chain isn't string matching. It's tracking entities through time.

Apr 20, 202613 min read
Diagram of a cascade verification pipeline wrapping an AI model, with each stage validating the previous stage's output
AIProduction AIVerification

The Verification Layer Is the Product

I ran my production KYB pipeline at FP16, INT8, and INT4. Accuracy held at 92% across all three. A LoRA fine-tune dropped it to 12%. The model wasn't the variable.

Apr 15, 20266 min read
AIArchitectureProduction AI

The Three Enterprise Layers Are Collapsing Into One

For twenty years, enterprise software separated CRM, orchestration, and back-office into three layers with three vendor contracts. A confidence-gated decisioning layer is collapsing all three — and the vendors know it.

Apr 13, 20266 min read
Scatter plot showing accuracy vs cost for 7 model variants including base Llama 3 8B, Llama 3.2 3B challenger, and three LoRA fine-tuned models clustered at 12% accuracy
AIProduction AIBenchmark

Quantization, LoRA, and the 8% Problem: Benchmarking Local LLMs for Production AI

I benchmarked 7 model variants across 3 quantization levels, tested a 3B challenger, built a self-training pipeline, and attempted QLoRA fine-tuning. The fine-tuning failed catastrophically. Here's what I learned.

Apr 12, 202615 min read
Scatter plot showing accuracy vs cost per 1K queries for three quantization levels, with Q4_0 marked as Pareto-optimal
AIProduction AICost Optimization

I Ran My KYB Engine at Three Quantization Levels. Accuracy Didn't Move. Cost Dropped 6x.

I benchmarked FP16 vs INT8 vs INT4 quantization on a real KYB verification pipeline. Zero accuracy loss. 5.6x faster. 6x cheaper. Here's the data.

Apr 11, 20268 min read
Defensive AI Architecture flowchart showing model output passing through validate, constrain, and escalate gates, with paths to deliver, deliver with caveat, or reject
AIArchitecture

Making Small LLMs Production-Safe: Defensive AI and Glass Box Architecture

Small open-weight models like OLMo are fast and cheap. They're also confidently wrong. The fix isn't a better model — it's two architectural patterns: Defensive AI, where every output is untrusted until proven otherwise, and Glass Box Architecture, where everything around the model is transparent and auditable.

Apr 10, 202610 min read
Architecture diagram showing MCP clients connecting to the Blueprint KYB server, which runs a 4-layer discovery cascade and returns structured JSON
AIMCPArchitecture

Exposing an Agentic Pipeline as an MCP Server

I had a working KYB verification cascade — stealth browsers, LangGraph, the whole thing. It sat inside my application and nothing else could use it. Wrapping it as an MCP server took 160 lines and an evening.

Apr 8, 20268 min read
Diagram showing two implementations — custom async Python and LangGraph StateGraph — feeding into the same 4-layer discovery cascade with identical outputs
AIArchitectureLangGraph

From Custom Orchestration to LangGraph: Why the Framework Didn't Change My Architecture

I built the same AI verification cascade two ways — 1,497 lines of hand-rolled async Python and a 9-node LangGraph StateGraph. You can toggle between them with an environment variable. The outputs are identical. Here's what that tells you about frameworks vs. architecture.

Apr 5, 20269 min read
Bar chart comparing precision and recall across four fraud detection models
AIArchitectureData Science

Three Layers Deep: Building a Fraud Detection System That Knows What It Doesn't Know

A single fraud model gives you a number. It doesn't tell you why it's suspicious, or what to do when it's not sure. Sentinel is a fraud detection system built in three layers — rules that catch what's obvious, models that catch what's subtle, and a routing layer that sends the uncertain cases to a human instead of guessing.

Apr 2, 20269 min read
Spectrum showing general patterns on the left, your specific case on the right, with the triage layer in the gap between them
AIArchitectureData Science

When Models Disagree: General Patterns, Specific Problems

Every model is trained on general patterns. Your business runs on specific ones. When they match, it's magic. When they don't, it's 'Shop Now' instead of 'Careers.' The value isn't in the model — it's in knowing where your case falls.

Apr 1, 20267 min read
Predicted test effects by location and gender for win and conversion models
Machine LearningAd TechData Science

From Proving to Predicting: When Two Models Disagree

We ran logistic regression and neural networks on the same ad tech data. They told different stories. Which one do you believe? That's the same trust problem behind every AI agent and every automated decision.

Mar 31, 20268 min read
Conversion probability scatter plot by location, segmented by gender and test group
Data ScienceAd TechStatistics

When the Numbers Lie: Statistical Modeling in Ad Tech

A new ad design 'won' overall — until we broke it down by segment. Two groups actually converted better on the old design. Aggregate metrics hide where your model actually fails.

Mar 30, 20269 min read
Side-by-side architecture comparison: supply chain parts classification vs KYB verification engine
AIArchitectureAutomation

The 80% Problem: What Supply Chain Taught Me About AI Agents

AI agents process thousands of companies a day — but still need a human to verify the results. I built this same escalation pattern a decade ago in global supply chain. The tools changed. The architecture didn't.

Mar 29, 202610 min read
KYB verification engine dashboard screenshot
AIAutomation

Know Your Business: Building an AI Engine That Verifies 1,900 Companies a Day

How we built a fully automated KYB verification engine that processes thousands of companies daily — from data sources and tiered processing to stealth browsing and eight verification signals.

Mar 8, 20265 min read
Sovereign infrastructure stack powering Walsenburg Tech
InfrastructureSelf-Hosting

The Sovereign Stack: Why a Lead Architect Builds His Own Infrastructure in the Rockies

Why Walsenburg Tech self-hosts on Hetzner and Coolify — bare metal strategy, identity management, monitoring challenges, and the cost wins that make it all worthwhile.

Mar 5, 20264 min read

Want to talk engineering?

Have questions about anything you've read here? I'm always up for a good architecture conversation.