Engineering Blog

Building AI That Ships

Technical deep dives, architecture decisions, and lessons learned from building a production AI agent platform.

FeaturedArchitecture

The Create-From-Steps Breakthrough: Why AI Should Describe Intent, Not Generate Code

We tried having AI generate XML workflows. It failed — truncation, escaping issues, hallucinated success. So we flipped the architecture: AI describes WHAT in ~150 tokens of JSON, and the server handles HOW. One API call. 2 seconds. $0.001. Here's the full story of how we got there.

February 4, 20268 min read

All Posts

EngineeringJanuary 29, 202612 min read

Building a Multi-Agent Healthcare Platform: 6 Phases in 82 Hours

How we went from zero to 250+ production actions across 12 domains. The sprint that created a comprehensive AI agent platform for healthcare administration.

ArchitectureJanuary 25, 202610 min read

Why We Replaced AutoInvoke with a Manual Tool Call Loop

Semantic Kernel's AutoInvokeKernelFunctions is a black box. We built ManualToolCallLoop for visibility, guardrails, cost tracking, and per-call trace recording.

SecurityJanuary 20, 20268 min read

Designing a Fail-Closed Guardrail Pipeline for AI Agents

Permission checks, cost controls, and PII scanning on every tool call. How we built enterprise-grade safety into the agent execution engine.

Case StudyJanuary 15, 20266 min read

From 9 Minutes to 16 Seconds: The Architecture Behind 94% Time Reduction

The story of how AI agents transformed healthcare claims processing from a manual bottleneck to automated precision at scale.

LLMJanuary 10, 20267 min read

Why We Chose xAI Grok for Production AI Agents

4M tokens/min throughput, $0.20/M input pricing, and reliable tool calling. How grok-4-1-fast became our primary production LLM.

Want These Systems Built for You?

Everything in these posts is running in production. Let's talk about building the same for your organization.

Book a Discovery Call