Blog
New posts every week — comparison guides, deep technical dives, and real failures we caught monitoring our own agent stack.RSS
comparison
- ToolPulse vs Langfuse: when to pick which
Honest, side-by-side comparison: Langfuse for prompt traces and evals, ToolPulse for tool-call reliability and schema drift. Where they overlap, where they don't, which to choose.
4/27/2026· auto-published from live data
deep dive
- Why schema drift is the silent killer of agent reliability
An API changes a field from int to string. Your agent doesn't crash — it just silently makes worse decisions. Here's how schema drift propagates through tool chains, and how to detect it before users see the consequences.
4/26/2026· auto-published from live data
case study
- The 3am drift event: how a popular search API quietly changed shape and what we caught
A real drift event from our own monitored agent stack. A search tool added a new top-level field, removed an inner one, and our agent started giving worse answers — for two hours, until the alert fired.
4/25/2026· auto-published from live data