| Management number | 231874904 | Release Date | 2026/06/18 | List Price | US$10.34 | Model Number | 231874904 | ||
|---|---|---|---|---|---|---|---|---|---|
| Category | |||||||||
Your agent works in the demo. It breaks in production.You built it. You tested it. You shipped it. Then the on-call alert fired at 2am: a loop had made 47 identical tool calls and burned $800 in API charges. Another session had been silently truncating the user's original instructions since turn 6. A third had answered confidently with fabricated data that never existed in any tool result.You're not doing anything wrong. You built what the tutorials showed you — a system that was never designed to survive real users, real load, and real failure.This book fixes that.Based on real benchmarks from production agent systems, LLM Agent Engineering documents the exact patterns that eliminate the seven failure modes that destroy agent deployments — loops, context overflow, tool hallucination, memory drift, silent wrong answers, cost explosion, and observability blindness.Every number in this book was measured, not estimated.You'll build a complete production agent stack from scratch:Failure mode diagnostics — classify what broke and why in under 10 minutes, from a structured trace fileAgent loop architecture — ReAct, Plan-and-Execute, Supervisor: benchmarked on real tasks so you choose before you build, not afterCost visibility — WARN at 50%, CRITICAL at 80%, HALT at 95%: budget guards with 3–13µs overhead that never missContext management — three truncation strategies with measured quality impact; summary-based compression at 6.3x ratioLong-term memory — hybrid retrieval +2.44% precision over dense-only, semantic supersession, TTL and staleness lifecycleOutput quality — three-layer validation: schema, semantic completeness, evidence grounding; confidence calibration that corrects systematic overconfidenceTool reliability — typed schemas cut hallucination from 8.4% to 0.9%; per-tool retry budgets, result validation at 0.045ms overheadMulti-agent orchestration — parallel execution at 2.99x speedup, wave-based dispatch, conflict resolution at $0 additional costError recovery — checkpoint and resume with zero re-execution, fabrication detection, chaos suite with resilience scoringObservability — turn-level tracing, span hierarchy, structured logging at 0.0011ms overhead, Streamlit debug dashboardCost optimization — model routing cuts cost 61% with <2% quality delta; semantic caching at 30% hit rate saves $109/year per agent instanceProduction operations — session isolation verified at 500 concurrent users, queue back-pressure, canary rollout, five on-call runbooksEvery chapter ships with production-ready Python code and a complete implementation in the companion code repository. Not pseudocode. Not simplified examples. Code you can run today.This is not a book about what agents are. It's a book for engineers who already know — and need systems that don't loop, don't fabricate, and don't generate invoices nobody signed off on.If your agent is in production and the failure modes are winning, this is the book that changes that.The classifier in Chapter 1 takes 15 minutes to instrument. Most engineers find their first loop vulnerability the same day. Read more
| ASIN | B0H4WC18JK |
|---|---|
| XRay | Not Enabled |
| Language | English |
| File size | 2.4 MB |
| Page Flip | Enabled |
| Word Wise | Not Enabled |
| Print length | 425 pages |
| Accessibility | Learn more |
| Screen Reader | Supported |
| Publication date | June 11, 2026 |
| Enhanced typesetting | Enabled |
If you notice any omissions or errors in the product information on this page, please use the correction request form below.
Correction Request Form