AI Observability, Part 2: The Grounding Layer
RAG systems fail silently when retrieval breaks. Learn to monitor Azure AI Search, vector stores, and the retrieval pipeline that feeds your models.
New to the blog? This is your guide to understanding my approach to cloud architecture, technical leadership, and navigating anxiety in tech.
Start with Leadership, Drum Major Style - my framework for technical leadership that actually works.
Check out From Base Camp to Summit to understand how to build cloud environments that scale.
Read Debugging Myself - my personal journey with anxiety and what I've learned.
Start with Decide or Drown - the framework for strategic technology decisions.
Cloud architecture patterns, governance frameworks, and real-world implementation strategies.
Frameworks that enable speed, not slow it down. Security, compliance, and organizational patterns.
Modern service management, operational change, and building resilient systems.
Technical leadership, decision-making frameworks, and mental health in tech.
KQL queries, monitoring strategies, and making sense of your data.
RAG systems fail silently when retrieval breaks. Learn to monitor Azure AI Search, vector stores, and the retrieval pipeline that feeds your models.
Spec quality isn't a skill you can workshop. It's an emergent property of organizational health. Why training fails and what actually produces clarity.
Skip the Netflix-scale chaos. Learn to start chaos engineering in Azure with simple, safe experiments that actually improve your systems without breaking production.
Stop building useless dashboards. Learn to create Azure Workbooks that actually help your team make decisions and solve problems faster.
AI can accelerate your growth or replace it. The productivity discourse gives you metrics that feel good but tell you nothing about what's happening to your capability.
Stop monitoring AI infrastructure like web servers. Learn to instrument Azure OpenAI with queries that reveal token consumption, content filters, and cost attribution.