<?xml version="1.0" encoding="utf-8" standalone="yes"?><rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>AIOps - Tag - Shengxu · Cloud Architecture &amp; DevOps</title><link>https://shengxu.pages.dev/en/tags/aiops/</link><description>Cloud architecture &amp; DevOps notes by Shengxu: Kubernetes, Cilium, observability, LLM infra, AI agents.</description><generator>Hugo 0.153.2 &amp; FixIt v0.4.0-alpha.3-20251225101113-8ffb9a95</generator><language>en</language><lastBuildDate>Mon, 19 Jan 2026 15:00:00 +0800</lastBuildDate><atom:link href="https://shengxu.pages.dev/en/tags/aiops/index.xml" rel="self" type="application/rss+xml"/><item><title>From Traffic Gatekeeping to Quality Insight: A 2026 Guide to Building Enterprise-Grade LLM Observability Systems</title><link>https://shengxu.pages.dev/en/posts/llm-observability-guide-2026/</link><pubDate>Mon, 19 Jan 2026 15:00:00 +0800</pubDate><guid>https://shengxu.pages.dev/en/posts/llm-observability-guide-2026/</guid><category domain="https://shengxu.pages.dev/en/categories/ai/">AI</category><category domain="https://shengxu.pages.dev/en/categories/observability/">Observability</category><description>&lt;p&gt;As large language models (LLMs) evolve from &amp;ldquo;novelty toys&amp;rdquo; into the &amp;ldquo;productivity backbone&amp;rdquo; of enterprises, a question that every technical leader keeps coming back to has surfaced: &lt;strong&gt;When API calls become a black box, how do we manage these massive, expensive AI models with the same rigor we apply to databases or microservices?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;If 2024 was the year everyone was busy &amp;ldquo;getting demos to work,&amp;rdquo; then 2026 marks the dawn of &amp;ldquo;fine-grained governance.&amp;rdquo; The simple &amp;ldquo;call succeeded/failed&amp;rdquo; logs of the past can no longer answer today&amp;rsquo;s complex operational questions: &lt;em&gt;&amp;ldquo;Why was this agent so smart yesterday, but today it&amp;rsquo;s spouting nonsense?&amp;rdquo;&lt;/em&gt;, &lt;em&gt;&amp;ldquo;Why did our token costs suddenly double last month?&amp;rdquo;&lt;/em&gt;, &lt;em&gt;&amp;ldquo;Is someone trying to attack our customer service bot with a prompt injection?&amp;rdquo;&lt;/em&gt;&lt;/p&gt;</description></item></channel></rss>