<?xml version="1.0" encoding="utf-8" standalone="yes"?><rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Security - Category - Shengxu · Cloud Architecture &amp; DevOps</title><link>https://shengxu.pages.dev/en/categories/security/</link><description>Cloud architecture &amp; DevOps notes by Shengxu: Kubernetes, Cilium, observability, LLM infra, AI agents.</description><generator>Hugo 0.153.2 &amp; FixIt v0.4.0-alpha.3-20251225101113-8ffb9a95</generator><language>en</language><lastBuildDate>Sat, 21 Mar 2026 14:31:56 +0800</lastBuildDate><atom:link href="https://shengxu.pages.dev/en/categories/security/index.xml" rel="self" type="application/rss+xml"/><item><title>Cilium 2026 (Continued): How the Unified Data Plane Is Reshaping Kubernetes Platform Architecture</title><link>https://shengxu.pages.dev/en/posts/cilium-2026-part-2-unified-dataplane/</link><pubDate>Sat, 21 Mar 2026 14:31:56 +0800</pubDate><guid>https://shengxu.pages.dev/en/posts/cilium-2026-part-2-unified-dataplane/</guid><category domain="https://shengxu.pages.dev/en/categories/kubernetes/">Kubernetes</category><category domain="https://shengxu.pages.dev/en/categories/devops/">DevOps</category><category domain="https://shengxu.pages.dev/en/categories/observability/">Observability</category><category domain="https://shengxu.pages.dev/en/categories/security/">Security</category><category domain="https://shengxu.pages.dev/en/categories/ai/">AI</category><description>&lt;p&gt;In &lt;a href="https://shengxu.pages.dev/posts/cilium-2026/"&gt;the previous article on Cilium&lt;/a&gt;, we explored the real reasons behind the 2026 migration wave: it&amp;rsquo;s no longer just &amp;ldquo;a faster CNI,&amp;rdquo; but rather a reorganization of Kubernetes networking, security, observability, and multi-cluster capabilities into a more unified infrastructure foundation, while clarifying its division of labor and boundaries with Istio.&lt;/p&gt;
&lt;p&gt;If the previous article answered &amp;ldquo;What exactly can Cilium bring us?&amp;rdquo;, this one goes further, focusing on its core evolution: the &lt;strong&gt;Unified Dataplane&lt;/strong&gt;.&lt;/p&gt;</description></item><item><title>Before Discussing LLM Security, Is Your Kubernetes Foundation Up to Standard?</title><link>https://shengxu.pages.dev/en/posts/kubernetes-security-before-llm/</link><pubDate>Sat, 14 Mar 2026 10:00:00 +0800</pubDate><guid>https://shengxu.pages.dev/en/posts/kubernetes-security-before-llm/</guid><category domain="https://shengxu.pages.dev/en/categories/security/">Security</category><category domain="https://shengxu.pages.dev/en/categories/kubernetes/">Kubernetes</category><category domain="https://shengxu.pages.dev/en/categories/devops/">DevOps</category><description>&lt;p&gt;The explosion of Large Language Models (LLMs) and AI Agents has not only revolutionized business models but also introduced new application-layer security challenges such as prompt injection and data poisoning. While everyone&amp;rsquo;s attention is drawn to these cutting-edge vulnerabilities, let&amp;rsquo;s first pause and ask ourselves a fundamental question: &lt;strong&gt;Before diving into these complex AI security issues, is the cloud-native foundation that supports all our business workloads even up to par?&lt;/strong&gt;&lt;/p&gt;</description></item><item><title>Practical · Building a Memory-Enabled AI Writing Partner (Part 3): Security Architecture (RAG Protection, Fact Guard, and BYOK)</title><link>https://shengxu.pages.dev/en/posts/fantasy-novel-agent-security/</link><pubDate>Wed, 04 Feb 2026 10:00:00 +0800</pubDate><guid>https://shengxu.pages.dev/en/posts/fantasy-novel-agent-security/</guid><category domain="https://shengxu.pages.dev/en/categories/ai/">AI</category><category domain="https://shengxu.pages.dev/en/categories/security/">Security</category><category domain="https://shengxu.pages.dev/en/categories/devops/">DevOps</category><category domain="https://shengxu.pages.dev/en/categories/observability/">Observability</category><description>&lt;p&gt;In the previous 2.5 articles, I&amp;rsquo;ve already laid out the backbone of &lt;a href="https://shengxu.pages.dev/posts/fantasy-novel-agent-architecture-evolution/"&gt;FantasyNovelAgent&lt;/a&gt;:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;&lt;strong&gt;&lt;a href="https://shengxu.pages.dev/posts/fantasy-novel-agent-architecture-evolution/"&gt;Building a Memory-Enabled AI Writing Partner (Part 1): Multi-Agent Architecture Evolution&lt;/a&gt;&lt;/strong&gt;&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;&lt;a href="https://shengxu.pages.dev/posts/fantasy-novel-agent-database-evolution/"&gt;Building a Memory-Enabled AI Writing Partner (Part 2): Database Evolution&lt;/a&gt;&lt;/strong&gt;&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;&lt;a href="https://shengxu.pages.dev/posts/fantasy-novel-agent-retrieval-evolution/"&gt;Building a Memory-Enabled AI Writing Partner (Part 3): Retrieval System Evolution&lt;/a&gt;&lt;/strong&gt;&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;This article dives deep into the most overlooked yet critical aspect of AI systems: &lt;strong&gt;Security&lt;/strong&gt;.&lt;/p&gt;
&lt;p&gt;If you&amp;rsquo;re thinking, &amp;ldquo;I&amp;rsquo;m just writing a novel, what security issues could there be?&amp;rdquo;, consider this:&lt;/p&gt;</description></item><item><title>OWASP LLM Top 10 Security in Practice</title><link>https://shengxu.pages.dev/en/posts/owasp-llm-top-10-2026/</link><pubDate>Fri, 23 Jan 2026 10:00:00 +0800</pubDate><guid>https://shengxu.pages.dev/en/posts/owasp-llm-top-10-2026/</guid><category domain="https://shengxu.pages.dev/en/categories/security/">Security</category><category domain="https://shengxu.pages.dev/en/categories/ai/">AI</category><category domain="https://shengxu.pages.dev/en/categories/kubernetes/">Kubernetes</category><description>&lt;p&gt;Yesterday I had the privilege of attending a talk by Sergey Saburov from Acronis on &amp;ldquo;Agentic Engineering &amp;amp; LLM Security.&amp;rdquo; Sergey provided an in-depth analysis of security threats facing modern LLM applications, along with numerous real-world case studies aligned with the OWASP LLM Top 10 framework.&lt;/p&gt;
&lt;p&gt;I&amp;rsquo;ve organized and summarized the content based on the latest &lt;strong&gt;OWASP LLM Top 10 v2.0 (2025)&lt;/strong&gt; official standard. I&amp;rsquo;ve corrected some terminology discrepancies from the original talk (e.g., LLM06, LLM10) and compiled Python PoC (Proof of Concept) and defense scripts tailored for Kubernetes platform engineers, hoping this serves as a reference for building secure AI systems.&lt;/p&gt;</description></item><item><title>When AI Gets Your Database Password: A Practical Guide to MCP Exposure Risks</title><link>https://shengxu.pages.dev/en/posts/mcp-security-risks-guide/</link><pubDate>Tue, 20 Jan 2026 00:00:00 +0000</pubDate><guid>https://shengxu.pages.dev/en/posts/mcp-security-risks-guide/</guid><category domain="https://shengxu.pages.dev/en/categories/security/">Security</category><category domain="https://shengxu.pages.dev/en/categories/ai/">AI</category><description>&lt;p&gt;Last year, a typical scenario sparked heated debate in the security community: a developer installed Supabase&amp;rsquo;s MCP plugin in Cursor and configured a &lt;code&gt;service_role&lt;/code&gt; key (database super admin privileges) so the AI could query the database directly. One day, a customer casually asked in a ticket, &amp;ldquo;Can you show me our integration configuration?&amp;rdquo; The AI interpreted this as an instruction and printed the token directly in the reply.&lt;/p&gt;
&lt;p&gt;While this case often appears in security reports as a &amp;ldquo;risk demonstration,&amp;rdquo; the problem it reveals is real: &lt;strong&gt;The MCP protocol grants AI operational permissions, and prompt injection attacks allow hackers to &amp;ldquo;hijack&amp;rdquo; these permissions through natural language.&lt;/strong&gt;&lt;/p&gt;</description></item><item><title>Kubernetes 1.34/1.35 Certificate Revolution: From Manual Hell to Zero-Trust Heaven</title><link>https://shengxu.pages.dev/en/posts/kubernetes-1-34-1-35-certificates/</link><pubDate>Sat, 03 Jan 2026 19:00:00 +0800</pubDate><guid>https://shengxu.pages.dev/en/posts/kubernetes-1-34-1-35-certificates/</guid><category domain="https://shengxu.pages.dev/en/categories/kubernetes/">Kubernetes</category><category domain="https://shengxu.pages.dev/en/categories/cloud/">Cloud</category><category domain="https://shengxu.pages.dev/en/categories/security/">Security</category><description>&lt;p&gt;Recently upgraded to 1.35 and discovered that &lt;strong&gt;certificate management&lt;/strong&gt; changes are nothing short of revolutionary—especially for self-managed K8s users, where operational overhead has been cut in half.&lt;/p&gt;
&lt;p&gt;In the past, certificate issues were the &amp;ldquo;silent killer&amp;rdquo; of security incidents: expired certificates causing outages, token leaks, and manual rotation consuming 30% of ops time. Versions 1.34/1.35 introduce &lt;strong&gt;native automated mTLS&lt;/strong&gt;, making zero trust no longer exclusive to Istio. Today, let&amp;rsquo;s dive into these new features and compare them in a &lt;strong&gt;self-managed K8s vs. cloud K8s&lt;/strong&gt; hands-on scenario.&lt;/p&gt;</description></item><item><title>Kubernetes v1.33–v1.35 Deep Dive: From Native Sidecar to AI Compute Foundation</title><link>https://shengxu.pages.dev/en/posts/kubernetes-v1-33-v1-35-updates/</link><pubDate>Fri, 02 Jan 2026 09:50:00 +0800</pubDate><guid>https://shengxu.pages.dev/en/posts/kubernetes-v1-33-v1-35-updates/</guid><category domain="https://shengxu.pages.dev/en/categories/kubernetes/">Kubernetes</category><category domain="https://shengxu.pages.dev/en/categories/cloud/">Cloud</category><category domain="https://shengxu.pages.dev/en/categories/security/">Security</category><description>&lt;h2 class="heading-element" id="timeline-overview"&gt;&lt;span&gt;Timeline Overview&lt;/span&gt;
 &lt;a href="#timeline-overview" class="heading-mark"&gt;
 &lt;svg class="octicon octicon-link" viewBox="0 0 16 16" version="1.1" width="16" height="16" aria-hidden="true"&gt;&lt;path d="m7.775 3.275 1.25-1.25a3.5 3.5 0 1 1 4.95 4.95l-2.5 2.5a3.5 3.5 0 0 1-4.95 0 .751.751 0 0 1 .018-1.042.751.751 0 0 1 1.042-.018 1.998 1.998 0 0 0 2.83 0l2.5-2.5a2.002 2.002 0 0 0-2.83-2.83l-1.25 1.25a.751.751 0 0 1-1.042-.018.751.751 0 0 1-.018-1.042Zm-4.69 9.64a1.998 1.998 0 0 0 2.83 0l1.25-1.25a.751.751 0 0 1 1.042.018.751.751 0 0 1 .018 1.042l-1.25 1.25a3.5 3.5 0 1 1-4.95-4.95l2.5-2.5a3.5 3.5 0 0 1 4.95 0 .751.751 0 0 1-.018 1.042.751.751 0 0 1-1.042.018 1.998 1.998 0 0 0-2.83 0l-2.5 2.5a1.998 1.998 0 0 0 0 2.83Z"&gt;&lt;/path&gt;&lt;/svg&gt;
 &lt;/a&gt;
&lt;/h2&gt;&lt;ul&gt;
&lt;li&gt;&lt;strong&gt;v1.33 (Octarine)&lt;/strong&gt;: Released April 2025, Native Sidecar GA, security features enabled by default.&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;v1.34 (Of Wind &amp;amp; Will)&lt;/strong&gt;: Released August 2025, DRA GA, marking the native era of AI/GPU scheduling.&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;v1.35 (Timbernetes)&lt;/strong&gt;: Released December 2025, In-Place Pod Resize GA, zero-disruption elasticity becomes reality.&lt;/li&gt;
&lt;/ul&gt;
&lt;hr&gt;
&lt;h2 class="heading-element" id="1-v133-octarine-sidecar-graduation-and-default-security"&gt;&lt;span&gt;1. v1.33 &amp;ldquo;Octarine&amp;rdquo;: Sidecar Graduation and Default Security&lt;/span&gt;
 &lt;a href="#1-v133-octarine-sidecar-graduation-and-default-security" class="heading-mark"&gt;
 &lt;svg class="octicon octicon-link" viewBox="0 0 16 16" version="1.1" width="16" height="16" aria-hidden="true"&gt;&lt;path d="m7.775 3.275 1.25-1.25a3.5 3.5 0 1 1 4.95 4.95l-2.5 2.5a3.5 3.5 0 0 1-4.95 0 .751.751 0 0 1 .018-1.042.751.751 0 0 1 1.042-.018 1.998 1.998 0 0 0 2.83 0l2.5-2.5a2.002 2.002 0 0 0-2.83-2.83l-1.25 1.25a.751.751 0 0 1-1.042-.018.751.751 0 0 1-.018-1.042Zm-4.69 9.64a1.998 1.998 0 0 0 2.83 0l1.25-1.25a.751.751 0 0 1 1.042.018.751.751 0 0 1 .018 1.042l-1.25 1.25a3.5 3.5 0 1 1-4.95-4.95l2.5-2.5a3.5 3.5 0 0 1 4.95 0 .751.751 0 0 1-.018 1.042.751.751 0 0 1-1.042.018 1.998 1.998 0 0 0-2.83 0l-2.5 2.5a1.998 1.998 0 0 0 0 2.83Z"&gt;&lt;/path&gt;&lt;/svg&gt;
 &lt;/a&gt;
&lt;/h2&gt;&lt;p&gt;The keywords for v1.33 are &amp;ldquo;&lt;strong&gt;Native Sidecar&lt;/strong&gt;&amp;rdquo; and &amp;ldquo;&lt;strong&gt;Security Enabled by Default&lt;/strong&gt;.&amp;rdquo; This release transforms long-standing experimental capabilities into dependable infrastructure for daily engineering.&lt;/p&gt;</description></item><item><title>IngressNightmare (CVE-2025-1974): Vulnerability Deep Dive and Gateway API Migration Guide</title><link>https://shengxu.pages.dev/en/posts/ingress-nightmare-gateway-api-migration/</link><pubDate>Sat, 27 Dec 2025 10:00:00 +0800</pubDate><guid>https://shengxu.pages.dev/en/posts/ingress-nightmare-gateway-api-migration/</guid><category domain="https://shengxu.pages.dev/en/categories/kubernetes/">Kubernetes</category><category domain="https://shengxu.pages.dev/en/categories/security/">Security</category><description>&lt;p&gt;The recently disclosed &lt;strong&gt;&amp;ldquo;IngressNightmare&amp;rdquo;&lt;/strong&gt; vulnerability in Ingress-NGINX has once again thrust nginx-ingress into the spotlight, serving as a stark warning for clusters still relying on traditional Ingress.&lt;/p&gt;
&lt;p&gt;Below is a technical review focused on engineering practice, covering the vulnerability recap, risk analysis, short-term fixes, how to leverage this as an opportunity to migrate to Gateway API, and a comparison of pros and cons before and after migration.&lt;/p&gt;
&lt;hr&gt;
&lt;h2 class="heading-element" id="vulnerability-brief-ingressnightmare-cve20251974"&gt;&lt;span&gt;Vulnerability Brief: IngressNightmare (CVE‑2025‑1974)&lt;/span&gt;
 &lt;a href="#vulnerability-brief-ingressnightmare-cve20251974" class="heading-mark"&gt;
 &lt;svg class="octicon octicon-link" viewBox="0 0 16 16" version="1.1" width="16" height="16" aria-hidden="true"&gt;&lt;path d="m7.775 3.275 1.25-1.25a3.5 3.5 0 1 1 4.95 4.95l-2.5 2.5a3.5 3.5 0 0 1-4.95 0 .751.751 0 0 1 .018-1.042.751.751 0 0 1 1.042-.018 1.998 1.998 0 0 0 2.83 0l2.5-2.5a2.002 2.002 0 0 0-2.83-2.83l-1.25 1.25a.751.751 0 0 1-1.042-.018.751.751 0 0 1-.018-1.042Zm-4.69 9.64a1.998 1.998 0 0 0 2.83 0l1.25-1.25a.751.751 0 0 1 1.042.018.751.751 0 0 1 .018 1.042l-1.25 1.25a3.5 3.5 0 1 1-4.95-4.95l2.5-2.5a3.5 3.5 0 0 1 4.95 0 .751.751 0 0 1-.018 1.042.751.751 0 0 1-1.042.018 1.998 1.998 0 0 0-2.83 0l-2.5 2.5a1.998 1.998 0 0 0 0 2.83Z"&gt;&lt;/path&gt;&lt;/svg&gt;
 &lt;/a&gt;
&lt;/h2&gt;&lt;ul&gt;
&lt;li&gt;&lt;strong&gt;Severity&lt;/strong&gt;: In March 2025, researchers disclosed a set of high-severity vulnerabilities in the Ingress-NGINX controller, collectively known as &amp;ldquo;IngressNightmare.&amp;rdquo; Among them, &lt;strong&gt;CVE‑2025‑1974&lt;/strong&gt; has a CVSS score of &lt;strong&gt;9.8&lt;/strong&gt;, rated as &amp;ldquo;Critical&amp;rdquo; by the official team and multiple security vendors, affecting a vast number of Kubernetes clusters.&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Root Cause&lt;/strong&gt;: The core issue lies in the &lt;strong&gt;Validating Admission Webhook&lt;/strong&gt;. When validating an Ingress object, the controller generates an NGINX configuration based on the object and its annotations, then uses &lt;code&gt;nginx -t&lt;/code&gt; for validation. During this process, insufficient filtering of annotations and configuration fragments allows attackers to inject arbitrary NGINX directives, ultimately leading to Remote Code Execution (RCE) on the controller Pod.&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Low Attack Barrier&lt;/strong&gt;: An attacker only needs access to the admission webhook within the Pod network (many clusters even expose it to the public internet) to trigger the vulnerability via unauthenticated requests. This is an &lt;strong&gt;unauthenticated RCE&lt;/strong&gt;, highly susceptible to mass exploitation by worms or automated attack tools.&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Vulnerability Chain&lt;/strong&gt;: The same disclosure includes several other high-severity injection vulnerabilities (e.g., CVE‑2025‑24514, CVE‑2025‑1097, CVE‑2025‑1098), collectively forming the IngressNightmare vulnerability chain, with an attack surface far exceeding a single CVE.&lt;/li&gt;
&lt;/ul&gt;
&lt;hr&gt;
&lt;h2 class="heading-element" id="risk-and-impact-from-nginx-to-full-cluster-takeover"&gt;&lt;span&gt;Risk and Impact: From NGINX to Full Cluster Takeover&lt;/span&gt;
 &lt;a href="#risk-and-impact-from-nginx-to-full-cluster-takeover" class="heading-mark"&gt;
 &lt;svg class="octicon octicon-link" viewBox="0 0 16 16" version="1.1" width="16" height="16" aria-hidden="true"&gt;&lt;path d="m7.775 3.275 1.25-1.25a3.5 3.5 0 1 1 4.95 4.95l-2.5 2.5a3.5 3.5 0 0 1-4.95 0 .751.751 0 0 1 .018-1.042.751.751 0 0 1 1.042-.018 1.998 1.998 0 0 0 2.83 0l2.5-2.5a2.002 2.002 0 0 0-2.83-2.83l-1.25 1.25a.751.751 0 0 1-1.042-.018.751.751 0 0 1-.018-1.042Zm-4.69 9.64a1.998 1.998 0 0 0 2.83 0l1.25-1.25a.751.751 0 0 1 1.042.018.751.751 0 0 1 .018 1.042l-1.25 1.25a3.5 3.5 0 1 1-4.95-4.95l2.5-2.5a3.5 3.5 0 0 1 4.95 0 .751.751 0 0 1-.018 1.042.751.751 0 0 1-1.042.018 1.998 1.998 0 0 0-2.83 0l-2.5 2.5a1.998 1.998 0 0 0 0 2.83Z"&gt;&lt;/path&gt;&lt;/svg&gt;
 &lt;/a&gt;
&lt;/h2&gt;&lt;ol&gt;
&lt;li&gt;&lt;strong&gt;Sensitive Information Leakage&lt;/strong&gt;: Once RCE is achieved within the ingress-nginx controller container, attackers can read all Kubernetes Secrets mounted to that Pod. &lt;strong&gt;Crucially&lt;/strong&gt;, the NGINX Ingress Controller typically has extremely high privileges (ClusterRole), requiring it to read Secrets from &lt;strong&gt;all namespaces&lt;/strong&gt; in the cluster to obtain TLS certificates. This means the consequence of RCE is not just the current Namespace, but the &lt;strong&gt;complete leakage of all cluster certificates and credentials&lt;/strong&gt;.&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Traffic Hijacking and Tampering&lt;/strong&gt;: The controller usually has read and write permissions for Ingress resources in the cluster. Combined with RCE, attackers can further tamper with routing, transparently forwarding user traffic to attacker-controlled backends for man-in-the-middle attacks or data theft.&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;&amp;ldquo;One Hole to Rule the Cloud&amp;rdquo;&lt;/strong&gt;: Practical tests by multiple security vendors show that in clusters with loose default network policies, an attacker only needs execution permissions on any Pod to laterally access the admission webhook, thereby escalating to cluster-level control.&lt;/li&gt;
&lt;/ol&gt;
&lt;hr&gt;
&lt;h2 class="heading-element" id="short-term-remediation-patch-first-rebuild-later"&gt;&lt;span&gt;Short-Term Remediation: Patch First, Rebuild Later&lt;/span&gt;
 &lt;a href="#short-term-remediation-patch-first-rebuild-later" class="heading-mark"&gt;
 &lt;svg class="octicon octicon-link" viewBox="0 0 16 16" version="1.1" width="16" height="16" aria-hidden="true"&gt;&lt;path d="m7.775 3.275 1.25-1.25a3.5 3.5 0 1 1 4.95 4.95l-2.5 2.5a3.5 3.5 0 0 1-4.95 0 .751.751 0 0 1 .018-1.042.751.751 0 0 1 1.042-.018 1.998 1.998 0 0 0 2.83 0l2.5-2.5a2.002 2.002 0 0 0-2.83-2.83l-1.25 1.25a.751.751 0 0 1-1.042-.018.751.751 0 0 1-.018-1.042Zm-4.69 9.64a1.998 1.998 0 0 0 2.83 0l1.25-1.25a.751.751 0 0 1 1.042.018.751.751 0 0 1 .018 1.042l-1.25 1.25a3.5 3.5 0 1 1-4.95-4.95l2.5-2.5a3.5 3.5 0 0 1 4.95 0 .751.751 0 0 1-.018 1.042.751.751 0 0 1-1.042.018 1.998 1.998 0 0 0-2.83 0l-2.5 2.5a1.998 1.998 0 0 0 0 2.83Z"&gt;&lt;/path&gt;&lt;/svg&gt;
 &lt;/a&gt;
&lt;/h2&gt;&lt;p&gt;Before discussing Gateway API migration, all clusters still running ingress-nginx need to take two immediate actions:&lt;/p&gt;</description></item></channel></rss>