<?xml version="1.0" encoding="utf-8" standalone="yes"?><rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Cloudflare - Tag - Shengxu · Cloud Architecture &amp; DevOps</title><link>https://shengxu.pages.dev/en/tags/cloudflare/</link><description>Cloud architecture &amp; DevOps notes by Shengxu: Kubernetes, Cilium, observability, LLM infra, AI agents.</description><generator>Hugo 0.153.2 &amp; FixIt v0.4.0-alpha.3-20251225101113-8ffb9a95</generator><language>en</language><lastBuildDate>Sun, 25 Jan 2026 10:00:00 +0800</lastBuildDate><atom:link href="https://shengxu.pages.dev/en/tags/cloudflare/index.xml" rel="self" type="application/rss+xml"/><item><title>Practical Guide · Building a Memory-Enabled AI Writing Partner (Part 1): Multi-Agent Architecture Evolution</title><link>https://shengxu.pages.dev/en/posts/fantasy-novel-agent-architecture-evolution/</link><pubDate>Sun, 25 Jan 2026 10:00:00 +0800</pubDate><guid>https://shengxu.pages.dev/en/posts/fantasy-novel-agent-architecture-evolution/</guid><category domain="https://shengxu.pages.dev/en/categories/ai/">AI</category><category domain="https://shengxu.pages.dev/en/categories/devops/">DevOps</category><description>&lt;p&gt;When writing a long novel, the most painful part isn&amp;rsquo;t &amp;ldquo;not being able to write,&amp;rdquo; but &amp;ldquo;forgetting what you&amp;rsquo;ve already written&amp;rdquo;: Did I set up that foreshadowing properly? Was the character already injured in the last chapter? When exactly was that specific rule established? Once the word count reaches hundreds of thousands, relying solely on human memory and scattered notes quickly spirals out of control.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;FantasyNovelAgent&lt;/strong&gt; grew out of this very need, evolving step by step: starting as a simple Python script, then adding dynamic memory and automatic archiving, followed by multi-device sync support, and finally moving toward a front-end/back-end separation with a cloud-native storage prototype. This article reviews that evolutionary path and explains the key trade-offs made along the way, offering a reference for similar projects.&lt;/p&gt;</description></item><item><title>Hands-On: Building an Automated AI Semantic Search with Cloudflare Vectorize and Gemini</title><link>https://shengxu.pages.dev/en/posts/building-ai-search-with-cloudflare-and-gemini/</link><pubDate>Fri, 23 Jan 2026 15:30:00 +0800</pubDate><guid>https://shengxu.pages.dev/en/posts/building-ai-search-with-cloudflare-and-gemini/</guid><category domain="https://shengxu.pages.dev/en/categories/ai/">AI</category><category domain="https://shengxu.pages.dev/en/categories/devops/">DevOps</category><description>&lt;p&gt;In 2026, adding AI search to a personal blog is nothing new. But achieving it with &lt;strong&gt;zero cost&lt;/strong&gt;, &lt;strong&gt;full automation&lt;/strong&gt;, and &lt;strong&gt;high performance&lt;/strong&gt; remains a technical topic worth exploring.&lt;/p&gt;
&lt;p&gt;This article breaks down the technical architecture behind this site&amp;rsquo;s AI Search feature, showing how to combine &lt;strong&gt;Cloudflare Workers&lt;/strong&gt;, &lt;strong&gt;Vectorize&lt;/strong&gt;, &lt;strong&gt;D1&lt;/strong&gt;, and &lt;strong&gt;Google Gemini&lt;/strong&gt; to build a closed-loop RAG (Retrieval-Augmented Generation) system.&lt;/p&gt;
&lt;h2 class="heading-element" id="1-core-architecture-design"&gt;&lt;span&gt;1. Core Architecture Design&lt;/span&gt;
 &lt;a href="#1-core-architecture-design" class="heading-mark"&gt;
 &lt;svg class="octicon octicon-link" viewBox="0 0 16 16" version="1.1" width="16" height="16" aria-hidden="true"&gt;&lt;path d="m7.775 3.275 1.25-1.25a3.5 3.5 0 1 1 4.95 4.95l-2.5 2.5a3.5 3.5 0 0 1-4.95 0 .751.751 0 0 1 .018-1.042.751.751 0 0 1 1.042-.018 1.998 1.998 0 0 0 2.83 0l2.5-2.5a2.002 2.002 0 0 0-2.83-2.83l-1.25 1.25a.751.751 0 0 1-1.042-.018.751.751 0 0 1-.018-1.042Zm-4.69 9.64a1.998 1.998 0 0 0 2.83 0l1.25-1.25a.751.751 0 0 1 1.042.018.751.751 0 0 1 .018 1.042l-1.25 1.25a3.5 3.5 0 1 1-4.95-4.95l2.5-2.5a3.5 3.5 0 0 1 4.95 0 .751.751 0 0 1-.018 1.042.751.751 0 0 1-1.042.018 1.998 1.998 0 0 0-2.83 0l-2.5 2.5a1.998 1.998 0 0 0 0 2.83Z"&gt;&lt;/path&gt;&lt;/svg&gt;
 &lt;/a&gt;
&lt;/h2&gt;&lt;p&gt;Our goal is a fully automated workflow: &lt;strong&gt;write and deploy&lt;/strong&gt;. The author only needs to push Markdown articles; everything else—vector generation, index updates, frontend deployment—is automated.&lt;/p&gt;</description></item></channel></rss>