<?xml version="1.0" encoding="utf-8" standalone="yes"?><rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Gemini - Tag - Shengxu · Cloud Architecture &amp; DevOps</title><link>https://shengxu.pages.dev/en/tags/gemini/</link><description>Cloud architecture &amp; DevOps notes by Shengxu: Kubernetes, Cilium, observability, LLM infra, AI agents.</description><generator>Hugo 0.153.2 &amp; FixIt v0.4.0-alpha.3-20251225101113-8ffb9a95</generator><language>en</language><lastBuildDate>Fri, 23 Jan 2026 15:30:00 +0800</lastBuildDate><atom:link href="https://shengxu.pages.dev/en/tags/gemini/index.xml" rel="self" type="application/rss+xml"/><item><title>Hands-On: Building an Automated AI Semantic Search with Cloudflare Vectorize and Gemini</title><link>https://shengxu.pages.dev/en/posts/building-ai-search-with-cloudflare-and-gemini/</link><pubDate>Fri, 23 Jan 2026 15:30:00 +0800</pubDate><guid>https://shengxu.pages.dev/en/posts/building-ai-search-with-cloudflare-and-gemini/</guid><category domain="https://shengxu.pages.dev/en/categories/ai/">AI</category><category domain="https://shengxu.pages.dev/en/categories/devops/">DevOps</category><description>&lt;p&gt;In 2026, adding AI search to a personal blog is nothing new. But achieving it with &lt;strong&gt;zero cost&lt;/strong&gt;, &lt;strong&gt;full automation&lt;/strong&gt;, and &lt;strong&gt;high performance&lt;/strong&gt; remains a technical topic worth exploring.&lt;/p&gt;
&lt;p&gt;This article breaks down the technical architecture behind this site&amp;rsquo;s AI Search feature, showing how to combine &lt;strong&gt;Cloudflare Workers&lt;/strong&gt;, &lt;strong&gt;Vectorize&lt;/strong&gt;, &lt;strong&gt;D1&lt;/strong&gt;, and &lt;strong&gt;Google Gemini&lt;/strong&gt; to build a closed-loop RAG (Retrieval-Augmented Generation) system.&lt;/p&gt;
&lt;h2 class="heading-element" id="1-core-architecture-design"&gt;&lt;span&gt;1. Core Architecture Design&lt;/span&gt;
 &lt;a href="#1-core-architecture-design" class="heading-mark"&gt;
 &lt;svg class="octicon octicon-link" viewBox="0 0 16 16" version="1.1" width="16" height="16" aria-hidden="true"&gt;&lt;path d="m7.775 3.275 1.25-1.25a3.5 3.5 0 1 1 4.95 4.95l-2.5 2.5a3.5 3.5 0 0 1-4.95 0 .751.751 0 0 1 .018-1.042.751.751 0 0 1 1.042-.018 1.998 1.998 0 0 0 2.83 0l2.5-2.5a2.002 2.002 0 0 0-2.83-2.83l-1.25 1.25a.751.751 0 0 1-1.042-.018.751.751 0 0 1-.018-1.042Zm-4.69 9.64a1.998 1.998 0 0 0 2.83 0l1.25-1.25a.751.751 0 0 1 1.042.018.751.751 0 0 1 .018 1.042l-1.25 1.25a3.5 3.5 0 1 1-4.95-4.95l2.5-2.5a3.5 3.5 0 0 1 4.95 0 .751.751 0 0 1-.018 1.042.751.751 0 0 1-1.042.018 1.998 1.998 0 0 0-2.83 0l-2.5 2.5a1.998 1.998 0 0 0 0 2.83Z"&gt;&lt;/path&gt;&lt;/svg&gt;
 &lt;/a&gt;
&lt;/h2&gt;&lt;p&gt;Our goal is a fully automated workflow: &lt;strong&gt;write and deploy&lt;/strong&gt;. The author only needs to push Markdown articles; everything else—vector generation, index updates, frontend deployment—is automated.&lt;/p&gt;</description></item></channel></rss>