Weekend Project: Building a Local Load Balancer for LLM API Keys

Sat, 14 Feb 2026 10:18:00 +0800

Lately, because I’ve been using various LLM services (OpenAI, Gemini, DeepSeek, etc.) intensively, I’ve run into a very real pain point: being broke.

To save money, I applied for multiple free API keys (like Google Gemini’s Free Tier or DeepSeek’s complimentary credits), but these free keys often come with strict rate limits (RPM/TPM). Just when I’m in the flow writing code, a 429 Too Many Requests error pops up, completely breaking my train of thought. It’s really frustrating.

MacOS - Tag - Shengxu · Cloud Architecture & DevOps

Weekend Project: Building a Local Load Balancer for LLM API Keys