Building a Distributed Persistent Queue That Scaled AI Workloads 5x Under LLM Rate Limits - Enggist