This routing proxy sits between your application and various LLM providers, automatically selecting the cheapest model capable of handling each request. You'd reach for this when you're making hundreds or thousands of LLM calls and want to stop overpaying for simple tasks that don't need GPT-4 level intelligence. It analyzes each prompt and routes to the most cost-effective option, which the maintainers claim cuts costs by 60-90% in typical workloads. Think of it as a smart load balancer that optimizes for your wallet instead of latency, useful for any project where LLM costs are adding up but you don't want to manually tune model selection for every call.
io.github.ericm1018/skillfm-llm-cost-optimizer-openai-anthropic-usage
io.github.mikerawsonnz/llm-orchestration-agent
io.github.mikerawsonnz/authenticated-llm-agent
labforgedev/copilot-memory-mcp
csoai-org/agent-prompt-injection-firewall-mcp
io.github.mikerawsonnz/authenticated-multi-llm-agent