#

llm-proxy

Here are 22 public repositories matching this topic...

doramirdor / NadirClaw

Open-source LLM router & AI cost optimizer. Routes simple prompts to cheap/local models, complex ones to premium — automatically. Drop-in OpenAI-compatible proxy for Claude Code, Codex, Cursor, OpenClaw. Saves 40-70% on AI API costs. Self-hosted, no middleman.

Updated Mar 17, 2026
Python

starbaser / ccproxy

Build mods for Claude Code: Hook any request, modify any response, /model "with-your-custom-model", intelligent model routing using your logic or ours

Updated Mar 13, 2026
Python

lm-proxy

Nayjest / lm-proxy

OpenAI-compatible HTTP LLM proxy / gateway for multi-provider inference (Google, Anthropic, OpenAI, PyTorch). Lightweight, extensible Python/FastAPI—use as library or standalone service.

ai proxy proxy-server openai language-models api-proxy pyton google-ai fastapi openai-api llm anthropic llm-inference llm-proxy llm-api llm-gateway

Updated Feb 19, 2026
Python

SmarterRouter

peva3 / SmarterRouter

SmarterRouter: An intelligent LLM gateway and VRAM-aware router for Ollama, llama.cpp, and OpenAI. Features semantic caching, model profiling, and automatic failover for local AI labs.

docker self-hosted model-serving gpu-monitoring fastapi llm openai-proxy semantic-cache local-llm ollama llm-proxy ollama-api ai-gateway llm-router self-hosted-ai ai-cache

Updated Mar 16, 2026
Python

fabiojbg / LLMApiGateway

A personal LLM gateway with fault-tolerant capabilities for calls to LLM models from any provider with OpenAI-compatible APIs. Advanced features like retry, model sequencing, and body parameter injection are also available. Especially useful to work with AI coders like Cline and RooCode and providers like OpenRouter.

fault-tolerance proxy gateway openai-api llm openrouter llm-proxy llm-gateway roocode cline-ai

Updated Feb 1, 2026
Python

matdev83 / llm-interactive-proxy

Connect any LLM-powered client app, such as a coding agent, to any supported inference backend/model.

Updated Mar 11, 2026
Python

vibheksoni / UniClaudeProxy

Use any LLM with Claude Code — proxy that translates Anthropic API to OpenAI, Gemini, DeepSeek, Ollama, and more. Full tool calling, streaming, ReAct XML fallback, hot-reload config.

streaming sse gemini openai api-proxy claude fastapi ai-tools anthropic react-xml ollama llm-proxy deepseek tool-calling claude-code

Updated Feb 12, 2026
Python

kiku-jw / reliapi

Small reliability layer for HTTP APIs and LLM calls. Idempotent HTTP/LLM proxy with retries, cache, circuit breaker and predictable AI costs.

python redis caching api-gateway reliability http-proxy prometheus self-hosted retry circuit-breaker idempotency fastapi budget-control llm-proxy llm-gateway api-reliability

Updated Mar 11, 2026
Python

xiaoliuzhuan / model-relay-desktop

Local desktop relay for Trae and Cursor, routing OpenAI and Anthropic models through switchable proxy config groups.

desktop-app proxy nuxt openai trae cursor claude tauri anthropic llm-proxy model-relay

Updated Mar 7, 2026
Python

deltawi / deltallm

Route, manage, and analyze your LLM requests across multiple providers with a unified API interface

llm llm-inference llm-proxy ai-gateway llm-agents

Updated Mar 16, 2026
Python

NullRabbitLabs / llm-gateway

Multi-provider LLM gateway with automatic fallback and cost tracking. Single API across DeepSeek, Gemini, OpenAI, and Anthropic.

python gemini ai-agents llm anthropic llm-proxy ai-gateway deepseek openai-compatible

Updated Mar 10, 2026
Python

cenoff / GeminiConnect

Python proxy for Gemini API. Overcomes the tight free-tier rate limits of Gemini Pro by key pooling and provides full OpenAI compatibility for OpenWebUI.

python proxy-server gemini gemini-api fastapi llm-proxy litellm ai-gateway gemini-pro openwebui rate-limit-bypass openai-compatible

Updated Nov 30, 2025
Python

guojun21 / thalamus

Use your Cursor subscription to power Claude Code — smart proxy with lazy tool loading, auto-continuation & model fallback

cursor cursor-api anthropic function-calling llm-proxy tool-calling ai-coding openai-compatible claude-code model-fallback

Updated Mar 7, 2026
Python

RichardHam-co-uk / ProjectLodestar

AI development environment with 90% cost savings. Routes between 8 LLM providers while defaulting to FREE local models. Production-ready with automated testing.

python ai developer-tools llama cost-optimization aider ollama llm-proxy litellm deepseek llm-router ai-coding

Updated Mar 8, 2026
Python

beee003 / astrai-openclaw

Astrai inference router skill for OpenClaw — 40-60% savings on agent LLM costs with intelligent routing and privacy controls

privacy inference gdpr cost-optimization llm-proxy llm-gateway ai-routing openclaw

Updated Feb 16, 2026
Python

Xplore0114 / openai-compatible-proxy

Turn any LLM API into an OpenAI-compatible endpoint in minutes.

api-gateway dify fastapi openai-api llm-proxy ai-gateway open-webui openai-compatible cherry-studio

Updated Mar 8, 2026
Python

aaronmorris-dev / litellm-langfuse-caddy

Ready to go local AI gateway: LiteLLM proxy + Langfuse observability + Caddy reverse proxy. Per-tool tracing, session evaluation, and zero-config trace enrichment.

self-hosted llm llm-proxy langfuse litellm llm-observability

Updated Mar 16, 2026
Python

vandamme-proxy

CedarVerse / vandamme-proxy

Simultaneous multi-provider proxy for Claude Code - Do the splits with your Claude requests.

python api-gateway proxy openai-api llm-proxy claude-code

Updated Mar 12, 2026
Python

hershdoshi55 / neuralgate

LLM cost-optimization proxy which routes requests across 11 models from 4 providers using complexity classification, two-level semantic caching, and automatic provider failover

python docker redis prometheus cost-optimization fastapi pgvector llm-proxy

Updated Mar 12, 2026
Python

poushwell / orchesis

Open-source AI Agent Control Plane. Transparent HTTP proxy for security, cost optimization, and reliability between AI agents and LLM APIs.

proxy mcp policy-engine ai-agents cost-optimization ai-security runtime-security llm-security llm-proxy agentic-ai agent-security mcp-security openclaw

Updated Mar 17, 2026
Python

Improve this page

Add a description, image, and links to the llm-proxy topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the llm-proxy topic, visit your repo's landing page and select "manage topics."