A lightweight reverse proxy for unified access to multiple LLM backends. Supports streaming, token tracking, and multi-provider fallback.
- OpenAI (gpt-5.5, gpt-5.4-mini, gpt-5.4-nano) - Anthropic (claude-opus-4, claude-sonnet-4, claude-haiku-4) - Google Gemini (gemini-2.5-flash, gemini-2.5-pro) - Ollama (local models: llama3.3, qwen3, gemma3)
All endpoints require Bearer Token authentication:
Authorization: Bearer <your-token>
System: operational
curl https://api.ai-relay-croft.cloud/v1/chat/completions \
-H "Authorization: Bearer sk-your-token" \
-H "Content-Type: application/json" \
-d '{
"model": "gpt-5.5",
"messages": [{"role": "user", "content": "Hello"}]
}'