Skip to main content

Documentation Index

Fetch the complete documentation index at: https://metrion.mintlify.app/llms.txt

Use this file to discover all available pages before exploring further.

Metrion acts as a transparent proxy between your application and AI providers. All proxy endpoints accept the same request body as the original provider API — Metrion forwards your request and returns the original response unchanged.

Endpoints

ProviderMethodEndpointSDK base URL
Anthropic (SDK)POSThttps://www.metrion.dev/api/proxy/v1/messageshttps://www.metrion.dev/api/proxy
Anthropic (direct)POSThttps://www.metrion.dev/api/proxy/messages— for curl / direct HTTP requests
OpenAIPOSThttps://www.metrion.dev/api/proxy/openai/v1/chat/completionshttps://www.metrion.dev/api/proxy/openai
Google GeminiPOSThttps://www.metrion.dev/api/proxy/gemini/v1/chat/completionshttps://www.metrion.dev/api/proxy/gemini
MistralPOSThttps://www.metrion.dev/api/proxy/mistral/v1/chat/completionshttps://www.metrion.dev/api/proxy/mistral
Grok (xAI)POSThttps://www.metrion.dev/api/proxy/grok/v1/chat/completionshttps://www.metrion.dev/api/proxy/grok

Request body

Send the same request body you would send directly to the provider. Metrion does not modify your request payload.
// Anthropic example
{
  "model": "claude-opus-4-6",
  "max_tokens": 1024,
  "messages": [{ "role": "user", "content": "Hello" }]
}
// OpenAI-compatible example (OpenAI, Gemini, Mistral, Grok)
{
  "model": "gpt-4o",
  "messages": [{ "role": "user", "content": "Hello" }]
}

Response body

The response body is identical to what the provider returns. Metrion only reads usage metadata (token counts, latency) from the response — it does not alter the response structure or content.

Streaming

Streaming is supported for all providers. Pass stream: true in the request body exactly as you would with the provider’s API directly — no additional configuration is needed.
{
  "model": "gpt-4o",
  "stream": true,
  "messages": [{ "role": "user", "content": "Hello" }]
}

Limits

ParameterValue
Rate limit100 requests per minute, per Metrion token
Timeout30 seconds
Free plan monthly cap10,000 requests per month
If you exceed the monthly cap on the Free plan, the proxy returns a 429 Too Many Requests response. Your counter resets on the first of each month. To remove the monthly cap, upgrade to Pro.