Documentation
Merius API
Merius serves open models on our own Blackwell GPUs over one OpenAI-compatible API. If you have written against the OpenAI SDK, you already know the request shape — change the base URL, use an Merius key, and call any model we serve.
Introduction #
The Merius API speaks the OpenAI HTTP schema end to end: the same endpoints, the same request and response bodies, the same streaming format. You do not install an Merius SDK or learn a new client — any OpenAI-compatible library works once you point it at our base URL and pass an Merius API key.
Under that familiar surface, every request runs on hardware we own and operate: B200/B300 (NVIDIA Blackwell) GPUs in EU and US datacenters. You get transparent per-token pricing, EU data residency on request, and no prompt or completion retention.
Already using the OpenAI SDK? Skip ahead to Authentication and Quickstart — the only lines that change are your base URL and key.
What you get #
A short list of what the API gives you, and where to read more:
OpenAI-compatible
Drop-in /chat/completions, /completions, and /models endpoints. Keep your SDK and request shape; change one line.
Open models, our GPUs
Qwen3, GLM, and Mistral open models served on owned B200/A100 hardware — not resold cloud capacity. See Models.
Streaming, tools, JSON
Server-sent-event streaming, function calling, and structured outputs, all in the OpenAI format you already use.
EU + US, zero retention
EU-only routing on request, GDPR-compliant, no prompt or completion storage. See Data residency.
Make your first request #
Three steps and you are calling a model:
curl https://api.merius.ai/v1/chat/completions \
-H "Authorization: Bearer $MERIUS_API_KEY" \
-H "Content-Type: application/json" \
-d '{
"model": "qwen/qwen3-30b-a3b",
"messages": [{"role": "user", "content": "Hello"}]
}'
- Create an Merius account and generate an API key in the dashboard.
- Point your OpenAI client at the Merius base URL (the one line that changes).
- Call /chat/completions with any model slug we serve.
The Quickstart shows the same call in Python and TypeScript. Authentication covers creating and storing keys.