Access frontier open-source LLMs through a single API. Pay per token, no subscription required.
deepseek/deepseek-v3.2
moonshotai/kimi-k2.5
z-ai/glm-5
minimax/minimax-m2.5
489 tokens/sec means your AI assistant thinks faster. Cursor autocomplete feels instant, Claude Code edits land quicker, and coding agents iterate in seconds instead of minutes.
Drop-in replacement for the OpenAI SDK. Change one line of code to switch.
Scale without restrictions. Send as many requests as you need.
Latest hardware for the fastest inference available today.