About
Run large language models locally with a simple CLI and API, supporting Llama, Mistral, and more.
Replaces
OpenAI API
OpenAI
Partial
Anthropic API
Anthropic
Partial
Google Gemini API
Partial
DeepSeek
DeepSeek
Full
Hugging Face
Hugging Face
Partial
Fireworks AI
Fireworks AI
Partial
Google Gemini API
Partial