model one-click apps
Deploy model apps with a click, get started with Koyeb in seconds.
/Featured
Ollama
Ollama is a self-hosted AI solution to run open-source large language models on your own infrastructure.
Gemma 2 9b
Deploy Gemma 2 9B on Koyeb high-performance GPU.
Hermes 3 Llama-3.1 8B
Deploy NousResearch Hermes 3 on Koyeb high-performance GPU.
Llama 3.1 8B Instruct
Deploy Llama 3.1 8B Instruct on Koyeb high-performance GPU.
Mistral 7B Instruct v0.3
Deploy Mistral 7B Instruct v0.3 on Koyeb high-performance GPU.
Qwen 2.5 7B Instruct
Deploy Qwen 2.5 7B Instruct on Koyeb high-performance GPU
SmolLM2 1.7B Instruct
Deploy SmolLM2 1.7B Instruct on Koyeb high-performance GPU.