Inference tutorials

Discover how to build, deploy and run Inference applications in production on Koyeb. The fastest way to deploy applications globally.

/Latest

DeepSeek-R1’s Multi-Lingual and Agentic RAG Capabilities in Practice

Charly Poly

DeepSeek-R1’s Multi-Lingual and Agentic RAG Capabilities in Practice

Explore real-world examples of DeepSeek-R1's powerful capabilities, including agentic reasoning, multilingual abilities, and large context windows, using Koyeb and Inngest orchestration.

Jan 31, 2025

13 min read

Deploy the vLLM Inference Engine to Run Large Language Models (LLM) on Koyeb

Justin Ellingwood

Deploy the vLLM Inference Engine to Run Large Language Models (LLM) on Koyeb

Learn how to set up a vLLM Instance to run inference workloads and host your own OpenAI-compatible API on Koyeb.

Jun 12, 2024

12 min read

Deploy AI apps to production in minutes

Get started