DeepSparse Server
DeepSparse is an inference runtime taking advantage of sparsity with neural networks offering GPU-class performance on CPUs.
Open WebUI is an extensible, feature-rich and user-friendly WebUI for LLMs.
GPUs are available on Koyeb! Get ready to deploy serverless AI apps on high-performance infrastructure. Deploy today!
Open WebUI is a ChatGPT-like web UI for various LLM runners, including Ollama and other OpenAI-compatible APIs.
It's hard to name all of the features supported by Open WebUI, but to name a few:
Open WebUI also offers great flexibility and configurability, so in practice, you can have your own private ChatGPT suited to your needs.
This example app deploys Open WebUI together with Ollama, which simplifies running the latest state-of-the-art models.
Once the Open WebUI server is deployed, you can start interacting with it via your Koyeb App URL similar to: https://<YOUR_APP_NAME>-<YOUR_KOYEB_ORG>.koyeb.app
.
Pull the model you want from Ollama and start using your private ChatGPT lookalike!
DeepSparse is an inference runtime taking advantage of sparsity with neural networks offering GPU-class performance on CPUs.
Deploy Fooocus, a powerful AI image generation tool, on Koyeb
LangServe makes it easy to deploy LangChain applications as RESTful APIs.