February 13th, 2025
Best Open Source LLMs in 2025
Explore top open LLMs in 2025: DeepSeek-R1-Qwen-32b, Mistral Small 3, and Qwen 2.5 Coder. Deploy on serverless GPUs for scalable, dedicated inference endpoints.
70 posts
February 13th, 2025
Explore top open LLMs in 2025: DeepSeek-R1-Qwen-32b, Mistral Small 3, and Qwen 2.5 Coder. Deploy on serverless GPUs for scalable, dedicated inference endpoints.
January 16th, 2025
Discover how high-performance serverless GPUs, autoscaling, scale-to-zero, and other cutting-edge features simplify AI deployment in 2025. Learn how our next-gen platform empowers developers to build faster, smarter, and more cost-efficient AI solutions.
January 8th, 2025
Top PostgreSQL providers with free hosting solutions. Discover free options ideal for building side projects and MVPs.
December 19th, 2024
Since we've released so much goodness in the past couple of weeks, we prepared a recap to make sure you don't miss out on any of our serverless news.
December 18th, 2024
Discover our new plans designed for developers and teams to provide more flexibility, clearer pricing, and effortless scalability as your needs grow.
December 12th, 2024
We are dropping prices across our range of Serverless GPUs including L4, L40S, and A100 GPUs. Build and deploy your AI applications for less with serverless using Scale to Zero and Autoscaling.
September 2nd, 2024
Discover how Beavr scaled their SaaS platform globally with Koyeb and why they chose Koyeb infrastructure over Vercel for scalability, autoscaling, and enhanced developer experience.
June 17th, 2024
Today, we are announcing autoscaling in GA! Get dynamic, flexible, and responsive resource allocations for production. Automatically scale your AI and full stack applications.
May 20th, 2024
Looking to boost the performance of your AI workloads using LLMs in productions? Explore the best inference engines and servers like vLLM, RayLLM with RayServe, TensorRT-LLM, HuggingFace Text Generation Inference, and more to see which one you should be using when performing inference.
May 6th, 2024
Wonder how we added the ability to automatically scale Services running on our platform? Learn about the engineering behind autoscaling Instances on a global and serverless platform.
April 25th, 2024
Trying to keep up with AI and all the buzzwords surrounding it? Learn some essential AI terminology like parameters, weights, tokens, quantization, sparsity and more with our intro to LLMs.
April 10th, 2024
The Koyeb team is growing! Get to know Paweł, software engineer building the Koyeb serverless platform.
April 4th, 2024
Learn how toddl.co reduced its build and deployment times, automated manual proccesses, and regained time to focus on its business.
March 26th, 2024
Last Thursday night, we co-organized a Local and Open Source AI developer meetup with Ollama and Dagger at Station F. Over 450 developers attended, both from the local and international scene.
March 18th, 2024
KubeCon EU 2024 is upon us! We are gearing up for a week full of cloud native and open source fun! Check out our handcrafted list of this week's awesome events.
March 7th, 2024
Learn all about RAG, the AI framework addressing major LLM limitations by supplementing an LLM's knowledge source with external resources. Discover the benefits of RAG, its origins and ideal use cases, how to implement it and deploy RAG-powered AI applications on Koyeb.
February 27th, 2024
We are thrilled to announce that our Singapore location is generally available to deploy your full stack applications, low-latency AI workloads, APIs, and databases.
February 16th, 2024
Discover how FoxSell optimized time, cost, and performance with Koyeb's innovative cloud solutions. From streamlined global deployments to exceptional performance on bare metal machines, explore their journey to efficiency in e-commerce.
February 7th, 2024
Discover how the University of British Columbia Housing Assessment Resource Tools (UBC HART) leverages Koyeb for rapid deployments of the housing assessment resource tools they are developing in the face of Canada's housing crisis.
January 31st, 2024
Our team is growing! Get to know David, Koyeb's first Product Manager, and find out how he is helping us make our vision for serverless a reality.
January 23rd, 2024
Our team is growing! Get to know Leo, Koyeb's newest Software Engineer, and find out how he is helping us make our vision for serverless a reality.
December 19th, 2023
Kong leverages Koyeb for rapid global deployments to highlight its API lifecycle management platform.
December 15th, 2023
Discover how Qwigo leverages global serverless deployments on Koyeb to deploy their CPU-intensive services across the US and Europe.
December 13th, 2023
Using a managed database saves time and every when building applications that need persistent storage. While there are numerous benefits to using a managed database, there are serious and long-term considerations to have before choosing one for your app. Our analysis aims to equip you with insights that will help when selecting a managed database for your next project.
October 27th, 2023
Today, we reaffirm our commitment to maintaining a free tier. We explain how we intend to sustain it and why we are so committed to providing one. We provide a free tier so users can explore the platform and deploy a hobby project before deploying a production-grade application. This is a story about bare metal, abuse, credit cards, and scale-to-zero.
October 25th, 2023
Deep dive into how we built a global serverless engine with Nomad, Kuma/Envoy and Golang to make multi-region deployments easy.
September 13th, 2023
With Koyeb Metrics, you get a high-level overview of what is happening in your services running on Koyeb. Discover how you can use Koyeb Metrics to understand your services and diagnose performance issues.
September 5th, 2023
Today, we are excited to announce the support of Dockerfile based deployments in general availability. Building and deploying using Dockerfiles offer more flexibility and control over the build process of your applications to let you deploy any kind of applications, frameworks, and runtimes.
August 31st, 2023
Today, we are thrilled to announce workers are generally available on Koyeb! You can now easily deploy workers to process any background jobs with high-end performance in all of our locations.
August 10th, 2023
The latest version of the Koyeb CLI is available and brings many helpful improvements! Discover the reworked error messages, smoother flow for creating and updating services, and how we improved the foundations of our CLI to continue to building the developer experience we envision.
July 3rd, 2023
The Koyeb team is growing! Get to know Justin, our technical writer and driving-force behind our documentation and technical guides.
June 26th, 2023
Exciting news: We have a new teammate! Get to know Sebastian, our Customer Success Engineer helping Koyeb users deploy successfully!
June 2nd, 2023
Want to learn more about microVMs? Learn all about this lightweight virtualization technology and get all of your microVM questions answered with this short and sweet primer.
May 10th, 2023
Get to know our new team member, Kamil! He is our product designer helping us build an optimal developer experience and make our vision for serverless a reality.
April 27th, 2023
Learn all about gRPC, a high-performance remote procedure call framework, and how it improves communication between services. Discover the benefits of using gRPC, its use cases, how to implement it in various programming languages and deploy gRPC applications on Koyeb.
April 24th, 2023
We have a new team member! Get to know Julia, our talent partner helping us make Koyeb's vision for serverless a reality. Learn what she looks for in a candidate, her favorite part of the hiring process, and what she gets up to outside of work.
April 20th, 2023
eBPF lets you run sandboxed programs in a kernel's operating system. Catch up on the heated debate taking place in the service mesh world about how this technology will shape the future of the service mesh and network innovation.
April 4th, 2023
We have a new team member! Get to know Julien, a Software Engineer helping us make Koyeb's vision for serverless a reality.
December 15th, 2022
You can now deploy your full-stack applications and APIs in our newest US-East location near Washington, DC. Enjoy high-end performance and all of the platform's built-in features for your apps in the world's largest connectivity hub.
December 14th, 2022
Today, we are super excited to announce the grand opening of our Frankfurt core location in Germany! Frankfurt is the largest connectivity hub in Europe and is an amazing place to run high-performance and low latency applications.
October 28th, 2022
Learn about continuous deployment, what goes into building a strong continuous deployment pipeline, the value it adds, how it differs from continuous delivery, and getting it built into your application development.
October 24th, 2022
Wondering what is a service mesh? Get all of your service mesh questions answered with this short and sweet primer. We take a look at how the two crucial parts of a service mesh, the control plane and the data plane, work together to handle interservice communication.
September 28th, 2022
Discover how we added end-to-end tracing to all requests for Koyeb Apps. We explain how we implemented end-to-end tracing, why we chose Jaeger and Grafana Agent to power our observability stack, and how we overcame the challenges we encountered along the way.
August 31st, 2022
Heroku's free tier changed the way developers, hobbyists, students, and indie hackers deployed applications. Heroku's announcement to sunset their free tier marks the end of an era. We take the time to reflect on the rise and impact of Heroku’s legendary free tier.
August 2nd, 2022
Sitting between clients and backend services, API gateways have a number of uses and benefits. Get the lowdown with this short and simple post on API gateways.
July 25th, 2022
Our team is growing! Get to know Nils, our first Full Stack Engineer, and find out how he is helping us make our vision for serverless a reality.
July 18th, 2022
We have a new team member! Get to know Anthony, a Senior Software Engineer helping us make Koyeb's vision for serverless a reality.
July 5th, 2022
Koyeb has a new team member! Get to know Diego, our Software Engineering Intern, and find out how he is helping us make our vision for serverless a reality.
December 21st, 2021
While writing a comparison of Kubernetes and Koyeb, we tried to determine how much operating a Kubernetes cluster really costs. This section of our comparison took us hours to write and ended up being so long that we decided to write a dedicated post about it. Kubernetes is a proven technology, but the true cost is often underestimated: this post investigates the actual financial costs of using Kubernetes.
November 9th, 2021
If you're afraid to push to production on a Friday, rely on big-bang deployments, or find recovering from an infrastructure failure is a painful and time-consuming incident, then it is seriously time to talk about continuous deployment best practices. Discover the different go-to continuous deployment strategies and how you can get a continuous deployment pipeline built-in to your application by deploying on the Koyeb Serverless Platform.
October 20th, 2021
The Koyeb team is growing! Get to know our new team member, Nicolas, in this interview. He is a Senior Backend Engineer helping make Koyeb's vision for serverless a reality.
October 19th, 2021
We recently wrote about how the Koyeb Serverless Engine runs microVMS to host your Services but we skipped a big subject: Global Networking. This is a deep dive to understand the life of an end user's request for a service hosted on Koyeb. We explore the technology and components that make up our internal architecture by following the journey of a request from an end-user, through Koyeb's Global Edge Network, and to the application running in one of our Core locations.
October 8th, 2021
The Koyeb team is growing! Get to know our new team member, Thomas, in this interview. He is a Senior Backend Engineer helping make Koyeb's vision for serverless a reality.
September 16th, 2021
Users have certain expectations for modern web services and applications. Discover how building distributed and global architectures enable you to respond to those standards and and what it really means to deploy globally..
August 10th, 2021
There are several different architecture designs for Web APIs. While REST and RPC remain two popular choices, the arrival of GraphQL and OpenAPI bring new possibilities in the realm of performance, functionality, productivity of your web APIs.
July 7th, 2021
We decided to build our own serverless engine, one that would not be limited by existing implementations. The first version of Koyeb was built on top of Kubernetes and allowed us to quickly build a working cloud platform. After a few months of operating with this version, we decided to move user workloads from Kubernetes to a custom stack based on Nomad, Firecracker, and Kuma.
May 27th, 2021
An API gateway is an API management tool that provides several benefits in a microservice architecture. Learn more about how API gateways work, their typical use cases, and what you should consider before implementing one.
May 18th, 2021
Caching is an effective technique for improving performance and reducing latency speeds for the requests of your web services and apps. CDNs bring your content even closer to end-users. Learn about cache control: what it is, how to configure it, and when to use it.
April 29th, 2021
A service mesh is a dedicated layer of infrastructure that simplifies network management and increases visibility into typically complex microservice architectures. We explore this emerging technology by reviewing its history, purpose-built design, and implementations.
April 22nd, 2021
Virtualization optimizes the use of computing resources. Firecracker, a lightweight virtualization technology, is transforming the possibilities of serverless workloads.
March 31st, 2021
Event routers are the middlemen in an event-driven system. RabbitMQ and Apache Kafka are two popular event routers with very different implementations. Learn about their difference to make better decisions for powering modern apps.
March 23rd, 2021
Service discovery is the vital component in a microservice architecture that enables communication between services. Discover the influence of DNS on service discovery as well as learn about the different models of service discovery and their real-world implementations.
March 15th, 2021
Sync and async are two popular types of programming models when building event-based architectures, APIs, and handling long-running tasks. This blog posts compares async and sync processing as well as covers when to use them.
March 4th, 2021
Event-driven architectures are a great model to align your business with the real-world. Pairing it with serverless technology is a dream come true for your developers and your business.
February 25th, 2021
When considering a FaaS or CaaS deployment strategy, it is worthwhile to consider the difference in the managed responsiblities between FaaS and CaaS offerings. Learn more about ideal use cases and when to use FaaS or CaaS solutions.
February 23rd, 2021
Firecracker is a virtualization technology with that is setting the serverless world ablaze. While there are many perks with Firecracker, here are our top ten reasons why we love Firecracker.
February 8th, 2021
FaaS and CaaS are two popular deployment strategies with their own unique advantages and ideal use cases. Knowing what distinguishes them can help when deciding how to build and deploy your web apps.
January 27th, 2021
Virtualization technology is evolving. Firecracker is an emerging solution that combines the security and isolation of bare metal instances with the density and performance of containers.
January 18th, 2021
The serverless computing era is here. Learn about the implications of going serverless as well as the benefits and existing challenges to implementing this emerging technology.
January 7th, 2021
The serverless computing era is here. Learn about the history and evolution of cloud computing to see why developers and businesses are excited about serverless technology.