Vercel BlogMay 15, 2026 1

Sort providers by cost, latency, or throughput on AI Gateway

Sort providers by cost, latency, or throughput on AI Gateway
Summary
The AI Gateway now allows users to sort providers by cost, time to first token (TTFT), or throughput (TPS), enabling better control over ranking criteria for model requests. This feature automatically adapts to changes in provider availability and performance metrics without requiring code modifications. The sorting options enhance the ability to optimize for specific needs, such as cost-effectiveness or latency sensitivity.

Related Articles

Avoid Unnecessary Re-renders in Vue with `v-memo`

Avoid Unnecessary Re-renders in Vue with `v-memo`

Jakub Andrzejewski

Nuxt Tip: Difference Between useFetch and event.$fetch

Nuxt Tip: Difference Between useFetch and event.$fetch

Michael Hoffmann

Why Loaders Matter for Performance

Why Loaders Matter for Performance

Jakub Andrzejewski