Compare
Gateway Tax = (gateway - direct).
Auto-pick: Direct + GW
Mini trend ยท last samples
Methodology
Each latency figure comes from full chat prompts sent to the provider; we wait for streamed completions, capturing provider overhead, the model's thinking time, and transport.
- Samples aggregate real conversation flows, not pings.
- Gateway tax = gateway latency - direct latency.
- Freshness badges combine sample count and last-run timestamps.
Avg heute (UTC)
Top 10 ยท min 3 samples
#1
Groq Direct
n=24
97 ms
#2
Groq Direct
n=23
117 ms
#3
Groq Direct
n=36
130 ms
#4
Mistral Direct
n=23
169 ms
#5
Groq Direct
n=24
179 ms
#6
Mistral Direct
n=24
181 ms
#7
Groq Direct
n=23
189 ms
#8
Mistral Direct
n=35
190 ms
#9
Groq Direct
n=24
194 ms
#10
Mistral Direct
n=35
201 ms
Schnellste Messung (letzte 20m)
Top 10 ยท min 1 sample
#1
Groq Direct
n=7
64 ms
#2
Groq Direct
n=10
67 ms
#3
Groq Direct
n=6
93 ms
#4
Groq Direct
n=7
125 ms
#5
Groq Direct
n=7
134 ms
#6
Mistral Direct
n=7
135 ms
#7
Mistral Direct
n=6
135 ms
#8
Groq Direct
n=10
137 ms
#9
Mistral Direct
n=9
137 ms
#10
Groq Direct
n=6
138 ms
Latency Trend
Legend
API Server Map
Live endpoints
Top regions
Data lรคdt โฆ
We measure from Germany; our server is pinned on the map.