Compare
Gateway Tax = (gateway - direct).
Auto-pick: Direct + GW
Mini trend ยท last samples
Methodology
Each latency figure comes from full chat prompts sent to the provider; we wait for streamed completions, capturing provider overhead, the model's thinking time, and transport.
- Samples aggregate real conversation flows, not pings.
- Gateway tax = gateway latency - direct latency.
- Freshness badges combine sample count and last-run timestamps.
Avg heute (UTC)
Top 10 ยท min 3 samples
#1
Groq Direct
n=33
101 ms
#2
Groq Direct
n=33
120 ms
#3
Groq Direct
n=50
121 ms
#4
Groq Direct
n=33
177 ms
#5
Mistral Direct
n=33
181 ms
#6
Mistral Direct
n=33
184 ms
#7
Groq Direct
n=33
185 ms
#8
Mistral Direct
n=49
186 ms
#9
Groq Direct
n=33
194 ms
#10
Mistral Direct
n=49
196 ms
Schnellste Messung (letzte 20m)
Top 10 ยท min 1 sample
#1
Groq Direct
n=10
64 ms
#2
Groq Direct
n=7
71 ms
#3
Groq Direct
n=7
109 ms
#4
Groq Direct
n=7
118 ms
#5
Mistral Direct
n=7
132 ms
#6
Groq Direct
n=7
132 ms
#7
Mistral Direct
n=10
134 ms
#8
Mistral Direct
n=7
135 ms
#9
Groq Direct
n=7
144 ms
#10
Mistral Direct
n=7
144 ms
Latency Trend
Legend
API Server Map
Live endpoints
Top regions
Data lรคdt โฆ
We measure from Germany; our server is pinned on the map.