AI Dashboard

Model usage, token economics and inference health across the Helios Pro AI gateway.

Total Calls

6.45M

24h window

+18.2%vs last month

Tokens Processed

3.24B

input + output

+22.4%vs last month

Avg. Latency

284ms

p50 across models

-38msvs last month

Inference Cost

$32,900

$0.0051 / 1k tokens

+4.1%vs last month

Daily calls, last 14 days

+18.2%

Where tokens are spent

p50 / p95 / p99 (ms)

-38ms p50

Gateway event stream

Live

Calls, tokens, latency, cost & health

6 models

Model	Calls↕	Tokens↕	Latency	Cost↕	Health
Helios-2 Turbo Helios Labs	2.4M	1.2B	184ms	$8,420	healthy
GPT-4o OpenAI	1.8M	920M	412ms	$12,400	healthy
Claude 3.5 Sonnet Anthropic	940k	480M	396ms	$6,820	degraded
Llama 3.1 70B Self-hosted	620k	320M	248ms	$1,240	healthy
Mistral Large Mistral AI	410k	180M	322ms	$2,180	down
Gemini 1.5 Pro Google	280k	140M	288ms	$1,840	healthy