AI Dashboard
Model usage, token economics and inference health across the Helios Pro AI gateway.
Total Calls
6.45M
24h window
+18.2%vs last month
Tokens Processed
3.24B
input + output
+22.4%vs last month
Avg. Latency
284ms
p50 across models
-38msvs last month
Inference Cost
$32,900
$0.0051 / 1k tokens
+4.1%vs last month
Model Usage
Daily calls, last 14 days
+18.2%
Tokens by Category
Where tokens are spent
Latency Trend
p50 / p95 / p99 (ms)
-38ms p50
Live Inference
Gateway event stream
Live
- AS
Aarav Sharma ran inference on Helios-2 Turbo (4.2k tokens)
12 sec ago - PP
Priya Patel fine-tuned Llama 3.1 70B
2 min ago - DC
Daniel Chen deployed Mistral Large v3
8 min ago - SM
Sofia Mendes hit rate limit on Claude 3.5 Sonnet
14 min ago - MB
Marcus Bell optimized prompt for GPT-4o
26 min ago
Model Performance
Calls, tokens, latency, cost & health
6 models
| Model | Calls↕ | Tokens↕ | Latency | Cost↕ | Health |
|---|---|---|---|---|---|
Helios-2 Turbo Helios Labs | 2.4M | 1.2B | 184ms | healthy | |
GPT-4o OpenAI | 1.8M | 920M | 412ms | healthy | |
Claude 3.5 Sonnet Anthropic | 940k | 480M | 396ms | degraded | |
Llama 3.1 70B Self-hosted | 620k | 320M | 248ms | healthy | |
Mistral Large Mistral AI | 410k | 180M | 322ms | down | |
Gemini 1.5 Pro | 280k | 140M | 288ms | healthy |