Agent Manager’s FinOps tooling gives you real-time visibility into LLM token usage and associated costs. You can monitor spending per agent and session, set expected burn-rate baselines, and receive alerts when agent costs deviate from normal behavior.Documentation Index
Fetch the complete documentation index at: https://operativusai.mintlify.app/llms.txt
Use this file to discover all available pages before exploring further.
What FinOps tracks
Valuation rates
Token-to-USD conversion rates per model. Agent Manager uses these rates to compute cost estimates from raw token counts reported by the LLM provider.
Burn rates
Real-time USD spend velocity per active session. Sliding-window accumulators track cost per hour so you can detect runaway sessions before they become costly.
Historical trends
Daily cost aggregations over configurable trailing windows (7, 30, or up to 90 days) broken down by agent and organization.
Anomaly detection
Sessions whose burn rate exceeds a registered agent baseline by a configurable multiplier are flagged as anomalies, visible in the dashboard and via API.
Viewing cost data
Historical cost trends (trailing N days, default 7):Configuring valuation rates
Agent Manager computes cost estimates using per-model token-to-USD rates. Retrieve the current rate table:Setting agent burn-rate baselines
Baselines define the expected normal USD/hour spend for an agent. Agent Manager uses baselines to identify anomalous sessions.Set baselines after running your agents in normal conditions for a few days. Use the historical trends endpoint to determine a representative USD/hour figure for each agent.
Anomaly detection
When a session’s burn rate exceeds its agent’s baseline by a configured multiplier, it appears as an active anomaly:Prometheus metrics
Agent Manager exposes FinOps data via Prometheus at the standard actuator endpoint:| Metric | Type | Description |
|---|---|---|
agent.runs | Counter | Total agent run count |
agent.tool.calls | Counter | Total tool invocations |
finops.cache.savings.usd | Counter | Cumulative USD saved via semantic cache |
finops.embedding.cost.usd | Summary | Cumulative USD spent on embeddings |