Studio — Web Dashboard¶
The web dashboard exposes every Studio capability through a browser — designed for team members who don't live in a terminal, and for visual inspection of complex multi-agent systems.
/overview¶
Blueprint summary and quick-action panel.
Blueprint: customer-support-system v1.2.0
Owner: platform-team | Tags: support, production
Resources
Agents: 5 Workflows: 1
Providers: 3 Contracts: 2
Compliance Score: 83% (5/6 checks passed)
[Validate] [Simulate] [View Traces] [View Diff]
The compliance score comes from GovernanceService — see Services. Click Validate to rerun checks live. Click Simulate to jump to /simulate with the current blueprint pre-loaded.
/agents¶
Agent roster with full prompt viewer.
Agents (5)
classifier fast / claude-haiku-3-5-20241022
Route to billing, technical, returns, or general
System prompt:
Classify the customer message into exactly one of:
billing, technical, returns, general.
Respond with ONLY the category name.
billing_agent expert / claude-sonnet-4-20250514
Handle billing disputes and subscription issues
System prompt:
You are a billing specialist. Handle disputes, refunds,
and subscription questions. Always acknowledge frustration first.
...
Tools: lookup_invoice, issue_refund
Each agent card shows: - Name, description, provider, model - Full system prompt (expandable) - Tools and guardrails attached
/workflows¶
Pattern diagrams with interactive agent inspection.
Workflows (1)
main [supervisor] ── [Run Simulation]
flowchart TD
IN[Input] --> classifier
classifier -->|billing| billing_agent
classifier -->|technical| technical_agent
...
formatter --> OUT[Output]
- Rendered Mermaid diagram for each workflow
- Click any agent node to see its prompt and provider inline
- Pattern type badge:
supervisor,pipeline,fan_out_fan_in, etc. - Run Simulation button launches
/simulatewith this workflow pre-selected
/simulate¶
Interactive workflow runner.
Workflow: [main ▼]
Mode: [◉ Mock (free)] [○ Live (real API)]
Task: [I was charged twice this month ] [Run]
─────────────────────────────────────────────────────
Step 1 classifier 0.012s MockLLM
Output: billing
Step 2 billing_agent 0.024s MockLLM
Output: "I understand your frustration. Let me look into the
double charge on your account..."
Step 3 formatter 0.008s MockLLM
Output: "Thank you for reaching out. I understand your frustration
regarding the duplicate charge..."
─────────────────────────────────────────────────────
Final: "Thank you for reaching out. I understand..."
Duration: 0.044s | Cost: $0.00 (mock) | Route: billing
Switch between Mock and Live mode. Live mode uses real providers and shows actual latency and cost.
/traces¶
Trace explorer for recorded .jsonl files.
See the dedicated Traces page for full detail.
Loaded: traces/production_runs.jsonl
1,240 spans | 180 runs | last run: 2025-06-09T14:32
Run #174 supervisor 3 steps 3.4s $0.0092
├── classifier 0.3s claude-haiku 12→1 tok $0.00001
├── billing_agent 2.4s claude-sonnet 180→140 tok $0.00840
└── formatter 0.7s claude-haiku 145→80 tok $0.00051
Cost by model (last 7 days):
claude-sonnet-4-20250514 ████████████████ $24.80 (72%)
gpt-4o-mini ████ $6.20 (18%)
claude-haiku-3-5-20241022 ██ $3.40 (10%)
/governance¶
Compliance scores and diff view.
Blueprint: customer-support-system v1.2.0
Compliance Score: 83% (5/6 checks passed)
✓ Agent refs All workflow agent references resolve
✓ Provider refs All agent provider references resolve
✓ Pattern names All patterns are registered
✓ Contract refs All contract references resolve
✗ SLA values timeout_seconds: 30s may be too low for expert provider
(claude-sonnet p99 observed at 2.4s)
✓ Security No hardcoded API keys detected
Diff vs previous version:
~ agents.billing_agent.provider: expert → premium (+$0.008/call)
- agents.legacy_agent (removed)
/providers¶
Real-time provider health and budget dashboard.
Provider Health
anthropic / claude-haiku-3-5-20241022
Status: ✓ HEALTHY Avg latency: 340ms p99: 820ms
Errors (1h): 0 Budget used: $1.24 / $50.00 ██░░░░░░░░ 2.5%
anthropic / claude-sonnet-4-20250514
Status: ✓ HEALTHY Avg latency: 1,100ms p99: 2,400ms
Errors (1h): 2 (rate limit) Budget used: $18.60 / $50.00 ██████░░░ 37%
openai / gpt-4o-mini
Status: ✓ HEALTHY Avg latency: 480ms p99: 1,100ms
Errors (1h): 0 Budget used: $4.30 / $50.00 █░░░░░░░░░ 9%
Budget data comes from the cost_budget.daily_usd field in the blueprint's observability: block.
/diff¶
Visual semantic diff between two blueprint versions.
Upload two YAML files or paste YAML directly into the editor panels. Changes are categorised:
- BREAKING — pattern changed, required field removed
- WARNING — prompt changed, provider swapped
- INFO — metadata, descriptions, tags
/docs¶
Auto-generated documentation for the loaded blueprint — produced by BlueprintRenderer.to_markdown(). Exportable as a Markdown file.
Launching with a Blueprint Pre-Loaded¶
# Load blueprint and traces on startup
pyagent dashboard \
--blueprint customer-support.yaml \
--trace traces/production_runs.jsonl \
--port 8000