13
Tool Use & MCP Playground
3 scenarios exploring tool design, MCP servers, and tool selection at scale. Toggle components on and off to see how they affect tool call accuracy.
Choose a Scenario
3 scenarios available
Tool Design Scenario
A file management agent must execute a complex multi-step operation. Each tool design principle you enable improves the model's ability to select the correct tool and call it with valid arguments. Watch tool call accuracy climb from random guessing to precise execution.
Agent Type
File management agent
Tools Available
8 file operations
Challenge
Multi-step file ops with rename
Token Usage
0/ 1,580 (0%)
Names
Schemas
Descriptions
Errors
Validation
Examples
Context Components
0/6 active
0/100
Broken
Guessing Blind
Engineer
Move all .pdf files from /downloads to /documents/reports and rename them with today's date prefix.
AI Agent
Guessing BlindIssues (5)
- ×Invents a non-existent tool name
- ×Uses relative paths instead of absolute
- ×Tries to do everything in one call
- ×No awareness of available tools
- ×Cannot recover from errors