13

Tool Use & MCP Playground

3 scenarios exploring tool design, MCP servers, and tool selection at scale. Toggle components on and off to see how they affect tool call accuracy.

Choose a Scenario

3 scenarios available

Tool Design Scenario

A file management agent must execute a complex multi-step operation. Each tool design principle you enable improves the model's ability to select the correct tool and call it with valid arguments. Watch tool call accuracy climb from random guessing to precise execution.

Agent Type

File management agent

Tools Available

8 file operations

Challenge

Multi-step file ops with rename

Token Usage

0/ 1,580 (0%)
Names
Schemas
Descriptions
Errors
Validation
Examples

Context Components

0/6 active

0/100

Broken

Guessing Blind

No components

Engineer

Move all .pdf files from /downloads to /documents/reports and rename them with today's date prefix.

AI Agent

Guessing Blind

Issues (5)

  • ×Invents a non-existent tool name
  • ×Uses relative paths instead of absolute
  • ×Tries to do everything in one call
  • ×No awareness of available tools
  • ×Cannot recover from errors