Playground

Test your prompts against AI models and compare results. Open the Playground by clicking Playground in the editor's Actions tab.

The Playground uses a grid layout where prompt versions are rows and AI models are columns. You can run individual cells, entire rows or columns, or the full grid at once.

In this section

Model & Settings — configure models with temperature, max tokens, and reasoning effort.
Batch Comparison — add multiple models and versions to the grid for side-by-side comparison.
AI Judge Scoring — automatic scoring of responses on accuracy, helpfulness, relevance, coherence, and safety.
Run History — browse past test runs and copy responses.