Episode Configuration

Toggle to use a worker trained via GRPO instead of a naive heuristic worker.


Final Scores


Security Metrics