Agent Evaluation Runner
Instructions:
- Please clone this space, then modify the code to define your agent's logic.
- Ensure metadata.jsonl is available with question-answer pairs.
- Log in to your Hugging Face account using the button below. This uses your HF username for submission.
- Click 'Run Evaluation & Submit All Answers' to fetch questions, run your agent, submit answers, and see the score.
Agent Configuration:
- 📄 Uses metadata.jsonl for answer lookup
- ❓ Returns 'unknown' for unmatched questions
Questions and Agent Answers