GAIA Agent Evaluation Runner
GAIA Agent - 基于 LangGraph 的智能体,支持:
- RAG 知识库检索(高相似度直接返回答案)
- 网络搜索(DuckDuckGo)
- 文件处理(文本、ZIP、PDF、Excel)
- 代码执行(沙箱环境)
Instructions:
- Log in to your Hugging Face account using the button below.
- Click 'Run Evaluation & Submit All Answers' to start evaluation.
- Wait for the agent to process all questions (this may take a while).