GAIA Agent Evaluation Runner

GAIA Agent - 基于 LangGraph 的智能体,支持:

  • RAG 知识库检索(高相似度直接返回答案)
  • 网络搜索(DuckDuckGo)
  • 文件处理(文本、ZIP、PDF、Excel)
  • 代码执行(沙箱环境)

Instructions:

  1. Log in to your Hugging Face account using the button below.
  2. Click 'Run Evaluation & Submit All Answers' to start evaluation.
  3. Wait for the agent to process all questions (this may take a while).