SETA: Scaling Environments for Terminal Agents

— Designing resilient toolkits and scalable RL environments for CAMEL terminal agents

Authors: Qijia Shen 1,2, Jay Rainton 3, Aznaur Aliev 4, Ahmed Awelkair 1,2,4,5, Boyuan Ma 3, Zhiqi (Julie) Huang 6,7, Yuzhen Mao 8, Wendong Fan 1,2, Philip Torr 9, Bernard Ghanem 4, Changran Hu 3, Urmish Thakker 3, ****Guohao Li 1,2

  1. CAMEL-AI.org
  2. Eigent.AI
  3. SambaNova
  4. KAUST
  5. University of Malaya
  6. Imperial College London
  7. University College London
  8. Stanford University
  9. University of Oxford

TL;DR: In SETA, we start with building robust toolkits to empower the agent’s planning and terminal capability, and achieved SOTA performance among the same model families. We performed extensive analysis on the results, and built up the scalable environment synthesis pipeline for RL training to further improve the models’ terminal capability.

Key contributions: