SEAL stands for Synergistic Co-Evolution of Agents and Learning Environments. It is a research project that comes with a paper, a poster, and a project homepage, written by authors from Ant Group, Westlake University, the University of Michigan, and the University of Science and Technology of China. The license is Apache 2.0. The project is about making AI agents that use tools, the kind of agents that call functions, query APIs, or run commands to finish a task. The idea is that the agent and the training environment improve together in a closed loop. The agent runs through tasks, the system watches which steps fail, and the failures are sorted into categories such as invalid tool calls, wrong arguments, missed tool calls, failed recovery attempts, and responses that do not match what was expected. These labels then feed back into both the training interface and the model itself. The training method uses something called GRPO, a reinforcement learning approach, where the diagnostic categories reweight the rewards given during training. The README says the actual tool definitions, task labels, and verifier stay the same during evaluation, so the comparison to other methods remains fair. The training environment is built on BFCL, a public benchmark for function-calling agents. To run it, you clone the repo, create a Python 3.10 conda environment called seal, install the requirements, and then set up a second conda environment for the BFCL benchmark using its setup script. After both are ready, you launch the BFCL service and then start the training run with python launcher.py pointing at exp/SEAL.yaml. The repository layout is organized into folders for the experiment config, the BFCL environment service, modules for diagnostic state and reward reweighting, task adapters, and the released data splits.
Generated 2026-05-21 · Model: sonnet-4-6 · Verify against the repo before relying on details.