The agent in CLI format (run on any env, specific hw, docker)
- input artifact -> output playable env (local/docker) and reproduced results.
Then, researchers/students can play with it, gain deeper understanding and may propose better solutions.
It is a good agent beyond artifact evaluation.