This is a workflow for evaluating NVIDIA's Cosmos Policy model on robotics simulation benchmarks, specifically LIBERO and RoboCasa environments. It handles the full setup from scratch, including headless GPU evaluation and inference profiling. You'd reach for this if you're testing Cosmos Policy's performance or comparing it against other robotic manipulation models in standardized sim environments. The fact that it got 192 installs and passed most security audits suggests it's being used in actual research workflows. It's pulling from the public cosmos-policy repository, so you're working with the official eval setup rather than someone's fork.
npx -y skills add orchestra-research/ai-research-skills --skill evaluating-cosmos-policy --agent claude-codeInstalls into .claude/skills of the current project.
Select a file.
sickn33/antigravity-awesome-skills
moizibnyousaf/ai-agent-skills
github/awesome-copilot