Routes you through Karpathy's autonomous ML search loop: write program.md with your research goal, let the agent edit train.py, keep runs that lower val_bpb in 300 seconds, revert everything else. The skill enforces the immutable harness rule (prepare.py and eval stay frozen once your session starts) and picks the right mode whether you're doing first-time setup, refining your research charter, running the overnight loop, or interpreting results.tsv. It draws a hard line between this and prompt eval tooling like LangSmith or Braintrust. Works best when you have a real training repo, a GPU with ~40GB VRAM, and want disciplined ratcheting instead of hero rewrites.
npx -y skills add akillness/oh-my-skills --skill autoresearch --agent claude-codeInstalls into .claude/skills of the current project.
Select a file.
juliusbrussee/caveman
mattpocock/skills
shadcn/improve
obra/superpowers
forrestchang/andrej-karpathy-skills
vercel-labs/skills