When you've got a folder full of experiment JSONs and CSVs scattered across results directories, this skill pulls them together into comparison tables with proper baselines and deltas. It handles the tedious parts: computing means and standard deviations across seeds, flagging outliers, identifying trends in hyperparameter sweeps. What I like is the structured insight format that forces you to go from observation to interpretation to next experiment, which keeps you from just staring at numbers. It'll also propose documentation updates when findings look significant. Basically saves you from writing the same pandas aggregation code every time you need to compare model runs.
npx -y skills add wanshuiyin/auto-claude-code-research-in-sleep --skill analyze-results --agent claude-codeInstalls into .claude/skills of the current project.
Select a file.
sickn33/antigravity-awesome-skills