Ml Training Recipes

174 installs9.2k stars

Summary

This is a comprehensive PyTorch training reference that covers everything from optimizer selection (Muon vs AdamW for different parameter types) to domain-specific patterns for LLMs, vision, diffusion, and biomedical applications. It includes actual code snippets for training loops, learning rate schedules, and mixed precision setup, plus decision tables for architecture selection based on data scale. The scaling laws section gives you Chinchilla-optimal token counts, and there's practical debugging advice for loss spikes and OOM errors. What makes this useful is the specificity: it tells you to use lr * (d_model / 768)^(-0.5) for dimension scaling and eps=1e-10 for AdamW in bfloat16, not just "tune your hyperparameters." If you're setting up training from scratch or debugging why your loss won't converge, this beats hunting through papers and GitHub issues.

Install to Claude Code

npx -y skills add orchestra-research/ai-research-skills --skill ml-training-recipes --agent claude-code

Installs into .claude/skills of the current project.

CodeRabbit

AI writes the code. CodeRabbit catches the slop.

Try For Free →

Keep your Mac awake

Keep your Mac awake while Claude Code and 40+ AI agents run. Sleeps when they're idle.

One time payment $9 →

Context.dev

Integrate web data into your AI product. One API to scrape website & brand data.

Get API Key Now →

Make your agent a DeFi expert

Agent, run crypto. Access onchain data & trade routes via 1inch.

Install now →

Make money from your Skills

On Capafy, your Skill runs online 24/7 as an agent product, and you get paid every time someone uses it.

Start earning →

AppSignal

Monitor with ease. Code with confidence.

Start Free Trial →

CodeRabbit

AI writes the code. CodeRabbit catches the slop.

Try For Free →

Keep your Mac awake

Keep your Mac awake while Claude Code and 40+ AI agents run. Sleeps when they're idle.

One time payment $9 →

Context.dev

Integrate web data into your AI product. One API to scrape website & brand data.

Get API Key Now →

Make your agent a DeFi expert

Agent, run crypto. Access onchain data & trade routes via 1inch.

Install now →

Make money from your Skills

On Capafy, your Skill runs online 24/7 as an agent product, and you get paid every time someone uses it.

Start earning →

AppSignal

Monitor with ease. Code with confidence.

Start Free Trial →

Files

SKILL.md

Select a file.

Featured

CodeRabbit

AI writes the code. CodeRabbit catches the slop.

Try For Free →

Keep your Mac awake

Keep your Mac awake while Claude Code and 40+ AI agents run. Sleeps when they're idle.

One time payment $9 →

Context.dev

Integrate web data into your AI product. One API to scrape website & brand data.

Get API Key Now →

Make your agent a DeFi expert

Agent, run crypto. Access onchain data & trade routes via 1inch.

Install now →

Make money from your Skills

On Capafy, your Skill runs online 24/7 as an agent product, and you get paid every time someone uses it.

Start earning →

AppSignal

Monitor with ease. Code with confidence.

Start Free Trial →

Ml Training Recipes

Install to Claude Code

Ml Training Recipes

Install to Claude Code

Recommended

Recommended