Slides

302 installs59 stars

Summary

Generates slide decks from topics, URLs, or text using AI visuals. Defaults to slides only, but you can add voice narration if needed. It handles the full workflow: asks what you want slides about, picks a language, optionally configures a speaker, then runs the generation in the background and notifies you when done. Supports inline viewing or downloading the script (and audio if narrated) to your current directory with sensible filenames. The hard gate pattern means it walks you through choices one at a time before executing anything. Useful when you need a quick visual presentation without manually designing slides, or when you want to turn a URL or concept into something you can actually present.

Install to Claude Code

npx -y skills add marswaveai/skills --skill slides --agent claude-code

Installs into .claude/skills of the current project.

CodeRabbit

AI writes the code. CodeRabbit catches the slop.

Try For Free →

Keep your Mac awake

Keep your Mac awake while Claude Code and 40+ AI agents run. Sleeps when they're idle.

One time payment $9 →

Context.dev

Integrate web data into your AI product. One API to scrape website & brand data.

Get API Key Now →

Make your agent a DeFi expert

Agent, run crypto. Access onchain data & trade routes via 1inch.

Install now →

Make money from your Skills

On Capafy, your Skill runs online 24/7 as an agent product, and you get paid every time someone uses it.

Start earning →

AppSignal

Monitor with ease. Code with confidence.

Start Free Trial →

CodeRabbit

AI writes the code. CodeRabbit catches the slop.

Try For Free →

Keep your Mac awake

Keep your Mac awake while Claude Code and 40+ AI agents run. Sleeps when they're idle.

One time payment $9 →

Context.dev

Integrate web data into your AI product. One API to scrape website & brand data.

Get API Key Now →

Make your agent a DeFi expert

Agent, run crypto. Access onchain data & trade routes via 1inch.

Install now →

Make money from your Skills

On Capafy, your Skill runs online 24/7 as an agent product, and you get paid every time someone uses it.

Start earning →

AppSignal

Monitor with ease. Code with confidence.

Start Free Trial →

Files

SKILL.mdView on GitHub

When to Use

User wants to create a slide deck or presentation
User asks for "slides", "幻灯片", "PPT", or "presentation"
User wants visual content organized into slides from a topic or URL

When NOT to Use

User wants a narrated video without slides (use /explainer)
User wants audio-only content (use /speech or /podcast)
User wants a podcast-style discussion (use /podcast)
User wants to generate a standalone image (use /image-gen)

Purpose

Generate slide decks with AI-generated visuals from topics, URLs, or text. By default, slides are generated without audio narration. Narration can be optionally enabled. Ideal for presentations, summaries, and visual storytelling.

Hard Constraints

Always read config following shared/config-pattern.md before any interaction
Follow shared/cli-patterns.md for execution modes, error handling, and interaction patterns
Always follow shared/cli-authentication.md for auth checks
Follow shared/speaker-selection.md when narration is enabled
Never hardcode speaker IDs — always fetch from the speakers CLI when the user wants to change voice
Never save files to ~/Downloads/ or .listenhub/ — save artifacts to the current working directory with friendly topic-based names (see shared/config-pattern.md § Artifact Naming)
Mode is always slides — never info or story (those are for /explainer)
Only 1 speaker supported (when narration is enabled)
Default behavior: skip audio (no narration). Only add narration when the user explicitly requests it via --no-skip-audio

Use the AskUserQuestion tool for every multiple-choice step — do NOT print options as plain text. Ask one question at a time. Wait for the user's answer before proceeding to the next step. After all parameters are collected, summarize the choices and ask the user to confirm. Do NOT call any CLI command until the user has explicitly confirmed.

Step -1: CLI Auth Check

Follow shared/cli-authentication.md. If the CLI is not installed or the user is not logged in, auto-install and auto-login — never ask the user to run commands manually.

Step 0: Config Setup

Follow shared/config-pattern.md Step 0 (Zero-Question Boot).

If file doesn't exist — silently create with defaults and proceed:

mkdir -p ".listenhub/slides"
echo '{"outputMode":"inline","language":null,"defaultSpeakers":{}}' > ".listenhub/slides/config.json"
CONFIG_PATH=".listenhub/slides/config.json"
CONFIG=$(cat "$CONFIG_PATH")

Do NOT ask any setup questions. Proceed directly to the Interaction Flow.

If file exists — read config silently and proceed:

CONFIG_PATH=".listenhub/slides/config.json"
[ ! -f "$CONFIG_PATH" ] && CONFIG_PATH="$HOME/.listenhub/slides/config.json"
CONFIG=$(cat "$CONFIG_PATH")

Setup Flow (user-initiated reconfigure only)

Only run when the user explicitly asks to reconfigure. Display current settings:

当前配置 (slides)：
  输出方式：{inline / download / both}
  语言偏好：{zh / en / 未设置}
  默认主播：{speakerName / 使用内置默认}

Then ask:

outputMode: Follow shared/output-mode.md § Setup Flow Question.
Language (optional): "默认语言？"
- "中文 (zh)"
- "English (en)"
- "每次手动选择" → keep null

After collecting answers, save immediately:

NEW_CONFIG=$(echo "$CONFIG" | jq --arg m "$OUTPUT_MODE" '. + {"outputMode": $m}')
echo "$NEW_CONFIG" > "$CONFIG_PATH"
CONFIG=$(cat "$CONFIG_PATH")

Interaction Flow

Step 1: Topic / Content

Free text input. Ask the user:

What would you like to create slides about?

Accept: topic description, text content, URL(s), or any combination.

Step 2: Language

If config.language is set, pre-fill and show in summary — skip this question. Otherwise ask:

Question: "What language?"
Options:
  - "Chinese (zh)" — Content in Mandarin Chinese
  - "English (en)" — Content in English
  - "Japanese (ja)" — Content in Japanese

Step 3: Narration

Ask:

Question: "需要语音旁白吗？（默认否）"
Options:
  - "不需要" — Slides only, no narration
  - "需要" — Add voice narration to slides

Default is no narration. If the user says yes, proceed to Step 4. Otherwise skip to Step 5.

Step 4: Speaker Selection (only if narration enabled)

Skip this step entirely if narration is not enabled.

Follow shared/speaker-selection.md:

If config.defaultSpeakers.{language} is set → use saved speaker silently
If not set → use built-in default from shared/speaker-selection.md for the language
Show the speaker in the confirmation summary (Step 5) — user can change from there if desired
Only show the full speaker list if the user explicitly asks to change voice

Only 1 speaker is supported for slides narration.

Step 5: Confirm & Generate

Summarize all choices:

Without narration:

Ready to generate slides:

  Topic: {topic}
  Language: {language}
  Narration: No

  Proceed?

With narration:

Ready to generate slides:

  Topic: {topic}
  Language: {language}
  Narration: Yes
  Speaker: {speaker name}

  Proceed?

Wait for explicit confirmation before running any CLI command.

Workflow

Submit (background): Run the CLI command with run_in_background: true and timeout: 660000:

Without narration (default):

listenhub slides create \
  --query "{topic}" \
  --lang {en|zh|ja} \
  --image-size 2K \
  --aspect-ratio 16:9 \
  --timeout 600 \
  --json

With narration:

listenhub slides create \
  --query "{topic}" \
  --lang {en|zh|ja} \
  --image-size 2K \
  --aspect-ratio 16:9 \
  --no-skip-audio \
  --speaker "{name}" \
  --timeout 600 \
  --json

If the user provided a source URL, add --source-url "{url}".

The CLI handles polling internally and returns the final result when generation completes.

Tell the user the task is submitted and that they will be notified when it finishes.
When notified of completion, parse and present the result:

Parse the CLI JSON output for key fields:
```
EPISODE_ID=$(echo "$RESULT" | jq -r '.episodeId')
AUDIO_URL=$(echo "$RESULT" | jq -r '.audioUrl // empty')
CREDITS=$(echo "$RESULT" | jq -r '.credits // empty')
```
Read OUTPUT_MODE from config. Follow shared/output-mode.md for behavior.

Without narration:

inline or both: Present the online link.
```
幻灯片已生成！

在线查看：https://listenhub.ai/app/slides/{episodeId}
消耗积分：{credits}
```
download or both: Also save the script file. Generate a topic slug following shared/config-pattern.md § Artifact Naming.
- Save as {slug}-slides.md in cwd (dedup if exists)
- Present the save path in addition to the above summary.
With narration:

inline or both: Display audio URL as a clickable link.
```
幻灯片已生成！

在线查看：https://listenhub.ai/app/slides/{episodeId}
音频链接：{audioUrl}
消耗积分：{credits}
```
download or both: Also save files. Generate a topic slug following shared/config-pattern.md § Artifact Naming.
- Create {slug}-slides/ folder (dedup if exists)
- Write script.md inside
- Download audio:
```
curl -sS -o "{slug}-slides/audio.mp3" "{audioUrl}"
```
- Present:
```
已保存到当前目录：
  {slug}-slides/
    script.md
    audio.mp3
```

After Successful Generation

Update config with the choices made this session:

NEW_CONFIG=$(echo "$CONFIG" | jq \
  --arg lang "{language}" \
  '. + {"language": $lang}')
echo "$NEW_CONFIG" > "$CONFIG_PATH"

If narration was used, also save the speaker:

NEW_CONFIG=$(echo "$CONFIG" | jq \
  --arg lang "{language}" \
  --arg speakerId "{speakerId}" \
  '. + {"language": $lang, "defaultSpeakers": (.defaultSpeakers + {($lang): [$speakerId]})}')
echo "$NEW_CONFIG" > "$CONFIG_PATH"

Estimated times:

Slides without narration: 2-4 minutes
Slides with narration: 4-8 minutes

Resources

CLI authentication: shared/cli-authentication.md
CLI patterns: shared/cli-patterns.md
Speaker query: shared/cli-speakers.md
Speaker selection guide: shared/speaker-selection.md
Config pattern: shared/config-pattern.md
Output mode: shared/output-mode.md

Composability

Invokes: speakers CLI (for speaker selection when narration enabled)
Invoked by: content-planner (Phase 3)

Example

User: "帮我做一个关于量子计算的幻灯片"

Agent workflow:

Topic: "量子计算"
Language: pre-filled from config or ask → "zh"
Narration: ask → "不需要"
Confirm and generate

listenhub slides create \
  --query "量子计算" \
  --lang zh \
  --image-size 2K \
  --aspect-ratio 16:9 \
  --timeout 600 \
  --json

Wait for CLI to return result, then present the online link.

User: "Create slides about React hooks with narration"

Agent workflow:

Topic: "React hooks"
Language: ask → "en"
Narration: ask → "需要"
Speaker: use built-in default for English
Confirm and generate

listenhub slides create \
  --query "React hooks" \
  --lang en \
  --image-size 2K \
  --aspect-ratio 16:9 \
  --no-skip-audio \
  --speaker "Mars" \
  --timeout 600 \
  --json

Wait for CLI to return result, then present the online link and audio link.

Featured

CodeRabbit

AI writes the code. CodeRabbit catches the slop.

Try For Free →

Keep your Mac awake

Keep your Mac awake while Claude Code and 40+ AI agents run. Sleeps when they're idle.

One time payment $9 →

Context.dev

Integrate web data into your AI product. One API to scrape website & brand data.

Get API Key Now →

Make your agent a DeFi expert

Agent, run crypto. Access onchain data & trade routes via 1inch.

Install now →

Make money from your Skills

On Capafy, your Skill runs online 24/7 as an agent product, and you get paid every time someone uses it.

Start earning →

AppSignal

Monitor with ease. Code with confidence.

Start Free Trial →

When to Use

User wants to create a slide deck or presentation
User asks for "slides", "幻灯片", "PPT", or "presentation"
User wants visual content organized into slides from a topic or URL

When NOT to Use

User wants a narrated video without slides (use /explainer)
User wants audio-only content (use /speech or /podcast)
User wants a podcast-style discussion (use /podcast)
User wants to generate a standalone image (use /image-gen)

Purpose

Hard Constraints

Always read config following shared/config-pattern.md before any interaction
Follow shared/cli-patterns.md for execution modes, error handling, and interaction patterns
Always follow shared/cli-authentication.md for auth checks
Follow shared/speaker-selection.md when narration is enabled
Never hardcode speaker IDs — always fetch from the speakers CLI when the user wants to change voice
Never save files to ~/Downloads/ or .listenhub/ — save artifacts to the current working directory with friendly topic-based names (see shared/config-pattern.md § Artifact Naming)
Mode is always slides — never info or story (those are for /explainer)
Only 1 speaker supported (when narration is enabled)
Default behavior: skip audio (no narration). Only add narration when the user explicitly requests it via --no-skip-audio

Step -1: CLI Auth Check

Follow shared/cli-authentication.md. If the CLI is not installed or the user is not logged in, auto-install and auto-login — never ask the user to run commands manually.

Step 0: Config Setup

Follow shared/config-pattern.md Step 0 (Zero-Question Boot).

If file doesn't exist — silently create with defaults and proceed:

mkdir -p ".listenhub/slides"
echo '{"outputMode":"inline","language":null,"defaultSpeakers":{}}' > ".listenhub/slides/config.json"
CONFIG_PATH=".listenhub/slides/config.json"
CONFIG=$(cat "$CONFIG_PATH")

Do NOT ask any setup questions. Proceed directly to the Interaction Flow.

If file exists — read config silently and proceed:

CONFIG_PATH=".listenhub/slides/config.json"
[ ! -f "$CONFIG_PATH" ] && CONFIG_PATH="$HOME/.listenhub/slides/config.json"
CONFIG=$(cat "$CONFIG_PATH")

Setup Flow (user-initiated reconfigure only)

Only run when the user explicitly asks to reconfigure. Display current settings:

当前配置 (slides)：
  输出方式：{inline / download / both}
  语言偏好：{zh / en / 未设置}
  默认主播：{speakerName / 使用内置默认}

Then ask:

outputMode: Follow shared/output-mode.md § Setup Flow Question.
Language (optional): "默认语言？"
- "中文 (zh)"
- "English (en)"
- "每次手动选择" → keep null

After collecting answers, save immediately:

NEW_CONFIG=$(echo "$CONFIG" | jq --arg m "$OUTPUT_MODE" '. + {"outputMode": $m}')
echo "$NEW_CONFIG" > "$CONFIG_PATH"
CONFIG=$(cat "$CONFIG_PATH")

Interaction Flow

Step 1: Topic / Content

Free text input. Ask the user:

What would you like to create slides about?

Accept: topic description, text content, URL(s), or any combination.

Step 2: Language

If config.language is set, pre-fill and show in summary — skip this question. Otherwise ask:

Question: "What language?"
Options:
  - "Chinese (zh)" — Content in Mandarin Chinese
  - "English (en)" — Content in English
  - "Japanese (ja)" — Content in Japanese

Step 3: Narration

Ask:

Question: "需要语音旁白吗？（默认否）"
Options:
  - "不需要" — Slides only, no narration
  - "需要" — Add voice narration to slides

Default is no narration. If the user says yes, proceed to Step 4. Otherwise skip to Step 5.

Step 4: Speaker Selection (only if narration enabled)

Skip this step entirely if narration is not enabled.

Follow shared/speaker-selection.md:

If config.defaultSpeakers.{language} is set → use saved speaker silently
If not set → use built-in default from shared/speaker-selection.md for the language
Show the speaker in the confirmation summary (Step 5) — user can change from there if desired
Only show the full speaker list if the user explicitly asks to change voice

Only 1 speaker is supported for slides narration.

Step 5: Confirm & Generate

Summarize all choices:

Without narration:

Ready to generate slides:

  Topic: {topic}
  Language: {language}
  Narration: No

  Proceed?

With narration:

Ready to generate slides:

  Topic: {topic}
  Language: {language}
  Narration: Yes
  Speaker: {speaker name}

  Proceed?

Wait for explicit confirmation before running any CLI command.

Workflow

Submit (background): Run the CLI command with run_in_background: true and timeout: 660000:

Without narration (default):

listenhub slides create \
  --query "{topic}" \
  --lang {en|zh|ja} \
  --image-size 2K \
  --aspect-ratio 16:9 \
  --timeout 600 \
  --json

With narration:

listenhub slides create \
  --query "{topic}" \
  --lang {en|zh|ja} \
  --image-size 2K \
  --aspect-ratio 16:9 \
  --no-skip-audio \
  --speaker "{name}" \
  --timeout 600 \
  --json

If the user provided a source URL, add --source-url "{url}".

The CLI handles polling internally and returns the final result when generation completes.

Tell the user the task is submitted and that they will be notified when it finishes.
When notified of completion, parse and present the result:

Parse the CLI JSON output for key fields:
```
EPISODE_ID=$(echo "$RESULT" | jq -r '.episodeId')
AUDIO_URL=$(echo "$RESULT" | jq -r '.audioUrl // empty')
CREDITS=$(echo "$RESULT" | jq -r '.credits // empty')
```
Read OUTPUT_MODE from config. Follow shared/output-mode.md for behavior.

Without narration:

inline or both: Present the online link.
```
幻灯片已生成！

在线查看：https://listenhub.ai/app/slides/{episodeId}
消耗积分：{credits}
```
download or both: Also save the script file. Generate a topic slug following shared/config-pattern.md § Artifact Naming.
- Save as {slug}-slides.md in cwd (dedup if exists)
- Present the save path in addition to the above summary.
With narration:

inline or both: Display audio URL as a clickable link.
```
幻灯片已生成！

在线查看：https://listenhub.ai/app/slides/{episodeId}
音频链接：{audioUrl}
消耗积分：{credits}
```
download or both: Also save files. Generate a topic slug following shared/config-pattern.md § Artifact Naming.
- Create {slug}-slides/ folder (dedup if exists)
- Write script.md inside
- Download audio:
```
curl -sS -o "{slug}-slides/audio.mp3" "{audioUrl}"
```
- Present:
```
已保存到当前目录：
  {slug}-slides/
    script.md
    audio.mp3
```

After Successful Generation

Update config with the choices made this session:

NEW_CONFIG=$(echo "$CONFIG" | jq \
  --arg lang "{language}" \
  '. + {"language": $lang}')
echo "$NEW_CONFIG" > "$CONFIG_PATH"

If narration was used, also save the speaker:

NEW_CONFIG=$(echo "$CONFIG" | jq \
  --arg lang "{language}" \
  --arg speakerId "{speakerId}" \
  '. + {"language": $lang, "defaultSpeakers": (.defaultSpeakers + {($lang): [$speakerId]})}')
echo "$NEW_CONFIG" > "$CONFIG_PATH"

Estimated times:

Slides without narration: 2-4 minutes
Slides with narration: 4-8 minutes

Resources

CLI authentication: shared/cli-authentication.md
CLI patterns: shared/cli-patterns.md
Speaker query: shared/cli-speakers.md
Speaker selection guide: shared/speaker-selection.md
Config pattern: shared/config-pattern.md
Output mode: shared/output-mode.md

Composability

Invokes: speakers CLI (for speaker selection when narration enabled)
Invoked by: content-planner (Phase 3)

Example

User: "帮我做一个关于量子计算的幻灯片"

Agent workflow:

Topic: "量子计算"
Language: pre-filled from config or ask → "zh"
Narration: ask → "不需要"
Confirm and generate

listenhub slides create \
  --query "量子计算" \
  --lang zh \
  --image-size 2K \
  --aspect-ratio 16:9 \
  --timeout 600 \
  --json

Wait for CLI to return result, then present the online link.

User: "Create slides about React hooks with narration"

Agent workflow:

Topic: "React hooks"
Language: ask → "en"
Narration: ask → "需要"
Speaker: use built-in default for English
Confirm and generate

listenhub slides create \
  --query "React hooks" \
  --lang en \
  --image-size 2K \
  --aspect-ratio 16:9 \
  --no-skip-audio \
  --speaker "Mars" \
  --timeout 600 \
  --json

Wait for CLI to return result, then present the online link and audio link.

Slides

Install to Claude Code

When to Use

When NOT to Use

Purpose

Hard Constraints

Step -1: CLI Auth Check

Step 0: Config Setup

Setup Flow (user-initiated reconfigure only)

Interaction Flow

Step 1: Topic / Content

Step 2: Language

Step 3: Narration

Step 4: Speaker Selection (only if narration enabled)

Step 5: Confirm & Generate

Workflow

After Successful Generation

Resources

Composability

Example

Slides

Install to Claude Code

When to Use

When NOT to Use

Purpose

Hard Constraints

Step -1: CLI Auth Check

Step 0: Config Setup

Setup Flow (user-initiated reconfigure only)

Interaction Flow

Step 1: Topic / Content

Step 2: Language

Step 3: Narration

Step 4: Speaker Selection (only if narration enabled)

Step 5: Confirm & Generate

Workflow

After Successful Generation

Resources

Composability

Example

Recommended

Recommended