Generates slide decks from topics, URLs, or text using AI visuals. Defaults to slides only, but you can add voice narration if needed. It handles the full workflow: asks what you want slides about, picks a language, optionally configures a speaker, then runs the generation in the background and notifies you when done. Supports inline viewing or downloading the script (and audio if narrated) to your current directory with sensible filenames. The hard gate pattern means it walks you through choices one at a time before executing anything. Useful when you need a quick visual presentation without manually designing slides, or when you want to turn a URL or concept into something you can actually present.
npx -y skills add marswaveai/skills --skill slides --agent claude-codeInstalls into .claude/skills of the current project.
/explainer)/speech or /podcast)/podcast)/image-gen)Generate slide decks with AI-generated visuals from topics, URLs, or text. By default, slides are generated without audio narration. Narration can be optionally enabled. Ideal for presentations, summaries, and visual storytelling.
shared/config-pattern.md before any interactionshared/cli-patterns.md for execution modes, error handling, and interaction patternsshared/cli-authentication.md for auth checksshared/speaker-selection.md when narration is enabled~/Downloads/ or .listenhub/ — save artifacts to the current working directory with friendly topic-based names (see shared/config-pattern.md § Artifact Naming)slides — never info or story (those are for /explainer)--no-skip-audioFollow shared/cli-authentication.md. If the CLI is not installed or the user is not logged in, auto-install and auto-login — never ask the user to run commands manually.
Follow shared/config-pattern.md Step 0 (Zero-Question Boot).
If file doesn't exist — silently create with defaults and proceed:
mkdir -p ".listenhub/slides"
echo '{"outputMode":"inline","language":null,"defaultSpeakers":{}}' > ".listenhub/slides/config.json"
CONFIG_PATH=".listenhub/slides/config.json"
CONFIG=$(cat "$CONFIG_PATH")
Do NOT ask any setup questions. Proceed directly to the Interaction Flow.
If file exists — read config silently and proceed:
CONFIG_PATH=".listenhub/slides/config.json"
[ ! -f "$CONFIG_PATH" ] && CONFIG_PATH="$HOME/.listenhub/slides/config.json"
CONFIG=$(cat "$CONFIG_PATH")
Only run when the user explicitly asks to reconfigure. Display current settings:
当前配置 (slides):
输出方式:{inline / download / both}
语言偏好:{zh / en / 未设置}
默认主播:{speakerName / 使用内置默认}
Then ask:
outputMode: Follow shared/output-mode.md § Setup Flow Question.
Language (optional): "默认语言?"
nullAfter collecting answers, save immediately:
NEW_CONFIG=$(echo "$CONFIG" | jq --arg m "$OUTPUT_MODE" '. + {"outputMode": $m}')
echo "$NEW_CONFIG" > "$CONFIG_PATH"
CONFIG=$(cat "$CONFIG_PATH")
Free text input. Ask the user:
What would you like to create slides about?
Accept: topic description, text content, URL(s), or any combination.
If config.language is set, pre-fill and show in summary — skip this question.
Otherwise ask:
Question: "What language?"
Options:
- "Chinese (zh)" — Content in Mandarin Chinese
- "English (en)" — Content in English
- "Japanese (ja)" — Content in Japanese
Ask:
Question: "需要语音旁白吗?(默认否)"
Options:
- "不需要" — Slides only, no narration
- "需要" — Add voice narration to slides
Default is no narration. If the user says yes, proceed to Step 4. Otherwise skip to Step 5.
Skip this step entirely if narration is not enabled.
Follow shared/speaker-selection.md:
config.defaultSpeakers.{language} is set → use saved speaker silentlyshared/speaker-selection.md for the languageOnly 1 speaker is supported for slides narration.
Summarize all choices:
Without narration:
Ready to generate slides:
Topic: {topic}
Language: {language}
Narration: No
Proceed?
With narration:
Ready to generate slides:
Topic: {topic}
Language: {language}
Narration: Yes
Speaker: {speaker name}
Proceed?
Wait for explicit confirmation before running any CLI command.
Submit (background): Run the CLI command with run_in_background: true and timeout: 660000:
Without narration (default):
listenhub slides create \
--query "{topic}" \
--lang {en|zh|ja} \
--image-size 2K \
--aspect-ratio 16:9 \
--timeout 600 \
--json
With narration:
listenhub slides create \
--query "{topic}" \
--lang {en|zh|ja} \
--image-size 2K \
--aspect-ratio 16:9 \
--no-skip-audio \
--speaker "{name}" \
--timeout 600 \
--json
If the user provided a source URL, add --source-url "{url}".
The CLI handles polling internally and returns the final result when generation completes.
Tell the user the task is submitted and that they will be notified when it finishes.
When notified of completion, parse and present the result:
Parse the CLI JSON output for key fields:
EPISODE_ID=$(echo "$RESULT" | jq -r '.episodeId')
AUDIO_URL=$(echo "$RESULT" | jq -r '.audioUrl // empty')
CREDITS=$(echo "$RESULT" | jq -r '.credits // empty')
Read OUTPUT_MODE from config. Follow shared/output-mode.md for behavior.
Without narration:
inline or both: Present the online link.
幻灯片已生成!
在线查看:https://listenhub.ai/app/slides/{episodeId}
消耗积分:{credits}
download or both: Also save the script file. Generate a topic slug following shared/config-pattern.md § Artifact Naming.
{slug}-slides.md in cwd (dedup if exists)With narration:
inline or both: Display audio URL as a clickable link.
幻灯片已生成!
在线查看:https://listenhub.ai/app/slides/{episodeId}
音频链接:{audioUrl}
消耗积分:{credits}
download or both: Also save files. Generate a topic slug following shared/config-pattern.md § Artifact Naming.
{slug}-slides/ folder (dedup if exists)script.md insidecurl -sS -o "{slug}-slides/audio.mp3" "{audioUrl}"
已保存到当前目录:
{slug}-slides/
script.md
audio.mp3
Update config with the choices made this session:
NEW_CONFIG=$(echo "$CONFIG" | jq \
--arg lang "{language}" \
'. + {"language": $lang}')
echo "$NEW_CONFIG" > "$CONFIG_PATH"
If narration was used, also save the speaker:
NEW_CONFIG=$(echo "$CONFIG" | jq \
--arg lang "{language}" \
--arg speakerId "{speakerId}" \
'. + {"language": $lang, "defaultSpeakers": (.defaultSpeakers + {($lang): [$speakerId]})}')
echo "$NEW_CONFIG" > "$CONFIG_PATH"
Estimated times:
shared/cli-authentication.mdshared/cli-patterns.mdshared/cli-speakers.mdshared/speaker-selection.mdshared/config-pattern.mdshared/output-mode.mdUser: "帮我做一个关于量子计算的幻灯片"
Agent workflow:
listenhub slides create \
--query "量子计算" \
--lang zh \
--image-size 2K \
--aspect-ratio 16:9 \
--timeout 600 \
--json
Wait for CLI to return result, then present the online link.
User: "Create slides about React hooks with narration"
Agent workflow:
listenhub slides create \
--query "React hooks" \
--lang en \
--image-size 2K \
--aspect-ratio 16:9 \
--no-skip-audio \
--speaker "Mars" \
--timeout 600 \
--json
Wait for CLI to return result, then present the online link and audio link.
larksuite/cli
googleworkspace/cli
googleworkspace/cli
googleworkspace/cli