CAT
/MCP
SkillsMCPMarketplacesDigestToolsAdvertise

This week in Claude

Every Monday: Claude Code, Agent SDK, MCP, and the Anthropic platform moves worth your time.

Skills by Category
Frontend DevelopmentBackend & APIsTesting & QASecurityDevOps & CI/CDGit & Pull RequestsDocumentationCode Review & QualityAI & Agent BuildingSkill Development
MCP Servers by Category
Sales & MarketingWeb & Browser AutomationDatabasesAI & LLM ToolsCloud & InfrastructureCommunication & MessagingDeveloper ToolsDesign & CreativeDocuments & KnowledgeSearch & Web Crawling
Marketplaces by Category
AI Agents & OrchestrationLLM IntegrationDevelopment ToolsFrontend & UIBackend & APIsDatabasesTesting & Code QualityDevOps & CloudSecurity & ComplianceGit & Version Control

Cross AI Tools

Discover Claude Code plugins, extensions, and tools. Automatically updated directory of Anthropic Claude AI marketplaces with development tools, productivity plugins, and integrations.

Resources

  • Browse Skills
  • Browse MCP Servers
  • Browse Marketplaces
  • Plugins Reference

Community

  • About
  • Tools
  • Feedback
  • Privacy Policy
  • Advertise

Built for the Claude Code community with Claude Code by @mertduzgun

Independent project, not affiliated with Anthropic

Supertone TTS

supertone-inc/supertone-mcp
2authSTDIOregistry active
Summary

Wraps the Supertone TTS API so you can synthesize speech, clone voices, and browse a multilingual catalog directly from Claude or any MCP client. Exposes text_to_speech with pitch shift and speed controls, predict_duration for cost estimation before synthesis, search_voice to filter by language/gender/age/use case, and clone_voice to create custom voices from local audio files. Supports Korean, English, Japanese, and 20+ other languages. Output modes let you save files to disk, return inline audio resources, or both. Includes full CRUD for managing cloned voices. Useful when you need programmatic TTS without leaving your LLM workflow or want to test voice samples before committing credits.

CodeRabbit
CodeRabbit
AI writes the code. CodeRabbit catches the slop.
Try For Free →
Keep your Mac awake
Keep your Mac awake
Keep your Mac awake while Claude Code and 40+ AI agents run. Sleeps when they're idle.
One time payment $9 →
Context.devContext.dev
Context.dev
Integrate web data into your AI product. One API to scrape website & brand data.
Get API Key Now →
Make your agent a DeFi expert
Make your agent a DeFi expert
Agent, run crypto. Access onchain data & trade routes via 1inch.
Install now →
Make money from your Skills
Make money from your Skills
On Capafy, your Skill runs online 24/7 as an agent product, and you get paid every time someone uses it.
Start earning →
AppSignal
AppSignal
Monitor with ease. Code with confidence.
Start Free Trial →
CodeRabbit
CodeRabbit
AI writes the code. CodeRabbit catches the slop.
Try For Free →
Keep your Mac awake
Keep your Mac awake
Keep your Mac awake while Claude Code and 40+ AI agents run. Sleeps when they're idle.
One time payment $9 →
Context.devContext.dev
Context.dev
Integrate web data into your AI product. One API to scrape website & brand data.
Get API Key Now →
Make your agent a DeFi expert
Make your agent a DeFi expert
Agent, run crypto. Access onchain data & trade routes via 1inch.
Install now →
Make money from your Skills
Make money from your Skills
On Capafy, your Skill runs online 24/7 as an agent product, and you get paid every time someone uses it.
Start earning →
AppSignal
AppSignal
Monitor with ease. Code with confidence.
Start Free Trial →

supertone-mcp

A composable MCP toolkit for the Supertone TTS API. Rather than a single "speak this text" command, it exposes Supertone's SDK as a set of building-block tools — synthesis, voice discovery, preview, duration/credit prediction, usage tracking, and full voice-cloning CRUD — that an LLM assembles to fulfill a request. Works in Claude Desktop, Cursor, or any MCP-compatible client.

supertone-inc/supertone-mcp MCP server

Covers Korean, English, Japanese, and 31 languages total. Speed (0.5x–2.0x), pitch shift (-24 to +24 semitones), emotion styles, per-call output mode, streaming, and model selection.

Features

Synthesis

  • text_to_speech — Convert text to audio. Per-call control of output_mode (files / resources / both), autoplay, streaming, model, plus include_phonemes / normalized_text. Long text is auto-chunked by the SDK.
  • predict_duration — Estimate audio length (and credit cost) without synthesizing.

Voice discovery (preset)

  • search_voice — Filter the catalog by language, gender, age, use_case, style, model, name, or description.
  • get_voice — Full detail for one voice.
  • preview_voice — Sample audio URLs for a voice (filterable by language/style/model).

Custom voice cloning

  • clone_voice — Create a cloned voice from a local WAV/MP3 (≤3MB).
  • search_custom_voice — List/filter cloned voices.
  • get_custom_voice — Full detail for one cloned voice.
  • edit_custom_voice — Update name and/or description.
  • delete_custom_voice — Permanently delete (irreversible).

Audio assembly

  • merge_audio_files — Concatenate two or more local audio files (mp3/wav) into one via a bundled ffmpeg. Supports plain concat, silence gaps between clips (gap_ms), or crossfade blending (crossfade_ms). Output format auto-detected (mixed → mp3) or forced via output_format. No system ffmpeg required.

Usage & credits

  • get_credit_balance — Remaining credits.
  • get_usage_history — Usage over a time window.
  • get_voice_usage — Usage for a specific voice.

Breaking changes & migration (0.2.0)

0.2.0 moves behavior control out of environment variables and into per-call tool parameters — so the LLM decides per request, not the server config.

Before (env var)After (per-call parameter)Note
SUPERTONE_MCP_OUTPUT_MODE=files|resources|bothtext_to_speech(output_mode=...)Default still files
SUPERTONE_MCP_AUTOPLAY=truetext_to_speech(autoplay=...)Default changed true → false (playback is now explicit)
(always streamed)text_to_speech(streaming=...)New, default false (one-shot). streaming=true requires model="sona_speech_1"

Other changes:

  • Default model changed sona_speech_1 → sona_speech_2_flash.
  • list_voices was removed (since the discovery release) and replaced by search_voice — call it with no arguments to reproduce the old "list everything" behavior.
  • No more hard 300-character limit — longer text is auto-chunked by the SDK (credit/latency scale with length).

If you previously set SUPERTONE_MCP_OUTPUT_MODE or SUPERTONE_MCP_AUTOPLAY, remove them from your client config and pass output_mode / autoplay per call instead. (The server prints a one-time stderr notice if it sees the removed vars.)

Installation

# Using uvx (recommended)
uvx supertone-mcp

# Using pip
pip install supertone-mcp

Configuration

Claude Desktop

Add to claude_desktop_config.json:

{
  "mcpServers": {
    "supertone-tts": {
      "command": "uvx",
      "args": ["supertone-mcp"],
      "env": {
        "SUPERTONE_API_KEY": "your-api-key-here"
      }
    }
  }
}

Cursor

Add to your Cursor MCP settings (same JSON shape as above).

Environment Variables

Only authentication and stable defaults are configured via the environment — all behavior is controlled per call.

VariableRequiredDefaultDescription
SUPERTONE_API_KEYYes—Your Supertone API key
SUPERTONE_MCP_VOICE_IDNopreset voice (Aiden, multilingual)Default voice_id for text_to_speech / predict_duration (override per call)
SUPERTONE_OUTPUT_DIRNo~/supertone-tts-output/Directory where audio files are saved (used by output_mode=files/both)

Removed in 0.2.0: SUPERTONE_MCP_OUTPUT_MODE and SUPERTONE_MCP_AUTOPLAY — see Migration.

Output modes (text_to_speech output_mode)

ModeReturnsUse when
files (default)Plain text with the saved file path + metadataYou want the file on disk
resourcesMCP AudioContent + TextContent (no file written)The client renders audio inline (e.g., Claude.ai chat)
bothFile on disk and AudioContent/TextContentYou want both — preview inline, keep the file

Usage Examples

The MCP client routes natural-language requests across these tools — the value of the toolkit is composition: the LLM chains several tools to satisfy one request.

Example 1 — Discover → preview → estimate cost → synthesize

"Find a calm Korean female voice, let me hear a sample, check the cost, then make this announcement as an mp3."

The LLM assembles:

search_voice(language="ko", gender="female", style="neutral")   # find candidates
  → preview_voice(voice_id)                                       # sample URLs to confirm the voice
  → predict_duration(text, voice_id) + get_credit_balance()       # gauge cost before spending
  → text_to_speech(text, voice_id, output_format="mp3",
                   output_mode="files")                           # synthesize

Example 2 — Clone my voice → use it right away

"Make a cloned voice from ~/recordings/sample.wav named MyVoice, then read this greeting with it and play it for me."

The LLM assembles:

clone_voice(name="MyVoice", audio_path="~/recordings/sample.wav")   # create the cloned voice
  → get_custom_voice(voice_id)                                       # confirm it was created
  → text_to_speech(text, voice_id=<cloned>, autoplay=true)           # synthesize, then play immediately

autoplay is a per-call parameter (default false), so playback happens only when explicitly requested.

Tool Parameters

text_to_speech

ParameterTypeRequiredDefaultDescription
textstringYes—Text to convert (long text is auto-chunked by the SDK)
voice_idstringNoenv or presetVoice identifier (browse via search_voice)
languagestringNokoLanguage code — one of 31 (ko, en, ja, …)
output_formatstringNomp3mp3 or wav
modelstringNosona_speech_2_flashsona_speech_1, sona_speech_2, sona_speech_2_flash, sona_speech_2t, sona_speech_3t, supertonic_api_1, supertonic_api_3
speedfloatNo1.00.5–2.0
pitch_shiftintNo0-24 to +24 semitones
stylestringNo—Emotion style (varies by voice)
output_modestringNofilesfiles, resources, or both (see Output modes)
autoplayboolNofalsePlay the audio locally after synthesis (macOS afplay)
streamingboolNofalseStream synthesis. Only supported by model="sona_speech_1"
include_phonemesboolNofalseReturn phoneme timing data alongside the audio
normalized_textstringNo—Pre-normalized text (only used by sona_speech_2 / sona_speech_2_flash)

predict_duration

Same core parameter schema as text_to_speech (long text auto-chunked). Returns "Predicted duration: 2.34s (credit usage is proportional to duration).".

search_voice

All parameters optional. With no filters → full catalog. With any filter → first response line is Filters applied: ....

ParameterTypeDescription
languagestringe.g., ko, en, ja
genderstringe.g., male, female
agestringe.g., young_adult, child
use_casestringe.g., narration, advertisement
stylestringe.g., neutral, happy
modelstringe.g., sona_speech_2_flash
namestringpartial match
descriptionstringpartial match

get_voice / preview_voice

ToolRequiredOptional
get_voicevoice_id—
preview_voicevoice_idlanguage, style, model (filter samples)

clone_voice

ParameterTypeRequiredDescription
namestringYesDisplay name (non-empty)
audio_pathstringYesLocal WAV or MP3 path (≤3MB). Supports ~ expansion
descriptionstringNoOptional note

Custom voice CRUD

ToolRequiredOptional
search_custom_voice—name, description (partial match)
get_custom_voicevoice_id—
edit_custom_voicevoice_idname, description (at least one required)
delete_custom_voicevoice_id— (IRREVERSIBLE)

Usage & credits

ToolRequiredOptional
get_credit_balance——
get_usage_history—— (reports a recent default window)
get_voice_usagevoice_id—

merge_audio_files

ParameterTypeRequiredDescription
input_pathsstring[]YesTwo or more local mp3/wav paths (~ expansion supported). A single file is returned as-is
gap_msintNoSilence (ms) inserted at each junction. Default 0. Mutually exclusive with crossfade_ms
crossfade_msintNoCrossfade blend (ms) at each junction. Default 0. Mutually exclusive with gap_ms
output_formatstringNoForce mp3 or wav. If omitted: all-same-ext → that ext; mixed → mp3

Development

# Clone and install
git clone https://github.com/supertone-inc/supertone-mcp.git
cd supertone-mcp
uv sync

# Run tests
uv run pytest -q

# Run with coverage
uv run pytest --cov=src --cov-report=term-missing

License

MIT

Featured
CodeRabbit
CodeRabbit
AI writes the code. CodeRabbit catches the slop.
Try For Free →
Keep your Mac awake
Keep your Mac awake
Keep your Mac awake while Claude Code and 40+ AI agents run. Sleeps when they're idle.
One time payment $9 →
Context.devContext.dev
Context.dev
Integrate web data into your AI product. One API to scrape website & brand data.
Get API Key Now →
Make your agent a DeFi expert
Make your agent a DeFi expert
Agent, run crypto. Access onchain data & trade routes via 1inch.
Install now →
Make money from your Skills
Make money from your Skills
On Capafy, your Skill runs online 24/7 as an agent product, and you get paid every time someone uses it.
Start earning →
AppSignal
AppSignal
Monitor with ease. Code with confidence.
Start Free Trial →

Configuration

SUPERTONE_API_KEY*secret

Supertone API key

SUPERTONE_MCP_VOICE_ID

Default voice_id for synthesis (falls back to a preset voice)

SUPERTONE_OUTPUT_DIR

Directory where audio files are saved (default ~/supertone-tts-output/)

Categories
Search & Web Crawling
Registryactive
Packagesupertone-mcp
TransportSTDIO
AuthRequired
UpdatedJun 5, 2026
View on GitHub

Related Search & Web Crawling MCP Servers

View all →
Google Search

com.mcparmory/google-search

Scrape Google search results with SERP data, ads, and knowledge panels
25
Brave Search

io.github.pipeworx-io/brave-search

Brave Search MCP — independent web index (no Google/Bing dependency)
Serper Search and Scrape

marcopesani/mcp-server-serper

Serper MCP Server supporting search and webpage scraping
154
Brave Search Mcp Server

brave/brave-search-mcp-server

Brave Search MCP Server: web results, images, videos, rich results, AI summaries, and more.
1.2k
Google Search Console

com.mcparmory/google-search-console

Query search analytics, manage sitemaps, and inspect site URLs and status
25
Google Search Console

acamolese/google-search-console-mcp

Google Search Console MCP server: SEO audits, performance queries, URL inspection, indexing checks.
3