Multi-provider media generation — images, video, audio, and transcription via a unified interface
npx -y @r16t/multimodal-mcp{
"OPENAI_API_KEY": "YOUR_SECRET_VALUE",
"XAI_API_KEY": "YOUR_SECRET_VALUE",
"GEMINI_API_KEY": "YOUR_SECRET_VALUE",
"ELEVENLABS_API_KEY": "YOUR_SECRET_VALUE",
"BFL_API_KEY": "YOUR_SECRET_VALUE",
"MEDIA_OUTPUT_DIR": "YOUR_VALUE_HERE"
}Add this server entry to the mcpServers object in your Claude Desktop config, then restart the app.
{
"mcpServers": {
"io-github-rsmdt-multimodal": {
"command": "npx",
"args": [
"-y",
"@r16t/multimodal-mcp"
],
"env": {
"OPENAI_API_KEY": "YOUR_SECRET_VALUE",
"XAI_API_KEY": "YOUR_SECRET_VALUE",
"GEMINI_API_KEY": "YOUR_SECRET_VALUE",
"ELEVENLABS_API_KEY": "YOUR_SECRET_VALUE",
"BFL_API_KEY": "YOUR_SECRET_VALUE",
"MEDIA_OUTPUT_DIR": "YOUR_VALUE_HERE"
}
}
}
}~/Library/Application Support/Claude/claude_desktop_config.json%APPDATA%\Claude\claude_desktop_config.jsonNo remote HTTP endpoint is advertised. Use the package or stdio setup shown in Install.
multimodal is an MCP server for Multi-provider media generation — images, video, audio, and transcription via a unified interface. It supports STDIO transport.
Use the generated config in Install. This server runs with npx -y @r16t/multimodal-mcp; add any required environment variables before starting your client.
Choose the Claude Desktop tab in Install and copy the config for npx -y @r16t/multimodal-mcp. Add required environment variables before starting Claude Desktop.
Choose the Claude Code tab in Install and copy the config for npx -y @r16t/multimodal-mcp. Add required environment variables before starting Claude Code.
Choose the Codex tab in Install and copy the config for npx -y @r16t/multimodal-mcp. Add required environment variables before starting Codex.
Choose the Cursor or VS Code tab in Install and copy the config for npx -y @r16t/multimodal-mcp. Add required environment variables before starting Cursor or VS Code.
multimodal uses STDIO transport. Use the package or command config in Install.
multimodal inventory is listed when the MCP endpoint exposes tools, resources, or prompts. Some servers require auth first.
multimodal does not advertise a verified auth requirement. If discovery fails, it may still need provider login, an API key, a bearer token, or a session header.
| Package | Registry | Version | Inputs |
|---|---|---|---|
@r16t/multimodal-mcpstdio | npm | 1.3.1 | Env: OPENAI_API_KEY secret Env: XAI_API_KEY secret Env: GEMINI_API_KEY secret Env: ELEVENLABS_API_KEY secret Env: BFL_API_KEY secret Env: MEDIA_OUTPUT_DIR |