PixVerse
STDIO通过MCP访问PixVerse视频生成模型
通过MCP访问PixVerse视频生成模型
A comprehensive tool that allows you to access PixVerse's latest video generation models via applications that support the Model Context Protocol (MCP), such as Claude or Cursor. Generate videos from text, animate images, create transitions, add lip sync, sound effects, and much more!
https://github.com/user-attachments/assets/08ce90b7-2591-4256-aff2-9cc51e156d00
PixVerse MCP is a powerful tool that enables you to access PixVerse's latest video generation models through applications that support the Model Context Protocol (MCP). This integration allows you to generate high-quality videos with advanced features including text-to-video, image-to-video, video extensions, transitions, lip sync, sound effects, and more.
The system consists of two main components:
Python:
UV/UVX:
curl -LsSf https://astral.sh/uv/install.sh | sh
powershell -ExecutionPolicy ByPass -c "irm https://astral.sh/uv/install.ps1 | iex"
{ "mcpServers": { "PixVerse": { "command": "uvx", "args": [ "pixverse-mcp" ], "env": { "PIXVERSE_API_KEY": "your-api-key-here" } } } }
"PIXVERSE_API_KEY": "xxxx"Use natural language prompts via Claude or Cursor to generate videos.
Basic Example:
Generate a video of a sunset over the ocean. Golden sunlight reflects on the water as waves gently hit the shore.
Advanced Example with Parameters:
Generate a night cityscape video with the following parameters:
Content: Skyscraper lights twinkling under the night sky, with car lights forming streaks on the road
Aspect Ratio: 16:9
Quality: 540p
Duration: 5 seconds
Motion Mode: normal
Negative Prompts: blur, shaking, text
Supported Parameters:
Use detailed scene descriptions or shot lists to create more structured videos.
Scene Description Example:
Scene: A beach in the early morning.
The sun is rising, casting golden reflections on the sea.
Footprints stretch across the sand.
Gentle waves leave white foam as they retreat.
A small boat slowly sails across the calm sea in the distance.
Aspect Ratio: 16:9, Quality: 540p, Duration: 5 seconds.
Shot-by-Shot Example:
Generate a video based on this storyboard:
- Start: Top-down shot of a coffee cup with steam rising
- Close-up: Ripples and texture on the coffee surface
- Transition: Stirring creates a vortex
- End: An open book and glasses next to the cup
Format: 1:1 square, Quality: 540p, Motion: fast
Quickly generate videos of specific themes or styles without detailed descriptions.
Theme Example:
Generate a video with a futuristic technology theme, including neon lights and holographic projections.
Style Example:
Generate a watercolor-style video of blooming flowers with bright, dreamy colors.
Combine AI's creativity with video generation.
Style Transfer Example:
This is a photo of a cityscape. Reinterpret it with a retro style and provide a video prompt.
Story Prompt Example:
If this street photo is the opening scene of a movie, what happens next? Provide a short video concept.
Emotional Scene Example:
Look at this forest path photo and design a short video concept, either a micro-story or a scene with emotional progression.
Generate a sunset ocean video with golden sunlight reflecting on the water
Example with parameters:
Prompt: "A majestic eagle soaring over mountain peaks at sunrise"
Quality: 720p
Duration: 5
Model: v5
Aspect Ratio: 16:9
Parameters: Quality(360p-1080p), Duration(5s/8s), Aspect Ratio(16:9/1:1/9:16), model(v4.5/v5)
1. Upload image → Get img_id
2. Use img_id to generate animated video
Example with parameters:
Prompt: "The character walks through a magical forest with glowing trees"
img_id: 12345
Quality: 720p
Duration: 5s
Model: v5
Use source_video_id to extend existing video
Example with parameters:
Prompt: "The scene continues with the character discovering a hidden cave"
source_video_id: 67890
Duration: 5s
Quality: 720p
Model: v5
Upload two images to create smooth morphing animation
Example with parameters:
Prompt: "Transform from sunny beach to stormy night sky"
first_frame_img: 11111
last_frame_img: 22222
Duration: 5s
Quality: 720p
Model: v5
Video: 
TTS: Choose speaker + input text
Audio: Upload audio file + video
Example with parameters:
# Method 1: Generated Video + TTS
source_video_id: 33333
lip_sync_tts_speaker_id: "speaker_001"
lip_sync_tts_content: "Welcome to our amazing video tutorial"
# Method 2: Generated Video + Custom Audio
source_video_id: 33333
audio_media_id: 44444
# Method 3: Uploaded Video + TTS
video_media_id: 55555  # Upload your video first
lip_sync_tts_speaker_id: "speaker_002"
lip_sync_tts_content: "This is a custom narration"
# Method 4: Uploaded Video + Custom Audio
video_media_id: 55555  # Upload your video first
audio_media_id: 44444  # Upload your audio first
Describe effects: "Ocean waves, seagull calls, gentle wind"
Example with parameters:
# Method 1: Generated Video + Sound Effects
sound_effect_content: "Gentle ocean waves, seagull calls, soft wind"
source_video_id: 55555
original_sound_switch: true  # Keep original audio
# Method 2: Uploaded Video + Sound Effects
sound_effect_content: "Urban traffic, footsteps, city ambiance"
video_media_id: 66666  # Upload your video first
original_sound_switch: false  # Replace original audio
# Method 3: Replace Audio Completely
sound_effect_content: "Epic orchestral music, thunder, dramatic tension"
video_media_id: 77777  # Upload your video first
original_sound_switch: false  # Replace with new audio
Upload multiple images, use @ref_name references
Example: @person standing in front of @city with @drone flying overhead
Example with parameters:
Prompt: "@hero standing in front of @city with @drone flying overhead"
image_references: [
  {type: "subject", img_id: 66666, ref_name: "hero"},
  {type: "background", img_id: 77777, ref_name: "city"},
  {type: "subject", img_id: 88888, ref_name: "drone"}
]
Duration: 5s
Model:v4.5
Quality: 720p
Aspect Ratio: 16:9
Check video_id status every 6 seconds until completion
Example with parameters:
video_id: 99999
# Check every 6 seconds until status becomes "completed" or "failed"
# Typical generation time: 60-120 seconds
Status: pending → in_progress → completed/failed
How do I get a PixVerse API key?
What should I do if the server doesn't respond?
Does MCP support image-to-video or keyframe features?
How to obtain credits?
What video formats and sizes are supported?
Where can I find the generated video?
How long does video generation take?
What to do if you encounter a spawn uvx ENOENT error?
For Mac/Linux:
sudo cp ./uvx /usr/local/bin
For Windows:
where uvx
v2.0.0 (Latest)
v1.0.0