Video Generation

Create short video clips from text prompts, animate still images, or produce talking-head videos with lipsync. All results are saved to your Pixa workspace.

Text-to-video

CLI

Generate a video from a text description:

pixa run prompt "a slow pan across a sunlit kitchen counter with coffee steam rising" \
  --model kling-v2.6-pro \
  --aspect-ratio 16:9 --json

MCP

Use the generate_media tool with media_type: "video":

Generate a 5-second product reveal video of sneakers rotating on a white background.

Image-to-video

Attach a start frame and describe the motion you want:

pixa run prompt "slow zoom in with subtle background motion" \
  --attachment start-frame.png --json

The model uses your image as the opening frame and animates from there.

Talking video

Attach a face image and provide a script in the prompt to generate a talking-head video:

pixa run prompt "Hi everyone, welcome to our product launch. Today we are introducing something special." \
  --attachment face.png --json

The model animates the face to speak the provided text with natural lip movement and expression.

Lipsync

Attach both a face image and an audio file to sync lip movement to existing audio:

pixa run prompt "lipsync this face to the audio" \
  --attachment face.png \
  --attachment voiceover.mp3 --json

The --attachment flag is repeatable. For lipsync, provide the face image and audio file as separate attachments.

Video settings

CLI flags

Flag	Description	Example
`--model`	Video model ID	`kling-v2.6-pro`
`--aspect-ratio`	Output aspect ratio	`16:9`, `9:16`, `1:1`
`--output-format`	File format	`mp4`, `webm`
`--num-variations`	Number of variations	`1`-`4`

MCP parameters

The generate_media tool accepts: prompt, model, aspect_ratio, media_type ("video"), output_format, num_variations, and attachments.

Batch video generation

Generate multiple video variations:

pixa run prompt "product reveal animation" \
  --attachment product.png \
  --num-variations 3 --json

To produce the same video in different aspect ratios for different platforms, run separate commands:

# Landscape for YouTube
pixa run prompt "product demo video" \
  --attachment product.png --aspect-ratio 16:9 --json

# Portrait for TikTok / Reels
pixa run prompt "product demo video" \
  --attachment product.png --aspect-ratio 9:16 --json

Tips

For talking videos, write the script as natural speech. Avoid stage directions or formatting — just write what the person should say.

Start frame matters. For image-to-video, the quality and composition of your start frame directly affects the output.
Keep prompts focused. Describe one clear motion or scene rather than a complex sequence.
Use --json in agent workflows so the next step can parse the result from the output.
Async jobs. Video generation can take several minutes. Use pixa jobs follow <id> to poll, or in MCP use get_job_status with sync: true.

Video Generation (User Guide) — in-app video workflow
CLI Command Reference — full flag and command reference

Getting started

Tools & workflows

Reference

Text-to-video

CLI

MCP

Image-to-video

Talking video

Lipsync

Video settings

CLI flags

MCP parameters

Batch video generation

Tips

Getting started

Tools & workflows

Reference

​Text-to-video

​CLI

​MCP

​Image-to-video

​Talking video

​Lipsync

​Video settings

​CLI flags

​MCP parameters

​Batch video generation

​Tips

​Related

Text-to-video

CLI

MCP

Image-to-video

Talking video

Lipsync

Video settings

CLI flags

MCP parameters

Batch video generation

Tips

Related