AI Image and Video API Platform

Happy Horse 1.0 Text-to-Video

/models/happyhorse-1.0-t2v/run

Text-to-video with 720p/1080p output and 2-15 second durations

Happy Horse 1.0 Image-to-Video

/models/happyhorse-1.0-i2v/run

Image-to-video animation with 720p/1080p output and 2-15 second durations

Happy Horse 1.0 Video Edit

/models/happyhorse-1.0-video-edit/run

Alibaba Happy Horse 1.0 video edit — apply style transfer or local replacement to a source video using text prompts and optional reference images. 720p / 1080p, 3-15 second output.

Heygen Avatar

/models/heygen-avatar/run

Heygen Avatar 4 via fal.ai. Animate a portrait with prompt-driven speech or an audio track, with optional background and captions.

/models/kling-motion-control/run

Kling Motion Control v3 Standard

3 credits/sec

Kling Video v3 Standard motion control endpoint

Kling Motion Control v3 Pro

/models/kling-motion-control-pro/run

Kling Video v3 Pro motion control endpoint

Kling Reference to Video

/models/kling-reference-to-video/run

Kling O3 reference-driven video generation. Image or video references, Standard or Pro tier.

image-to-videovideo-extend

Kling 2.6 Pro

Kling Video v2.6 Pro (fal.ai). Text-to-video or image-to-video, 5 or 10 seconds, with audio generation.

/models/kling-v2-6/run

/models/kling-video-v3-standard-text/run

Kling Video v3 Standard (Text)

6 credits/sec

Standard text-to-video with native audio

/models/kling-video-v3-standard-image/run

Kling Video v3 Standard (Image)

6 credits/sec

Standard image-to-video with native audio

/models/kling-video-v3-pro-text/run

Kling Video v3 Pro (Text)

8 credits/sec

Pro text-to-video with cinematic quality and native audio

/models/kling-video-v3-pro-image/run

Kling Video v3 Pro (Image)

8 credits/sec

Pro image-to-video with cinematic quality and native audio

/models/kling-video-edit/run

Kling Video Edit

40 credits

Kling O3 video-to-video edit. Standard or Pro, with optional reference images and audio preservation.

Lip Sync

5 credits

Replicate sync/lipsync-2. Align mouth movements in a video to a separate audio track.

/models/lip-sync/run

audio-to-video

LTX 2.3 Fast Text-to-Video

/models/ltx-2-fast-t2v/run

Fast text-to-video generation (6-20s, 1080p-2160p).

LTX 2.3 Fast Image-to-Video

/models/ltx-2-fast-i2v/run

Fast image-to-video generation (6-20s, 1080p-2160p).

LTX 2.3 Pro Text-to-Video

/models/ltx-2-pro-t2v/run

Higher quality text-to-video generation (6-10s, 1080p-2160p).

LTX 2.3 Pro Image-to-Video

/models/ltx-2-pro-i2v/run

Higher quality image-to-video generation (6-10s, 1080p-2160p).

LTX 2.3 Pro Extend Video

/models/ltx-2-pro-extend/run

Extend an existing video clip from the start or end (1-20s, Pro tier only).

audio-to-videoimage-to-video

OmniHuman 1.5

45 credits

ByteDance OmniHuman 1.5 via Replicate. Audio-driven talking-head video with lip sync.

/models/omnihuman/run

text-to-videoimage-to-videoaudio-to-video

P-Video

0.5 credits/sec

Pruna P-Video — video generation with text/image/audio conditioning, draft mode, and 720p/1080p outputs.

/models/p-video/run

/models/p-video-avatar/run

P Video Avatar

1 credit/sec

Pruna P Video Avatar — animate a portrait into a talking avatar from a script or an audio file. 30 voices, 10 languages, 720p / 1080p.

Pixverse v5.6

7.5 credits

Pixverse v5.6 video generation via Replicate — text-to-video or image-to-video with optional audio, at 360p–1080p.

/models/pixverse/run

Pixverse V6

/models/pixverse-v6/run

Pixverse V6 video generation via Runware. Text-to-video, image-to-video (start frame), or multi-clip (start + end frame).

Runway Gen-4.5 Video

/models/runway-gen4-video/run

Runway Gen-4.5 video generation. Text-to-video or image-to-video, 5 or 10 seconds.

Runway

/models/runway-video/run

Canonical version-agnostic Runway video API ID.

Runway Gen-4 (Legacy API ID)

/models/runway-gen4/run

Legacy alias for clients pinned to runway-gen4; maps to the current Runway model.

/models/seedance-1.5/run

Seedance 1

8 credits

ByteDance Seedance 1 video generation. Text-to-video or image-to-video with optional end frame.

Seedance 2 High

/models/seedance-2-high/run

Higher-quality Seedance 2.0 video generation (supports 1080p)

/models/seedance-2-reference/run

Seedance 2 Reference to Video

20 credits

Seedance 2.0 multimodal reference-to-video. Combine up to 9 images, 3 video clips, and 3 audio tracks to guide characters, motion, and sound.

/models/seedance-video-edit/run

Seedance 2 Video Edit

25 credits

Edit source videos with Seedance 2.0 using prompted changes, optional reference images, and 480p, 720p, or 1080p output.

/models/veo-3.1-fast/run

VEO 3.1 Fast

3 credits/sec

Faster generation at 3 credits per second

/models/veo-3.1-standard/run

VEO 3.1 Standard

8 credits/sec

Higher quality at 8 credits per second

VEO 3.1 Lite

/models/veo-3.1-lite/run

Runware-powered Lite variant at 1.5 credits/sec for 720p and 2 credits/sec for 1080p. No reference images, no audio generation, no 1:1 aspect ratio.

/models/video-autocaption/run

Video Autocaption

5 credits

TikTok-style auto-captioning via Replicate.

/models/video-reframe/run

Video Reframe

8 credits

Luma Reframe Video via Replicate. Change a video's aspect ratio intelligently.

/models/video-to-sound/run

Video to Sound

2 credits

ThinkSound via Replicate. Generate a sound effect track from a video.

/models/video-transform/run

Video Transform

7 credits/sec

Runway Aleph 2.0 via Replicate. Transform up to 30 seconds of video with a prompt.

Video Upscaler

/models/video-upscaler/run

Topaz Labs Video Upscale via Replicate. Upscale video resolution and FPS.

/models/wan-2.2-standard/run

WAN 2.2 Standard

3 credits

Premium quality with enhanced detail

WAN 2.2 Plus

/models/wan-2.2-plus/run

Official Alibaba model with 1080p support

/models/wan-2.2-extended/run

WAN 2.2 Extended

1.2 credits/sec

fal.ai WAN 2.2 with up to 10-second videos and dual LoRA support

/models/wan-2.2-animate/run

WAN 2.2 Animate

2 credits

WAN 2.2 video animation. Drive a character image with a motion reference video.

WAN 2.2 Spicy Image-to-Video

/models/wan-2.2-i2v-spicy/run

Image-to-video with WAN 2.2 Spicy. Animate a starting image. 480p or 720p, 5s or 8s clips.

/models/wan-2.2-replace/run

WAN 2.2 Replace

2 credits

WAN 2.2 character replacement. Swap a character in a source video while preserving scene and motion.

/models/wan-2.6-standard/run

WAN 2.6 Standard

2.5 credits/sec

Higher quality, 720p/1080p support

/models/wan-2.6-flash/run

WAN 2.6 Flash

1 credit/sec

Fast and affordable image-to-video

/models/wan-2.7-i2v-spicy/run

WAN 2.7 Spicy Image-to-Video

20 credits

Image-to-video with WAN 2.7 Spicy. Animate a starting image with optional driving audio. 720p or 1080p, 2–15 second clips.

/models/wan-2.7-t2v/run

WAN 2.7 Text-to-Video

2.5 credits/sec

Text-to-video with audio sync, 720p/1080p output, and 2-15 second durations

/models/wan-2.7-i2v/run

WAN 2.7 Image-to-Video

2.5 credits/sec

Image-to-video and video continuation with optional last-frame control and audio sync

image-to-videovideo-extend

/models/wan-reference-to-video/run

WAN Reference to Video

4 credits

Alibaba WAN reference-to-video. Up to 5 image/video references with multi-shot support.

/models/wan-video-character-swap/run

WAN Video Character Swap

20 credits

Alibaba WAN character swap. Combine a character image with a reference video to produce a new clip.

/models/wan-video-edit/run

WAN 2.7 Video Edit

6 credits

Alibaba WAN 2.7 video editing. Modify an existing clip via prompt with optional reference images.

Grok Imagine Video

xAI Grok Imagine video. Text-to-video or image-to-video, 1-15 seconds at 480p or 720p. Image-to-video can use the Grok Imagine 1.5 backbone for natively-synchronized audio.

/models/xai-video/run

Grok Video Edit

/models/xai-video-edit/run

xAI Grok Imagine Video edit. Transform short clips via Replicate.