AI Image and Video API Platform
Compare models, inspect endpoints, and access docs in one place.
API Reference
Public docs
Endpoints, schemas, and examples.
OpenAPI 3.1
Typed integrations
SDK generation and machine-readable specs.
LLM Docs
Agent-ready docs
`llm.txt` and agent integration help.
API Keys
Authentication
Create keys for apps and agents.
Usage Dashboard
Requests and credits
Monitor volume, logs, and balance.
Buy Credits
Prepaid capacity
Top up for image and video usage.
Available Models
137 AI models for image and video generation behind one async control plane
Consistent Characters
Generate consistent character variations with FLUX Kontext, Nano Banana Pro/2, Flux 2 Dev, or Qwen Image 2 Pro.
/models/consistent-characters/runFace Enhance
Crystal Upscaler via Replicate. Face-detail preserving upscale, cost scales with output megapixels.
/models/face-enhance/runFLUX
FLUX family on Replicate. Schnell, Dev, Pro, Kontext, Ultra, and LoRA remix variants in one entrypoint.
/models/flux/runFlux Edit (Kontext)
Black Forest Labs FLUX.1 Kontext for text-driven image editing. Dev (open-weight), Pro (state-of-the-art), and Max (premium typography).
/models/flux-edit/runFlux Dev
High-quality development model with configurable steps, guidance, and LoRA support.
/models/flux-dev/runFlux Krea Dev
Photorealistic generation that avoids the oversaturated AI look. LoRA compatible.
/models/flux-krea-dev/runFlux Dev Multi LoRA
Supports multiple custom LoRAs simultaneously for complex style combinations.
/models/flux-dev-multi-lora/runFlux 1.1 Pro
Latest pro model with enhanced quality and strong prompt adherence.
/models/flux-1.1-pro/runFlux 1.1 Pro Ultra
Highest quality Flux model with raw mode for natural-looking images.
/models/flux-1.1-pro-ultra/runFlux Kontext Pro
Advanced model with state-of-the-art performance for both generation and editing.
/models/flux-kontext-pro/runFlux Kontext Max
Premium model with maximum performance and improved typography for generation and editing.
/models/flux-kontext-max/runNano Banana Edit
Google Nano Banana image editing. Multi-image fusion + edit instruction with Standard/Pro/Pro-fal tiers and 1K/2K/4K resolution.
/models/google-nano-banana/runGPT Image 2
OpenAI GPT Image 2 via fal.ai — next-generation image model with 4K rendering and sharper text fidelity.
/models/gpt-image-2/runHeygen Avatar
Heygen Avatar 4 via fal.ai. Animate a portrait with prompt-driven speech or an audio track, with optional background and captions.
/models/heygen-avatar/runIdeogram Character
Generate consistent characters from a single reference image in many styles.
/models/ideogram-character/runKling Reference to Video
Kling O3 reference-driven video generation. Image or video references, Standard or Pro tier.
/models/kling-reference-to-video/runLip Sync
Replicate sync/lipsync-2. Align mouth movements in a video to a separate audio track.
/models/lip-sync/runOmniHuman 1.5
ByteDance OmniHuman 1.5 via Replicate. Audio-driven talking-head video with lip sync.
/models/omnihuman/runWAN 2.2 Animate
WAN 2.2 video animation. Drive a character image with a motion reference video.
/models/wan-2.2-animate/runWAN Reference to Video
Alibaba WAN reference-to-video. Up to 5 image/video references with multi-shot support.
/models/wan-reference-to-video/runQuick start:
https://pixeldojo.ai/api/v1