AI Image and Video API Platform
Compare models, inspect endpoints, and access docs in one place.
API Reference
Public docs
Endpoints, schemas, and examples.
OpenAPI 3.1
Typed integrations
SDK generation and machine-readable specs.
LLM Docs
Agent-ready docs
`llm.txt` and agent integration help.
API Keys
Authentication
Create keys for apps and agents.
Usage Dashboard
Requests and credits
Monitor volume, logs, and balance.
Buy Credits
Prepaid capacity
Top up for image and video usage.
Available Models
135 AI models for image and video generation behind one async control plane
Boogu Image
Boogu Image — bilingual (EN/ZH) text-to-image generation with crisp detail and 2K output.
/models/boogu-image/runBoogu Image Edit
Boogu Image instruction-based editing. Provide a source image and an edit instruction.
/models/boogu-image-edit/runBria 3.2
Bria 3.2 — text-to-image with 9 aspect ratio presets at 1K resolution, optional image and prompt enhancement, and photography/art medium hints.
/models/bria-3-2/runChange Camera Angle
Camera-aware editing via fal.ai Qwen Image Edit 2511 with multi-angle LoRA. 360° orbit, tilt, and zoom.
/models/change-camera-angle/runConsistent Characters
Generate consistent character variations with FLUX Kontext, Nano Banana Pro/2, Flux 2 Dev, or Qwen Image 2 Pro.
/models/consistent-characters/runCreative Upscale
Clarity Upscaler (creative upscale) via Replicate. Boost detail with stable-diffusion refinement.
/models/creative-upscale/runErnie
Baidu Ernie text-to-image (fal.ai). Multilingual prompts and built-in prompt expansion.
/models/ernie/runFace Enhance
Crystal Upscaler via Replicate. Face-detail preserving upscale, cost scales with output megapixels.
/models/face-enhance/runFLUX
FLUX family on Replicate. Schnell, Dev, Pro, Kontext, Ultra, and LoRA remix variants in one entrypoint.
/models/flux/runFlux 2 Flex
Max-quality with up to 10 reference images
/models/flux-2-flex/runFlux 2 Klein 4B
Very fast generation and editing with up to 5 reference images
/models/flux-2-klein-4b/runFlux 2 Klein 9B
4-step distilled FLUX.2 [klein] foundation model for flexible control
/models/flux-2-klein-9b/runFlux 2 Pro
High-quality with up to 8 reference images
/models/flux-2-pro/runFlux 2 Max
The highest fidelity image model from Black Forest Labs
/models/flux-2-max/runFlux 2 Dev
Fast quality with up to 4 reference images
/models/flux-2-dev/runFlux 2 Dev + LoRA
Dev model with custom LoRA support
/models/flux-2-lora/runFlux Edit (Kontext)
Black Forest Labs FLUX.1 Kontext for text-driven image editing. Dev (open-weight), Pro (state-of-the-art), and Max (premium typography).
/models/flux-edit/runFlux Dev
High-quality development model with configurable steps, guidance, and LoRA support.
/models/flux-dev/runFlux Krea Dev
Photorealistic generation that avoids the oversaturated AI look. LoRA compatible.
/models/flux-krea-dev/runFlux Dev Multi LoRA
Supports multiple custom LoRAs simultaneously for complex style combinations.
/models/flux-dev-multi-lora/runFlux 1.1 Pro
Latest pro model with enhanced quality and strong prompt adherence.
/models/flux-1.1-pro/runFlux 1.1 Pro Ultra
Highest quality Flux model with raw mode for natural-looking images.
/models/flux-1.1-pro-ultra/runFlux Kontext Pro
Advanced model with state-of-the-art performance for both generation and editing.
/models/flux-kontext-pro/runFlux Kontext Max
Premium model with maximum performance and improved typography for generation and editing.
/models/flux-kontext-max/runGoogle Gemini Flash
Fast generation with Gemini 2.5 Flash
/models/gemini-flash/runGoogle Nano Banana Pro
SOTA with accurate typography and reasoning
/models/nano-banana-pro/runGoogle Nano Banana 2
Next-generation SOTA model with stronger consistency
/models/nano-banana-2/runNano Banana Edit
Google Nano Banana image editing. Multi-image fusion + edit instruction with Standard/Pro/Pro-fal tiers and 1K/2K/4K resolution.
/models/google-nano-banana/runGPT-Image 1.5 Low
Fast, lower detail generation
/models/gpt-image-low/runGPT-Image 1.5 Medium
Balanced quality and speed
/models/gpt-image-medium/runGPT-Image 1.5 High
Maximum detail and quality
/models/gpt-image-high/runGPT-Image 1.5 Edit
OpenAI GPT-Image 1.5 image editing — supply 1-8 reference images plus an edit instruction. Optional transparent background and high-fidelity input mode.
/models/gpt-image-1-5-edit/runGPT Image 2
OpenAI GPT Image 2 via fal.ai — next-generation image model with 4K rendering and sharper text fidelity.
/models/gpt-image-2/runGPT Image 2 Edit
OpenAI GPT Image 2 image editing — supply 1-8 reference images plus an edit instruction. Optional mask for inpainting. 4K-capable; pricing varies by quality + size.
/models/gpt-image-2-edit/runHiDream Edit
HiDream O1 image-conditioned editing. Provide a source image and an instruction.
/models/hidream-edit/runIdeogram 4 Turbo
Fastest and cheapest Ideogram 4.0. Same stunning realism, creative designs, and text rendering — tuned for speed and iteration.
/models/ideogram-v4-turbo/runIdeogram 4 Balanced
The sweet spot. Balances speed, quality, and cost — a great default for most graphic design, marketing, and poster work.
/models/ideogram-v4-balanced/runIdeogram 4 Quality
The highest-quality Ideogram 4.0. Slowest but best for hero images, print-ready work, and detailed text-heavy layouts.
/models/ideogram-v4-quality/runIdeogram Character
Generate consistent characters from a single reference image in many styles.
/models/ideogram-character/runCharacter Stylist
One-shot FLUX Kontext variants — filters, cartoonify, iconic locations, haircut swap, headshots, renaissance, face-to-many, and more.
/models/image-editor/runImage Relighting
Relight images with Magic Lighting, Nano Banana Pro/2, or Qwen Image Edit — multi-provider routing with per-model credit rates.
/models/image-relighting/runFlux Image to Image
FLUX Dev LoRA image-to-image on Replicate. Prompt + source image + optional LoRA weights.
/models/image-to-image-flux/runImagineArt
ImagineArt family — 1.0 (Mixture-of-Experts photorealism), 1.5, 1.5 Pro, and the 2.0 preview.
/models/imagineart/runKling Image V3
Kling Image V3 (fal.ai). High-quality text-to-image with flexible aspect ratios.
/models/kling-image/runKling Image Edit
Kling Image V3 (fal.ai) image-to-image editing with a text instruction.
/models/kling-image-edit/runMagnific Upscaler
Freepik Magnific upscaler. Creative or precision mode, up to 16x.
/models/magnific-upscaler/runOutpaint
fal.ai Image Apps V2 outpainting. Expand an image beyond its original edges.
/models/outpaint/runP-Image
Pruna P-Image. Sub-second text-to-image with optional custom dimensions.
/models/p-image/runP-Image Edit
Pruna P-Image Edit. Fast image editing with up to 5 reference images.
/models/p-image-edit/runPony Realism
Pony Realism - Stylized anime generation
/models/ponyxl-ponyrealism-v23/runPony NAI
Pony NAI - Stylized anime generation
/models/ponyxl-tponynai3-v7/runWai ANI
Wai ANI - Stylized anime generation
/models/ponyxl-waianinsfwponyxl-v140/runQWEN Image Plus
Fast generation with excellent quality
/models/qwen-image-plus/runQWEN Image Max
Highest quality output
/models/qwen-image-max/runQWEN Image 2.0
Fast, balanced image generation and editing
/models/qwen-image-2.0/runQWEN Image 2.0 Pro
Enhanced text rendering, realistic textures, and semantic adherence
/models/qwen-image-2.0-pro/runQwen Image 2 Edit
Alibaba DashScope Qwen Image 2 edit — supply 1-3 reference images plus an edit instruction. Standard and Pro variants.
/models/qwen-image-2-edit/runQwen Image Edit
Alibaba DashScope Qwen Image edit — supply 1-3 reference images plus an edit instruction. Plus and Max model variants.
/models/qwen-image-edit/runQwen Image Edit Spicy
Qwen Image Edit Spicy. Add, remove, or modify elements in an existing image with text guidance.
/models/qwen-image-edit-spicy/runFlux Redux
Black Forest Labs Flux Redux image variations — feed a source image, get stylistic riffs.
/models/redux-flux/runSeedream 4.5
ByteDance Seedream 4.5 — new-generation image creation with superior aesthetics, text rendering, and up to 4K resolution.
/models/seedream-4/runSeedream 5 Lite
ByteDance Seedream 5.0 Lite — fast, high-quality image generation and editing with strong aesthetics and text rendering.
/models/seedream-5-lite/runWAN 2.6 Image
Alibaba WAN 2.6 text-to-image with prompt enhancement and multi-image output.
/models/wan-2.6-image/runWAN 2.6 Image Edit
Alibaba WAN 2.6 image editing. Up to 4 reference images.
/models/wan-2.6-image-edit/runWAN 2.7 Standard
Faster Wan 2.7 image generation and editing
/models/wan-2.7-image/runWAN 2.7 Pro
Higher quality Wan 2.7 tier with 4K support for text-to-image
/models/wan-2.7-image-pro/runWAN 2.7 Image Edit
Alibaba WAN 2.7 image editing. Standard and Pro tiers, supports up to 9 input images for fusion edits.
/models/wan-2.7-image-edit/runWAN 2.2 Image
Fast cinematic image generation (3-6 seconds) with up to 4MP output and optional LoRA support.
/models/wan-image/runGrok Imagine
xAI Grok Imagine. Fast tier for quick iteration, Quality tier for higher fidelity at 1k or 2k.
/models/xai-image/runGrok Image Edit
xAI Grok image editing. Sync response (no polling). Provide an image URL and a text edit instruction. Optional quality tier for 1k/2k high-fidelity edits.
/models/xai-image-edit/runZ Image Spicy
Z Image Spicy text-to-image. Square / portrait / landscape compositions, 256–1536px on each side.
/models/z-image-spicy/runZ Image Turbo
Super-fast 6B parameter text-to-image with great text rendering and LoRA support.
/models/z-image-turbo/runQuick start:
https://pixeldojo.ai/api/v1