# PixelDojo API (Async Jobs) > Optimized for LLM consumption. Copy into your AI assistant. > This document is dynamically generated from live model data. ## Overview PixelDojo provides an async job-based API for AI image and video generation. **Base URL:** `https://pixeldojo.ai/api/v1` ### Core Endpoints - `POST /api/v1/models/{apiId}/run` — Submit a generation job - `GET /api/v1/jobs/{jobId}` — Poll job status and outputs - `GET /api/v1/models` — List available models (with credit costs) - `GET /api/v1/models/{apiId}` — Model details + input parameters ## Authentication All requests require an API key: ``` Authorization: Bearer YOUR_API_KEY ``` Get your API key: https://pixeldojo.ai/api-platform/api-keys ## Requirements - API key - Sufficient credits (per-image for image models, per-second for video models) ## Image Models (71 available) ### Change Camera Angle (`change-camera-angle`) Camera-aware editing via fal.ai Qwen Image Edit 2511 with multi-angle LoRA. 360° orbit, tilt, and zoom. - Cost: 1 credit(s) - Use cases: editing ### Consistent Characters (`consistent-characters`) Generate consistent character variations with FLUX Kontext, Nano Banana Pro/2, Flux 2 Dev, or Qwen Image 2 Pro. - Cost: 1 credit(s) - Use cases: character ### Creative Upscale (`creative-upscale`) Clarity Upscaler (creative upscale) via Replicate. Boost detail with stable-diffusion refinement. - Cost: 0.5 credit(s) - Features: LoRA - Use cases: upscale ### Dreamina 3.1 (`dreamina`) ByteDance Dreamina 3.1. 4MP cinematic text-to-image with precise style control. - Cost: 1 credit(s) ### Ernie (`ernie`) Baidu Ernie text-to-image (fal.ai). Multilingual prompts and built-in prompt expansion. - Cost: 1 credit(s) - Use cases: photorealism, marketing ### Face Enhance (`face-enhance`) Crystal Upscaler via Replicate. Face-detail preserving upscale, cost scales with output megapixels. - Cost: 2 credit(s) - Use cases: upscale, character ### FLUX (`flux`) FLUX family on Replicate. Schnell, Dev, Pro, Kontext, Ultra, and LoRA remix variants in one entrypoint. - Cost: 1 credit(s) - Features: LoRA - Use cases: photorealism, character, cinematic ### Flux 2 Flex (`flux-2-flex`) Max-quality with up to 10 reference images - Cost: 1.5 credit(s) (by resolution: 0.5 MP=1, 1 MP=1.5, 2 MP=3, 4 MP=5) - Features: LoRA, Editing - Use cases: photorealism, cinematic ### Flux 2 Klein 4B (`flux-2-klein-4b`) Very fast generation and editing with up to 5 reference images - Cost: 0.1 credit(s) - Features: LoRA, Editing - Use cases: photorealism, cinematic ### Flux 2 Klein 9B (`flux-2-klein-9b`) 4-step distilled FLUX.2 [klein] foundation model for flexible control - Cost: 0.5 credit(s) - Features: LoRA, Editing - Use cases: photorealism, cinematic ### Flux 2 Pro (`flux-2-pro`) High-quality with up to 8 reference images - Cost: 1.5 credit(s) (by resolution: 0.5 MP=1, 1 MP=1.5, 2 MP=2, 4 MP=2) - Features: LoRA, Editing - Use cases: photorealism, cinematic ### Flux 2 Max (`flux-2-max`) The highest fidelity image model from Black Forest Labs - Cost: 2 credit(s) (by resolution: 0.5 MP=1, 1 MP=2, 2 MP=3, 4 MP=4) - Features: LoRA, Editing - Use cases: photorealism, cinematic ### Flux 2 Dev (`flux-2-dev`) Fast quality with up to 4 reference images - Cost: 1 credit(s) - Features: LoRA, Editing - Use cases: photorealism, cinematic ### Flux 2 Dev + LoRA (`flux-2-lora`) Dev model with custom LoRA support - Cost: 1 credit(s) (by resolution: 0.5 MP=1, 1 MP=1, 2 MP=1, 4 MP=2) - Features: LoRA, Editing - Use cases: photorealism, cinematic ### Flux Edit (Kontext) (`flux-edit`) Black Forest Labs FLUX.1 Kontext for text-driven image editing. Dev (open-weight), Pro (state-of-the-art), and Max (premium typography). - Cost: 1 credit(s) - Use cases: editing, character ### Flux Dev (`flux-dev`) High-quality development model with configurable steps, guidance, and LoRA support. - Cost: 1 credit(s) - Features: LoRA, Editing - Use cases: photorealism, character ### Flux Krea Dev (`flux-krea-dev`) Photorealistic generation that avoids the oversaturated AI look. LoRA compatible. - Cost: 1 credit(s) - Features: LoRA, Editing - Use cases: photorealism, character ### Flux Dev Multi LoRA (`flux-dev-multi-lora`) Supports multiple custom LoRAs simultaneously for complex style combinations. - Cost: 1 credit(s) - Features: LoRA, Editing - Use cases: photorealism, character ### Flux 1.1 Pro (`flux-1.1-pro`) Latest pro model with enhanced quality and strong prompt adherence. - Cost: 1 credit(s) - Features: LoRA, Editing - Use cases: photorealism, character ### Flux 1.1 Pro Ultra (`flux-1.1-pro-ultra`) Highest quality Flux model with raw mode for natural-looking images. - Cost: 1.5 credit(s) - Features: LoRA, Editing - Use cases: photorealism, character ### Flux Kontext Pro (`flux-kontext-pro`) Advanced model with state-of-the-art performance for both generation and editing. - Cost: 1 credit(s) - Features: LoRA, Editing - Use cases: photorealism, character ### Flux Kontext Max (`flux-kontext-max`) Premium model with maximum performance and improved typography for generation and editing. - Cost: 2 credit(s) - Features: LoRA, Editing - Use cases: photorealism, character ### Google Gemini Flash (`gemini-flash`) Fast generation with Gemini 2.5 Flash - Cost: 1 credit(s) - Features: Editing - Use cases: photorealism, marketing ### Google Nano Banana Pro (`nano-banana-pro`) SOTA with accurate typography and reasoning - Cost: 3 credit(s) (by resolution: 1K=3, 2K=3, 4K=6) - Features: Editing - Use cases: photorealism, marketing ### Google Nano Banana 2 (`nano-banana-2`) Next-generation SOTA model with stronger consistency - Cost: 3 credit(s) (by resolution: 1K=2, 2K=3, 4K=4) - Features: Editing - Use cases: photorealism, marketing ### Nano Banana Edit (`google-nano-banana`) Google Nano Banana image editing. Multi-image fusion + edit instruction with Standard/Pro/Pro-fal tiers and 1K/2K/4K resolution. - Cost: 3 credit(s) - Use cases: photorealism, character, marketing ### GPT-Image 1.5 Low (`gpt-image-low`) Fast, lower detail generation - Cost: 1 credit(s) - Features: Editing - Use cases: marketing, editing ### GPT-Image 1.5 Medium (`gpt-image-medium`) Balanced quality and speed - Cost: 1 credit(s) - Features: Editing - Use cases: marketing, editing ### GPT-Image 1.5 High (`gpt-image-high`) Maximum detail and quality - Cost: 4 credit(s) - Features: Editing - Use cases: marketing, editing ### GPT-Image 1.5 Edit (`gpt-image-1-5-edit`) OpenAI GPT-Image 1.5 image editing — supply 1-8 reference images plus an edit instruction. Optional transparent background and high-fidelity input mode. - Cost: 4 credit(s) - Features: Editing ### GPT Image 2 (`gpt-image-2`) OpenAI GPT Image 2 via fal.ai — next-generation image model with 4K rendering and sharper text fidelity. - Cost: 5 credit(s) - Features: Editing - Use cases: editing, character, marketing ### GPT Image 2 Edit (`gpt-image-2-edit`) OpenAI GPT Image 2 image editing — supply 1-8 reference images plus an edit instruction. Optional mask for inpainting. 4K-capable; pricing varies by quality + size. - Cost: 5 credit(s) - Features: Editing - Use cases: editing ### HiDream Edit (`hidream-edit`) HiDream O1 image-conditioned editing. Provide a source image and an instruction. - Cost: 1 credit(s) - Features: Editing - Use cases: editing ### Hunyuan 3D (`hunyuan-3d`) Tencent Hunyuan 3D 3.1. Generate 3D meshes from a text prompt or a single image. - Cost: 4 credit(s) ### Ideogram Character (`ideogram-character`) Generate consistent characters from a single reference image in many styles. - Cost: 5 credit(s) - Use cases: character, text-rendering ### Character Stylist (`image-editor`) One-shot FLUX Kontext variants — filters, cartoonify, iconic locations, haircut swap, headshots, renaissance, face-to-many, and more. - Cost: 1 credit(s) ### Image Relighting (`image-relighting`) Relight images with Magic Lighting, Nano Banana Pro/2, or Qwen Image Edit — multi-provider routing with per-model credit rates. - Cost: 1 credit(s) - Use cases: editing ### Flux Image to Image (`image-to-image-flux`) FLUX Dev LoRA image-to-image on Replicate. Prompt + source image + optional LoRA weights. - Cost: 1 credit(s) ### ImagineArt (`imagineart`) ImagineArt family — 1.0 (Mixture-of-Experts photorealism), 1.5, 1.5 Pro, and the 2.0 preview. - Cost: 1.5 credit(s) ### Kling Image V3 (`kling-image`) Kling Image V3 (fal.ai). High-quality text-to-image with flexible aspect ratios. - Cost: 1 credit(s) - Features: Editing - Use cases: marketing, cinematic ### Kling Image Edit (`kling-image-edit`) Kling Image V3 (fal.ai) image-to-image editing with a text instruction. - Cost: 1 credit(s) - Use cases: editing ### Magnific Upscaler (`magnific-upscaler`) Freepik Magnific upscaler. Creative or precision mode, up to 16x. - Cost: 3 credit(s) - Use cases: upscale ### OpenAI Image 1 (`openai-image-1`) OpenAI GPT Image 1 Mini. Text-to-image via Replicate. - Cost: 1 credit(s) ### OpenAI Image 1 Edit (`openai-image-1-edit`) OpenAI GPT Image 1 Mini image editing — combine 1-8 reference images with a text edit instruction. Supports transparent or opaque backgrounds. - Cost: 1 credit(s) - Features: Editing ### Outpaint (`outpaint`) fal.ai Image Apps V2 outpainting. Expand an image beyond its original edges. - Cost: 1 credit(s) - Use cases: editing ### P-Image (`p-image`) Pruna P-Image. Sub-second text-to-image with optional custom dimensions. - Cost: 0.1 credit(s) - Features: Editing ### P-Image Edit (`p-image-edit`) Pruna P-Image Edit. Fast image editing with up to 5 reference images. - Cost: 0.25 credit(s) - Features: Editing ### Pony Realism (`ponyxl-ponyrealism-v23`) Pony Realism - Stylized anime generation - Cost: 1 credit(s) - Features: LoRA ### Pony NAI (`ponyxl-tponynai3-v7`) Pony NAI - Stylized anime generation - Cost: 1 credit(s) - Features: LoRA ### Wai ANI (`ponyxl-waianinsfwponyxl-v140`) Wai ANI - Stylized anime generation - Cost: 1 credit(s) - Features: LoRA ### QWEN Image Plus (`qwen-image-plus`) Fast generation with excellent quality - Cost: 1 credit(s) - Features: LoRA, Editing - Use cases: text-rendering, editing ### QWEN Image Max (`qwen-image-max`) Highest quality output - Cost: 2 credit(s) - Features: LoRA, Editing - Use cases: text-rendering, editing ### QWEN Image 2.0 (`qwen-image-2.0`) Fast, balanced image generation and editing - Cost: 1 credit(s) - Features: Editing - Use cases: text-rendering, editing ### QWEN Image 2.0 Pro (`qwen-image-2.0-pro`) Enhanced text rendering, realistic textures, and semantic adherence - Cost: 2 credit(s) - Features: Editing - Use cases: text-rendering, editing ### Qwen Image 2 Edit (`qwen-image-2-edit`) Alibaba DashScope Qwen Image 2 edit — supply 1-3 reference images plus an edit instruction. Standard and Pro variants. - Cost: 1 credit(s) - Features: Editing ### Qwen Image Edit (`qwen-image-edit`) Alibaba DashScope Qwen Image edit — supply 1-3 reference images plus an edit instruction. Plus and Max model variants. - Cost: 1 credit(s) - Features: Editing - Use cases: editing ### Qwen Image Edit Spicy (`qwen-image-edit-spicy`) Qwen Image Edit Spicy. Add, remove, or modify elements in an existing image with text guidance. - Cost: 1 credit(s) - Use cases: editing ### Flux Redux (`redux-flux`) Black Forest Labs Flux Redux image variations — feed a source image, get stylistic riffs. - Cost: 1 credit(s) ### Seedream 3 (`seedream-3`) ByteDance Seedream 3 text-to-image via Replicate. - Cost: 1 credit(s) - Use cases: photorealism, marketing ### Seedream 4.5 (`seedream-4`) ByteDance Seedream 4.5 — new-generation image creation with superior aesthetics, text rendering, and up to 4K resolution. - Cost: 1 credit(s) - Features: Editing - Use cases: photorealism, marketing ### Seedream 5 Lite (`seedream-5-lite`) ByteDance Seedream 5.0 Lite — fast, high-quality image generation and editing with strong aesthetics and text rendering. - Cost: 1 credit(s) - Features: Editing - Use cases: photorealism, marketing ### WAN 2.6 Image (`wan-2.6-image`) Alibaba WAN 2.6 text-to-image with prompt enhancement and multi-image output. - Cost: 1 credit(s) - Features: Editing - Use cases: marketing, cinematic ### WAN 2.6 Image Edit (`wan-2.6-image-edit`) Alibaba WAN 2.6 image editing. Up to 4 reference images. - Cost: 1 credit(s) - Use cases: editing ### WAN 2.7 Standard (`wan-2.7-image`) Faster Wan 2.7 image generation and editing - Cost: 1 credit(s) - Features: Editing - Use cases: marketing, cinematic ### WAN 2.7 Pro (`wan-2.7-image-pro`) Higher quality Wan 2.7 tier with 4K support for text-to-image - Cost: 2 credit(s) - Features: Editing - Use cases: marketing, cinematic ### WAN 2.7 Image Edit (`wan-2.7-image-edit`) Alibaba WAN 2.7 image editing. Standard and Pro tiers, supports 1-4 input images for fusion edits. - Cost: 1 credit(s) - Use cases: editing ### WAN 2.2 Image (`wan-image`) Fast cinematic image generation (3-6 seconds) with up to 4MP output and optional LoRA support. - Cost: 1 credit(s) - Features: LoRA ### Grok Imagine (`xai-image`) xAI Grok Imagine. Fast tier for quick iteration, Quality tier for higher fidelity at 1k or 2k. - Cost: 1 credit(s) - Features: Editing - Use cases: photorealism, marketing, text-rendering ### Grok Image Edit (`xai-image-edit`) xAI Grok image editing. Sync response (no polling). Provide an image URL and a text edit instruction. Optional quality tier for 1k/2k high-fidelity edits. - Cost: 1 credit(s) ### Z Image Spicy (`z-image-spicy`) Z Image Spicy text-to-image. Square / portrait / landscape compositions, 256–1536px on each side. - Cost: 1 credit(s) - Use cases: photorealism, artistic ### Z Image Turbo (`z-image-turbo`) Super-fast 6B parameter text-to-image with great text rendering and LoRA support. - Cost: 0.5 credit(s) - Features: LoRA ## Video Models (61 available) ### Grok Imagine R2V (`grok-r2v`) xAI Grok Imagine reference-to-video via Replicate. 1 to 7 reference images plus prompt for 1 to 10 second clips at 480p or 720p. - Cost: 10 credit(s) ### Grok Video Extend (`grok-video-extend`) xAI Grok Imagine video extension. Continue an existing MP4 with a prompt-directed extension (2 to 10 seconds). - Cost: 12 credit(s) ### Hailuo Standard (`hailuo-standard`) Premium quality text-to-video and image-to-video - Cost: 8 credit(s) (768p: 6s=8/10s=12; 1080p: 6s=12/10s=20) - Use cases: video, marketing ### Hailuo Fast (`hailuo-fast`) Fast image-to-video generation - Cost: 4 credit(s) (768p: 6s=4/10s=7; 1080p: 6s=7) - Use cases: video, marketing ### Happy Horse 1.0 Reference to Video (`happyhorse-1.0-r2v`) Alibaba Happy Horse 1.0 reference-to-video — multi-reference image input that preserves subject characters, driven by a text prompt. 720p / 1080p, 3-15 second clips. - Cost: 4 credits/sec (by resolution: 720p=4, 1080p=6 /sec) ### Happy Horse 1.0 Text-to-Video (`happyhorse-1.0-t2v`) Text-to-video with 720p/1080p output and 2-15 second durations - Cost: 4 credits/sec (by resolution: 720p=4, 1080p=6 /sec) ### Happy Horse 1.0 Image-to-Video (`happyhorse-1.0-i2v`) Image-to-video animation with 720p/1080p output and 2-15 second durations - Cost: 4 credits/sec (by resolution: 720p=4, 1080p=6 /sec) ### Happy Horse 1.0 Video Edit (`happyhorse-1.0-video-edit`) Alibaba Happy Horse 1.0 video edit — apply style transfer or local replacement to a source video using text prompts and optional reference images. 720p / 1080p, 3-15 second output. - Cost: 4 credits/sec (by resolution: 720p=4, 1080p=6 /sec) ### Heygen Avatar (`heygen-avatar`) Heygen Avatar 4 via fal.ai. Animate a portrait with prompt-driven speech or an audio track, with optional background and captions. - Cost: 2 credits/sec - Features: Audio - Use cases: video, character ### Kling Motion Control v3 Standard (`kling-motion-control`) Kling Video v3 Standard motion control endpoint - Cost: 3 credits/sec - Features: Audio ### Kling Motion Control v3 Pro (`kling-motion-control-pro`) Kling Video v3 Pro motion control endpoint - Cost: 4 credits/sec - Features: Audio ### Kling Reference to Video (`kling-reference-to-video`) Kling O3 reference-driven video generation. Image or video references, Standard or Pro tier. - Cost: 15 credit(s) - Use cases: video, character ### Kling 2.6 Pro (`kling-v2-6`) Kling Video v2.6 Pro (fal.ai). Text-to-video or image-to-video, 5 or 10 seconds, with audio generation. - Cost: 15 credit(s) - Use cases: video, cinematic ### Kling Video v3 Standard (Text) (`kling-video-v3-standard-text`) Standard text-to-video with native audio - Cost: 6 credits/sec - Features: Audio - Use cases: video, cinematic ### Kling Video v3 Standard (Image) (`kling-video-v3-standard-image`) Standard image-to-video with native audio - Cost: 6 credits/sec - Features: Audio - Use cases: video, cinematic ### Kling Video v3 Pro (Text) (`kling-video-v3-pro-text`) Pro text-to-video with cinematic quality and native audio - Cost: 8 credits/sec - Features: Audio - Use cases: video, cinematic ### Kling Video v3 Pro (Image) (`kling-video-v3-pro-image`) Pro image-to-video with cinematic quality and native audio - Cost: 8 credits/sec - Features: Audio - Use cases: video, cinematic ### Kling Video Edit (`kling-video-edit`) Kling O3 video-to-video edit. Standard or Pro, with optional reference images and audio preservation. - Cost: 40 credit(s) ### Lip Sync (`lip-sync`) Replicate sync/lipsync-2. Align mouth movements in a video to a separate audio track. - Cost: 5 credit(s) - Use cases: video, character ### LTX 2.3 Fast Text-to-Video (`ltx-2-fast-t2v`) Fast text-to-video generation (6-20s, 1080p-2160p). - Cost: 2 credits/sec (by resolution: 1080p=2, 1440p=3, 2160p=6 /sec) - Features: Audio ### LTX 2.3 Fast Image-to-Video (`ltx-2-fast-i2v`) Fast image-to-video generation (6-20s, 1080p-2160p). - Cost: 2 credits/sec (by resolution: 1080p=2, 1440p=3, 2160p=6 /sec) - Features: Audio ### LTX 2.3 Pro Text-to-Video (`ltx-2-pro-t2v`) Higher quality text-to-video generation (6-10s, 1080p-2160p). - Cost: 2 credits/sec (by resolution: 1080p=2, 1440p=4, 2160p=8 /sec) - Features: Audio ### LTX 2.3 Pro Image-to-Video (`ltx-2-pro-i2v`) Higher quality image-to-video generation (6-10s, 1080p-2160p). - Cost: 2 credits/sec (by resolution: 1080p=2, 1440p=4, 2160p=8 /sec) - Features: Audio ### LTX 2.3 Pro Extend Video (`ltx-2-pro-extend`) Extend an existing video clip from the start or end (1-20s, Pro tier only). - Cost: 2 credits/sec - Features: Audio ### OmniHuman 1.5 (`omnihuman`) ByteDance OmniHuman 1.5 via Replicate. Audio-driven talking-head video with lip sync. - Cost: 45 credit(s) - Features: Audio - Use cases: video, character ### P-Video (`p-video`) Pruna P-Video — video generation with text/image/audio conditioning, draft mode, and 720p/1080p outputs. - Cost: 0.5 credits/sec (by resolution: 720p=0.5, 1080p=1 /sec) - Features: Audio ### P Video Avatar (`p-video-avatar`) Pruna P Video Avatar — animate a portrait into a talking avatar from a script or an audio file. 30 voices, 10 languages, 720p / 1080p. - Cost: 1 credits/sec (by resolution: 720p=1, 1080p=2 /sec) - Features: Audio ### Pixverse v5.6 (`pixverse`) Pixverse v5.6 video generation via Replicate — text-to-video or image-to-video with optional audio, at 360p–1080p. - Cost: 7.5 credit(s) - Features: Audio ### Pixverse V6 (`pixverse-v6`) Pixverse V6 video generation via Runware. Text-to-video, image-to-video (start frame), or multi-clip (start + end frame). - Cost: 10 credit(s) ### Runway Gen-4.5 Video (`runway-gen4-video`) Runway Gen-4.5 video generation. Text-to-video or image-to-video, 5 or 10 seconds. - Cost: 15 credit(s) - Use cases: video, cinematic ### Runway (`runway-video`) Canonical version-agnostic Runway video API ID. - Cost: 15 credit(s) - Use cases: video, cinematic ### Runway Gen-4 (Legacy API ID) (`runway-gen4`) Legacy alias for clients pinned to runway-gen4; maps to the current Runway model. - Cost: 15 credit(s) - Use cases: video, cinematic ### Seedance 1 (`seedance-1.5`) ByteDance Seedance 1 video generation. Text-to-video or image-to-video with optional end frame. - Cost: 8 credit(s) - Features: Audio - Use cases: video, marketing ### Seedance 2 High (`seedance-2-high`) Higher-quality Seedance 2.0 video generation (supports 1080p) - Cost: 4 credits/sec (by resolution: 480p=3, 720p=4, 1080p=10 /sec) - Features: Audio - Use cases: video, marketing ### Seedance 2 Reference to Video (`seedance-2-reference`) Seedance 2.0 multimodal reference-to-video. Combine up to 9 images, 3 video clips, and 3 audio tracks to guide characters, motion, and sound. - Cost: 20 credit(s) ### Seedance 2 Video Edit (`seedance-video-edit`) Edit source videos with Seedance 2.0 using prompted changes, optional reference images, and 480p, 720p, or 1080p output. - Cost: 25 credit(s) ### Text to Music (`text-to-music`) ElevenLabs Music via Replicate. Generate music from a text prompt. - Cost: 2 credit(s) ### VEO 3.1 Fast (`veo-3.1-fast`) Faster generation at 3 credits per second - Cost: 3 credits/sec - Features: Audio - Use cases: video, cinematic ### VEO 3.1 Standard (`veo-3.1-standard`) Higher quality at 8 credits per second - Cost: 8 credits/sec - Features: Audio - Use cases: video, cinematic ### VEO 3.1 Lite (`veo-3.1-lite`) Runware-powered Lite variant at 1.5 credits/sec for 720p and 2 credits/sec for 1080p. No reference images, no audio generation, no 1:1 aspect ratio. - Cost: 1.5 credits/sec (by resolution: 720p=1.5, 1080p=2 /sec) - Features: Audio - Use cases: video, cinematic ### Video Autocaption (`video-autocaption`) TikTok-style auto-captioning via Replicate. - Cost: 5 credit(s) ### Video Reframe (`video-reframe`) Luma Reframe Video via Replicate. Change a video's aspect ratio intelligently. - Cost: 8 credit(s) ### Video to Sound (`video-to-sound`) ThinkSound via Replicate. Generate a sound effect track from a video. - Cost: 2 credit(s) ### Video Transform (`video-transform`) Runway Gen4 Aleph via Replicate. Transform the first 5 seconds of a video with a prompt. - Cost: 20 credit(s) ### Video Upscaler (`video-upscaler`) Topaz Labs Video Upscale via Replicate. Upscale video resolution and FPS. - Cost: 10 credit(s) ### WAN 2.2 Standard (`wan-2.2-standard`) Premium quality with enhanced detail - Cost: 3 credit(s) - Features: LoRA - Use cases: video, cinematic ### WAN 2.2 Plus (`wan-2.2-plus`) Official Alibaba model with 1080p support - Cost: 10 credit(s) - Features: LoRA - Use cases: video, cinematic ### WAN 2.2 Extended (`wan-2.2-extended`) fal.ai WAN 2.2 with up to 10-second videos and dual LoRA support - Cost: 1.2 credits/sec - Features: LoRA - Use cases: video, cinematic ### WAN 2.2 Animate (`wan-2.2-animate`) WAN 2.2 video animation. Drive a character image with a motion reference video. - Cost: 2 credit(s) - Use cases: video, character ### WAN 2.2 Spicy Image-to-Video (`wan-2.2-i2v-spicy`) Image-to-video with WAN 2.2 Spicy. Animate a starting image. 480p or 720p, 5s or 8s clips. - Cost: 15 credit(s) - Use cases: video, cinematic ### WAN 2.2 Replace (`wan-2.2-replace`) WAN 2.2 character replacement. Swap a character in a source video while preserving scene and motion. - Cost: 2 credit(s) ### WAN 2.6 Standard (`wan-2.6-standard`) Higher quality, 720p/1080p support - Cost: 2.5 credits/sec (by resolution: 720p=2.5, 1080p=3 /sec) - Features: Audio - Use cases: video, cinematic ### WAN 2.6 Flash (`wan-2.6-flash`) Fast and affordable image-to-video - Cost: 1 credits/sec (by resolution: 720p=1, 1080p=1.5 /sec) - Features: Audio - Use cases: video, cinematic ### WAN 2.7 Spicy Image-to-Video (`wan-2.7-i2v-spicy`) Image-to-video with WAN 2.7 Spicy. Animate a starting image with optional driving audio. 720p or 1080p, 2–15 second clips. - Cost: 20 credit(s) - Features: Audio - Use cases: video, cinematic ### WAN 2.7 Text-to-Video (`wan-2.7-t2v`) Text-to-video with audio sync, 720p/1080p output, and 2-15 second durations - Cost: 2.5 credits/sec (by resolution: 720p=2.5, 1080p=3 /sec) - Features: Audio - Use cases: video, cinematic ### WAN 2.7 Image-to-Video (`wan-2.7-i2v`) Image-to-video and video continuation with optional last-frame control and audio sync - Cost: 2.5 credits/sec (by resolution: 720p=2.5, 1080p=3 /sec) - Features: Audio - Use cases: video, cinematic ### WAN Reference to Video (`wan-reference-to-video`) Alibaba WAN reference-to-video. Up to 5 image/video references with multi-shot support. - Cost: 4 credit(s) - Use cases: video, character ### WAN Video Character Swap (`wan-video-character-swap`) Alibaba WAN character swap. Combine a character image with a reference video to produce a new clip. - Cost: 20 credit(s) ### WAN 2.7 Video Edit (`wan-video-edit`) Alibaba WAN 2.7 video editing. Modify an existing clip via prompt with optional reference images. - Cost: 6 credit(s) ### Grok Imagine Video (`xai-video`) xAI Grok Imagine video. Text-to-video or image-to-video, 1-15 seconds at 480p or 720p. - Cost: 10 credit(s) ### Grok Video Edit (`xai-video-edit`) xAI Grok Imagine Video edit. Transform short clips via Replicate. - Cost: 15 credit(s) > Full list with parameters: `GET /api/v1/models` ## Response Format ### Submit Response ```json { "jobId": "job_abc123", "status": "pending", "statusUrl": "https://pixeldojo.ai/api/v1/jobs/job_abc123", "creditCost": 1, "creditsRemaining": 99 } ``` ### Completed Response ```json { "jobId": "job_abc123", "status": "completed", "output": { "images": ["https://temp.pixeldojo.ai/...png"] }, "creditCost": 1, "expiresAt": "2025-01-23T12:00:00Z" } ``` ## Error Codes | Code | Status | Description | |------|--------|-------------| | `unauthorized` | 401 | Missing or invalid API key | | `invalid_json` | 400 | Invalid JSON in request body | | `validation_error` | 400 | Input validation failed | | `not_found` | 404 | Model or job not found | | `insufficient_credits` | 402 | Insufficient credits | | `credit_error` | 500 | Failed to deduct credits | | `submission_failed` | 500 | Failed to submit job | | `expired` | 410 | Job has expired | | `rate_limit_exceeded` | 429 | Rate limit exceeded | | `internal_error` | 500 | Internal server error | ## Rate Limits 60 requests per minute across all endpoints. ## Examples ### cURL ```bash # Submit a job curl -X POST "https://pixeldojo.ai/api/v1/models/flux-1.1-pro/run" \ -H "Authorization: Bearer YOUR_API_KEY" \ -H "Content-Type: application/json" \ -d '{"prompt": "A sunset", "aspect_ratio": "1:1", "webhook_url": "https://example.com/webhook"}' # Poll for results curl "https://pixeldojo.ai/api/v1/jobs/job_abc123" \ -H "Authorization: Bearer YOUR_API_KEY" # List recent jobs curl "https://pixeldojo.ai/api/v1/jobs?limit=10" \ -H "Authorization: Bearer YOUR_API_KEY" # Replay a terminal webhook curl -X POST "https://pixeldojo.ai/api/v1/jobs/job_abc123/webhook" \ -H "Authorization: Bearer YOUR_API_KEY" ``` ### Python ```python import requests import time API_KEY = "your_api_key" BASE_URL = "https://pixeldojo.ai/api/v1" model_schema = requests.get( f"{BASE_URL}/models/flux-1.1-pro/schema" ).json() submit_response = requests.post( f"{BASE_URL}/models/flux-1.1-pro/run", headers={ "Authorization": f"Bearer {API_KEY}", "Content-Type": "application/json", }, json={"prompt": "A sunset", "aspect_ratio": "1:1", "webhook_url": "https://example.com/webhook"}, ) job = submit_response.json() while True: status = requests.get( job["statusUrl"], headers={"Authorization": f"Bearer {API_KEY}"} ).json() if status["status"] in {"completed", "failed"}: print(status) break time.sleep(2) ``` ### JavaScript ```javascript const schema = await fetch("https://pixeldojo.ai/api/v1/models/flux-1.1-pro/schema"); const requestSchema = await schema.json(); const submit = await fetch("https://pixeldojo.ai/api/v1/models/flux-1.1-pro/run", { method: "POST", headers: { "Authorization": "Bearer YOUR_API_KEY", "Content-Type": "application/json" }, body: JSON.stringify({ prompt: "A sunset", aspect_ratio: "1:1", webhook_url: "https://example.com/webhook" }) }); const job = await submit.json(); let result; do { const status = await fetch(job.statusUrl, { headers: { "Authorization": "Bearer YOUR_API_KEY" } }); result = await status.json(); if (result.status === "pending" || result.status === "processing") { await new Promise((resolve) => setTimeout(resolve, 2000)); } } while (result.status === "pending" || result.status === "processing"); console.log(requestSchema.schema, result.assets, result.webhook); ``` ## Best Practices 1. Store API keys securely - never in client-side code 2. Poll job status with exponential backoff (start at 2s, max 30s) 3. Download outputs promptly (24h expiry) 4. Use seeds for reproducibility 5. Use webhook_url instead of polling for production workloads ## Links - API Platform: https://pixeldojo.ai/api-platform - API Keys: https://pixeldojo.ai/api-platform/api-keys - Documentation: https://pixeldojo.ai/api-platform/documentation - OpenAPI Spec: https://pixeldojo.ai/api/openapi - llms.txt: https://pixeldojo.ai/llms.txt