Image & Video tools
Claude Code can
actually call.
Install once. Use any MCP client or hit REST. 133+ image and video models, async by default.
# 1. Install in your Claude Code / Cursor / OpenClaw project
npx @pixeldojo/mcp init
# 2. Set your API key
export PIXELDOJO_API_KEY=pd_your_api_key
# 3. Restart your agent. It now has these tools:
# pixeldojo:campaign URL or product -> hero + lifestyle + video
# pixeldojo:campaign_status Poll a campaign by id
# pixeldojo:from_url Paste a URL, get product profile
# pixeldojo:generate Any prompt -> image or video (with preset)
# pixeldojo:edit Edit an image with a text instruction
# pixeldojo:character Consistent characters across shots
# pixeldojo:storyboard Multi-shot scenes from one brief
# pixeldojo:upscale Enhance any image
# pixeldojo:status Poll a long-running job
# Get your key: https://pixeldojo.ai/api-platform/api-keysQuick Start
Install
Claude Code · Cursor · Codex
One command. Restart your editor. All five tools appear.
npx @pixeldojo/mcp initThen set PIXELDOJO_API_KEY in your environment.
Claude Desktop
Edit one JSON file, restart the app.
{
"mcpServers": {
"pixeldojo": {
"command": "npx",
"args": ["-y", "@pixeldojo/mcp"],
"env": { "PIXELDOJO_API_KEY": "pd_..." }
}
}
}File: ~/Library/Application Support/Claude/claude_desktop_config.json
Cowork
Drag-and-drop plugin. No JSON to edit.
- 1. Open the archive, paste your PIXELDOJO_API_KEY.
- 2. Drag the file into Cowork.
- 3. All five tools appear.
Named Skills
Named Skills
One install, four skills. Your LLM picks the right one per task.
pixeldojo:generate
Any prompt.
Best model, automatically.
Your agent describes what it needs in plain English. PixelDojo routes to the right model (photorealism, text rendering, video) and hands back a URL.
- ✓130+ models, one skill to call
- ✓Images, video, editing. Same call shape
- ✓Credits deducted only on success
>_ Generate a cinematic portrait, Tokyo rain, neon reflections
PixelDojo
Routing to flux-1.1-pro...
Job queued: job_k9mXpQ2r
✓ output: https://pixeldojo.ai/r/…/portrait.png
✓ 1024×1024 PNG · 1 credit
>_ _
>_ Alex presenting a new phone, marble desk, soft studio light
PixelDojo
Loading ref: alex_character.png...
Routing to flux-kontext...
Job queued: job_3vNaL8wK
✓ output: https://pixeldojo.ai/r/…/alex-desk.png
✓ Consistency preserved · 2 credits
>_ _
pixeldojo:character
Same character.
Any scene.
Pass a reference image once. Your agent reuses the character across any number of scenes (different backgrounds, poses, lighting) while preserving their face and features.
- ✓Prompt evolves the scene; the character stays locked
- ✓Works with Flux Kontext, Ideogram Character, and more
- ✓No LoRA training required. Just a reference image URL
pixeldojo:storyboard
One brief.
N shots, planned and generated.
Your agent writes the brief. PixelDojo breaks it into shots, decides which are images and which are video, generates them in parallel, and returns an ordered array of output URLs.
- ✓Mix image and video shots in the same storyboard
- ✓Shot planning included. No need to prompt each frame individually
- ✓Returns an ordered array your agent can pass to an editor or exporter
>_ 60s product reveal: teaser, unboxing, close-up, lifestyle
PixelDojo
Planning 4 shots...
Shot 1/4 ✓ teaser (image)
Shot 2/4 ✓ unboxing (video)
Shot 3/4 ✓ close-up (image)
Shot 4/4 ✓ lifestyle (video)
✓ outputs: [4 URLs] · 6 credits
>_ _
>_ Upscale this product photo to 4K, enhance detail
PixelDojo
Analyzing: 1024×1024 → 4096×4096
Routing to magnific-upscaler...
Job queued: job_8tHjR5mN
✓ output: https://pixeldojo.ai/r/…/upscaled.png
✓ 4096×4096 PNG · 2 credits
>_ _
pixeldojo:upscale
Any image.
Up to 16× sharper.
Pass any image URL. Your agent gets back a high-res version. No upload step, no format conversion. Conservative mode preserves the original; creative mode can enhance textures and fine detail.
- ✓2× to 16× magnification depending on model
- ✓Works on any image URL. No upload required
- ✓Conservative and creative upscale tiers
Agentic Skills
One call, whole campaign
Higher-level skills that compose the named tools above. Drop a URL, get back a full launch package. No prompt-engineering, no chaining by hand.
pixeldojo:campaign
Campaign
One URL or product profile, one MCP call. Returns a hero image, N lifestyle variants, and an optional vertical video. Submits in parallel, polls under one budget.
pixeldojo:campaign({
productUrl: "https://shop.example/atomic"
})pixeldojo:from_url
From URL
Paste a product page, get back { name, description, images } extracted via JSON-LD, OpenGraph, or heuristic fallback. The cold-start fix for any agentic flow.
pixeldojo:from_url({
url: "https://shop.example/atomic"
})pixeldojo:campaign_status
Campaign status
Poll a campaign by ID. Returns assets when every sub-job is terminal, or a handoff describing what is still in flight. Mirrors the per-job pixeldojo:status pattern.
pixeldojo:campaign_status({
campaignId: "campaign_abc123"
})Preset Library
Try a preset, no prompt-writing required
Curated starting points across 60 workflows. Refresh for a different mix.
Character 3-up
Dance loop
Cinematic drone
Cyberpunk street
Minimal poster
Startup hero graphic
Children storybook
App icon
Desert fashion
Food overhead
Product hero shot
Editorial coffee shop
Badge design
Creative 4x
Fashion runway
Wellness flatlay
Cinematic portrait
Flat lay tools
Product poster
Data viz poster
Lifestyle vlog
Travel postcard
Magazine cover
Food still life
API Design
Built for automation
Every detail is designed for machines that call APIs, not humans clicking buttons.
image, video, upscale, edit
Submit a job, get a job ID. Poll the status URL or register a webhook. Every model has a JSON schema endpoint, so your agent knows the request shape before calling. No headless browsers, no UI scraping, no screenshots. Credits are deducted on success, not before.
- Async + webhook
- ·
- JSON schema per model
- ·
- llm.txt + OpenAPI 3.1
- ·
- Credit-based pricing
- ·
- One auth
REST API
Endpoint reference
| Method | Endpoint | Description |
|---|---|---|
GET | /api/v1/models | List all available models |
GET | /api/v1/models/{apiId}/schema | Get the JSON schema for a model |
POST | /api/v1/models/{apiId}/run | Submit a generation job |
GET | /api/v1/jobs/{jobId} | Check job status and get output URLs |
POST | /api/v1/jobs/{jobId}/webhook | Register a webhook for completion |
Full reference: API Documentation · OpenAPI Spec · llm.txt
133+ models, one API
Same endpoint pattern for every model. Your agent picks the model, we handle the rest.
Change Camera Angle
Camera-aware editing via fal.ai Qwen Image Edit 2511 with multi-angle LoRA. 360° orbit, tilt, and zoom.
/models/change-camera-angle/runConsistent Characters
Generate consistent character variations with FLUX Kontext, Nano Banana Pro/2, Flux 2 Dev, or Qwen Image 2 Pro.
/models/consistent-characters/runCreative Upscale
Clarity Upscaler (creative upscale) via Replicate. Boost detail with stable-diffusion refinement.
/models/creative-upscale/runDreamina 3.1
ByteDance Dreamina 3.1. 4MP cinematic text-to-image with precise style control.
/models/dreamina/runErnie
Baidu Ernie text-to-image (fal.ai). Multilingual prompts and built-in prompt expansion.
/models/ernie/runFLUX
FLUX family on Replicate. Schnell, Dev, Pro, Kontext, Ultra, and LoRA remix variants in one entrypoint.
/models/flux/runFlux 2 Flex
Max-quality with up to 10 reference images
/models/flux-2-flex/runFlux 2 Klein 4B
Very fast generation and editing with up to 5 reference images
/models/flux-2-klein-4b/runGrok Imagine R2V
xAI Grok Imagine reference-to-video via Replicate. 1 to 7 reference images plus prompt for 1 to 10 second clips at 480p or 720p.
/models/grok-r2v/runGrok Video Extend
xAI Grok Imagine video extension. Continue an existing MP4 with a prompt-directed extension (2 to 10 seconds).
/models/grok-video-extend/runHailuo Standard
Premium quality text-to-video and image-to-video
/models/hailuo-standard/runHailuo Fast
Fast image-to-video generation
/models/hailuo-fast/runHappy Horse 1.0 Text-to-Video
Text-to-video with 720p/1080p output and 2-15 second durations
/models/happyhorse-1.0-t2v/runHappy Horse 1.0 Image-to-Video
Image-to-video animation with 720p/1080p output and 2-15 second durations
/models/happyhorse-1.0-i2v/runWorks with
