Steadier Toggles in Video Tools
- ๐Fixed
Turning audio generation off on Seedance 2 Reference no longer crashes the page. The same fix covers the Keep Audio, Captions, and prompt toggles on Kling Video Edit and the avatar tools.
Changelog
New models, features, and improvements shipped to PixelDojo.
Turning audio generation off on Seedance 2 Reference no longer crashes the page. The same fix covers the Keep Audio, Captions, and prompt toggles on Kling Video Edit and the avatar tools.
Reality Polisher now scores your image with multiple independent AI checks instead of one. Obvious flaws like an extra finger no longer slip through with a perfect score.
Each polishing pass now fixes the exact flaws found in your image instead of applying generic corrections. Results land noticeably harder.
Pick your image's medium, from photo to anime to watercolor and 40 more, or let auto-detect lock it in. Either way it stays fixed for the whole run so your art style never drifts toward realism.
Polishing rounds are faster, scores are fairer for images without people, and the best result so far is always the starting point for the next round.
Refreshing mid-run no longer loses finished iterations, and stopping a run no longer shows a false failure.
When a generation gets blocked by content moderation, you now see the actual guidance instead of a generic failure message.
The home page got a tune-up. The Start Creating button now greets you right at the top on every device, and the page loads lighter than before.
On phones, the Sign Up button is now always in reach in the top bar.
The home page is also much shorter and easier to scan. Model deep dives now live on the AI Images, AI Videos, and LoRA Training pages where they belong.
More speed-ups under the hood. The home page sends less code to your browser, the FAQ and footer appear instantly instead of loading in, and the video showcase on the AI Videos page now switches clips without a blank flash.
Continuing a saved video on WAN 2.2 now brings its LoRAs along. Click First frame or Last frame on a saved video and the LoRAs you made it with are re-applied automatically, with the same strengths. No more re-picking them by hand.
Character Studio is your new home for character work: keep one character consistent across images, build multi-view turnaround sheets, try outfits on them, animate them into talking avatars, and train a custom model of them. Find it in the top menu.
Open any image or video in My Media and tap Recreate to jump straight into the tool that made it, with the prompt already filled in. Creations made with a LoRA bring the LoRA along too, so you can remix in one tap.
The image detail view got a cleaner, calmer look that gets out of the way of your art.
The main menu now always opens to the Home tab, so you land in the same familiar spot every time. Quick Access also moved up above Recent, putting your go-to destinations right at the top.
On video tools, hover any saved or community video and you will see First frame and Last frame buttons. Click one to pull that frame into the composer as your start image, then keep the video going from there. The easiest way to pick up a clip right where it left off.
The new Character Sheets tool turns one or more reference photos into a full multi-view turnaround: front, 3/4, side, and back views of the same character in one image. Add several references for even stronger consistency. Perfect for animation reference, model sheets, and building training sets.
Choose photorealistic or anime, and full-body or portrait framing. Pick from Nano Banana Pro, Nano Banana 2, WAN 2.7, or Qwen Image 2 Pro to match the look and cost you want.
Save an image or video made with a LoRA and My Media now remembers which LoRAs you used, showing them right on the saved creation.
Consistent Characters now includes Grok Imagine, Flux Kontext Dev, WAN 2.7, Qwen Edit Spicy, and Seedream 4.5. Pick the look and speed you want while keeping the same face across every variation.
Some of the new character models support Spicy Mode for uncensored results. Flip it on in your settings.
Consistent Characters now lets you set the aspect ratio of your portraits. Choose square, portrait, landscape, wide, or tall so your character sets come out in the shape you need. Available on every model that supports it.
Every image and video in the gallery now has a Create your own button. Tap it to jump straight into the exact tool that made it, with the prompt already filled in and ready to tweak.
The leaderboards got a full redesign โ a cleaner look with a top-three podium and a clear ranked list. Every entry now shows its likes, reuses, and saves at a glance.
Popular now ranks by a combined score: likes, plus how often a creation is reused, plus how often it's saved. The rankings reflect everything the community loves, not just views.
A new Most Saved board highlights the creations people bookmark to their favorites the most.
Google Gemini Omni Flash is here. Turn a text idea, a photo, or even an existing video into a short clip with built-in sound, no separate audio step. Pick your length and shape and go.
Bring a still photo to life, or remix a clip you already have. Drop in an image to animate it, or hand it a video and describe the change you want.
Love a clip but want a tweak? Hit Continue on any Omni Flash result and just describe the change. It builds a fresh take on that clip, no re-uploading.
Nano Banana 2 Lite gives you a faster, lower-cost way to create images at 2 credits each, sitting right next to the full Nano Banana 2 model.
In a tool's Saved tab, you can now make any creation public with a single tap. It's submitted to the community gallery through the same quick review as the My Media page, then appears for everyone to discover.
The Saved and Community tabs now link straight to your full My Media library and the full community gallery, so you can jump to the big view in one click.
The history, saved, and community tabs in each tool now do more than reuse a prompt. On tools that edit or take a reference, tap any creation to drop its image or video straight into your input, so you can build on a result in one click.
Star the creations you love and turn on the Favorites filter to see just those in your tabs. Your go-to references and best results stay one tap away.
Leave a comment on community creations right from the tool you're working in, without heading over to the main gallery.
Set up a public profile and your gallery posts show your name instead of appearing anonymous. A quick reminder now appears on the gallery so you can add yours in seconds, and people browsing the community can discover more of your work.
The account menu is clearer about which is which. 'Manage Login' covers your sign in details, and 'Manage Profile' takes you straight to your public profile and gallery settings.
Your negative prompt is where you tell the AI what to leave out. Listing things to avoid there could sometimes trip a content warning and stop your generation before it started. Now it doesn't, so you can spell out what to skip freely and your work runs right away.
WAN 2.2 Image now only offers the resolutions it can actually produce. Picking a higher one used to fail right at the start. The choices match what works, so your image generates the first time.
WAN Video Character Swap now works with saved and shared videos, not just fresh uploads. It reads the clip length for you, so you no longer hit a confusing error when you pick a video from your library.
If you build with our API, you can now check your remaining credit balance with a single request. Your apps and agents can confirm there's enough to run before they start, and prompt a top-up when it's getting low.
The Community tab on each tool now shows the full set of example creations, with no repeats. Some examples were being skipped or shown twice, so you couldn't always see the full range of what a tool can do.
Every tool now shares the same streamlined layout. Your prompt and settings sit in one tidy bar at the bottom, and your results take center stage above it, so the whole site feels consistent no matter which tool you open. This now covers the video and avatar tools too, including the talking-avatar tools that take a voice clip.
Adding an image now works the same on every tool. One button opens a single window where you can drag in a file or pick from your media, instead of two separate buttons doing two different things. The same clean picker shows up across the whole site, including tools that take a video too.
The Spicy models now run in Canvas. Animating an image or creating with a Spicy tool inside Canvas could fail to start. They work normally again.
When a generation can't start because of something you can fix, like a prompt that runs too long, you now get a clear message that says exactly what to change and the limit to stay under, instead of a vague 'validation failed.' Same helpful wording everywhere, including Canvas.
When a generation gets turned down for content reasons, you now get a nudge toward the Spicy tools, which are far less restricted. Look for the flame in the menu. No more hitting a wall with no idea what to try next. This now shows up in Canvas too, not just on the tool pages.
The Canvas model picker now shows a quick icon next to each model, the same ones you see in the main menu. A pencil means the model can edit images, a flame means it is less restricted, and a speaker means it makes sound. Spot the right model at a glance.
The menu on your phone now opens on the first tap. Picking a tool from the menu sometimes needed two taps. One tap takes you straight there.
More tools stay responsive while they work. Across the image tools, the Generate button no longer locks up, so you can line up your next image without waiting on the last one to finish.
Generations are steadier when a model is briefly busy. If the backbone has a quick hiccup, we now retry for you behind the scenes instead of bouncing your request back, so fewer creations end with a 'please try again.'
Big reference photos just work in Seedream 4.5. A very high resolution image used to get turned away for being too large. Now it is automatically sized to fit, so your upload goes straight through.
Generating more than one image at a time with Seedream 4.5 works again. Choosing 2, 3, or 4 images had quietly stopped the Generate button from doing anything. Pick any count and create away.
When a generation can't start yet, for example a prompt that needs adjusting, the tool now keeps you on your settings instead of jumping to an empty results view. You stay right where you can fix it and try again.
Your Agent can now make talking videos. Give it a portrait and a script and it brings the face to life, lip-synced to your words. Try it with /avatar, or just ask it to make someone talk.
Add a voiceover without leaving the chat. Type /voice and what you want said, and the Agent returns natural-sounding narration you can play, download, and drop into your videos.
Need a soundtrack? /music turns a quick description into a short, original track. Set the mood, pick a length, and play it right in the chat.
Videos in the community gallery now play correctly when you open them. Clips from our newer video models were showing a still frame instead of the player. They play right away again.
Canvas has a cleaner, more focused look. Your work now fills the screen front and center, and everything you need sits in one simple bar at the bottom: the step you're on, the model, settings, and Generate. A slim row of your steps sits just above it, so you can jump back to any result with a tap.
Save and download now live right on each result, so finishing a piece is one tap. The same simple bar works on any screen, so creating on your phone feels just like desktop.
Starting a fresh canvas is now a clear, visual choice: pick Generate Image, Edit, Upscale, or Create Video to jump straight in, each shown with an example of what it makes.
Rodin is here, a new way to turn images into real 3D models. Add one to five reference photos of the same subject and Rodin builds a textured 3D mesh you can spin, zoom, and download. Great for game assets, product visualization, and 3D printing.
Choose Fast mode for quick prototyping or Standard mode for higher production quality, and switch on HighPack for sharper 4K textures when you want extra detail.
Krea Image is here, a flagship aesthetic image model that art-directs itself. It turns a short prompt into a polished, stylish image in seconds. It comes in three tiers from one tool: Turbo for fast, cheap generation, plus Medium and Large for higher fidelity with a creativity dial and style reference images.
Krea Image works with Spicy Mode for fewer restrictions when you need them.
The dashboard and the tools menu now lead with our two easiest starting points: the all-in-one AI Canvas and the AI Agent. Jump straight into either, or pick a specific tool from the menu. New here? Start with one of these and you can make something in seconds.
The PixelDojo Agent has a fresh start screen with a gallery of real looks you can make. Tap one to load it and make your own, or jump straight into editing, video, and product photoshoots. Just describe what you want and it makes it.
The start screen now leads with what the Agent can make: Create Image, Edit, Make Video, and Ads and Marketing, each shown with a real example and the video card actually plays. One tap drops you straight into that kind of project.
The Agent is much nicer on phones. Bigger tap targets, a cleaner input that stays clear of the keyboard, and generated results you can save or download right where they appear.
A fresh Canvas now shows you how it works at a glance. Generate an image, then edit, upscale, and animate it, with each result flowing into the next tool. Tap a step to jump straight to it, or pick a one-click starting prompt with a model already loaded, so you can make something in seconds.
Canvas can now chain more of your favorite models. Qwen Image 2 Edit and Qwen Image Edit for instruction-based edits, Spicy image-to-video (WAN 2.7 and WAN 2.2) for animating a still, and Z Image Spicy for generation, all available right inside a Canvas session.
Canvas now opens with the most-used model for each step already selected, so you can just type a prompt and generate. The model picker lists models by how popular they are, so your top choice sits right at the top, marked Popular, instead of alphabetically.
Canvas works much better on phones. The Generate button stays visible when the keyboard is open, and the model picker opens as a tall, scrollable sheet so you can see far more models at a glance.
Seedance 2 image-to-video can now take both a start frame and an end frame. Add both and your clip animates from the first image to the last โ perfect for precise transitions and morphs. The end frame is optional; drop in just a start frame for regular image-to-video.
Seedance 2 can now generate true 4K video (3840ร2160). Pick 4K on the High quality tier for text-to-video, image-to-video, and reference-to-video โ and on the Standard tier in Seedance 2 Video Edit. Fast still tops out at 720p.
4K runs at a higher rate than 1080p โ about 23 credits per second, or 27 when you include a reference video โ so you can budget before you generate.
Extending a clip with WAN 2.7 Video now picks a valid output length for you and keeps it longer than your source, so the whole clip is included. If your source clip is over the 10-second limit, you're told up front โ instead of the generation running, failing, and refunding partway through.
Seedance 2 reference-to-video now checks your reference video's length before you're charged and shows the 15-second limit clearly, rather than failing after submission.
Uploading an unsupported image format (HEIC, AVIF, or GIF) now shows a clear message asking for JPG, PNG, or WebP right away โ instead of letting a generation run, fail, and refund.
When a video you add to a Seedance video tool can't be used, the error now tells you the real reason and limit โ frame size up to 1080p, 24โ60 fps, MP4 or MOV โ and what to change. Before, a rejected video was mislabeled as an image with the wrong size rule.
Reference image errors now show your file's actual dimensions, so you can see exactly how far outside the 300โ6000px-per-side range you are before resizing.
The API model catalog is easier to browse โ filter by output type (Image, Video, Audio, 3D) and by what each model does (text-to-image, image editing, text-to-video, image-to-video, upscaling) to find the right model fast.
Audio model previews now play. Text to Speech and Text to Music examples were showing a blank video player on their model pages โ they now appear as a proper audio player you can listen to.
More models now show real example outputs on their API pages, so you can see what they produce before you integrate.
Happy Horse 1.1 is here. On the Happy Horse video and reference-to-video tools, a new Model selector lets you switch between 1.0 and the newer 1.1 engine for text-to-video, image-to-video, and reference-to-video โ at the same 720p/1080p and 2โ15 second durations.
Happy Horse 1.1 costs less to run โ 3 credits per second at 720p and 4 credits per second at 1080p, down from 4 and 6 on 1.0. Pick 1.1 from the Model selector to pay the lower rate.
Boogu Image is here โ a bilingual (English & Chinese) text-to-image model with crisp detail and 2K output. Generate up to 4 images at once.
Boogu Image also does instruction-based editing โ give it a source image and describe the change you want, right from the same page.
When you upload several images at once into an editing or reference tool, all of them now appear as cards in your workspace. Previously only the first showed up even though the rest were still attached to your edit. Fixed across the image editors โ Flux 2 Flex, Flux Studio, Google Nano Banana, GPT Image (1.5 and 2), HiDream, Luma, Qwen Image 2, Seedream 4, and WAN 2.6 / 2.7 Image.
When a WAN 2.7 Video generation can't run because of your input โ an image that's too large, an audio clip that's too long, or an unsupported size โ you now get a plain-English message telling you exactly what to fix, instead of a generic error code. If a generation fails, your credits are still refunded automatically.
When you call a specific model variant through the API by its own name, it now charges that variant's own credit price and generates with that exact variant โ instead of quietly falling back to the family's default. Fixed across FLUX 2 (Klein 4B/9B, Pro, Max, Dev, Dev + LoRA), Qwen Image and Qwen Image 2, Recraft V4.1 (Standard/Pro in raster, SVG, and utility), Hailuo (Standard/Fast), Google Gemini and Nano Banana, Kling Motion Control (Standard/Pro), WAN 2.6 and WAN 2.2, and GPT-Image 1.5 (Low/Medium/High). For example, FLUX 2 Klein 4B now charges 0.1 credits instead of 1.5. You'll pay the right amount and get the model you picked.
PixelDojo has a new logo โ say hello to our dragon. You'll spot the refreshed mark across the site and in your browser tab.
Every tool that takes multiple reference images now lets you pick several from your library in one go โ open the picker, check the ones you want, and hit "Add" โ instead of adding them one at a time and reopening the picker for each. This covers the reference-to-video tools (Seedance 2 Reference, Kling Reference to Video, Grok Imagine R2V, Happy Horse R2V, WAN Reference to Video, Veo 3.1), the video editors (Seedance 2 Video Edit, Kling Video Edit, WAN Video Edit), and the image editors (Seedream 4, GPT Image 2, Google Nano Banana, Qwen Image 2, WAN 2.7/2.6 Image, and more). Dragging in multiple files at once works too.
Seedance 2 Reference lets you batch-select multiple reference videos the same way.
When editing an image with WAN 2.7, choosing the 1:1 aspect ratio now returns a true square. Previously it kept the source image's dimensions instead of cropping to square.
Image editing now has an "Original" aspect-ratio option that keeps your input image's proportions โ and it's the default for edits, so your results match your source unless you pick a specific ratio.
Seedance 2, Seedance 2 Reference, and Seedance 2 Video Edit now work with photos of real people. Use a real face as your start frame or in your reference images and your video generates instead of getting rejected โ great for character work, cameos, and bringing a specific person into a scene. Your uploads are still checked for safety; we've just removed the overly strict block that was turning away ordinary photos of people.
All three Seedance 2 video tools are now marked 'less restrictive' in the menu, so you can spot the ones with fewer content limits at a glance.
In the Recent generations panel, each item's remove button is now a red trash icon, clearly separate from closing the panel โ so reaching for the corner to dismiss it no longer wipes out your latest result by mistake. The close button at the top right is now the obvious way out. Removing an item still only clears it from this list; your saved files are never touched.
A few tools in the navigation menu were missing their icons โ they now display correctly.
Ideogram 4 is here โ text-to-image with best-in-class text rendering, stunning realism, and bold creative designs. It's the one to reach for when the words have to be crisp and correct: posters, logos, branding, advertising, book covers, and product mockups. Pick the tier that fits your job โ Turbo (1 credit) for fast iteration, Balanced (2 credits) as an everyday default, or Quality (3 credits) for hero, print-ready work. Generate at high resolutions across square, portrait, landscape, and ultrawide shapes, or let Ideogram choose the best fit for your prompt. Find it under Generate Images.
Ideogram 4 is built into Canvas too, so you can chain it with edit, upscale, and animate in a single session โ and all three tiers are available on the developer API.
MAI Image is here โ a new text-to-image model with standout prompt adherence and natural lighting. It renders exactly what you describe with clean, photoreal detail, making it a dependable all-rounder for product shots, editorial portraits, food, interiors, and concept art. Choose from 11 aspect ratios (1:1 through 21:9, or 'auto' to let the model pick the best fit), generate up to 4 at once, and export PNG, JPEG, or WebP. 1.5 credits per image โ find it under Generate Images.
A full MAI Image prompting guide with real example galleries โ product, portrait, cinematic landscape, food, and concept art โ is live under Guides.
WAN 2.6 Image now generates at its full, correct resolutions with a wider set of aspect ratios โ square, portrait, and landscape, plus a true 21:9 ultrawide. Pick the exact shape you need and get properly sized output.
Editing with WAN 2.6 now keeps your original image's proportions by default, so your edits come back the right shape instead of being cropped or stretched.
Added a seed option to WAN 2.6 Image for repeatable results โ reuse a seed to get consistent variations of the same prompt.
Grok Imagine Video now has a 1.5 model for image-to-video โ animate any photo and it creates matching audio (music, sound effects, and ambient sound) in the same pass, no separate editing needed. Just switch to image-to-video mode and pick it from the Model option. Still 2 credits per second.
The Grok Imagine prompting guide has a new audio section for 1.5 โ how to cue music, sound effects, ambient tone, and short dialogue, plus the AUDIO: block syntax, real example clips you can play with sound, and copy-ready templates. Find it under Guides.
Credits now settle accurately on instant-finishing jobs โ a successful generation no longer triggers a mistaken refund, and jobs that return multiple results won't over-refund. Your balance reflects exactly what you used.
Qwen image generation and editing now finish reliably โ your work no longer gets stuck spinning if we ship an update while an image is being created. Every job lands as a finished result, or refunds automatically if something goes wrong.
Image tools are now far easier to use on mobile โ pick your model, aspect ratio, and number of images with tap-friendly controls, see the credit cost right on the Generate button, and on phones the controls slide up in a tidy composer drawer so the screen stays focused on your work.
This rollout reached a dozen-plus image generators, upscalers, and editors, so they all look and behave the same way no matter which one you open โ no more relearning the controls from tool to tool.
P Video Animate โ motion transfer for still images. Upload a reference video and a character image, and it copies the video's motion, timing, and camera moves onto your character. Under Generate Videos. 1 credit/sec at 720p, 2 credits/sec at 1080p.
Flux Pro Virtual Try-On โ premium, high-fidelity try-on under Characters & Faces. Dress a person photo in any garment, with an optional styling note ("tucked in", "open over a tee"). Preserves pose, lighting, and background. 2 credits per result.
Krea V2 (Medium + Large) โ text-to-image with a creativity dial (raw โ high) and style reference images. Medium at 1 credit for fast iteration, Large at 2 credits for the highest fidelity, across eight aspect ratios.
Riverflow 2.0 now generates cleanly on every aspect ratio at 1K, 2K, and 4K โ the ratios that used to error out are fixed.
Mobile tool pages: the Generate button stays visible and tappable, new generations scroll into view automatically, and recent generations moved into the My Stuff menu to keep the screen clear for creating.
Fresh prompting guides with real example galleries for Krea V2, Flux Pro Virtual Try-On, Riverflow, Luma UNI 1, and P Video Animate โ all under Guides.
Riverflow 2.0 โ Fast + Pro in one tool. Fast at 1 credit (1K/2K) for cheap iteration. Pro at 3 credits (1K/2K) or 7 credits for full 4K โ the only model on the site shipping 4K output right now. Text-to-image with optional reference image to steer style and composition (the model writes a fresh frame from your prompt; references aren't modified). Find it under Image Generation.
Luma UNI 1 โ Standard + MAX in one tool. Standard at 1 credit for fast iteration, MAX at 3 credits for the quality-first variant. ~3K resolution across 7 aspect ratios, up to 8 reference images per edit, optional manga style preset. Also under Image Generation.
P-Image Upscale โ sub-second upscaling up to 128 megapixels. From 0.1 credit (4 MP) up to 3 credits (128 MP). Toggles for detail boost and an AI-image realism pass that cleans up plastic-looking outputs. Find it under Upscale & Enhance.
Searching the Prompt Library and then clicking the Image or Video filter chip was kicking users to an error page. The filter now applies cleanly on top of any search โ pick a query, narrow by type, and the matching prompts show up the way they should. Same fix covers tag and model combinations along the way.
Sourceful Riverflow 2.0 is now live in the Image Generation section. One tool, two quality tiers: Fast is 1 credit per image (1K or 2K). Pro is 3 credits per image at 1K/2K and 7 credits at 4K โ the only tier on Pixel Dojo right now that ships full 4K output.
Click any image in your history to use as a reference โ Riverflow uses it to steer style and composition while generating a fresh frame from your prompt. (It's reference-guided generation, not inpainting/editing โ the source image isn't modified.) Ten aspect ratios available: 1:1, 4:3, 3:4, 16:9, 9:16, 3:2, 2:3, 5:4, 4:5, and 7:3 (ultra-wide).
The resolution picker is smart about the model you're on โ 4K is unavailable while Fast is selected (Fast doesn't ship 4K), and switching from Pro 4K back to Fast snaps you to 2K automatically instead of leaving an invalid combo. Cost preview updates live as you change model or tier so there's no math required.
Luma UNI 1 is now live in the Image Generation section. One tool, two quality tiers: Standard is fast and clean at 1 credit per image, MAX is the quality-first variant at 3 credits per image. Pick the tier in the form, the cost updates live as you change it.
Both text-to-image and image editing in the same page. Click any image in your history to add it as a reference, then describe the edit โ Luma supports up to 8 reference images per generation. Ships at ~3K resolution across 1:1, 16:9, 9:16, 3:2, 2:3, 2:1, and 1:2. Optional 'manga' style preset for the look you'd expect.
The in-app 'What's New' modal was clipping the last item or two behind the Got it footer on shorter screens โ laptops, smaller windows, and phones in particular. The modal now keeps comfortable breathing room above and below regardless of viewport size, the footer always stays in view, and mobile browser chrome (Safari's address bar in particular) no longer pushes anything off-screen.
When there's more content below the visible area, a bouncing chevron appears at the bottom of the modal alongside a soft fade โ no more guessing whether you've seen everything. The cue disappears once you've scrolled to the last item.
The modal was rendering two close buttons stacked in the corner. Now there's one.
P-Image Upscale is now live in the Upscale & Enhance section. Sub-second image upscaling to any target size up to 128 megapixels. Pricing scales with output size: 0.1 credit for 4 MP, 0.5 credit for 16 MP (โ4K), 2 credits for 64 MP, 3 credits for 128 MP. Toggles for Enhance Details (sharper textures) and Enhance Realism (cleans up AI-generated images that look too plastic).
The Upscale & Enhance lineup now has three clear lanes: P-Image for fast and cheap bulk work, Clarity Pro for photorealistic with identity preservation, and Magnific for creative/premium. Pick the right one for the job instead of paying premium prices on every upscale.
When we ship a new version of Pixel Dojo while you have it open, the old page used to silently reload itself a second or two after you next clicked something โ which sometimes ate the video or image you just started generating. It now shows a quiet 'A new version of Pixel Dojo is available' banner instead. You choose when to refresh, so nothing in flight gets dropped.
The app now checks for new versions in the background and surfaces the upgrade banner the moment a deploy lands, instead of waiting until you click something that tries to load a missing chunk. Less surprise, no broken interactions.
WAN 2.7 Video now reserves a slot in your history the instant you hit Generate, before the network call lands. If anything interrupts the dispatch โ a reload, a network blip โ the job is no longer 'lost'; it shows up as 'Preparingโฆ' and either resolves or clears itself cleanly. Other generation tools get the same protection as we roll them over.
Qwen Image, Qwen Image 2, and Kling Image no longer deselect your source images the moment you hit Generate. The images you picked stay selected so you can keep iterating on the same shot โ tweak the prompt and hit Generate again without re-picking everything. Use the Exit edit mode button (or Clear history) when you actually want to start fresh. Matches how the other edit-capable models already behave.
The looping hero video at the top of the home page no longer hangs on a loading spinner โ it loads and plays as soon as the page is ready. Underneath, we also hardened the player so any future bad video URL falls through to a Play button instead of silently spinning.
The Agent now reads what your prompt is actually about โ product shot, portrait, logo, cinematic, anime, character โ and picks a model that's good at that flavor. Before, every text-to-image prompt routed to whatever was most popular overall, which meant a logo request and a product photo both landed on the same generic default. Now a logo goes to a text-rendering-strong model, a product shot goes to a product-photography model, and so on.
Model recommendations now use site-wide usage data (every surface โ chat, Canvas, dashboard, API, MCP) instead of Canvas-only data. The top models reflect what creators across the whole site converge to, not just one surface. A single heavy API caller can't dominate the rank either โ each user's contribution is capped at 5% of total signal.
NSFW prompts now route to spicy-capable models from the start instead of bouncing off the safety gate of a non-spicy default. If no spicy model is available for the request type, the Agent says so clearly and points you at the dedicated Spicy tools in the sidebar.
When the Agent picks a model for you, you'll see WHY in the response โ 'used seedream-4 because it's tuned for product photography; also good: nano-banana-pro, gpt-image-2'. No more black-box defaults.
Every Agent result now shows a 'Try next' chip strip โ Animate, Add effect, Photoshoot, Upscale, Edit again โ that one-click chains the next skill onto your current image. Pre-fills the composer, attaches the asset for you, and the inline mode/effect picker pops automatically. No more remembering what / commands exist or what step comes next.
Brand-new Agent chats now show a categorized starter grid (Image / Edit / Video / Marketing) with 8 ready-to-send prompts spanning every skill โ including the newer /photoshoot, /effect, and /hooks paths that were easy to miss. The /hooks card carries a 'Free' badge so you can experiment with paid-social hooks without thinking about credits.
Campaign tool card was showing a fake 'Prediction not found' error on campaigns that were actually still rendering. Wrong job-tracker path was sending the campaign ID to the image-prediction lookup, which 404'd and surfaced as a red error. Campaigns now poll their own status endpoint, show partial progress as assets land ('Campaign running โ 3 of 5 assets ready'), and only flip to failed if every sub-job genuinely fails.
If you just generated a video, then asked the Agent to 'edit this' or 'animate it', the .mp4 URL was getting passed as a still-image reference โ which silently 422'd at the model with 'cannot read image'. The Agent now walks back to the most recent IMAGE asset in your history, skipping videos, since every edit / storyboard / upscale / character / photoshoot / effect skill consumes a still.
WAN 2.2 (Standard, Plus, Extended) was rejecting public-API calls that didn't explicitly include a 'mode' field with 'mode: Required'. Now infers the mode from your input: pass an image_url and it picks image-to-video, otherwise text-to-video. Same convention every other multi-mode video model uses.
Public /pricing page now shows three tiers side-by-side. Lite ($10/mo, 160 credits) for casual creators, Pro ($25/mo, 400 credits) for working creators, Pro Max ($50/mo, 800 credits) for power creators. Every tool, every model, every new launch โ included on every plan. Pick the volume that fits how much you create.
Pro Max is here for the creators who keep buying $50 credit packs one at a time. Same 800 credits, but as a subscription โ predictable monthly billing, no more 'I should top up before tomorrow's deadline'.
Every plan unlocks the whole studio. Lite gets the same 70+ models, LoRA training, Canvas, and developer API as Pro and Pro Max โ you're choosing how much you create per month, not which tools you get. Pro keeps its 6-month and annual billing options behind a small disclosure below the card so they don't compete with the tier comparison.
Mobile /pricing layout stacks the three cards with Pro on top to preserve the anchor โ same hierarchy you see on desktop, just vertical. No squishing the 'Most Popular' card off-screen.
SEO + page meta on /pricing now lead with 'Plans from $10/mo'. Schema.org markup includes all three tier offers so search engines can show the right starting price.
Pixel Dojo Lite is here โ $10/month for 160 credits. Same per-credit price as Pro ($0.0625/credit), just a smaller plan. Currently surfaced as a save option when you go to cancel and pick 'It's too expensive' โ coming to the public pricing page soon.
Cancel flow now leads with real save options when you pick 'It's too expensive'. Instead of just a 'Manage plan' button that punted you to a portal, you now see a clear sapphire-glow downgrade card showing $25 โ $10 with side-by-side credit counts (160 vs 400), then Pause as the secondary option, then a smaller 'Manage plan' link. Hierarchy you can scan in two seconds.
Downgrade now has a proper email flow. Click 'Switch to Lite' and you get an immediate confirmation showing exactly when your current plan ends โ plus a reminder email at that date with a one-click link to start Lite. Your trained models, LoRAs, and media library stay intact through the transition.
Modal stays open with a confirmation state after you schedule a downgrade โ 'Your Pro stays active until [date]' โ instead of closing instantly with a fading toast. Easier to absorb what just happened and what comes next.
Pause and Downgrade are now mutually exclusive so you can't accidentally do both. If you're already paused, the downgrade card shows 'Resume to downgrade'; if a downgrade is scheduled, the pause card shows 'Cancel pending downgrade first'. The blocked option stays visible so you know it exists.
Click the credit number in the top bar to open a quick view of your balance, this month's credits used, and your recent refunds. No more digging through menus to see where credits went or whether a failed generation actually got refunded.
Separate Buy Credits button now sits next to your balance, so topping up is always one click away โ even when you're deep inside a tool or Canvas session.
Small green dot lights up on the credit pill whenever refunds have landed that you haven't seen yet. When a generation fails or times out we put the credits back โ now you actually know it happened.
Dashboard now shows a monthly usage card right after Recent Creations: credits used, credits refunded, your top tools by spend, and a one-click link to the full report.
Refund notifications now fire as soon as a generation fails โ see the credits come right back in real time. Multiple refunds landing at once bundle into a single notification so you don't get spammed.
MCP skills handle missing inputs more gracefully. The character skill now returns a clear message when called without a reference image on edit-style models (instead of looping with a cryptic backend error), and the generate skill correctly maps parameters for grok-video-extend.
Magnific / Creative Upscale at very high scale factors now returns a clear, actionable error when the output exceeds size limits โ telling you exactly what to try (lower scale factor or smaller source image) instead of a raw parse error.
WAN 2.2 Spicy Image-to-Video is cheaper across the board. 480p drops from 1.5 to 1 credit per second (5s clip: 7.5 โ 5cr, 8s: 12 โ 8cr). 720p drops from 3 to 2 credits per second (5s clip: 15 โ 10cr, 8s: 24 โ 16cr). Same model, same speed โ just less expensive to iterate.
Happy Horse Reference now shows the flame badge in the menu โ the visual cue that flags a model as less restrictive. Easier to spot at a glance when you need character-locked, multi-reference video with fewer content guardrails.
API access now works with credits alone โ no active subscription required. If you ever saw 'An active subscription is required to use the API' from /api/v1/generate even though you had plenty of credits (or paid via a credit pack instead of a recurring plan), that error is gone. Behavior now matches what /api-platform/documentation has always said: signed-in users can create an API key and use it as long as they have credits.
Twenty-one tools picked up an in-tool prompting-guide banner today โ open one and you'll see a small 'Read the guide' card at the top of the settings sidebar (or in the empty state) linking straight to that model's dedicated prompting guide. Video side: Runway Gen-4.5, LTX 2.3, Pixverse V6, Pixverse v5.6, Seedance 1, WAN 2.5, P-Video. Image side: Hunyuan Image 3, Dreamina 3.1, Ernie, PonyXL, WAN Image, ImagineArt, Qwen Image, P-Image, WAN 2.7 Image, WAN 2.6 Image, FLUX Studio, GPT-Image 1.5, Z Image Turbo, FLUX 2 Flex. The guides have been live at /guides/* for a while โ they're just one click away from the tool page now.
FLUX Studio prompting guide at /guides/flux-studio-prompting-guide โ documents the every-FLUX-1-variant workspace (Schnell, Dev, Krea Dev, FLUX Pro, 1.1 Pro / Pro Ultra, Kontext Pro / Max). Four worked examples plus a tier-selection guide for matching variants to use cases.
WAN Image prompting guide at /guides/wan-image-prompting-guide โ the fast Replicate-backed WAN image variant. Documents the optional LoRA fal-lora variant and how it pairs with WAN 2.7 Image for hero shots. Four examples covering portrait, landscape, product, and stylized.
WAN 2.6 Image prompting guide at /guides/wan-2-6-image-prompting-guide โ intermediate-gen WAN with the distinctive fixed pixel-dimension preset system (1280ร1280, 1280ร720, 768ร1024, etc) vs free aspect ratios. Useful when you need exact downstream output dimensions.
P-Video prompting guide at /guides/p-video-prompting-guide โ Pruna's three-mode video model (text-to-video, image-to-video, audio-to-video). Documents the cost-efficient tier choice and the audio-to-video mode for music-driven motion and lip-sync work.
Ernie Image prompting guide at /guides/ernie-prompting-guide โ Baidu's image flagship with multilingual prompt support (English, Chinese, Japanese) and native CJK text rendering. Four worked examples spanning a Chinese tea master portrait, Shanghai Pudong skyline, Chinese calligraphy sign, and traditional garden pavilion.
FLUX 2 Flex prompting guide at /guides/flux-2-flex-prompting-guide โ the resolution-flexible FLUX 2 variant where you pick 0.5/1/2/4 MP per generation. Documents how to iterate cheap at 0.5 MP and finalize at higher MP, plus how it relates to FLUX 2 Pro / Max / Dev.
Pixverse v5.6 prompting guide at /guides/pixverse-prompting-guide โ the dependable previous-generation Pixverse workhorse. Quality tiers from 360p to 1080p, optional audio, predictable motion. Four examples covering glassy lake sailboat, beach character walk, fountain pen signature, and skater handrail grind.
WAN 2.5 prompting guide at /guides/wan-2-5-prompting-guide โ fast image-to-video flash variant in the WAN family. Documents how the speed-optimized I2V tier compares to the premium WAN 2.7. Four worked examples spanning rice terrace drone, maple forest character walk, ink-drop macro, and sunset surfer vertical.
PonyXL prompting guide at /guides/ponyxl-prompting-guide โ documents the score-tag quality system (score_9, score_8_up, score_7_up that you must lead every prompt with) and booru-style tag prompting conventions. Four worked examples across anime portrait, fantasy knight, chibi cat wizard, and anime mecha.
Z Image Turbo prompting guide at /guides/z-image-turbo-prompting-guide โ covers what's distinctive about Z Image Turbo's synchronous response (URL returns inline, no polling round-trip needed), and why that matters for high-volume programmatic workflows. Four production examples.
WAN 2.7 Image prompting guide at /guides/wan-2-7-image-prompting-guide โ the image-side sibling of WAN 2.7 Video. Documents Standard/Pro tier choice and how the same prompt language carries between WAN Image and WAN Video for consistent aesthetics across modalities.
P-Image prompting guide at /guides/p-image-prompting-guide โ at 0.1 credits per image it's the cheapest image option on the platform. Documents the moodboard / high-volume use case where you generate 30-50 variations for ~3-5 credits total, then upscale the keepers via a premium tier.
Hunyuan Image 3 prompting guide at /guides/hunyuan-image-3-prompting-guide โ Tencent's 80-billion-parameter image model. Four worked examples spanning painterly fantasy, photoreal cultural portrait, Brutalist architecture, and mechanical watch macro. Covers the Fast/Quality toggle and the unrestricted safety-off behavior.
Google Gemini Image prompting guide at /guides/google-gemini-image-prompting-guide โ Google's image flagship with three tiers (Standard at 1 credit, Pro at 3-6) and unique native 4K support. Examples for cinematic 21:9 landscape, documentary portrait, food editorial at 4:5 Instagram-spec, and stylized illustration.
ImagineArt prompting guide at /guides/imagineart-prompting-guide โ three model tiers (2.0 preview, 1.5 Pro, 1.5) in one workspace, nine aspect ratios including the unusual extremes 1:3 (ultra-tall mural) and 3:1 (ultra-wide banner). Four worked examples cover cinematic, fashion editorial, pop art, and ultra-tall mural composition.
Grok Imagine Video prompting guide at /guides/xai-video-prompting-guide โ xAI's video model. Four examples (Mojave Mustang, Irish cliff character, chocolate truffle macro, snowboarder vertical) covering cinematic motion, atmospheric scene composition, and 480p/720p tier choice.
Qwen Image prompting guide at /guides/qwen-image-prompting-guide โ covers the Plus / Max tier choice, snapshot pinning for campaign consistency, and clean text rendering in Latin and CJK scripts. Four worked examples including a multi-line bookshop sign with proper kerning.
Dreamina 3.1 prompting guide at /guides/dreamina-prompting-guide โ 4MP cinematic image generation with atmospheric defaults, flat pricing across 1K and 2K, and custom dimensions (256โ2048 each axis) for spec-size output. Four worked examples covering architecture, food editorial, fashion portrait, and atmospheric fantasy.
Pixverse V6 prompting guide at /guides/pixverse-v6-prompting-guide โ documents the unique multi-clip mode (first frame + last frame generates the transition between them), the six built-in style presets (anime, 3D animation, clay, comic, cyberpunk, none), and the moderation quirks worth knowing.
Seedance 1 prompting guide at /guides/seedance-1-prompting-guide โ smooth natural motion, six aspect ratios including the standout 21:9 ultra-wide cinematic, native audio with sound-cue steering, and the end-frame interpolation mode for motion between two anchor images.
GPT-Image 1.5 prompting guide at /guides/gpt-image-1-5-prompting-guide โ covers literal prompt adherence, in-image text rendering with quoted strings, transparent-background PNGs (real alpha channel, no cutout pass needed), and when to reach for it over FLUX or Seedream. Five worked examples spanning chalkboard typography, editorial illustration, macro photoreal, transparent logo, and cinematic landscape.
Ideogram Character prompting guide at /guides/ideogram-character-prompting-guide โ how to get consistent character variations from a single reference image, no LoRA training required. Includes a worked example of one character pushed across fantasy armor, anime, film noir, and sci-fi astronaut styles. Reference-image quality rules included.
Runway Gen-4.5 Video prompting guide at /guides/runway-gen4-video-prompting-guide โ cinematic motion, camera-language fluency, six aspect ratios from 21:9 cinematic through 9:16 vertical. Four worked examples (aerial drone push, character tracking shot, product macro, vertical reel) plus the gotcha that 1:1 aspect requires image-to-video mode.
LTX 2.3 prompting guide at /guides/ltx-2-prompting-guide โ the broadest-surface video model on the site. Covers the Fast vs Pro tradeoff, durations up to 20 seconds, resolutions through 4K, optional native audio, and three modes (text-to-video, image-to-video, video extension for 30+ second pieces). Four worked examples and a full settings table.
Eight fresh prompting guides shipped today โ FLUX, Recraft V4.1, Seedream 4.5, HiDream O1, Hailuo 2.3, Kling Image v3, Kling 2.6 Pro, and WAN 2.7 Video. Each one is built from a shared format with real generated examples (not stock clips), a copy-ready prompt library, settings reference, and a model-specific FAQ. Browse them all at /guides.
Four of the eight are video model guides โ Hailuo 2.3, Kling 2.6 Pro, and WAN 2.7 Video each ship with embedded sample clips you can scrub through before writing your own prompt. The shared template will speed up every video guide we publish from here.
Four image model guides for FLUX, Recraft V4.1, Seedream 4.5, and HiDream O1 โ cover style ranges, aspect ratios, the model's prompt sensitivities, and what to ask for vs what to leave out. Useful whether you're brand-new to the model or you've hit a wall trying to nail a specific look.
Seedream 3 has been fully retired โ the model was no longer reachable from the dashboard but a few public API and Canvas surfaces still advertised it, which meant API calls to /api/v1/models/seedream-3/run were returning service-unavailable errors. Cleaned up the catalog so what you see is what you can actually generate with. If you were using Seedream 3, move over to Seedream 4.5 or Seedream 5 Lite โ both are live and stronger across the board.
Hunyuan Image 3 moved to a new backbone so we can ship it with the safety checker fully disabled at the provider โ content blocks should no longer interrupt your generations. Pricing changes accordingly: 2 credits per image (was 1).
Hunyuan Image 3 settings simplified โ replaced the Steps + CFG sliders with a single Fast / Quality mode toggle (Fast is the default, flip it off when you want max fidelity). Aspect ratio presets expanded from 3 to 7 (added 16:9, 9:16, 4:3, 3:4, 3:2, 2:3).
Hunyuan Image 3 is now available โ Tencent's 80-billion-parameter text-to-image model with high-fidelity output and fine-grained control. 1 credit per image. Generate up to 4 images at once.
Hunyuan Image 3 exposes Steps (10โ50) and Guidance / CFG (1.5โ15) controls โ adjust denoising iterations and how literally the model follows your prompt. Default values (20 / 7) work for most cases; nudge up CFG when you want the output closer to the prompt.
Hunyuan Image 3 ships with three aspect ratio presets โ 1:1 (square), 3:1 (ultra-wide cinematic), 1:3 (ultra-tall portrait). More aspect ratios to follow once we lock down the supported pixel mappings.
Bria 3.2 prompting guide is live at /guides/bria-3-2-prompting-guide โ five example prompts spanning Photography and Art mediums, plus six prompting tips for matching medium to language, anchoring quality, and getting clean product shots.
Bria 3.2's 14:17 and 17:14 aspect presets were sending unsupported pixel dimensions and erroring on submit. The pixel mappings have been corrected โ those aspects now generate cleanly.
Bria 3.2 is now available โ text-to-image at 1K resolution across 9 aspect ratio presets (1:1, 3:2, 2:3, 7:4, 4:7, 9:7, 7:9, 17:14, 14:17). 1 credit per image. Generate up to 4 images per prompt and pick the output format (PNG, JPG, or WEBP) before you hit go.
Bria 3.2 medium toggle โ bias outputs toward Photography or Art with a single click. Useful when prompts read ambiguously and you want to nudge the result one direction or the other without rewriting the prompt.
Bria 3.2 image and prompt enhancement โ optional toggles for richer detail / sharper textures and for letting the model expand your prompt with descriptive variations.
Vidu Q3 is now available โ text-to-video and image-to-video across four resolution tiers (360p, 540p, 720p, 1080p) and five aspect ratios (16:9, 4:3, 1:1, 3:4, 9:16). Pick exactly the frame and resolution you need before generating instead of upscaling or reformatting later. Credit cost scales with resolution: 1 credit/second at 360p and 540p, 2 at 720p, 3 at 1080p.
Optional synchronized audio on Vidu Q3 โ off by default for clean tracks you'll post-score, one toggle on when you want ambient and diegetic sound layered in with the visuals. End your prompt with a 'Sounds of โฆ' line to steer the soundscape.
Vidu Q3 image-to-video โ upload one source still and Vidu writes motion onto it from your prompt. Source aspect drives the output frame automatically, so portrait photos animate as portrait videos without reformatting.
Vidu Q3 prompting guide at /guides/vidu-q3-prompting-guide โ six example prompts spanning cinematic aerials, character close-ups, product spots, multi-shot action, atmospheric wides, and vertical I2V. Plus a settings reference table and a six-item FAQ.
Image-to-video on WAN 2.7 was silently dropping the source frame in some flows โ you'd attach a photo, ask it to animate, and the result would be a completely different subject. The variant didn't pin its mode internally, so a default kicked in that took the text-only branch and ignored the image. Now pinned correctly: your starting frame is honored every time, whether you call it from chat, Canvas, the dashboard, or the public API.
Audited every multi-mode video model for the same bug pattern. Same fix applied to Happy Horse 1.0 (both text-to-video and image-to-video variants) and Kling Video v3 (all four โ Standard Text, Standard Image, Pro Text, Pro Image). Worst case caught: Kling Pro Image was doubly broken โ would have been billed as Standard pricing AND ignored the image. All fixed.
If you've ever pasted a long, detailed prompt and gotten hit with "Prompt must be 2,500 characters or less" or similar โ that's gone. Every generation tool now lets your full prompt through. If a specific model genuinely can't handle the length you'll see that model's own error message (which is more accurate than a guessed cap), but the upfront client-side rejection is no longer in the way.
Same prompt-honor fix from earlier today extended to Dreamina โ its 'Enhance Prompt' toggle was on by default and silently rewriting your wording before generation. Now off by default like the rest.
Hover-info icons added to the prompt-rewrite toggles on Dreamina, Hailuo, and WAN 2.2 too. Every "enhance / optimize / expand" checkbox across the site now tells you upfront what it actually does.
Audited every generation tool for silent prompt truncation. No tool ever shortens what you typed before sending it to the model. If your prompt is too long for a particular model you'll see a visible error so you can trim โ never a silent edit.
If you ever felt like WAN 2.7 Spicy (or one of the other WAN, Qwen, or Z Image tools) wasn't following your prompt โ you were right. Every generation was silently running your wording through an upstream auto-rewriter that expanded and stylized it before the model saw it. That's now off by default across all of them. What you type is what the model gets. You can still turn it back on if you want the AI to embellish short prompts.
Every "Enable prompt expansion" toggle now has a hover-info icon that explains exactly what it does (the model rewrites your wording) so the trade-off is visible at decision time.
Cleaned up the default negative prompt on video tools. Removed "excessive motion" and "flickering" โ those were silently fighting action-heavy prompts. Only always-harmful negatives (anatomy defects, watermarks, compression noise) remain. If you want stillness, you can still add it to your own negative prompt.
Images you generated with the SDXL LoRA tool weren't recording which LoRA produced them โ so when you opened the details popover in My Media on one of those images, the 'LoRAs Used' section stayed blank even when you'd clearly picked one. The save flow was dropping the LoRA metadata on the floor. Every new save from the LoRA Image Creator now correctly stores the LoRA name and strength, and they show up in the details panel where they belong. Older saves can't be retroactively tagged, but everything going forward will be.
A new 'My Stuff' button sits next to 'All Tools' in the top bar and opens a dropdown with everything personal โ Dashboard, Canvas, Agent, My Media, My LoRAs, My Prompts, Community Gallery, and your Usage Report. One click from any tool page. These destinations used to live two clicks deep inside the All Tools drawer (or buried in the avatar menu).
Your full pinned tools list and the last five tools you visited also show up inside My Stuff โ so you can jump between everywhere you actually work without leaving the page you're on. The three-icon pinned strip in the top bar stays exactly where it was.
Yesterday's 'Save All' truthful-toast fix only covered the image tools. Now every video tool โ Kling Video, Veo 3.1, Hailuo, Seedance, WAN, Grok Video, LTX, Pixverse, Runway, P Video, Omnihuman, and the rest โ uses the same single-round-trip save with per-item results. Select 10 videos, hit Save, you get an accurate count back. Mobile-safe with keepalive enabled.
Same fix applied to LoRA Image Creator. Bulk-save reliability now matches every other tool on the site.
If you ever clicked 'Save All' on a batch of generated images and saw the success toast but didn't find them in My Media later โ that was a real bug, not a glitch. The save flow has been rebuilt around a single server round-trip with truthful per-image results: the toast now says 'Saved 7 of 10 โ 3 failed' if any of them didn't land, and the failed ones stay selected so you can retry without re-picking them.
Mobile (especially iPhone) was where this hit hardest. iOS Safari was killing in-flight save requests whenever you switched apps or locked your screen mid-batch. Saves now run keepalive-true and the server-side fan-out keeps everything going even if you background the tab โ no more silent drops.
Recraft is now V4.1 โ better photorealism, smoother gradients, and noticeably better text rendering vs V4. Same prompt understanding and design taste, just sharper across the board. Available at /recraft.
Three output variants under the same tool: Raster (standard image), Vector SVG (editable, layered), and a brand-new Utility variant tuned for product mockups โ flat lighting, front-facing composition, clean scenes. Perfect for packaging, icons, and e-commerce hero shots.
Pricing simplified and cheaper across the board. All Standard generations are 1 credit (was 1โ2). All Pro generations are 5 credits (was 6โ8). Pick the variant you need without doing pricing math first.
Aspect ratio menu expanded from 8 to 14 options. Added 2:1, 1:2, 5:4, 14:10, 10:14, and 6:10 alongside the standard square/portrait/landscape set.
Kling Avatar used to charge the full 60-second rate up front regardless of how long your audio actually was โ a 2-second clip cost 120 credits at Standard. Now the cost panel reads your audio file the moment it uploads and charges only for the real duration. That same 2-second clip is 4 credits now (or 6 on Pro).
Kling Avatar V2 is live. Upload a portrait and an audio file and get a lip-synced talking video โ works on realistic humans, stylized characters, cartoons, and animals. Great for talking-head explainers, narrated product demos, and giving any character a voice without filming.
Two quality tiers: Standard for fast turnarounds (2 credits/second) and Pro for sharper facial detail and smoother motion (3 credits/second). Video length matches your audio automatically.
Optional prompt slot lets you guide actions, emotions, and camera movement โ the audio still drives the mouth and timing, the prompt adds direction on top ('warm enthusiasm', 'subtle nods', 'wide shot').
Public API calls to MuleRouter-backed video tools (WAN 2.2 / 2.7 Spicy Image-to-Video, Z-Image Spicy) now surface the same helpful error messages the in-app tools already do. When the model is briefly offline upstream, you'll see 'This video model is briefly offline while we top up our backbone. Your credits were refunded โ please try again in a few minutes.' โ not the previous internal architecture message. Applies broadly: any tool with a server-side override now bubbles its real error (size limits, format issues, transient outages) through /api/v1, not just MuleRouter models.
Type /photoshoot or /effect and a picker pops up with the named choices โ hero / lifestyle / with-model / levitating / restyle for photoshoots, and dolly-zoom / 360-spin / fpv / crane-up / whip-pan / parallax / vertigo / bullet-time for effects. Each entry has a short description so you can see what it does before picking. Tab or Enter to select; the value splices into your prompt and you keep typing the scene.
Typing something that isn't a valid mode/effect after /photoshoot or /effect (like pasting a URL) used to fail with a cryptic 'invalid enum value' error after sending. The picker now shows 'No match' inline as you type so the mistake is obvious before you waste the send.
/photoshoot turns any product image into a brand-quality shot. Five modes โ hero (clean studio), lifestyle (real-world setting), with-model (person interacting), levitating (dynamic floating with splashes), and restyle (same product, fresh aesthetic). Each mode bakes in the prompt scaffolding a product photographer would write โ you just pick the mode and add direction.
/effect adds a named cinematic effect to a video shot โ dolly-zoom (Vertigo), 360-spin, fpv (first-person drone), crane-up, whip-pan, parallax, vertigo, or bullet-time. Pass the effect name and what the camera is looking at; the camera-move prompt is handled for you.
/hooks generates 10 paid-social ad copy hooks for a product or concept โ TikTok / Reels / Shorts ready. Pure text, no image generation, zero credit cost. Mixes curiosity, social proof, transformation, contrarian, and FOMO angles so you can A/B test scroll-stoppers.
/model picker now works inline anywhere in your message, not just at the start. Type /model mid-prompt to pop a filterable list of every available model โ narrows as you type, picks splice into your text without wiping what you'd written.
The + button gained a 'Pick a model' shortcut and an 'Insert skill' submenu. Both insert at your cursor instead of replacing your prompt โ so you can write half a sentence, click +, choose a skill or model, and keep typing.
Follow-up skills (/edit, /character, /storyboard, /upscale, /photoshoot, /effect) now automatically use your last generated image as the reference. 'animate the dragon' just animates the dragon โ no re-attaching, no re-picking.
'Just make me an image' was sometimes routing to an image-edit model by mistake, which would error because no reference image was attached. The Agent now correctly defaults to a true text-to-image model when you don't supply one.
Slash commands in the Agent. Type / and pick a skill directly โ /generate, /edit, /character, /storyboard, /upscale, /from_url, /campaign โ to skip straight to the tool without going through the chat picker. Tab or Enter to confirm.
Pin a specific model with /model <apiId> <prompt> โ e.g. /model wan-2.7-i2v animate her smiling. Great when you know exactly which model you want and don't want the Agent to decide for you.
Attach reference images two ways: 'Upload from computer' or 'From My Media' โ pick any image you've already generated on PixelDojo without leaving the chat.
Attached images now render as a real thumbnail inside your chat bubble instead of a clunky '[attached image: URL]' line. Looks like a proper chat โ and follow-ups like 'animate this' thread the image through automatically.
Pasting a product URL (/from_url) now renders a rich preview card โ hero image, product name, description, and source โ instead of a wall of plain text. Much easier to confirm at a glance that the Agent grabbed the right product before you tell it to make hero shots.
The Agent's default text-to-image pick was Dreamina even when WAN and Nano-Banana are actually more popular. Defaults now follow real Canvas usage rank instead of alphabetical order โ so 'just make me an image' lands on what creators actually keep, not whichever model came first in the alphabet.
Reading a product page from a URL no longer renders the success card with a red error border. It's a successful action, so it now looks like one.
When you ask the Agent to 'animate this' on an image you uploaded or just generated, it now picks WAN 2.7 i2v by default โ far more faithful to the source subject than the more creative video models, which sometimes reinterpret the image entirely.
Every Agent generation card now shows the source image it used, so you can confirm at a glance that 'edit it' / 'upscale that' picked up the right reference.
The Agent now offers the same skill set as the public /skills page โ edit, product extraction from URL, and full campaign generation (hero + lifestyle + video pack from one product link) are all available in chat.
Reference images now thread through reliably when you say 'edit this' or 'animate it' as a follow-up โ no more dropping the image silently. Credits also refresh in the top bar the moment they're deducted, and conversation history persists cleanly when you leave and come back.
Asking the Agent to 'make a video' with WAN 2.2 no longer errors with 'mode: required' โ text-to-video vs image-to-video is auto-detected from your prompt.
Qwen Image, Qwen Image 2, and their Edit variants now accept your full prompt โ we removed the 800-character soft cap and the 'we trimmed your prompt' warning. Long, detailed prompts go through unmodified. If you regularly write past 800 chars for fine-grained control, this is for you.
Meet the Agent. Describe what you want โ 'cinematic samurai portrait', 'edit this to add neon rain', 'make a 6-second video of waves' โ and it picks the right model from across the 70+ tools and runs it. Live at /agent from the sidebar, available on subscribed plans. Generations cost the same credits as the tools they call; the chat itself is free.
The Agent auto-tunes to real PixelDojo usage. Defaults land on the models creators actually keep for each task โ text-to-image, edits, video, upscales, characters โ so 'just make me an image' produces what's working today, not yesterday's pick.
Six built-in skills cover the common asks: generate, edit, character (consistent identity across shots), storyboard (video), upscale, and a status checker for long-running jobs.
Videos render in the background and appear in chat the moment they're ready โ same auto-polling pattern the video tool pages use. No need to ask 'is it done yet?'.
Drag in a reference image to anchor edits, character generations, or upscales. Each completed asset shows the model that made it with Save / Download / Open full size on hover. Conversation history saves so you can resume any chat.
Right-sized the Prompt Library headline so the featured strip stays visible without scrolling on a standard desktop. Still bold and editorial โ just no longer eating the whole viewport.
Removed a redundant 'AI-powered' badge from the search bar โ the sparkle icon already says it, and the extra chrome was competing with the Create button.
When semantic search has no real matches for your query, the empty state now says so honestly and points you to popular prompts instead of leaving you stranded with a generic 'not found' message.
Semantic search on the Prompt Library now only returns prompts that actually relate to your query. Weak matches are filtered out before they reach you โ search for 'tesla' and you'll see Tesla-themed prompts (or an honest 'no semantic matches' when the library doesn't have any), not a random mix of motivational and business prompts pretending to be relevant.
Newly created prompts are now searchable the moment you save them. Previously, only prompts that were part of a periodic backfill were findable by semantic search โ anything saved between backfills was invisible until the next sweep. Embedding now runs at save time, so brand-new prompts show up in relevant search results immediately.
The Prompt Library landing page got a real makeover โ bigger editorial typography, a massive widescreen hero showcase autoplaying the standout video prompt of the week, and a row of supporting picks underneath. Great prompts now feel like featured work, not search results.
A 'Create Prompt' shortcut now lives right in the library hero so you can jump straight into the builder from anywhere in the library.
Semantic search now labels its results as 'Top N matches' instead of a misleading total โ the AI search surfaces the most relevant prompts, not every possible match, and the UI now reflects that clearly.
A brief render error that took the Prompt Library landing page offline for a window today โ now resolved.
The Prompt Library now uses AI-powered semantic search. Type what you're looking for in plain English and it finds prompts by meaning โ not just keyword matching. 'Cinematic portrait at dusk' finds dusk-mood prompts even if they don't use those exact words.
Every prompt detail page now shows up to six semantically related prompts below the main one, so you can discover variations and adjacent styles without starting a new search.
A featured strip at the top of the library showcases curated prompts with rendered video and image previews โ hover to see them come to life before you click through.
Outpaint no longer fails with a bare "Unprocessable Entity" error when you expand from the edges. The submit path now drops zero-valued sliders before the request reaches fal.ai, so an "edges" expansion with one or two active sides goes through cleanly instead of being rejected as an empty payload.
When fal.ai does reject an outpaint request, you now see the actual validation reason โ which field or value was wrong โ instead of just "Unprocessable Entity." Same translation we already use for Smart Resize and other fal-backed tools, now wired into Outpaint.
Click Enhance now works with model-specific examples on every PixelDojo tool, not just the top 10. Niche image and video models now pull from their own track record instead of guessing from generic data โ a real precision boost for less-mainstream tools.
The pool of prompts the enhancer learns from just got dramatically larger. Every thoughtful prompt now counts toward retrieval, not just the most-liked or admin-approved ones โ those are still weighted higher, but no longer the only ones in the running. Many more candidates per request.
Deleting a single image or video from My Media now fully cleans up the underlying file. Before, the gallery entry would disappear right away but the file lingered in storage.
Click Enhance now learns from real prompts that have produced great generations on PixelDojo. Instead of using a generic template, the enhancer pulls a handful of high-quality examples that actually worked on your chosen model and uses them to shape the rewrite. Think of it as having access to the prompt notebooks of every successful generation on the platform โ 40,000+ curated entries plus everything in the community gallery.
Per-model styling. The enhancer now knows that what works for Flux is different from what works for Veo 3, and pulls relevant examples from each model's track record. Your enhanced prompts are tuned to the model you're using, not a generic image template.
Gets better the more you use it. Every time you save, download, or upscale a generation, that signal feeds back into the enhancer. Prompts that produce things you keep get weighted higher next time someone enhances a similar idea.
Drop-in upgrade. If retrieval can't find anything useful for your specific prompt, you get the same enhancement you've always had โ no regression, no errors. The new system is additive.
The Canvas co-pilot now recommends models based on what the community is actually using โ not what its training data thinks is famous. Top models per chain type (generate, edit, upscale, animate) are pulled live from real Canvas usage and refreshed every hour. As new models climb the leaderboard, the co-pilot picks them up automatically. You can still name any model in your prompt ("use Flux," "with Seedream") and it will honor your pick.
WAN 2.7 Spicy history cards now have Use first frame and Use last frame buttons, matching WAN 2.2 standard. One click grabs a frame from any past video and drops it into the start-image slot for your next generation.
Clarity Pro Upscaler is here โ photorealistic upscaling up to 16ร with a creativity dial that goes from strict fidelity all the way to expressive re-imagining. Realistic skin, preserved identity, and output up to 64 megapixels (~8K). Find it in Upscale & Enhance.
The community gallery and all user profiles are now fully public โ no account required to browse, discover, and get inspired. Share any gallery link with anyone.
Gallery image descriptions are now generated the moment you open a piece. Open any image in the gallery and a smart description appears automatically, giving every creation more context.
The gallery detail page got a layout refresh โ likes, views, share, and vote are now in a single compact row so the image takes center stage.
Kling Video multi-shot mode was silently blocking generation in some cases due to a form validation issue. It now submits correctly every time.
Multi-Shot Mode on Kling Video 3.0 โ write a separate prompt for each shot (up to 6) and Kling stitches them into one seamless clip. Toggle it on, describe your scenes, choose Custom or Auto pacing, and generate. Great for short narratives, product demos, or any video that needs distinct beats.
Creator Studio has been retired. If you had it bookmarked, the link now takes you straight to Canvas โ the better tool for multi-model workflows.
A small welcome card on the dashboard now leads with the most important thing to know: "Generations live here for 24 hours." Save what you want to keep, otherwise it's gone tomorrow. Inline links to your last-24-hour Recent pill, My Media, and Buy Credits. Dismiss it once and it's gone for your account across every browser and device.
Tool empty states (the screen you see before you generate anything on most tools) now include the same 24-hour notice. Belt-and-suspenders so you see the message before you spend credits, not after you wonder what happened to your generation.
The NSFW Image-to-Video guide now covers all three NSFW-capable image-to-video models on PixelDojo โ WAN 2.7 Spicy, Happy Horse 1.0, and WAN 2.2 โ with a side-by-side model picker, per-model pricing tiers, and a quick-decision matrix so you know exactly which one to reach for. WAN 2.7 Spicy for the sharpest motion, Happy Horse 1.0 for physics-aware intimate scenes, WAN 2.2 when you need a character LoRA or cheap iteration.
The Recent floating pill (bottom-left of the dashboard) just got a real job. Every row now has three icon buttons on the right: Save, Download, and Remove. Save promotes that generation into your permanent My Media library; Download grabs the file directly to your computer; Remove hides the row from Recent (it never deletes the file or anything in your gallery).
Recovery: if you accidentally delete a generation from My Media, Recent still has it for 24 hours. Hit Save in Recent and it goes right back into your library.
Saved items now show a green checkmark in Recent so you can see at a glance which generations are already in your permanent library and which are still just "in flight."
ImagineArt 2.0 (preview) is here. The next-gen entry in the ImagineArt family lands with sharper realism, stronger composition, and the most faithful prompt following yet. 1 credit per image.
The /imagineart page now hosts the whole ImagineArt lineup โ 1.5, 1.5 Pro, and 2.0 โ in one workspace. Pick the model from the panel on the right, no more page-hopping. The menu link and page title are now just "ImagineArt" instead of "ImagineArt 1.5."
Pasting a really long prompt into Qwen Image (or Qwen Image 2, or either of their Edit variants) used to bounce with a validation error โ the model only reads 800 characters and we were rejecting anything longer outright. Now we trim the prompt to fit and pop a one-time warning toast telling you it happened, so the generation just runs. Same treatment for negative prompts (which cap at 500). If you're calling the API directly, the response now carries a `warnings` array describing what got trimmed.
Search in All Tools is much smarter. Multi-word queries now AND across tokens (typing "edit face" actually returns face-editing tools instead of zero results), category names are part of the searchable text (typing "face" surfaces everything in Characters & Faces, not just tools with "face" in the title), and a small synonym map covers common variations โ "edit" โ inpaint / modify / fix, "upscale" โ enhance / sharpen, "video" โ clip / animate, etc.
All Tools sidebar feels calmer. The 11 category icons used to all show in full color at rest โ a parade of pink / red / blue / purple / etc that fought for attention with no clear hierarchy. Icons now render at half opacity at rest with a much subtler colored chip behind them, then snap to full color on hover or when their category is active. Same color identity, way easier to scan.
Reverted the mobile All Tools Pinned and Recent sections from two-column to single-column. The two-column was harder to read because tool labels and descriptions got tight at narrow widths.
The search box is now a permanent affordance in the top bar โ click it (or press โK) to instantly search across every tool, style, and model. Replaces the static Dashboard / Canvas / My Media / My LoRAs / My Prompts / Gallery / API row that used to sit there. Those destinations live one click away in All Tools' Quick Access section, and on the dashboard cards. Search is the right primary affordance when you have 100+ tools.
Up to 3 of your pinned tools now show as compact icon buttons in the top bar (on screens md and wider). Pin from any tool card and it appears in your bar; unpin to remove. Pin order is stable so your bar doesn't shuffle. The icon for the tool you're currently on lights up โ your bar tells you where you are.
Theme toggle moved out of the bar and into the user menu (click your avatar). Theme is rarely changed once set, and freeing the bar slot leaves more room for what you actually use day to day.
Mobile All Tools is way less to scroll. Browse Categories now shows up first (instead of being buried under Pinned, Recent, Quick Access, and Featured Tools). Pinned + Recent fit two-per-row instead of stacked. Featured Tools stays on the dashboard hero where you can already see them. Net result: about 60% less vertical scroll before you reach the catalog.
Removed the duplicate "Account" category from All Tools (it had Usage Report, Profile, Subscription, Feedback โ all of which live in the user menu next to your avatar, where you'd expect to find account stuff).
All top-bar buttons are now 40px tall (was 36px), comfortably above the 44px-ish touch target threshold once you account for the padding around the icon. Easier to tap on iPad and laptop trackpads.
Open All Tools and the Home tab is now structured as a real launchpad: your Pinned tools at the top, your Recent generations next, then a Quick Access strip with Dashboard / Canvas / My Media / My LoRAs / My Prompts / Gallery / API, then a curated Featured Tools section. Pinning your first tool no longer makes the Featured suggestions disappear โ both coexist.
The drawer's left-rail tab is now "Home" instead of "Quick Start," with a calmer Home icon. The "Featured Tools" hero on /dashboard and the "Featured Tools" section in the drawer now share the same curated list, so when we promote a model, both surfaces update.
When you start typing in the drawer's search box, the left rail no longer keeps the previously-selected category lit up โ search results are global, so the highlight matched nothing. Sidebar entries dim while a search is active.
The Recent floating pill (bottom-left of the dashboard) now self-heals. If a generation it remembers has been deleted from your gallery or the temp preview expired, the broken thumbnail row auto-hides and the count updates instead of showing an empty box.
Drawer chrome cleanup: section labels are sentence case (Pinned / Recent / Featured Tools / Found N tools) instead of WIDELY-TRACKED UPPERCASE; duplicate Account / Usage / Credits links removed from the drawer footer (they live in the user menu and the credits pill); the small Editor / Spicy legend at the sidebar bottom is gone (the badge tooltips on each card already explain themselves).
Removed the two large pulsing blue / sapphire ambient circles that animated behind every dashboard page. They were 5%-opacity decoration; the dashboard now feels a touch calmer and more tool-focused without them.
A floating Recent button appears bottom-left of every dashboard page once you've generated something. Click it to see every successful generation from the last 24 hours across every tool โ image, video, edit, spicy, the lot โ with a thumbnail, timestamp, and one-click jump back to the tool that made it. Built for the all-too-common moment of "I refreshed and now I can't find my video."
WAN 2.7 Standard and WAN 2.7 Spicy now share their history grid โ a Spicy generation shows up on the Standard page and vice versa. Same treatment for WAN 2.2 Standard and WAN 2.2 Spicy. Previously each "tab" was a separate route with its own history, so jumping between Standard and Spicy could make a generation look lost.
Qwen Image edits no longer ignore your aspect-ratio selection. Picking 1:1, 16:9, 4:3 (or any other ratio) now resizes the output instead of silently mirroring the source image's dimensions. Caught and reported by jazze#77 โ thank you. Note: Spicy edits still inherit source dimensions; that's a backbone limitation, not a UI bug.
If your connection drops mid-generation โ common on mobile or shaky Wi-Fi โ you used to see a bare "Failed to fetch" toast with no context, leaving you wondering whether your credits had been silently spent. The toast now reads "Connection lost โ your credits are safe" and explains that if a generation actually started, credits refund automatically within 2 hours. Check your generation history before retrying. Applies across every tool page (every browser: Chrome, Safari, Firefox, Edge).
The public API for /api/v1/models/wan-image/run now matches its own documented default โ you can omit numberOfImages and get 1 image, instead of bouncing with "validation_error: numberOfImages: Required." The docs always said default: 1 but the schema was requiring the field; both sides now agree.
WAN 2.7 Spicy Image-to-Video is live. Upload a starting image, write a motion prompt, and animate it at 720p or 1080p for 2โ15 seconds. Optional driving audio drives lip-sync or motion. Click the Spicy tab on the WAN 2.7 Video page to switch in.
WAN 2.2 Spicy Image-to-Video joins the lineup as a faster, lighter sibling. 480p at 1.5 credits/sec and 720p at 3 credits/sec, with 5- or 8-second clips. Click the Spicy tab on the WAN 2.2 page to flip in.
Qwen Image Edit Spicy is now a one-click toggle inside the existing Qwen Image edit flow. Click an image, enter Edit Mode, then click the Spicy chip โ no second page, no re-uploading. Flat 1 credit per edit, single image output.
Z Image Spicy lives alongside Z Image Turbo with the exact same layout and controls. Flip between Standard and Spicy at the top of the composer and your prompt carries over. Same aspect-ratio presets, same prompt grammar, same masonry history grid.
Switching between Standard and Spicy variants preserves your prompt and source image across the swap. Drop a prompt into the standard tab, click Spicy, keep going.
Generations that fail validation now show what went wrong instead of silently doing nothing. If your prompt is too long, your image too large, or a value out of range, a clear toast tells you exactly which field and what to change. Previously the Generate button just returned with no feedback โ a long-standing papercut across Qwen Image, Qwen Image 2, WAN Image, ImagineArt, Z-Image Turbo, Dreamina, Ernie, and Ideogram Character.
Running out of credits now shows "Insufficient credits" with a link to top up, instead of a generic "generation.failed" toast. ~50 tool pages were checking the credit-error response in a way that quietly missed the real shape โ they all detect it correctly now.
Error toasts no longer leak internal codes like "INPUT.INVALID_PARAMETER" or "CREDITS.INSUFFICIENT" as the title. The friendly message that was supposed to render now actually renders.
Every parameter on every API model docs page now has a description. Hundreds of fields used to render as just a name and a type with no explanation; they now carry plain-English context โ what the field does, what the valid values mean, when to use it. Affects /api-platform pages, the OpenAPI spec, the LLM-readable docs at /llm, and Canvas's "More Settings" panel. All five surfaces auto-update from a single source so future tools inherit the descriptions for free.
Edit tools โ Qwen Image Edit, WAN 2.7 Image Edit, GPT-Image 2 Edit, P-Image Edit, OpenAI Image 1 Edit, HiDream Edit, and a few others โ now ship a multi-image example alongside the single-image curl. The capability was always there, but the docs only showed one URL; it was easy to miss that you can compose up to 3-9 reference images in a single call. The new examples make it obvious.
HiDream is now HiDream O1 โ a brand-new image backbone replacing the previous L1 family. Two variants live behind one tool: Full (the production quality target) and Dev (a faster, distilled version). Pick between them with a one-click pill right above the prompt.
Pricing simplified to a flat 1 credit per generation and 1 credit per edit. No more dual-credit tiers โ every HiDream call costs the same regardless of variant.
Aspect ratio moved inline with the composer. Pick Square HD, Square, Landscape 4:3 / 16:9, or Portrait 3:4 / 9:16 right next to the Generate button โ no more buried Settings modal. The picker uses real aspect-ratio icons so the layout you'll get is visible at a glance.
HiDream O1 honors the account-wide Spicy mode toggle. Flip Spicy on in Settings โ Content Preferences and HiDream generations + edits stop applying the post-hoc safety check. The /skills tools page now shows the flame badge so you can spot which tools are spicy-capable.
Image editing got the same upgrade. The Edit endpoint now uses HiDream O1's image-conditioned mode (Full and Dev variants), still 1 credit per call, and accepts up to four reference images for subject-driven edits if you call the public API directly.
Cleaner workspace โ the standalone Settings modal and the post-generation auto color correction option are gone. Both were holdovers from the old engine that didn't have a home in the new flow.
Public API users: /api/v1/models/hidream/run and /api/v1/models/hidream-edit/run are live with the new model + image_size schema. The OpenAPI spec and LLM docs picked it up automatically. The legacy /api/hidream and /api/hidream/edit endpoints continue to work for existing integrations.
Qwen Image 2 now defaults to 2048ร2048 (was 1024ร1024). If you weren't picking a size manually, every new generation gets 4ร the pixels โ noticeably sharper out of the box. The previous default was using a quarter of what the model can actually produce.
Qwen Image 2 unlocks a high-fidelity "Max" resolution tier: 2048ร2048, 2688ร1536, 1536ร2688, 2368ร1728, and 1728ร2368. The 2K and 1K tiers stay as alternatives for faster generations.
Qwen Image Plus and Max gained three controls that were previously hardcoded: smart prompt rewriting (the model adds detail to short prompts), watermark on/off, and a custom negative prompt. All available in the public API and Canvas More Settings.
Qwen Image Edit (both the legacy and 2.0 variants) now respect a watermark toggle. Output sizes are also tightened to a documented set โ invalid combinations get a clean error instead of a confusing 500.
Sending more than 1 output to Qwen Image Plus or Max used to bounce with an unhelpful upstream error โ the underlying model is fixed at 1 image. You now get a clean message at request time telling you to use LoRA mode if you want multiple outputs in a single call.
Power users on the public API can now pin to dated model snapshots (e.g. `qwen-image-2.0-pro-2026-04-22`, `qwen-image-plus-2026-01-09`) for reproducible behavior across model rev bumps. The rolling names like `qwen-image-2.0-pro` keep tracking the latest.
Polished the look of the Qwen Image and Qwen Image 2 pages in light mode โ Generate and Subscribe buttons now use the sapphire site CTA color, and the resolution picker reads cleaner. The new "Max" tier shows what it actually represents on hover (about 4 megapixels, the model's documented sweet spot).
WAN 2.7 Pro now generates true 4K images. Switch to Pro on /wan-2.7-image and set resolution to 4K โ output jumps from 2048ยฒ up to 4096ยฒ. Text-to-image only (the model caps editing at 2K).
Resolution and aspect ratio are now independent controls. Pick your size (1K / 2K / 4K) and your shape (1:1, 16:9, 9:16, 4:3, 3:4, 5:4, 4:5) separately, instead of hunting through a combined "size" dropdown with twelve mixed values. Standard model gets 1K and 2K; Pro adds 4K.
Edit Mode accepts up to 9 reference images at once, up from 4. Bring your whole mood board.
Interactive edit: in Edit Mode, toggle on "Interactive edit (bounding boxes)" and drag rectangles directly on each reference thumbnail to restrict changes to those regions. Up to 2 regions per image. Useful for outfit swaps, sign edits, or selective restyles without touching the rest of the image.
Custom color palettes in the Settings menu โ bias a generation toward 3โ10 specific colors with weighted ratios. Hex picker, percentage sliders, and an auto-balance button that distributes ratios to a clean 100%. Good for brand work or hitting a specific mood.
Seed control in the Settings menu. Pin a number for reproducible runs; leave blank for random.
All of the above is also live on the public API, the OpenAPI spec, and the LLM docs โ scripts and agents pick everything up automatically.
The public API for Kling 2.6 Pro and Seedance 1.5 now accepts the video duration as a plain number (e.g. "duration": 5) as well as a string ("5"). Previously, sending the number โ which most HTTP clients do by default, and which our own Seedance curl example showed โ bounced back with "duration: Expected '5' | '10', received number." Both shapes work now.
Seedream 4.5 custom output sizes that fall outside the model's supported range used to fail with a cryptic provider error ("image size must be at least 3686400 pixels. Request id: 0217โฆ"). You now get a clear message with your actual width ร height, the real limit (about 3.7โ16.8 megapixels), and a concrete suggestion like "increase to 2560ร1440".
When the generation service behind P-Image, P-Image Edit, P-Video, and P-Video Avatar is briefly overloaded, you now see a plain "the generation service is busy right now โ please wait a moment and try again" instead of a bare technical error like "PRUNA API error: 504."
When Canvas Co-pilot's planner refuses your request because the underlying model's safety filter flagged it, you now see "Co-pilot can't plan that โ your request was flagged by the safety filter. Please rephrase your goal โฆ" instead of the generic "Co-pilot couldn't draft a plan. Try again." The previous wording asked you to retry an identical prompt that would keep failing.
Nano Banana reference-image dimension errors used to surface as "Runware: Invalid reference image width. The reference image width must be an integer between 1 and 12288." โ leaking the provider brand name and the internal parameter name. You now see a clean message that explains each side must be between 1 and 12288 pixels and recommends 4000px on the long edge as a safe target. Same cleanup applies to prompt-length rejections (the message now says exactly which range is supported, no brand prefix). Centralized in the Runware client so every Runware-backed tool โ Image Relighting, Google Gemini Image, Pixverse v6, Consistent Characters, Veo 3.1, and Nano Banana โ gets the cleaner messages.
WAN 2.5 in Canvas now accepts an empty prompt when you're animating an image. Previously, dropping an image into a WAN 2.5 step and hitting run without typing a description bounced back with a "Prompt is required" error from your own UI โ even though the underlying model is happy to animate the image with neutral motion. The schema now treats the prompt as optional in image-to-video mode (text-to-video still requires one).
If your card got declined and you fixed it by starting a fresh subscription (instead of updating the card on the failing one), your account could end up marked as expired even though the new payment had gone through and the next renewal was already on the calendar. The two billing events were arriving seconds apart and the older one was overwriting the newer one. Subscriptions now stay active in this case, and a small number of customers we found in the bad state have already been put back to active automatically โ no action needed on your end.
Canvas now has an AI co-pilot. Press / on the canvas (or click the spark icon on the right edge) and ask it for anything โ "make 4 lifestyle variants of my active shot," "build a launch from shop.example.com/runner," "animate this into a 6-second reel." It drafts a plan, shows you the cost upfront, and one click runs the whole chain. Parallel-safe too: 4 lifestyle gens fire at the same time instead of waiting in a queue.
Plans collapse to a clean title + cost + Run button. Tap "Show steps" to see the numbered plan, or "Show graph" for a tiny DAG view of how steps depend on each other. Power-user detail is one click away when you want it; out of sight when you don't.
100-credit hard cap per plan, enforced both at the AI planner and on the server. If a plan would cost more, you get a clean rejection with a hint to ask for something smaller โ no surprise bills.
Co-pilot runs use the same execution path as a manual generate โ same credits, same automatic refunds on failure, same job poller, same telemetry. Failed nodes can be retried without re-running the ones that already succeeded.
Greenfield keyboard map: / and โK both open the co-pilot. โM opens the model picker. Esc collapses. The right-edge icon rail is 56px when collapsed so the existing single-generate workflow looks and feels exactly the same โ power users don't pay for a feature they didn't ask for.
Grok Imagine now has a Quality tier alongside the original Fast tier. Pick Quality for noticeably sharper detail, more accurate compositions, and stronger text rendering โ ideal for thumbnails, hero images, ad creative, or any final visual you'd ship to a client.
Choose 1K (1024px on the long edge) or 2K (2048px on the long edge) output on the Quality tier. 1K is faster and cheaper; 2K is the high-resolution deliverable. The default is 2K.
The Quality tier supports editing too โ upload an image, describe the change, and get a high-fidelity edit at 1K or 2K. Works the same way the Fast tier edits do, just with crisper output.
Pricing stays simple: Fast tier is still 1 credit per image. Quality is 2 credits at 1K or 3 credits at 2K. The credit chip on the Generate button updates live as you switch tiers and resolution.
Canvas now offers Grok Image Quality as its own tile in the model picker โ drop it into any chain when you want a cleaner final pass without leaving the canvas.
When Grok Image Edit can't generate an output โ usually because xAI's safety filter quietly refused the edit, or the source image is in an unsupported format โ you now see a message that names the likely cause and tells you what to do (rephrase the edit instruction, or re-upload the source image as a JPEG or PNG, then retry). Previously it surfaced a bare "Prediction failed:" with nothing after the colon, which gave you zero clue what went wrong.
pixeldojo:campaign โ drop a product URL into your AI agent (Claude Code, Cursor, Codex, Cline, Windsurf, Zed, and any other MCP client) and get back a hero image, four lifestyle variants, and a vertical video, all from one call. About 50 credits, two minutes end to end. Perfect for a quick-turn ad pack or a social drop.
pixeldojo:from_url โ paste any product page URL (Shopify, WooCommerce, BigCommerce, or anything with structured data) and get back the product name, description, and images. Used as the first step inside Campaign, but also handy on its own when an agent needs structured product info without copy/paste.
pixeldojo:generate now accepts a preset slug. Pass preset: "cinematic-portrait" or preset: "product-floating-clean" and skip the prompt-writing step entirely. The 60 expert-tuned presets in our library now work the same way through your agent as they do on the website.
Job status checks now show how long a job has been running. After 5 minutes your agent gets a "this is unusually long" warning; after 12 minutes it gets told to start a fresh request instead of polling forever. Fewer agents stuck in endless poll loops on the rare hung job.
The Skills page at pixeldojo.ai/skills now shows the new agentic tools front-and-center, with code snippets for each. New shortcut: paste pixeldojo.ai/skills.md into your AI agent and say "install this" โ the page has a built-in directive that tells your agent exactly which command to run.
Nano Banana jobs that appeared stuck on "still processing" โ even after the actual generation had completed โ now resolve correctly. The status check was looking up jobs through the wrong backbone for the Pro and Pro v2 tiers, so it never saw the completion. Old stuck jobs unblock immediately if you retry; new jobs return URLs as expected. If you generated a Nano Banana image in the last day and it never came back, retry it now.
When fal.ai rejects a Smart Resize request (status 422), you now see the actual validation reason โ e.g. which target size or input field was wrong โ instead of the bare "Unprocessable Entity" status text. Same translation we already use for the other fal-backed tools, now wired into the Smart Resize submit path so you can fix the input on the first retry.
The rotating agent chip on the Skills page now shows the actual brand mark for each integration โ Claude Code, Cursor, Codex, Cline, Windsurf, and Zed โ instead of two-letter initials. The chip also sits properly inline with the headline now, lined up with the cap of the surrounding letters instead of dropping below the baseline.
The install steps under the hero are tidier โ Step 01's title fits on one line so all three columns line up, and the agent list across the page (hero subtitle, install card, and "Works with" pills) is consistent end-to-end.
The hero headline on the Skills page now cycles through the agents that support PixelDojo โ Claude Code, Cursor, OpenClaw, Codex, Hermes, NemoClaw, and Cowork โ each with a small color-coded chip so you can immediately spot your setup.
The install section is now three numbered steps (01 / 02 / 03) instead of icon cards, making the path from zero to working agent quicker to skim.
The new /presets page lets you browse all 60 workflow presets in one place โ from product photography and fashion to storyboards and text rendering. Every card links directly to the matching tool with the preset loaded.
The preset library grew from 24 to 59 entries, all with real generated example images. Each thumbnail was produced with that exact preset so you know what output to expect before you click.
The Skills page now has a live terminal demo showing exactly what the agent conversation looks like: you describe what you want in plain English, it routes to the right model, and you get back an output URL. Makes the workflow concrete instead of abstract.
The hero on the Skills page now cycles through real agent use cases โ things like "Build a 4-shot product storyboard" โ so it's immediately clear what you can hand off to your LLM.
A "Build with PixelDojo" card is now on the main dashboard, linking developers straight to the Skills page and API Platform from day one.
Seedream 4 and Seedream 5 Lite now translate ByteDance's "The request failed because the image format is not supported by the API. Request id: 0217..." into a clean message that names the likely cause (HEIC, AVIF, GIF, or a corrupted file) and tells you what to do (re-upload as JPEG, PNG, or WebP). The internal request id no longer leaks to users.
Reference-image size errors on Nano Banana and Consistent Characters are clearer. The previous message โ "Please re-upload an image between 5kb and 20MB" โ fixed the unit casing and now explains both failure modes: if your image is too large, compress or resize it (under 4000px on the long edge usually works); if it's smaller than 5KB, the file may be corrupted or didn't finish uploading. Includes the supported formats (JPEG, PNG, WebP) so you know exactly what to send back.
When Seedream rejects a prompt for sensitive content or copyright reasons, you now see a clean "Generated content was blocked by the model provider โ try a different prompt or input media" instead of the raw "The request failed because the input text may contain sensitive information. Request id: 0217..." envelope. Credits are refunded the same way; the change is just the message you see.
If you ever saw the orange "You're out of credits" banner while your header still showed plenty of credits, that's gone. The banner could latch on after a brief subscription-status hiccup or a top-up and stay visible until you dismissed it manually โ even with hundreds of credits in your account. It now re-checks the live state on every render and hides itself the moment your balance is healthy again.
@pixeldojo/mcp ships on npm. Run `npx @pixeldojo/mcp init` for copy-paste install snippets, set PIXELDOJO_API_KEY, restart your agent, and you've got five named tools your LLM can call: pixeldojo:generate (any prompt โ image or video), pixeldojo:character (consistent characters across shots), pixeldojo:storyboard (multi-shot scenes from one brief), pixeldojo:upscale (enhance any image), and pixeldojo:status (check on long jobs). Works with Claude Code, Cursor, OpenClaw, and any other Model Context Protocol host.
Smart polling for long jobs. Fast generations come back in the same call. Jobs that take longer than 30 seconds โ LoRA training, long-form video โ return a job ID and a status URL, and your agent calls pixeldojo:status to check on them. No more 5-minute MCP timeouts that look like failures. Polling uses exponential backoff (1s โ 2s โ 4s โ 8s) so we don't hammer your account's rate limit.
Your API key never appears in your agent's logs. The MCP server strips any pd_xxx token from every log message before it leaves the process โ even debug output. Same for error messages: a 401 surfaces as "Set PIXELDOJO_API_KEY" with a link to the keys page, never as a stack trace with your token in it.
Recraft tool page now respects the ?preset=<slug> URL param. Click a preset thumbnail anywhere on the site and the recraft form opens with the prompt and aspect ratio already filled in. The other preset-having tools follow next.
PixelDojo Skills replaces the developer-agent landing at /agents. The new /skills page leads with a one-line install for AI-agent tools (Claude Code, Cursor, OpenClaw), covers four named skills your agent can call (generate, character, storyboard, upscale), and shows real preset thumbnails you can try without writing a prompt. Old /agents and /agents.md links keep working โ they redirect to the new home.
Try-a-preset on every tool docs page. Visit any model's page on the API Platform and you'll see a row of curated starting points โ cinematic portraits, product-on-marble shots, magazine covers, and more. Click one and we'll open the tool with those parameters loaded, so you can play before you write a single line of prompt. Twenty-four presets ship today across the most-used image and video models, with more on the way.
Use-case filter chips on the API Platform. Browse 130+ models by what you actually want to make: marketing, character, cinematic, fashion, photorealism, text-in-image, upscale, video, and more. The filters are deep-linkable โ share /api-platform?useCase=character with a teammate and they land on the right shortlist. Empty results now ship with a one-click clear-filters button instead of a dead-end message.
The Skills page tells the truth about scale. The hero now reads 130+ models โ matching what's actually on the API Platform โ instead of the old 40+ figure. The model showcase pulls live counts from the registry, so the number you see is the number you can call.
Sapphire-aligned design across the developer landing. The fuchsia-violet-cyan gradient look is gone, replaced with the sapphire palette that already drives the rest of the product. Cleaner hero, cleaner CTAs, less visual noise โ the page feels like a workspace tool, not a marketing template.
If a renewal payment failed and your subscription ended, the Subscribe page could keep telling you "You're Already Subscribed" โ sometimes for an hour or more โ and the Subscribe button stayed locked out. The page now reflects billing changes within a minute, so re-subscribing after a card decline works the way you'd expect.
Submitting WAN 2.2 with an unsupported quality_mode + resolution combo (e.g. quality_mode "plus" with resolution "720p" โ Plus only supports 480p and 1080p) used to fail opaquely as "credit_error: Invalid credit amount" after the request was already accepted, sounding like an account issue. You now get a schema validation error upfront that names the exact resolutions your chosen quality mode supports โ e.g. "Resolution '720p' isn't available on the plus quality mode โ pick one of: 480p, 1080p."
Switching models in Canvas no longer leaks settings from the previous tool. Picking Qwen Image 2 with the Pro variant and then switching to WAN 2.7, WAN, or ImagineArt used to send the previous tool's `model` value to the new one, failing with "Invalid enum value, expected 'wan-2.7-standard' | 'wan-2.7-pro', received 'qwen-image-2.0-pro'". Same root cause for sizes (1280ร1280 / 1536ร1536) and durations (numeric vs. string) leaking across model switches. Canvas now resets all advanced settings whenever the active model changes โ re-selecting the same model preserves your current picks.
Seedream 5 Lite custom-size mode now works end-to-end. Picking the "Custom" image size with your own width and height used to be rejected at the schema gate as "Invalid enum value, expected 'auto_2K' | 'auto_3K' | 'square_1_1' | โฆ received 'custom'" even though the server already supported it. Custom dimensions now flow through cleanly, matching how Seedream 4.5 already worked.
Sending more than 14 reference images to Nano Banana used to surface the raw provider error "Runware: Invalid number of elements for 'referenceImages' parameter" โ confusing brand and parameter names with no guidance. You now get a clean message that includes your actual count and tells you exactly how many to remove (e.g. "You sent 20 reference images. Nano Banana supports up to 14 โ please remove 6 and try again.").
WAN 2.5 in Canvas no longer rejects 3s and 4s durations and now lets you go up to 15s. The duration slider used to span 3-10s, but the model's actual range is 5-15s, so picking 3 or 4 failed with "Number must be greater than or equal to 5" and the 11-15s range was hidden. Slider now matches the model exactly, and stale durations from sibling tools (WAN 2.2, Hailuo, LTX) auto-clamp into range.
WAN Reference to Video in Canvas no longer fails when you switch in from a sibling tool with a different resolution or aspect-ratio picker. Stale '480p' values from WAN 2.2 and unsupported aspects like '3:2' used to slip through and trigger "Invalid enum value" โ Canvas now defensively falls back to 720p / 16:9 when an incompatible value is in flight.
P Video Avatar is live. Upload a portrait, write a short script (or paste an audio URL for lip-sync), and get a talking-head video. 30 named voices, 10 languages โ English (US/UK), Spanish, French, German, Italian, Portuguese, Japanese, Korean, Hindi. 1 credit per second at 720p, 2 credits per second at 1080p. Find it under Characters & Faces in the All Tools menu.
P Video Avatar runs unrestricted, so your spicy mode setting carries through the same way it does on the rest of the spicy-friendly tools.
Switching to Hailuo 02 in Canvas after a model with a different resolution picker (WAN 2.5, PixVerse, etc.) used to fail with "resolution: Invalid enum value, expected '768p' | '1080p', received '720p'" because Canvas carried over the previous model's value. Canvas now exposes Hailuo's exact resolutions and falls back to 768p when an incompatible value is in flight.
LTX 2 in Canvas no longer rejects odd-numbered durations. The duration slider used to step by 1 (so picking 7s or 11s failed with "Duration must be one of: 6, 8, 10, 12, 14, 16, 18, 20 seconds"); it's now an explicit list of valid lengths. Picking longer than 10s on the Fast tier also auto-locks 25fps + 1080p so the cross-tier requirement is always satisfied.
Runway Gen-4.5 in Canvas now constrains duration to 5s or 10s the same way the standalone Runway Video tool does, so stale 6s/8s values from sibling models no longer leak through and fail with "duration: Invalid input".
Happy Horse Reference to Video is here. Upload up to 9 reference images (characters, props, costumes), describe your scene with character1 / character2 tokens, and the model generates a video with each subject locked to its image. 720p or 1080p, 3-15 second clips, 4 credits per second at 720p and 6 at 1080p.
Happy Horse Video Edit lets you transform existing footage with a prompt โ style transfer ("make this look like a Studio Ghibli film") or local replacement ("swap the sweater for the one in the reference image"). Pair a source clip with up to 5 reference images, choose to keep your original audio or have new audio generated. Same pricing as the rest of the Happy Horse family โ 4 credits per output second at 720p, 6 at 1080p. Find both in Generate Videos under WAN Family.
Happy Horse 1.0 is now live. Generate fast, expressive video from text or a starting image at 720p or 1080p, with clip lengths from 2 to 15 seconds. 4 credits per second at 720p, 6 credits per second at 1080p. Find it under Generate Videos in the All Tools menu.
Happy Horse runs unrestricted out of the gate, alongside the rest of the spicy-friendly WAN family.
Public API calls to qwen-image-2.0, qwen-image-2.0-pro, qwen-image (Plus/Max), the Qwen Image Edit model in image-relighting, and the Qwen Image 2 Pro model in consistent-characters were hanging indefinitely. The dashboard worked fine because it polls a different endpoint, but the public API's polling didn't recognize the internal job ids these tools were returning. They now return image URLs directly when called via the API โ first /api/v1/jobs/{id} GET serves the result. Dashboard / Canvas / Creator Studio behavior unchanged.
The Qwen Image Edit, Qwen Image 2 Edit, and HiDream Edit endpoints are now reachable via the public API at /api/v1/models/{qwen-image-edit, qwen-image-2-edit, hidream-edit}/run. Sister set to Generation now matches every dashboard tool 1:1 โ every customer-facing capability has a public apiId.
Each /api-platform/{apiId} doc page now shows a "See also" section linking related apiIds (gen โ edit pairs across 16 tools: Flux, GPT Image, Nano Banana, Kling, P-Image, Qwen, Seedance, WAN, XAI, HiDream). Customers building integrations can discover companion endpoints without leaving the docs.
Canvas's "More Settings" panel now auto-populates for any new tool โ derived from the tool's schema enums + inputParameters. Add a tool.config.ts and the tool gets a working settings panel for free, no manual registration in the Canvas params table.
Two new CI gates so /api-platform docs and Canvas can't drift from the schemas: canvas-param-schemas-parity (every Canvas option must be schema-acceptable) and no-passthrough-schemas (schemas reject unknown fields, customers see clean 400s instead of silent drops).
CLAUDE.md now formally documents the single-source-of-truth principle: tool.config.ts is canonical. Adding a tool means writing one file โ every doc surface, the public API, Canvas, Creator Studio, and OpenAPI all derive from it automatically.
Closed the rest of the schema-vs-docs drift across the API platform: 11 more tools (Hailuo 02, Image Relighting, Pony XL, Qwen Image 2, Recraft, Seedream 4.5, Seedream 5 Lite, WAN 2.6 Image, WAN 2.6 Image Edit, WAN 2.7 Image, WAN 2.7 Image Edit) had documented enum lists for parameters like resolution, image_size, scheduler, output_format, and aspect_ratio while the schema accepted any string. The schema now enforces the same set the docs advertise โ bad values fail fast with a clear 400 instead of bouncing off the provider after credits are deducted.
Added a reverse-direction parity check to CI (tests/contracts/tighten-loose-string-audit.test.ts): every documented enum value must be enforced by the schema. Combined with the existing forward check (every advertised value must be acceptable), the docs and schemas can't drift apart again.
Aspect-ratio and resolution fields on the public API now reject invalid values up-front instead of being passed through to the provider for a confusing 422. ~16 tools (Flux, Flux 2, Flux Edit, Google Gemini Image, Nano Banana, Dreamina, Seedream 3, P-Image, P-Image Edit, Qwen Image, WAN 2.2 / 2.5 / 2.7 video, WAN Image, Ideogram Character, HiDream, Image Editor, Video Reframe, Video Transform, WAN 2.2 Animate) had loose `z.string()` validation that let unsupported values through silently โ they now use the same enum the dashboard offers.
WAN 2.5 / 2.7 video API now accepts the documented `720p`/`1080p` form (the dashboard's value) โ previously the schema only accepted uppercase `720P`/`1080P` even though every doc said lowercase. The API now matches the docs and continues to send the upstream-correct uppercase form to DashScope on your behalf.
HiDream resolution is now a proper enum on the API (`1024x1024`, `1360x768`, etc.). The dashboard's human-friendly form (`1024 ร 1024 (Square)`) still works, so you can use either.
Runway Gen-4.5 was 500'ing on every Canvas chain because the duration picker emitted "5" or "10" as text but the schema expected a number. Both formats now work, so animating a Canvas frame with Runway goes through on the first try.
LTX-2 was occasionally rejecting Canvas runs with "FPS must be one of: 24, 25, 48, 50" when the FPS dropdown was set โ same number-as-text mismatch. Coerced at the schema, so any FPS option you pick now works. Also added a guard that drops stale resolution values (like 480p left over from a previous WAN step) and falls back to 1080p instead of failing the run.
Consistent Characters and Google Nano Banana now show "Your reference image is outside the allowed size range. Please re-upload an image between 5KB and 20MB and try again." or "We couldn't fetch your reference image โ the link may be unreachable or the format unsupported. Please re-upload (JPEG or PNG) and try again." instead of a raw Runware JSON envelope with internal taskUUIDs and documentation URLs.
Image-edit endpoints for GPT Image 2, GPT-Image 1.5, and OpenAI Image 1 are now available via the public API: POST /api/v1/models/gpt-image-2-edit/run, /gpt-image-1-5-edit/run, and /openai-image-1-edit/run. Send a prompt plus image_urls (or input_images) and the same edit pipeline the dashboard uses runs end-to-end โ image URL sanitization included. Existing /api/v1/models/{wan-2.7-image-edit, wan-2.6-image-edit, kling-image-edit, p-image-edit, xai-image-edit}/run already worked the same way; this fills in the orphans.
Each new edit apiId is documented on /api-platform with example curls and shows up in /llm/{apiId}.txt for AI-assistant-driven integrations.
API documentation now matches the schemas the public API actually validates against. Previously several tools' /api-platform pages and /llm/{model}.txt docs advertised parameters the schema would silently drop โ for example `num_outputs` listed on Kling, Qwen, WAN, Seedream, Imagineart, P-Image, and others when the underlying schemas use `num_images`, `numberOfImages`, or `n` (or only generate one image per call). Renames + removals applied across ~15 tools so the names you read are the names the API accepts.
Google Nano Banana 2 and Nano Banana Pro (text-to-image variants under /api/v1/models/google-gemini-image/run) advertised `reference_images` for guided generation but the schema didn't accept the field. Reference images now work end-to-end on those variants โ same image URL sanitization and Runware backbone the dashboard uses.
/llm/{model}.txt now derives parameters from each tool's Zod schema when no hand-written list exists โ so reference-image fields, custom dimensions, and other inputs always show up in the LLM-friendly docs even for tools we haven't manually documented yet. Matches the same fallback /api/v1/models/{model} already does.
Heygen Avatar API parameters renamed to match the schema: `expression` โ `expressionHappy`, `background.type` โ `backgroundType`, `background.value` โ `backgroundValue`. The dashboard already used these names; the API docs now agree.
Deleted ~600 lines of stale hand-rolled API model entries that had drifted from each tool's actual schema (e.g. a static `nano-banana` apiId that competed with `google-nano-banana`, a `seedream-4` entry advertising `reference_images` while the live schema accepted `image_urls`). Every public apiId now comes from a single source of truth: the tool's own tool.config.ts.
If you upload an unsupported image format (HEIC, AVIF) or a corrupted file to Seedream or Magnific, you now see "Could not read your reference image โ the file format may be unsupported (HEIC, AVIF) or the image may be corrupted. Please re-upload as JPEG or PNG." instead of a raw provider JSON dump or Python traceback.
When a referenced image URL returns 404 / 403 / 401 (link expired or access denied), Grok Imagine Edit and other image-editing tools now say "We couldn't load your image โ the link returned 404 (not found). The image may have expired or moved. Please re-upload and try again." rather than "Failed to fetch image: 404".
Canvas was offering a 480p option for WAN 2.5 that the backend doesn't support (silently failed every run). Removed it from the Resolution dropdown. WAN 2.7 Video and Seedance 2 also now defensively normalize stale resolution values from previous Canvas steps so chains like "Kling Image โ Seedance 2 video" no longer carry over the wrong setting.
WAN family + Qwen models running on Alibaba DashScope now show "Alibaba DashScope is temporarily unreachable. Please try again in a moment." instead of a raw "fetch failed" error when the upstream service is briefly down. Same treatment for ByteDance / Seedream when its servers are overloaded โ credits are still refunded automatically.
WAN 2.2 image-to-video was occasionally rejecting valid runs with "sample_shift: Must be greater than or equal to 1" from Replicate. The schema now bounds the field correctly and clamps any out-of-range value, so the run goes through cleanly.
Reference-image fields on public API endpoints (Nano Banana's image_input, Seedream's image_urls, Veo 3.1's reference_images, and every other tool that accepts source or reference URLs) were being silently dropped because the public API bypassed the per-tool payload shaping that tool pages, Canvas, and Creator Studio already used. Public API calls now run through the same submission pipeline โ including image URL sanitization, multi-tier routing (e.g. Veo 3.1 Lite correctly uses its own backbone), and field-name translation โ so a curl to /api/v1/models/{id}/run behaves identically to the same model on the site.
Synchronous image models (Seedream 4.5, Grok Imagine, Z-Image Turbo, Recraft sync, Nano Banana Pro) now return completed output on the first /api/v1/jobs/{id} poll โ no more spinning while waiting for a provider job that never existed.
API calls to LTX-2 sub-models like /api/v1/models/ltx-2-pro-i2v/run were silently being validated as text-to-video instead of the mode the endpoint name implies, so callers got confusing errors about "text-to-video" rules while posting to an image-to-video endpoint. Each variant's mode is now applied automatically from the endpoint you call โ a minimal {prompt, image_url} call to ltx-2-pro-i2v just works.
LTX-2 fps validation used to return "fps: Invalid input" with no list of valid values. It now says "FPS must be one of: 24, 25, 48, 50." Aspect ratio errors similarly spell out the supported values and when each applies.
The Community Gallery and user profile galleries are now behind sign-in. Browsing generations, exploring model pages, clicking into individual images, and viewing other creators' profiles all require a PixelDojo account going forward โ the same account you use for the dashboard. Signed-out visitors get bounced to sign-in and routed back to the page they wanted.
Video Upscaler and Video Transform were throwing "Prediction ID is missing in the response" on every submit after a recent behind-the-scenes change. Both tools are working again โ upload your clip and go.
If a P-API model (P-Image, P-Video) can't reach its upstream service, you now see "Pruna AI service is temporarily unreachable. Please try again in a moment." instead of the raw "fetch failed" error. Credits are still refunded automatically.
Canvas was failing silently on Flux Image-to-Image and Face Enhance when you tweaked More Settings like Guidance Scale, Prompt Strength, or Scale Factor โ the inputs emit text and the schema expected numbers. Both now accept either, so your Canvas runs go through on the first try.
Canvas was offering a 21:9 aspect ratio for Grok Imagine that the backend doesn't support, so that pick always 500'd. Canvas now only shows the 7 aspect ratios Grok actually accepts.
If you pick Flux Pro or Flux 1.1 Pro with 21:9 or 9:21, you now see "Flux 1.1 Pro does not support 21:9 โ try 16:9, or switch to Flux Schnell or Flux Dev for ultra-wide ratios." instead of a raw 422 dump from Replicate. No credits spent when the pre-check rejects.
Creative Upscale was occasionally 500'ing from Canvas when Creativity or Resemblance sliders landed outside the provider's range. Values are now clamped to what Clarity Upscaler accepts (Creativity 0โ1, Resemblance 0โ3) so the run goes through cleanly.
Prompt safety checks now run server-side on every tool, Canvas, Creator Studio, the public API, and LoRA training fields โ not just the dashboard pages that had the client-side check. Three layers: a public-figure name list, a deterministic phrase scanner, and an AI moderation backstop. Anything flagged returns a clear, actionable message about what to change.
The image safety check that previously ran only on a couple of tools now applies to every tool that can produce explicit imagery, and uses a more reliable classifier than before. Photos that depict children are rejected with a specific message. Other tools (upscalers, background removers, analyzers) are unchanged.
Every image stored on PixelDojo is now eligible for CSAM hash scanning at the storage layer. All upload paths โ generation outputs, reference images, and LoRA training datasets โ route through the same scanning coverage.
When an uploaded image is over our 12 MB cap, we now show your file's actual size and tell you to resize or compress โ no more guessing why the upload failed. Applies to every tool that takes an image.
On Seedream 4, if your reference image is bigger than the 36-megapixel limit or the host is slow to respond, you now get a clear message with the actual dimensions and next steps โ instead of a raw technical error.
You can now paste Civitai model URLs from the new civitai.red domain anywhere we accept Civitai links โ LoRA training, model imports, and HuggingFace uploads all recognize it automatically.
Z Image Turbo โ the sub-second photorealistic model โ now has the sticky mobile composer. Describe, pick aspect ratio, add an optional LoRA, and generate up to four at a time. Eighteen tools on the new shell.
ImagineArt 1.5 now has the sticky mobile composer. Describe your image, pick aspect ratio and model tier, and generate up to four at a time from a single pull-up panel. Seventeen tools on the new shell.
Portrait Series Generator (Consistent Characters) now has the sticky mobile composer. Pick a source portrait, choose a model, set a background, and generate up to 12 portraits โ all from a single pull-up panel. Sixteen tools on the new shell.
Lip Sync, Video Autocaption, and Video to Sound all got the sticky mobile composer. Fifteen tools on the new shell now. The interior UI on these three is still pre-design-system (custom headers and buttons) โ a polish pass will line them up with the rest.
Text To Music now has the sticky mobile composer. Describe the music, drag the length slider, tap an example prompt, generate โ all from a single pull-up panel. Eleven tools on the new shell.
Hunyuan 3D now has the sticky mobile composer. Pick text or image input, tweak advanced settings, and generate โ all from a single pull-up panel. Ten tools on the new shell.
Subject Control, PuLID Characters, and Image to 3D have all been removed. Subject Control overlapped with Virtual Try-On and Style Transfer. PuLID and Image to 3D were both on disk but never made it into the sidebar menu โ nobody could navigate to them. Consistent Characters covers identity preservation, Hunyuan 3D covers 3D generation.
Dreamina 3.1 in Canvas now only offers 1K and 2K for resolution โ the two values the model actually accepts. If your Canvas session had 4K or HD carried over from another model, Dreamina runs were failing at submit; stale state is now politely remapped to 2K instead of erroring.
Magic Lighting now has the sticky mobile composer. Upload, pick a model, tap a lighting preset, and generate โ all from a single pull-up panel. Eight image tools on the new shell.
Style Transfer now has the sticky mobile composer. Source, style, prompt, and advanced settings all live in one pull-up panel. Seven image tools on the new shell now.
Change Camera Angle now has the sticky mobile composer. The orbit visualizer, sliders, presets, and generation options all live in one panel you can slide up from any scroll position. Six image tools on the new shell now.
Virtual Try-On now has the sticky mobile composer. Upload your garment, pick your model, and generate from any scroll position โ five image tools on the new shell now.
Portrait Upscaler now has the sticky mobile composer. Four image tools on the new shell so far โ upload, set your scale factor, and generate from any scroll position on mobile.
OmniHuman's 'Generate from Text' button works again. A plumbing change a week ago had disconnected the text-to-speech response from the page โ you'd click Generate Audio, nothing would appear, no error. Now it polls to completion and shows the audio ready to use. Caught while sweeping other tools for the same class of bug as Background Remover.
Background Remover now actually shows your finished image again. A recent plumbing change had disconnected the final result from the page โ you'd click Remove Background, wait, and get sent back to the empty state with no image. Fixed, verified end-to-end.
Extract Frame and Image Analyzer now have the same sticky mobile composer as Background Remover. Tap the bar at the bottom to open controls, tap again to see your result. Three image tools on the new shell now; more following on the same cadence.
Canvas signing now rotates on a 15-minute window, matching Creator Studio. If a signed link ever leaked from the page, its useful life is minutes, not hours. Zero visible change to how Canvas works for you.
On mobile, Background Remover now has a sticky composer at the bottom of the screen โ tap to open the upload controls from any scroll position, tap again to collapse and see your result. Same pattern video tools have used for a while. First image tool to get it; the rest will follow.
Under the hood, image and video tools now share one shell. Three duplicate components (ToolWorkspaceShell, ToolPageHeader, ToolEmptyState) replace six parallel ones (VideoToolShell, VideoPageHeader, VideoEmptyState + image siblings). Every page you touch lives on the same plumbing now โ future polish lands everywhere at once.
Every Creator Studio frame and video generation now flows through the same reliability backbone as tool pages and Canvas. If a model has an issue, we spot it and recover the same way whether you hit it from a tool page, Canvas, or Creator Studio. Zero visible change to your workflow โ just one shared plumbing instead of two.
Flux LoRA trainers โ all six of them โ now use the proper Flux purple gradient on their Generate buttons and chrome. Orange-red and yellow-orange drifts are gone. Every Flux-family tool, training or generating, speaks the same color.
Softer corners across marketing and landing pages โ hero cards, pricing tiles, community gallery, model showcases, and tutorial blocks now all share the same corner radius as the rest of the product. 42 files touched. The dashboard-to-landing transition is visibly one product now.
Image tools and video tools now share one empty-state component. Video tools keep their icon pills and info tips (like "Lite supports 720p or 1080p in 16:9 and 9:16"), image tools keep their colored-dot pills, and the visual style is consistent across both. Under the hood this eliminated a whole duplicate component tree.
The shared component library had a longstanding bug where every 'purple' accent was silently rendering as sapphire (the primary brand color) instead of actual purple. Across the Flux family (Flux Studio, Flux 2 Flex, HiDream, PonyXL, LoRA Image Creator, Feedback) that meant empty states, prompt boxes, aspect ratio pickers, credit cost displays, and mode toggles were all speaking the wrong color. Fixed at the source โ 16 shared components across tool-components, video-components, history-grid, pricing-plans, admin, and my-images. Every purple-accented surface now renders actual purple.
Kling Image and Text to Music Generate buttons moved to amber โ the proper Kling-family and audio-category color. Matches the rest of the Kling and audio surfaces across the product.
xAI Image's Generate button cleaned up from a sapphire-to-rose mix to pure rose โ consistent with xAI Video and the rest of the xAI family.
Magnific Upscaler, Creative Upscale, and Google Gemini Image are now the category-correct blue on Generate and chrome, instead of sapphire, yellow-amber, or blue-purple mixes.
Ernie and P-Image are now pure cyan โ matching Qwen, Recraft, ImagineArt, and the other cyan-family tools. No more stray rose or sapphire on their chrome.
LoRA Image Creator's cost display and Generate button are now the Flux-family purple gradient throughout instead of drifting back to sapphire.
The Subscribe CTA on the Qwen Image 2 page moved from a mixed emerald-teal gradient to solid sapphire โ matching the product-wide rule that site-level actions like Subscribe are solid, while per-tool Generate buttons carry the gradient.
Video utility tools (Video Reframe, Video Transform, Video Upscaler, Lip Sync, Merge Videos) now share a consistent green across their Generate buttons and chrome โ matching the category identity.
Subject Control, a character tool, now uses rose โ the character-tool family color โ instead of sapphire.
The Subscribe page CTA moved from a sapphire-to-blue gradient to solid sapphire โ site-level actions like Subscribe are solid sapphire across the product now, gradients are reserved for the single Generate button inside each tool.
Flux 2 Flex and HiDream Generate buttons are now the same purple as Flux Studio and PonyXL โ matching the documented Flux-family color. Sub-states (editing, upload, etc.) all agree now instead of drifting between blue, sapphire, and green.
OpenAI Image 1's Generate button moved from a blue/sapphire gradient to emerald โ the proper OpenAI-family color, consistent with GPT Image 2 and GPT Image 1.5.
Video to Sound's Generate button now uses amber โ the audio-tool family color โ instead of blue. Matches Text to Music and other audio surfaces.
Softer corners on cards, sheets, and empty-state tiles across 17 tool pages โ from Flux Studio to Seedance 2 to Purchase Credits. The whole product sits at the same corner radius now instead of a handful of pages looking bubble-rounded.
Flux Studio's Generate button now uses the proper Flux-family purple gradient instead of blue โ matches PonyXL and restores the family visual identity. The Subscribe-to-create CTA was cleaned up to solid sapphire (site-level actions are sapphire across the product now).
Removed a shimmer sweep animation from Generate buttons on Flux Studio and Seedream 4, plus the focus glow behind their prompt boxes. Tool surfaces are meant to be quiet; your prompt and your outputs hold focus now.
Toned down all-caps-shouting button labels on Billing, Settings, My Prompts, Merge Videos, and Usage Report. Labels are readable at a glance now instead of broadcasting at you.
Creator Studio's chrome now matches the rest of PixelDojo. Buttons and labels across the project header โ Save, Characters, Scene, Shot Settings โ use the same quiet treatment you see on every tool page. Your project title reads in title case instead of all caps.
Softer corners on the pre-project picker cards and the shot settings sheet. Same radius as the rest of the product, so Creator Studio sits next to Canvas and your tool pages without looking like a different app.
The in-project workspace is calmer. Removed a decorative background glow so your storyboard holds focus. Ambient motion still lives on the pre-project picker where it belongs.
Seedance 2, Seedance 2 Reference, and Seedance 2 Video Edit have moved to a new backbone. Same models, faster and more reliable queuing โ no more jobs getting stuck on "queued."
Most Seedance 2 generations now cost fewer credits. Fast 480p is down to 2 credits/sec and Fast 720p to 4 credits/sec. High 720p is 4 credits/sec (5 with a reference video), and High 1080p is 10 credits/sec (12 with a reference video). The quality popovers and Canvas cost displays reflect the new rates automatically.
Seedance 2 Video Edit keeps Standard tier and full 480p / 720p / 1080p output; reference images and reference videos work the same way you're used to.
Your dashboard now recognizes whether you've generated before. If you're a returning creator, the top of the page becomes a compact welcome strip with Create Image and Create Video ready to click โ and your Recent Creations move up to become the visual hero. First-time visitors still see the full onboarding intro so they know where to start.
The home page buttons are now solid sapphire across the board โ no more yellow Buy Credits pill, no yellow hover flash on Create Image, no gradient Subscribe button. Cleaner, more consistent, and it lets your art stay the hero.
Featured Tools on the dashboard now show honest category labels โ Image, Video, Editing โ instead of marketing claims like "Best Quality" or "Most Flexible." You can see at a glance what each tool is for.
Video tutorials got their own home at /tutorials. The Master AI Creation video library moved off the main dashboard to a dedicated page, so your workspace stays focused on getting work done. You can jump to them anytime via the new Videos link in the Prompting Guides section.
The returning-visitor welcome strip keeps a Buy Credits (or Subscribe for free accounts) button right alongside Create Image and Create Video โ styled as a subtle sapphire-accented pill so it's always one click away without competing with your creative actions.
GPT Image 2 Prompting Guide is live at /guides/gpt-image-2-prompting-guide โ a hands-on playbook with seven generated examples covering near-perfect text rendering, dense infographic layouts, multilingual signage (Latin + CJK), world-aware photorealism, product mockups, character consistency, and UI mockups. Every prompt is copyable.
Every prompting guide now has a clean breadcrumb trail โ Home โบ Prompting Guides โบ <Title> โ so you always know where you are and can jump back to the index with one click.
The site footer has a new "Prompting Guides" column led by an "All Prompting Guides" link to the index, plus direct links to every major guide including GPT Image 2, Nano Banana Pro, Flux 2, Qwen Image 2, Seedream 5, Veo 3.1, Kling 3, and Seedance 2.
The /guides index now lists every guide we've published โ including the WAN NSFW I2V guide under a new Adult (18+) category.
The AI Image landing page now leads with GPT Image 2 across its hero, FAQ, and structured data, and features a new Prompting Guides section so you can browse model-specific playbooks while choosing a generator.
Uploading an unusually large reference image to Nano Banana Edit (Pro tier) used to return a raw provider JSON blob. You'll now get a short, actionable message telling you the image dimensions are outside the supported range and to resize to within 1โ12288 pixels per side before retrying.
GPT Image 2 is live at /gpt-image-2 โ OpenAI's next-generation image model, with native 4K UHD rendering and noticeably sharper typography. Great for hero art, editorial covers, and print-ready marketing.
Six explicit render sizes in the size picker: 1024ร768, 1024ยฒ, 1024ร1536, Full HD 1920ร1080, QHD 2560ร1440, and 4K UHD 3840ร2160. Each 4K tile wears a small amber "2ร" chip so you know when you're on the premium tier.
Tiered credit pricing that tracks compute honestly. Low quality is a flat 0.5 credits per image at any size. Medium is 1.5 credits (3 at 4K UHD). High is 5 credits (10 at 4K UHD). The quality popover shows the live per-image cost for the size you picked, so there are no surprises.
GPT Image 2 also supports edit workflows โ upload or pick reference images and describe your change. Multi-image refs, optional edit masks, and the same 4K rendering carry through.
Available in Canvas as a full generate step with size, quality, and format controls in More Settings, and exposed in the public API as the gpt-image-2 model ID.
The Seedance 2 duration control in Canvas now shows the three durations the model actually supports โ 5s, 10s, 15s โ as discrete chips instead of a free-range slider. The slider let you pick 6 or 7 and then the model rejected the job; those nudges are no longer possible.
PixelDojo is now discoverable by AI agents and LLM-powered coding tools out of the box. The homepage and API reference advertise their machine-readable documentation (llms.txt, LLM-optimized reference, OpenAPI spec, ai-plugin manifest) through standard HTTP Link headers โ so agents like Claude Code, Cursor, and ChatGPT can find the API contract on first visit without any extra setup on your side.
The API reference at /api-docs now speaks both HTML and Markdown from the same URL. Visit it in a browser and you get the rich HTML page; have an AI client fetch it with Accept: text/markdown and you get the LLM-optimized reference directly โ now with the correct text/markdown content type so strict agents accept it on first try.
Added an API catalog at /.well-known/api-catalog (RFC 9727) โ a single machine-readable index pointing to the OpenAPI spec, HTML and Markdown documentation, AI plugin manifest, and health endpoint. Agents that follow the well-known convention can now find everything they need in one request.
Published an Agent Skills index at /.well-known/agent-skills/index.json exposing three tool-worthy skills โ generate-image, generate-video, and list-models โ each backed by a markdown manifest with SHA-256 integrity. An agent runtime that speaks the Agent Skills Discovery RFC can now call PixelDojo image and video generation as first-class skills.
Added OAuth 2.0 Protected Resource Metadata at /.well-known/oauth-protected-resource (RFC 9728) with supported scopes and a pointer to the API key page. Agents can auto-discover how to get credentials for the PixelDojo API instead of needing bespoke setup docs.
Homepage structured data now identifies PixelDojo as a SoftwareApplication with a feature list, category, and offer details alongside the existing Organization and WebSite blocks. Cleaner ingestion by search engines and AI crawlers โ and more accurate knowledge cards in answers about what PixelDojo actually is.
Declared our AI content preferences. The site robots.txt publishes Content Signals indicating the marketing site and documentation are open for AI training and inclusion in AI answers. User-generated gallery content and profile pages opt out of AI training via noai/noimageai โ your posted creations are crawled for search and inspiration, not harvested for training.
Agents can now auto-discover how to sign in. Added OpenID Connect discovery at /.well-known/openid-configuration and the matching OAuth 2.0 authorization server metadata at /.well-known/oauth-authorization-server, mirrored from our Clerk-backed auth. Agents that speak either flavor can find the issuer, endpoints, and supported scopes without any bespoke config.
Markdown negotiation now works site-wide on every agent-facing surface โ homepage, /agents, /api-platform, and /api-docs. Fetch any of them with Accept: text/markdown and you get the LLM-optimized reference; browsers keep getting the HTML pages. Moved the routing into middleware so it runs ahead of static-page caching, which the previous version got stuck behind.
Homepage Link headers are now served on every response, not just HTML ones. The original guard only emitted them when Accept: text/html was present, which meant agent crawlers using Accept: */* saw nothing. They now see the full RFC 8288 link graph (api-catalog, agent-skills, oauth-protected-resource, openid-configuration, service-doc, service-desc, and more) on any request to /.
Credit deductions are now fully transactional. Every debit and its ledger entry commit together as a single atomic unit, or not at all โ so even if the database blinks mid-generation, your balance and our record of where it went stay perfectly in sync. Eliminates a whole class of "my credits got deducted but the job never ran and I never got a refund" edge cases.
API keys are now stored as one-way hashes, not the raw key. The full secret lives only in your hands โ we cannot display it again after creation, and a database leak couldn't expose it either. Existing keys keep working seamlessly: they auto-upgrade to hashed storage the next time you use them. No rotation needed unless you've misplaced your copy.
Webhook HMAC signatures now use a dedicated signing secret separate from your API key. The signing secret is generated per-key and kept isolated โ meaning webhook verification keeps working even if you rotate the API key, and vice versa.
Tool pages feel more like a professional studio and less like a consumer toy. Image and video output cards have subtler shadows and a lighter hover lift, so the art stays the hero and the chrome gets out of the way.
First-run delight: the first image, first video, and first Canvas chain step you generate in a session gets a small scale-in โ then the animation goes quiet so it doesn't fire on every click.
This page (What's New) got an editorial redesign. Bigger display headlines, typewriter-style dates, magazine spacing. Designed to be read, not skimmed.
Per-second credit pricing on Seedance 2 Reference to Video and Seedance 2 Video Edit has been adjusted to reflect their higher compute costs. Seedance 2 Reference: Fast 5 cr/s at 480p and 9.5 at 720p; High 8 at 480p, 16 at 720p, 24 at 1080p. Seedance 2 Video Edit: Fast 6.5 cr/s flat; Standard 5 at 480p, 9.5 at 720p, 16 at 1080p. Plain Seedance 2 (text/image-to-video) pricing is unchanged.
1080p output is now available on Seedance 2, Seedance 2 Reference to Video, and Seedance 2 Video Edit โ on the High / Standard quality tiers. Fast tiers stay at 480p / 720p.
Seedance 2 Video Edit brings back its Standard quality tier โ higher fidelity for prompted video edits, especially at 720p and 1080p.
Pricing on all three Seedance 2 tools recalibrated to cleaner whole numbers. Seedance 2 / Reference: Fast 3 cr/s at 480p and 6 at 720p, High 5 at 480p, 10 at 720p, and 15 at 1080p. Seedance 2 Video Edit: Fast 4 cr/s flat; Standard 3 at 480p, 6 at 720p, 10 at 1080p.
Seedance 2 generations no longer hang on "queued" forever after the backbone migration โ the status endpoint URL namespace was wrong in the first cut. New jobs now poll correctly.
Picking a video from "My Videos" in Seedance 2 Reference to Video no longer fails with "Failed to fetch" โ the library picker was re-validating CDN-hosted videos and tripping browser CORS. Library videos skip the re-fetch now (they were already validated at upload time).
PixelDojo now respects your system-level "reduce motion" preference. If you have motion sensitivity settings turned on in your OS, decorative shimmer, pulse, and spin animations now pause, and transitions shorten. No setting to toggle in the app โ your OS choice just works everywhere.
Seedance 2, Seedance 2 Reference to Video, and Seedance 2 Video Edit now run on a faster backbone โ same prompts, same credit prices, faster and more reliable queuing. Expect fewer stalled jobs during peak traffic.
WAN 2.7 Video now accepts prompts up to 8,000 characters (previously 5,000). Long cinematic prompts and prompt-enhance expansions no longer trip the character cap.
Seedream 4 now retries transient ByteDance "service encountered an unexpected internal error" responses automatically with backoff. Generations that would previously fail after a provider hiccup now recover on their own.
Ideogram Character's resolution picker now lists only sizes the model actually accepts โ the old options (512ร512, 768ร768, 1280ร720, 1920ร1080, 1024ร768, 768ร1024) were silently rejected by Replicate. Replaced with nine provider-valid resolutions covering square, 4:3, 3:4, 3:2, 2:3, 16:9, 9:16, and ultra-wide/ultra-tall.
Veo 3.1 Standard no longer offers 1:1 as an aspect ratio โ Google's API only accepts 16:9 and 9:16, so picking square was guaranteed to fail. The dropdown now matches what actually works.
Grok Image (xAI) now accepts prompts up to 5,000 characters (previously 2,000), matching P-Image, WAN 2.5, WAN 2.7 Video, and Seedance 2.
Canvas resolution picker for WAN 2.5, WAN 2.7 Video, and WAN Reference to Video now shows a proper 720p / 1080p chip selector instead of a free-text field โ no more "Expected '720P' received '720p'" surprises. Stale sessions with lowercase values still submit successfully.
Prompt length limits unified across every generation tool. Modern image and video models accept 65Kโ131K tokens (hundreds of thousands of characters), so our per-tool caps of 1Kโ8K chars have been lifted to a single generous 250K-character ceiling. Detailed multilingual prompts, prompt-enhance expansions, and long cinematic descriptions no longer hit arbitrary limits on Ernie, Kling, Grok, Hidream, FLUX Edit, WAN, Seedance, or any other tool.
P-Image Edit now accepts prompts up to 5,000 characters (previously capped at 2,000), matching its sibling P-Image tool. Detailed edit instructions and longer Canvas chains no longer hit a surprise validation error.
WAN 2.2's public API schema now documents all LoRA parameters. Standard mode supports two LoRAs (lora_weights_transformer + lora_weights_transformer_2 with matching scale fields), and Extended mode supports full dual-LoRA with four slots โ 1 and 3 target the high-noise transformer, 2 and 4 target the low-noise transformer. Scales accept 0โ4.
Seedance 2 now accepts prompts up to 5,000 characters (previously capped at 2,000), matching the longer-prompt tools like WAN 2.5 and WAN 2.7.
The API Platform pricing page now shows real credit costs for 17 more models that were stuck displaying "0 credits" โ Consistent Characters, FLUX, Flux Edit, Nano Banana Edit, Grok Imagine R2V, Ideogram Character, Kling Image V3, Kling Image Edit, Kling 2.6 Pro, Lip Sync, OmniHuman 1.5, Flux Redux, Seedance 1, Seedance 2 Reference, WAN 2.7 Image Edit, WAN Reference to Video, and WAN 2.7 Video Edit. Every variant now surfaces the correct per-request price.
When an image upload fails in an image-to-image tool or Canvas, you'll now see the real reason (like a session or subscription issue) instead of a generic "Failed to upload" message that made it look like the file itself was the problem.
Seedance 2 Reference to Video is live. Guide Seedance 2.0 with up to 9 reference images, 3 video clips, and 3 audio tracks โ tag them in your prompt as [Image1], [Video1], [Audio1] to lock characters, motion, and sound in a single generation. Fast (3/5 credits/sec at 480p/720p) and High (3/6 credits/sec) tiers, native audio sync, 3โ15 second clips.
Seedance 2 Reference in Canvas now shows proper Quality, Resolution, Duration, Aspect Ratio, and Generate Audio dropdowns instead of raw text inputs that could silently fail validation.
P-Image, WAN 2.5, and WAN 2.7 Video now accept longer prompts (up to 5,000 characters) โ previously anything over 2,000 characters was rejected with a validation error.
GPT-Image 1.5, Qwen Image 2, and Ideogram Character now show proper dropdowns in Canvas More Settings โ Size, Quality, Model, Speed, and Style controls replace raw text fields that sometimes caused silent failures.
Seedream 4 edit mode no longer rejects requests that pass a single reference URL instead of an array โ the API now accepts both forms gracefully.
Pose Control (FLUX) works again โ submissions without a LoRA were being rejected upstream because the empty strength value was sent as a string instead of a number.
Grok Imagine Video in Canvas now has proper Duration, Resolution, and Aspect Ratio dropdowns instead of raw text fields that silently failed validation.
Earlier validation on Veo 3.1, WAN 2.6 Image Edit, and Seedream edit. Veo 3.1 (standard) now catches 1:1 aspect ratio with an input image upfront and asks for 16:9 or 9:16. WAN 2.6 Image Edit caps reference images at 4 when selecting. Seedream 4 and Seedream 5 Lite edit cap reference images at 14 when selecting. All three avoid confusing post-submit errors.
Ernie โ Baidu's multilingual text-to-image model is live. Write prompts in English, Chinese, or Japanese with built-in prompt expansion. HD is 1 credit, UHD is 3 credits, and the Turbo variant is 1 credit flat.
"Continue in Canvas" actually sends your image into Canvas now. If you'd ever used Canvas before, the button used to take you there empty-handed.
Seedream 4 and Seedream 5 Lite edit mode accept pasted or newly-uploaded images again โ base64 uploads were timing out before reaching the backend.
Veo 3.1 Lite videos finish and display correctly. Jobs were completing on the provider but never got picked up on our side.
Pose Control (FLUX) with multiple LoRAs works now โ was 0% success after the multi-LoRA update.
SeedEdit 3 is back online after a provider API change broke every request.
LTX 2 in Canvas now has proper Tier, Duration, Resolution, and FPS dropdowns instead of raw text inputs.
WAN Reference to Video in Canvas gets proper Duration slider plus Resolution, Aspect Ratio, Model, and Shot Type dropdowns โ no more typing values that silently fail.
WAN 2.6 Image Edit in Canvas now has Size, Prompt Enhancement, and Negative Prompt as proper form controls. Previously free-text fields were tripping validation.
Magnific Upscaler now catches oversized images upfront with a clear "Image is too large to upscale (max 10 MB)" message instead of a cryptic upstream error.
Nano Banana edits with "Match input image" aspect ratio no longer 400 on the upstream API โ always sends explicit width and height now.
Seedance 2 image-to-video no longer complains that you need an image after you already uploaded one โ mode and image state stay in sync.
Omnihuman's Fearful, Disgusted, and Surprised emotions produce audio now (were sending wrong values to the voice engine).
Flux Edit's guidance slider in Canvas now stays inside the range the model accepts โ no more mid-drag validation errors.
Empty-prompt submissions across 30+ tools now show a clean error. Advanced JSON mode, Enter-key submits, and rapid clicks all get caught client-side instead of hitting the API.
Content-moderation rejections across the Seedream and Seedance families are correctly labeled as content issues instead of provider outages โ cleaner signal when something is actually broken.
Public API calls to Heygen Avatar, Flux variants, Kling, Veo, Seedream, Seedance, WAN, Recraft, Ernie, and ~60 other models no longer return a spurious "Invalid credit amount" error. Per-request pricing for these models now flows through to the API correctly.
The /api/v1/models listing and API Platform docs pages show real credit costs for dynamically-priced models instead of "0 credits".
Your Usage report loads noticeably faster โ added indexes that make per-user lookups across credits and refunds much quicker, especially for longer time ranges.
Canvas's model picker now shows every model tier as its own option โ pick Nano Banana Standard, Pro, or 2 directly instead of hunting through settings.
Multi-image editing in Canvas โ models like Nano Banana Edit, P-Image Edit, and WAN 2.6 Image Edit accept up to 5 reference images. Click '+ Add reference' next to your source to attach more.
Canvas's More Settings panel is way friendlier. Aspect ratio and resolution are dropdowns. Image slots have a proper picker. Fields you've already filled from a canvas step get hidden so you're not duplicating work.
When you pick an image source for a video tool, mode auto-flips to image-to-video and the image slot is pre-filled. No more empty fields asking what to do.
P-Image Edit is now available in both the public API and Canvas โ fast image editing with up to 5 reference images.
Fixed the bug that silently sent edits to the wrong endpoint โ Canvas edits now land on the correct model every time.
Admin Tool Health dashboard โ per-tool success rate, error breakdown, and latency for the last 7 days.
Light mode contrast fixes on the Canvas source indicator and temporary-output warning โ readable in both themes now.
WAN 2.5 Image-to-Video now disables Generate until you've uploaded an image โ no more silent failures.
WAN 2.7 Video Extend mode waits for your source clip's duration to finish processing before letting you Generate.
Nano Banana Pro and Nano Banana 2 work correctly via the public API again โ requests now route to the right backend.
Image editing tools added to WAN 2.7 โ paint over areas and regenerate with AI.
Usage history now shows which tool and provider were used for each generation.
All API error messages are now clearer and more consistent across every endpoint.
Strengthened security across the platform โ all endpoints now require proper authentication.
Homepage loads 40% faster. Font loading improved by 97%.
Removed 90+ unused files and 48 unnecessary dependencies for a leaner, faster app.
Credits are now deducted upfront for LoRA training jobs โ no more surprise charges.
Automatic refunds are more reliable โ failed jobs always return your credits.
Brand-new Sapphire color scheme across the entire platform for a cleaner, more modern look.
Updated accent colors for each tool family โ image, video, and editing tools now have distinct visual identities.
New typography with Satoshi for headlines and Geist for body text โ easier to read everywhere.
Credit refunds section added to your usage report so you can track every refund.
Refund tracking is now more transparent โ see exactly when and why credits were returned.
Resolved edge cases where timed-out jobs weren't automatically refunding credits.
Stuck jobs are now automatically detected and refunded every 10 minutes.
Admin dashboard now has better tools for monitoring job health across the platform.
WAN 2.7 Video is here โ next-generation video creation with improved motion and detail.
WAN 2.7 Image generation added with updated model configurations.
Video continuation support โ extend your WAN 2.7 videos with last-frame extraction.
Better resolution handling for video models with smarter aspect ratio defaults.
Streamlined credit purchase flow โ fewer steps to top up your account.
Better error messages during checkout so you always know what happened.
WAN 2.6 Image model added to the lineup.
Reference-to-video now supports WAN 2.7 with better subject consistency.
Auto-resubmit for failed video downloads โ no more lost generations.
Prompt enhancement defaults updated for QWEN Image 2 and Flux 2 for better results out of the box.
HeyGen avatar tool now has smarter credit calculations based on video length.
Seedance 2 is live โ create videos with realistic motion and native audio awareness.
Updated AI model roster with the latest versions and pricing.
Video prompting guides added to the homepage for quick access.
Seedream 4 now supports custom image sizes and aspect ratio selection.
Sticky composer toggle makes it easier to create on mobile devices.
Funnel analytics now include date range selection and device breakdown.
Expanded API documentation with model discovery and job control endpoints.
AI agent discovery endpoint added โ build smarter integrations with PixelDojo.
API credit costs now adjust based on resolution for fairer pricing.
LTX 2.3 video model with revised pricing and UI improvements.
Kling Video upgraded to v3 with updated credit pricing and new motion controls.
Better subscription management โ cancel at period end without losing remaining access.
Pruna AI integration for faster image and video generation.
Extended quality mode for WAN 2.2 video โ longer, higher-quality outputs.
LoRA support added to WAN 2.2 for custom-trained style videos.
Runway Video generation is now available with the latest model.
Text-to-music pricing updated with better form validation for smoother creation.
Improved error handling across video and image generation endpoints.
Nano Banana Pro image generation on a faster backbone for blazing-fast results.
Consistent character generation now works with Nano Banana 2 models.
Smarter error messages with friendly descriptions when something goes wrong.
Storyboard export with transition controls โ plan your video projects visually.
Per-clip audio controls in the video merger tool.
Choose your LLM model for storyboard generation.
Nano Banana 2 models added with updated configurations across all tools.
Seedream 5 Lite and Seedream 5 Lite Edit models are now available.
New prompting guides added for popular models.
QWEN Image 2 model with smart aspect ratio and resolution tier selection.
xAI video and image tools now available.
Flux seed management for reproducible image generation โ get the same result every time.
Referral tracking is live โ earn 100 credits for every friend you invite.
Attribution touch tracking for better understanding of how users find PixelDojo.
SendGrid email lifecycle integration for account notifications.
AI-powered overview and funnel insights added to the admin dashboard.
Funnel analytics with period selection and device breakdown.
Flux 2 Klein 9B model added with updated credit costs and parameters.
Kling Image model now available for high-quality AI image generation.
Flux edit mode upgraded for faster, more iterative image refinement.
Optional end frames in Seedance video generation for more control over your clips.
Better SEO across all category pages with video support and improved metadata.
Smarter aspect ratio handling across all generation tools.
Video prompting guide with shot structure, camera language, and prompt templates.
Nano Banana Pro prompting guide with comparison examples and best practices.
Prompting guides section added directly to the dashboard homepage.
Hailuo video model upgraded to v2.3 with better motion quality.
Sora 2 now supports reference images โ guide your video generations with a starting photo.
Faster image loading across the platform with optimized loaders.
Seedance 1 video model with updated pricing and generation options.
LTX-2 video model added to the platform.
Automatic credit refunds for failed LTX-2 video generations.
One-click prompt enhancement โ improve your prompts with AI before generating.
Image moderation for uploads to keep the community safe.
Optional prompt field added to more tools for greater creative control.
Smoother cancellation experience with clear feedback and options.
Updated privacy policy and terms of service with clearer language.
Enhanced content safety policies across the platform.
Seedream upgraded to 4.5 with enhanced features and better output quality.
Image relighting tool โ change the lighting in any image with AI.
Consistent character generation with better prompt handling and job management.
Creator Studio is here โ plan and produce multi-scene video projects with storyboards.
Multiple video and image model options inside Creator Studio.
Media folder management and project organization built in.
Character attribute editor for consistent characters across scenes.
Consistent character generation โ keep the same character across multiple images.
Flux 2 Dev model support added for consistent character workflows.
Save generated assets directly to CDN from Creator Studio.
Z-Image Turbo for ultra-fast image generation with LoRA support.
LoRA training forms upgraded with better FAQs and multi-image upload support.
Zip file upload for LoRA training โ upload all your training images at once.
Seedream 4 edit endpoint โ modify existing images with AI-powered editing.
Multi-image blending in Seedream 4 for creative composites.
Retry logic for more reliable generations on Gemini and other models.
Redesigned dashboard homepage with recent creations and featured tools front and center.
Favorites โ save your best images and videos for quick access.
Refreshed button styles and tooltips across all tools for a more polished look.
Flux 2 Flex with Klein 4B model support and improved documentation.
Google Gemini Image generation with enhanced job tracking.
Better credit cost calculations with per-model pricing.
WAN 2.2 Plus mode for higher quality video output.
Frame extraction tool โ pull still frames from any video.
Interpolation output option for smoother WAN 2.2 videos.
Expanded API platform with comprehensive model registry and documentation.
API key management with subscription and rate limit checks.
In-app FAQ system with searchable help articles.
Kling 2.6 Pro video model now available in Creator Studio.
Image source picker for video generation โ start from any image.
Smarter credit calculations across all video models.
Multiple new AI models added with competitive pricing.
Admin dashboard with better user data display and management tools.
Enhanced subscription handling and billing cycle management.
Automated news publishing โ platform updates now post automatically.
New Flux models with enhanced credit management.
Magnific upscaler with improved credit cost calculations.
WAN 2.5 video generation with image-to-video and text-to-video modes.
Audio URL support in WAN 2.5 for video-with-sound generation.
Video caching to R2 for faster replay of generated videos.
Face swap tool โ swap faces in any image with AI.
WAN 2.6 Image editing with output size controls.
Use Previous Frame feature in Creator Studio for seamless video continuity.
Select multiple images or videos at once for batch actions.
Batch download, delete, and organize your generations.
Better mobile responsiveness across all generation tools.
Custom duration options for LTX-2 video generation.
Video merge tool with better upload support.
Enhanced image display and layout consistency across all tools.
Redesigned usage report page with sorting and better UI.
Centralized image validation for more consistent uploads across all tools.
Better video history layout with improved responsiveness.
Veo 3.1 video generation with premium cinematic controls.
Reality Polisher โ detect and fix AI artifacts in generated images.
Model visibility toggle for the community gallery.
ImagineArt model with multiple output count options.
Updated featured tools on the homepage with fresh highlights.
Better layout responsiveness across Creator Studio and dashboard.
New payment provider for smoother subscription management.
More reliable webhook handling for subscription changes.
Credit pack purchases now process faster and more reliably.
Authorize.Net integration for credit card top-ups.
Split-provider payment flow โ choose your preferred payment method.
AutoPlay video showcase on the landing page with deferred loading for faster page loads.
Seedance upgraded to 1.5 with better pricing and documentation.
Prompt manager with search and dynamic loading for faster prompt browsing.
Streamlined dashboard navigation and optimized training data handling.
New AI-powered image editing tools.
LoRA Image Creator with modal selection and better model version handling.
Google Gemini Image with better job tracking and filtering.
Cost comparison tool on the landing page โ see how PixelDojo pricing stacks up.
Refreshed landing page with Flux 2 features and better model showcases.
Updated credit cost structure for Flux models.