Ernie Image is Baidu's text-to-image model. Its defining traits: multilingual prompt support (you can write prompts in English, Chinese, or Japanese), native CJK text rendering (Chinese, Japanese, Korean characters render with proper letterforms), and strong photoreal output on East Asian subjects, architecture, and signage.
Two variants via the model field. ernie-image is the quality tier — 1 credit at HD (square / landscape / portrait HD) or 3 credits at Square UHD. ernie-image/turbo is the fast tier — 1 credit flat. Both support negative prompts and seed pinning for reproducibility.
Multi
EN / ZH / JA prompts