Text to Image V2
The Text-to-Image V2 API is deprecated and will be removed in the future. Please migrate to Text-to-Image V3.
POST Text to Image V2
The text-to-image endpoint will return only a task_id
. You should use the task_id
to call the /v2/progress API endpoint to retrieve the image generation results. We will gradually phase out the V2 endpoints. It is recommended to use the V3 endpoints to generate images.
Request Headers
Request Body
Positive prompt words, separated by ,
. If you want to use LoRA, you can call the /v3/model
endpoint with the parameter filter.types=lora
to retrieve the sd_name_in_api
field as the model_name
. Remember that the format for LoRA models is <lora:$sd_name:$weight>
.
Negative prompt words, separated by ,
.
This denoising process is called sampling because Stable Diffusion generates a new sample image at each step.
Enum: DPM++ 2M Karras
, DPM++ SDE Karras
, DPM++ 2M SDE Exponential
, DPM++ 2M SDE Karras
, Euler a
, Euler
, LMS
, Heun
, DPM2
, DPM2 a
, DPM++ 2S a
, DPM++ 2M
, DPM++ SDE
, DPM++ 2M SDE
, DPM++ 2M SDE Heun
, DPM++ 2M SDE Heun Karras
, DPM++ 2M SDE Heun Exponential
, DPM++ 3M SDE
, DPM++ 3M SDE Karras
, DPM++ 3M SDE Exponential
, DPM fast
, DPM adaptive
, LMS Karras
, DPM2 Karras
, DPM2 a Karras
, DPM++ 2S a Karras
, Restart
, DDIM
, PLMS
, UniPC
The number of images generated in one single generation. Range: [0, 8]
The number of generations. Range: [0, 8]
Think of steps as iterations in the image creation process. Range: (0, 50]
This setting determines how closely Stable Diffusion will adhere to your prompt. Range: (0, 30]
A seed is a number from which Stable Diffusion generates noise.
Height of the image. Range: (0, 2048]
Width of the image. Range: (0, 2048]
Name of the Stable Diffusion model. You can call the /v3/model
endpoint with the parameter filter.types=checkpoint
to retrieve the sd_name_in_api
field as the model_name
.
Enable the Stable Diffusion face restoration plugin.
VAE (Variational Auto Encoder). sd_vae
can be accessed in the API /v3/model with query parameters filter.types=vae
to retrieve the sd_name
field as the sd_vae
.
This parameter indicates the number of layers to stop from the bottom during optimization, so clip_skip on 2 would mean, that in SD1.x model where the CLIP has 12 layers, you would stop at 10th layer.
Hires.fix function switch.
Upscalers model names. AI upscalers are models trained with massive amounts of data.
Enum: Latent
, ESRGAN_4x
, R-ESRGAN 4x+
, R-ESRGAN 4x+ Anime6B
The magnification factor of the image, if params hr_resize_x and hr_resize_y are set, this parameter will be ignored.
Enum: 1
, 2
The target image width, the maximum image size is 2048, only take effect when parameters hr_scale=1.
The target image hight, the maximum image size is 2048, only take effect when parameters hr_scale=1.
Image storage time (seconds). Range [0, 604800]
Refiner infos to enhances the image details.
ControlNet.
Response
Example
request
response
Was this page helpful?