Text to Image V2
Deprecated
Text to Image V2
POST
Text to Image V2
Documentation Index
Fetch the complete documentation index at: https://novita.ai/docs/llms.txt
Use this file to discover all available pages before exploring further.
POST Text to Image V2
The text-to-image endpoint will return only atask_id. You should use the task_id to call the /v2/progress API endpoint to retrieve the image generation results. We will gradually phase out the V2 endpoints. It is recommended to use the V3 endpoints to generate images.
Request Headers
Request Body
Positive prompt words, separated by
,. If you want to use LoRA, you can call the /v3/model endpoint with the parameter filter.types=lora to retrieve the sd_name_in_api field as the model_name. Remember that the format for LoRA models is <lora:$sd_name:$weight>.Negative prompt words, separated by
,.This denoising process is called sampling because Stable Diffusion generates a new sample image at each step.
Enum:
Enum:
DPM++ 2M Karras, DPM++ SDE Karras, DPM++ 2M SDE Exponential, DPM++ 2M SDE Karras, Euler a, Euler, LMS, Heun, DPM2, DPM2 a, DPM++ 2S a, DPM++ 2M, DPM++ SDE, DPM++ 2M SDE, DPM++ 2M SDE Heun, DPM++ 2M SDE Heun Karras, DPM++ 2M SDE Heun Exponential, DPM++ 3M SDE, DPM++ 3M SDE Karras, DPM++ 3M SDE Exponential, DPM fast, DPM adaptive, LMS Karras, DPM2 Karras, DPM2 a Karras, DPM++ 2S a Karras, Restart, DDIM, PLMS, UniPCThe number of images generated in one single generation. Range: [0, 8]
The number of generations. Range: [0, 8]
Think of steps as iterations in the image creation process. Range: (0, 50]
This setting determines how closely Stable Diffusion will adhere to your prompt. Range: (0, 30]
A seed is a number from which Stable Diffusion generates noise.
Height of the image. Range: (0, 2048]
Width of the image. Range: (0, 2048]
Name of the Stable Diffusion model. You can call the
/v3/model endpoint with the parameter filter.types=checkpoint to retrieve the sd_name_in_api field as the model_name.Enable the Stable Diffusion face restoration plugin.
VAE (Variational Auto Encoder).
sd_vae can be accessed in the API /v3/model with query parameters filter.types=vae to retrieve the sd_name field as the sd_vae.This parameter indicates the number of layers to stop from the bottom during optimization, so clip_skip on 2 would mean, that in SD1.x model where the CLIP has 12 layers, you would stop at 10th layer.
Hires.fix function switch.
Upscalers model names. AI upscalers are models trained with massive amounts of data.
Enum:
Enum:
Latent, ESRGAN_4x, R-ESRGAN 4x+, R-ESRGAN 4x+ Anime6BThe magnification factor of the image, if params hr_resize_x and hr_resize_y are set, this parameter will be ignored.
Enum:
Enum:
1, 2The target image width, the maximum image size is 2048, only take effect when parameters hr_scale=1.
The target image hight, the maximum image size is 2048, only take effect when parameters hr_scale=1.
Image storage time (seconds). Range [0, 604800]
Refiner infos to enhances the image details.
ControlNet.