# Rate limits - Documentation

> For the complete documentation index, see [llms.txt](/llms.txt). Markdown is available with `Accept: text/markdown` and `.md` URL variants.

Source: /docs/guides/model-apis-rate-limits

# Rate limits

##

[​](#1-understanding-rate-limits)

1. Understanding Rate Limits

Rate limiting refers to the rules that govern how frequently a user’s API may access our platform services within a given time window. Its goals are:

- Prevent API abuse and misuse: By setting an upper bound on request frequency, we limit abnormal or unintended traffic.

- Ensure fair resource allocation: We prevent a small number of users from monopolizing system resources, so everyone can access the service on equal terms.

- Maintain API performance and reliability: By smoothing out traffic spikes, we stabilize response times and reduce failure rates due to overload, improving overall service quality.

- Protect system stability: Rate limits help absorb sudden bursts of traffic, preventing high-concurrency spikes from overwhelming our infrastructure.

##

[​](#2-rate-limiting-metrics)

2. Rate Limiting Metrics

For our image and video models, we use two primary rate-limiting metrics:

- IPM (Images Per Minute): The number of images an image model is allowed to generate per minute.

- RPM (Requests Per Minute): The number of API requests a video model is allowed to handle per minute.

##

[​](#3-default-rate-limits)

3. Default Rate Limits

For different models, our platform applies differentiated rate-limiting strategies based on each model’s computational resource consumption.

###

[​](#ipm)

IPM

The IPM (Images Per Minute) limit controls the number of images that can be generated per minute. The default IPM values for each model are listed in the table below.

Resource/ServiceModel APIUser Default SettingsText to Image`txt2img_v3`20Image to Image`img2img_v3`10Remove Background`remove_background`10Replace Background`replace_background`10Remove Text`remove_text`10Inpainting`inpainting`10Cleanup`cleanup`10Merge Face`merge_face`10FLUX.1 [schnell] Text to Image`flux-1-schnell`10Upscale`upscale_v3`20

###

[​](#rpm)

RPM

The RPM (Requests Per Minute) limit controls the number of API requests allowed per minute. The default RPM values for each model are provided in the table below.

Resource/ServiceModel APIUser Default SettingsVideo Merge Face`video_merge_face`10Text to Video`txt2video`2Image to Video`img2video`2Wan 2.1 Text to Video`wan_txt_to_video`20Wan 2.1 Image to Video`wan i2v`20Hunyuan Video Fast`hunyuan_video_fast`20KLING V1.6 Image to Video`Kling i2v`20KLING V1.6 Text to Video`Kling t2v`20Minimax Video-01`Minimax`20

##

[​](#4-handling-rate-limits)

4. Handling Rate Limits

###

[​](#how-to-monitor-rate-limits)

How to Monitor Rate Limits?

If your API requests exceed the allowed rate, the API will return:

- HTTP Status Code: 429 Too Many Requests

- Response Body: A message indicating that the rate limit has been exceeded.

###

[​](#best-practices)

Best Practices

To avoid triggering rate limits, you can:

- Implement client-side request throttling: Respect the platform’s rate limits by controlling your application’s request rate, ensuring you don’t send too many calls in a short period.

- Use exponential backoff on retries: When you receive a rate-limit response (e.g., HTTP 429), wait progressively longer between retry attempts instead of retrying immediately, reducing load on the service.

- Monitor your API usage: Continuously track and log your request counts, frequencies, and any error responses so you can adjust your usage patterns proactively.

###

[​](#when-you-hit-rate-limits)

When You Hit Rate Limits

If you receive an HTTP 429 (“Too Many Requests”) response, you can:

- Retry later: Wait a short period before sending your request again.

- Optimize your requests: Reduce the frequency of calls to stay within the platform’s rate limits.

- Request a higher rate limit: If you require a higher rate limit, please fill out the contact form at the [Quotas&Limits](https://novita.ai/quota-limits/image) to get in touch with us.

Last modified on May 12, 2025
