RPM (Requests Per Minute) limits vary based on a user’s verification level and account status. For detailed limits, please refer to the official documentation:
Yes. Novita plans to launch an RPM Upgrade Package, allowing users to manage and increase RPM limits independently, without manual approval.For further assistance, please book a call with our sales team.
How to control thinking function of Zai-org/GLM-4.5 when calling its API?
When calling API zai-org/glm-4.5, there always exists some situations where thinking function is not needed. In these cases, if you want to turn thinking function off, you can simply add one fixed sentence called:
Copy
Ask AI
"enable_thinking": false
at the bottom, for example:
Copy
Ask AI
{ "model": "zai-org/glm-4.5", "messages": [ { "role": "user", "content": "How is the weather in New York?" } ], "temperature": 0.7, "stream": false, "max_tokens": 500, "tool_choice": "auto", "enable_thinking": false}