Skip to main content
Dear Developer, We are writing to inform you of upcoming changes to our Serverless Endpoints. As part of our regular product lifecycle management to ensure high performance and resource optimization, we will be deprecating the models listed below on June 19, 2026 (UTC). You have 15 days from today to update your integration before these models are removed from the Serverless Endpoints.
Please refer to the table below to see if your application is using any of the impacted models and find their recommended alternatives:
Deprecated modelRecommended alternativeNotes
meta-llama/llama-3-70b-instructmeta-llama/llama-3.3-70b-instruct
meta-llama/llama-3-8b-instructmeta-llama/llama-3.1-8b-instruct
deepseek/deepseek-prover-v2-671bdeepseek/deepseek-v4-pro
zai-org/glm-4.5zai-org/glm-5.1

What actions should you take?

To avoid service interruption, please choose one of the following options before the deadline: Update your API code to point to the Recommended Alternative listed above. The newer models offer improved reasoning capabilities, faster inference speeds, and better cost-efficiency.

Option 2: Continue Using Legacy Models (via Dedicated Endpoints)

If your workflow strictly requires a specific version from the deprecated list (e.g., for reproducibility or specific fine-tuning), you can deploy it on your own private GPU resources using our Dedicated Endpoints.

Timeline

  • Announcement Date: June 4, 2026
  • End of Life (EOL): June 19, 2026 (UTC)
Note: After the EOL date, API requests to the deprecated models via Serverless Endpoints will result in an error. If you have any questions or need assistance with the migration, please reach out to us via our Discord community or submit a support ticket. Best regards, The Novita AI Team