August 8, 2025 Notice - Documentation

Model APIs Updates

New Model Available: Qwen-Image The Qwen-Image is a 20B MMDIT model for next-gen text-to-image generation. Especially strong at creating stunning graphic posters with native text. Key Highlights:
- SOTA text rendering — rivals GPT-4o in English, best-in-class for Chinese
- In-pixel text generation — no overlays, fully integrated
- Bilingual support, diverse fonts, complex layouts
Also excels at general image generation — from photorealistic to anime, impressionist to minimalist. A true creative powerhouse. Get started now: Qwen-Image Updated time: August 6, 2025
New Model Available: Wan 2.2 Text to Video, Wan 2.2 Image to Video Wan2.2 is the world’s first open-source MoE-architecture video generation model with cinematic control. Its Mixture-of-Experts (MoE) architecture scales model capacity without increasing computational requirements. Key Highlights:
- First-of-its-kind MoE Architecture: The first model to apply an MoE framework to video generation, drastically cutting down on resource consumption.
- 50% More Efficient: Saves approximately 50% of computational resources compared to models of a similar parameter scale.
- Enhanced Capabilities: Achieves significant improvements in generating complex motion, character interactions, and overall aesthetic quality.
Get started now: Wan 2.2 Text to Video, Wan 2.2 Image to Video Updated time: August 8, 2025

Add region selection when creating an endpoint in Serverless Novita now supports creating endpoints in selected regions. Updated time: August 5, 2025

Last modified on August 11, 2025

⌘I