Model APIs Updates
New Features
-
Novita LLMs Dedicated Endpoints
Fully Managed Deployment for Your Custom AI Models
High cost performance
- Deploy available models and LoRA adapters on customizable GPU endpoints with per-hour billing.
- Lower prices, No hard rate limits, Cheaper under high utilization, Custom base models and multiple LoRAs, Predictable performance unaffected. Learn more about Pricing and read this Blog to learn how to deploy a custom base model with Novita LLMs Dedicated Endpoints.
Agent SandBox Updates
New Features
-
Agent Sandbox Is Now Live
Agent Sandbox is a high-performance cloud runtime purpose-built for AI Agent workloads, designed to support the full lifecycle of autonomous agents.
Key capabilities include:
- Millisecond-level startup, optimized for high-concurrency tasks
- Multi-language support, including Python, JavaScript, and C++
- Enables code execution, web access, and system-level interactions
- Per-second billing for vCPU and memory — flexible, cost-efficient usage
Platform Service Updates
Improvements
-
In-App Messaging Now Live for Critical Notifications
Our new in-app messaging system is now available, allowing you to receive important platform-level updates in one place, including:
- GPU failures, service interruptions, and operational alerts
- Account-related notices, such as coupon delivery or login activity
- Promotional events and platform campaigns