Model APIs Updates

New Features

  • Novita LLMs Dedicated Endpoints Fully Managed Deployment for Your Custom AI Models High cost performance
    • Deploy available models and LoRA adapters on customizable GPU endpoints with per-hour billing.
    • Lower prices, No hard rate limits, Cheaper under high utilization, Custom base models and multiple LoRAs, Predictable performance unaffected. Learn more about Pricing and read this Blog to learn how to deploy a custom base model with Novita LLMs Dedicated Endpoints.
    Updated time: July 4, 2025

Agent SandBox Updates

New Features

  • Agent Sandbox Is Now Live Agent Sandbox is a high-performance cloud runtime purpose-built for AI Agent workloads, designed to support the full lifecycle of autonomous agents. Key capabilities include:
    • Millisecond-level startup, optimized for high-concurrency tasks
    • Multi-language support, including Python, JavaScript, and C++
    • Enables code execution, web access, and system-level interactions
    • Per-second billing for vCPU and memory — flexible, cost-efficient usage
    As the infrastructure foundation for the runtime era of AI Agents, Agent Sandbox empowers the next generation of generative agents with fast, scalable, and reliable execution capabilities. Visit the Documentation Center to learn more. Updated time: June 30, 2025

Platform Service Updates

Improvements

  • In-App Messaging Now Live for Critical Notifications Our new in-app messaging system is now available, allowing you to receive important platform-level updates in one place, including:
    • GPU failures, service interruptions, and operational alerts
    • Account-related notices, such as coupon delivery or login activity
    • Promotional events and platform campaigns
    All messages can be viewed in your Message Center in the console. Stay informed, effortlessly. Updated time: July 1, 2025