Novita AI & Browser Use: Quick Start

Browser Use is an open-source library that empowers LLMs to directly control web browsers, revolutionizing web interaction with advanced automation. By integrating Novita AI’s powerful LLMs and tools, Browser Use enables seamless browsing, content generation, and task automation for an optimized user experience.

This tutorial will show you how to integrate the Novita AI API with Browser Use to automate browser interactions.

How to Use Browser Use with Novita AI

Prerequisites

Python 3.11 or higher
A Novita AI API key

Installation

Step 1: Install Browser Use using pip:

pip install browser-use

Step 2: Install Playwright (required for browser automation):

playwright install chromium

Obtaining Novita AI LLM API Key

Create an account: Visit Novita AI’s website and sign up for an account.
Generate your API Key: After logging in, navigate to the Key Management page to generate your API key. This key is essential to connect Novita AI’s models to Cursor.

Environment Setup

Create a .env file in your project root and add your Novita API key:

NOVITA_API_KEY=your_api_key_here

Basic Implementation

Here’s a complete example of using Browser Use with Novita AI’s API:

"""
Web automation using Novita AI and Browser Use
"""

import asyncio
import os

from dotenv import load_dotenv
from langchain_openai import ChatOpenAI
from pydantic import SecretStr

from browser_use import Agent

# Load environment variables
load_dotenv()

api_key = os.getenv('NOVITA_API_KEY', '')
if not api_key:
    raise ValueError('NOVITA_API_KEY is not set')


async def run_search():
    agent = Agent(
        task=(
            '1. Go to https://www.reddit.com/r/LocalLLaMA '
            "2. Search for 'browser use' in the search bar "
            '3. Click on first result '
            '4. Return the first comment'
        ),
        llm=ChatOpenAI(
            base_url='https://api.novita.ai/v3/openai',
            model='deepseek/deepseek-v3-0324',
            api_key=SecretStr(api_key),
        ),
        use_vision=False,
    )

    await agent.run()


if __name__ == '__main__':
    asyncio.run(run_search())

Creating Your Own Tasks

You can customize the task parameter to perform a wide variety of web tasks:

task="Compare the price of gpt-4o and DeepSeek-V3"

For more complex tasks, you might want to enable vision capabilities:

agent = Agent(
    task="Find and summarize the latest news about AI on TechCrunch",
    llm=ChatOpenAI(
        base_url='https://api.novita.ai/v3/openai',
        model='deepseek/deepseek-v3-0324',
        api_key=SecretStr(api_key),
    ),
    use_vision=True,
)

Novita AI & DeepSearcher: Quick Start Novita AI & OWL Integration Guide

On this page

How to Use Browser Use with Novita AI
Prerequisites
Installation
Obtaining Novita AI LLM API Key
Environment Setup
Basic Implementation
Creating Your Own Tasks

Get started

Model APIs

Agent Sandbox

GPUs

Observability

Resources

Novita AI & Browser Use: Quick Start

How to Use Browser Use with Novita AI

Prerequisites

Installation

Obtaining Novita AI LLM API Key

Environment Setup

Basic Implementation

Creating Your Own Tasks

Get started

Model APIs

Agent Sandbox

GPUs

Observability

Resources

​How to Use Browser Use with Novita AI

​Prerequisites

​Installation

​Obtaining Novita AI LLM API Key

​Environment Setup

​Basic Implementation

​Creating Your Own Tasks

How to Use Browser Use with Novita AI

Prerequisites

Installation

Obtaining Novita AI LLM API Key

Environment Setup

Basic Implementation

Creating Your Own Tasks