Create Voice Model API

Fish Audio Voice Cloning

curl --request POST \
  --url https://api.novita.ai/v4beta/model \
  --header 'Authorization: <authorization>' \
  --header 'Content-Type: <content-type>' \
  --data '
{
  "type": {},
  "title": "<string>",
  "train_mode": {},
  "voices": [
    null
  ],
  "visibility": {},
  "description": {},
  "cover_image": {},
  "texts": [
    "<string>"
  ],
  "tags": [
    "<string>"
  ],
  "enhance_audio_quality": true
}
'

{
  "_id": "<string>",
  "type": {},
  "title": "<string>",
  "description": "<string>",
  "cover_image": "<string>",
  "state": {},
  "tags": [
    "<string>"
  ],
  "created_at": {},
  "updated_at": {},
  "visibility": {},
  "like_count": 123,
  "mark_count": 123,
  "shared_count": 123,
  "task_count": 123,
  "author": {
    "_id": "<string>",
    "nickname": "<string>",
    "avatar": "<string>"
  },
  "train_mode": {},
  "samples": [
    {
      "title": "<string>",
      "text": "<string>",
      "task_id": "<string>",
      "audio": "<string>"
    }
  ],
  "languages": [
    "<string>"
  ],
  "lock_visibility": true,
  "unliked": true,
  "liked": true,
  "marked": true
}

POST

v4beta

model

Fish Audio Voice Cloning

curl --request POST \
  --url https://api.novita.ai/v4beta/model \
  --header 'Authorization: <authorization>' \
  --header 'Content-Type: <content-type>' \
  --data '
{
  "type": {},
  "title": "<string>",
  "train_mode": {},
  "voices": [
    null
  ],
  "visibility": {},
  "description": {},
  "cover_image": {},
  "texts": [
    "<string>"
  ],
  "tags": [
    "<string>"
  ],
  "enhance_audio_quality": true
}
'

{
  "_id": "<string>",
  "type": {},
  "title": "<string>",
  "description": "<string>",
  "cover_image": "<string>",
  "state": {},
  "tags": [
    "<string>"
  ],
  "created_at": {},
  "updated_at": {},
  "visibility": {},
  "like_count": 123,
  "mark_count": 123,
  "shared_count": 123,
  "task_count": 123,
  "author": {
    "_id": "<string>",
    "nickname": "<string>",
    "avatar": "<string>"
  },
  "train_mode": {},
  "samples": [
    {
      "title": "<string>",
      "text": "<string>",
      "task_id": "<string>",
      "audio": "<string>"
    }
  ],
  "languages": [
    "<string>"
  ],
  "lock_visibility": true,
  "unliked": true,
  "liked": true,
  "marked": true
}

Fish Audio API for creating a voice model (voice cloning).

Request Headers

Content-Type

string

required

Enum: application/json

Authorization

string

required

Bearer authentication format, for example: Bearer {{API Key}}.

Request Body

type

enum<string>

required

Model type, tts is for text to speech.Available options: ttsAllowed value: "tts"

title

string

required

Model title or name.

train_mode

enum<string>

required

Model train mode, for TTS model, fast means model instantly available after creation.Available options: fastAllowed value: "fast"

voices

file[]

required

Upload voices files that will be used to tune the model.

visibility

enum<string>

default:"public"

Model visibility, public will be shown in the discovery page, unlist allows anyone with the link to access, private only be visible to the creator.Available options: public, unlist, private

description

string | null

Model description.

cover_image

file | null

Model cover image, this is required if the model is public.

texts

string[]

Texts corresponding to the voices, if unspecified, ASR will be performed on the voices.

Response

_id

string

required

Unique identifier for the created model.

type

enum<string>

required

Model type.Available options: svc, tts

title

string

required

Model title or name.

description

string

required

Model description.

cover_image

string

required

URL of the model cover image.

state

enum<string>

required

Current state of the model.Available options: created, training, trained, failed

Overview

Basic

Model APIs

GPUs

Fish Audio Voice Cloning

Request Headers

Request Body

Response

Overview

Basic

Model APIs

GPUs

​Request Headers

​Request Body

​Response

Request Headers

Request Body

Response