AI models catalog

JSON2Video integrates with several AI providers to generate assets at render time. Models are addressed by name in three element types:

image — image element with a prompt and a model.
video — video element with a prompt and a model.
voice — voice element with a text and a model (or implicit default).

Models marked featured are the recommended defaults. Models with credits per unit set to 0 do not consume credits when invoked with JSON2Video's managed key; usage may still be subject to per-provider rate limits.

When a connection is supplied on the element, the request runs against the customer's own provider account and no JSON2Video credits are deducted. When no connection is supplied, JSON2Video's managed key is used and credits are deducted according to the table below.

Image generation

Name	Credits / unit	Unit	Provider(s)	Featured
`flux-schnell`	0	file	replicate	yes
`freepik-classic`	0	file	freepik	yes
`gemini-2.5-flash`	20	file	replicate, wavespeed	yes
`nano-banana-pro`	80	file	wavespeed, replicate	no
`nano-banana-2`	40	file	wavespeed	no
`seedream-v4`	15	file	wavespeed, replicate, freepik	no
`flux-pro`	20	file	replicate, wavespeed	no

Aliases: text-to-image-basic → flux-schnell, text-to-image-pro → seedream-v4, nano-banana → gemini-2.5-flash

Voice synthesis

Name	Unit	Provider(s)	Featured
`azure`	second	azure	no
`elevenlabs-flash-v2-5`	second	elevenlabs	no
`elevenlabs-v3`	second	elevenlabs	no
`elevenlabs-v2`	second	elevenlabs	no

Aliases: elevenlabs → elevenlabs-v2

Video generation

Name	Credits / unit	Unit	Provider(s)	Featured
`seedance-v1.5-pro`	20	second	wavespeed	yes
`wan-v2.5`	30	second	wavespeed	no

Avatar / talking-head

Name	Credits / unit	Unit	Provider(s)	Featured
`infinite-talk`	20	second	wavespeed	no

Aliases: talking-head → infinite-talk

How to use a model

{
    "type": "image",
    "model": "flux-schnell",
    "prompt": "A futuristic city at sunset",
    "duration": 5
}

{
    "type": "voice",
    "model": "elevenlabs-flash-v2-5",
    "voice": "JBFqnCBsd6RMkjVDRZzb",
    "text": "Welcome to the future of automated video.",
    "connection": "your-elevenlabs-connection-id"
}

Notes

Aliases (e.g. text-to-image-basic, nano-banana, elevenlabs) redirect to a concrete model. Use them when you want JSON2Video to pick a reasonable default in your stead.
Provider-side rate limits apply on top of any JSON2Video quota. Failed AI generation surfaces as a render error; see Errors.
The credit cost shown here is what JSON2Video deducts when using its managed API keys. If you set a connection on the element, the cost is borne by your provider account instead.
The catalog above is fetched live from https://cdn.json2video.com/data/ai/models.json and cached for 15 minutes. If the upstream is unreachable, the last cached version is shown.