AI models catalog
JSON2Video integrates with several AI providers to generate assets at render time. Models are addressed by name in three element types:
image—imageelement with apromptand amodel.video—videoelement with apromptand amodel.voice—voiceelement with atextand amodel(or implicit default).
Models marked featured are the recommended defaults. Models with credits per unit set to 0 do not consume credits when invoked with JSON2Video's managed key; usage may still be subject to per-provider rate limits.
When a connection is supplied on the element, the request runs against the customer's own provider account and no JSON2Video credits are deducted. When no connection is supplied, JSON2Video's managed key is used and credits are deducted according to the table below.
Image generation
| Name | Credits / unit | Unit | Provider(s) | Featured |
|---|---|---|---|---|
flux-schnell | 0 | file | replicate | yes |
freepik-classic | 0 | file | freepik | yes |
gemini-2.5-flash | 20 | file | replicate, wavespeed | yes |
nano-banana-pro | 80 | file | wavespeed, replicate | no |
nano-banana-2 | 40 | file | wavespeed | no |
seedream-v4 | 15 | file | wavespeed, replicate, freepik | no |
flux-pro | 20 | file | replicate, wavespeed | no |
Aliases: text-to-image-basic → flux-schnell, text-to-image-pro → seedream-v4, nano-banana → gemini-2.5-flash
Voice synthesis
| Name | Credits / unit | Unit | Provider(s) | Featured |
|---|---|---|---|---|
azure | 0 | second | azure | no |
elevenlabs-flash-v2-5 | 0 | second | elevenlabs | no |
elevenlabs-v3 | 0 | second | elevenlabs | no |
elevenlabs-v2 | 0 | second | elevenlabs | no |
Aliases: elevenlabs → elevenlabs-v2
Video generation
| Name | Credits / unit | Unit | Provider(s) | Featured |
|---|---|---|---|---|
seedance-v1.5-pro | 20 | second | wavespeed | yes |
wan-v2.5 | 30 | second | wavespeed | no |
Avatar / talking-head
| Name | Credits / unit | Unit | Provider(s) | Featured |
|---|---|---|---|---|
infinite-talk | 20 | second | wavespeed | no |
Aliases: talking-head → infinite-talk
How to use a model
{
"type": "image",
"model": "flux-schnell",
"prompt": "A futuristic city at sunset",
"duration": 5
}
{
"type": "voice",
"model": "elevenlabs-flash-v2-5",
"voice": "JBFqnCBsd6RMkjVDRZzb",
"text": "Welcome to the future of automated video.",
"connection": "your-elevenlabs-connection-id"
}
Notes
- Aliases (e.g.
text-to-image-basic,nano-banana,elevenlabs) redirect to a concrete model. Use them when you want JSON2Video to pick a reasonable default in your stead. - Provider-side rate limits apply on top of any JSON2Video quota. Failed AI generation surfaces as a render error; see Errors.
- The credit cost shown here is what JSON2Video deducts when using its managed API keys. If you set a
connectionon the element, the cost is borne by your provider account instead. - The catalog above is fetched live from
https://cdn.json2video.com/data/ai/models.jsonand cached for 15 minutes. If the upstream is unreachable, the last cached version is shown.