Archived docs Get your API Key
Get started
Tutorials
Guides
Reference
Help for AI agents
πŸ€– AI Assistant

Audiograms

An audiogram is a video that visualises an audio file as an animated waveform on top of a static or animated background. It is the standard format used to share podcast snippets, voice clips, and other audio-first content on platforms that require video.

JSON2Video has a first-class audiogram element that renders a waveform synchronised to any audio source. You can combine it with images, text, and subtitles to build a complete share-ready video.

Minimum audiogram

The smallest possible audiogram needs an audio source and an audiogram element:

{
  "resolution": "instagram-portrait",
  "scenes": [
    {
      "elements": [
        { "type": "audio", "src": "https://example.com/clip.mp3" },
        {
          "type": "audiogram",
          "color": "white",
          "amplitude": 5,
          "y": 800
        }
      ]
    }
  ]
}

The audiogram element does not own an audio source β€” it visualises whatever audio is playing in the scene. Add one audio element (file, podcast clip…) or one voice element (AI-generated) and the audiogram tracks it automatically.

Common pattern: podcast share clip

A typical share clip combines:

  • A vertical 1080Γ—1920 (9:16) or square 1080Γ—1080 frame.
  • A still cover image (album art, episode artwork).
  • The host name or episode title as text.
  • The audiogram element overlaid on top.
  • Automatic subtitles for accessibility.
{
  "resolution": "instagram-portrait",
  "scenes": [
    {
      "elements": [
        { "type": "image", "src": "https://example.com/cover.jpg", "zoom": 1.1 },
        { "type": "audio", "src": "https://example.com/episode-clip.mp3" },
        {
          "type": "text",
          "text": "Episode 42 β€” Building in public",
          "y": 100,
          "style": "002",
          "settings": { "font-size": "70px", "color": "white" }
        },
        {
          "type": "audiogram",
          "color": "#FF6B00",
          "amplitude": 6,
          "y": 1500,
          "height": 200
        },
        {
          "type": "subtitles",
          "settings": { "style": "classic", "position": "bottom-center" }
        }
      ]
    }
  ]
}

Customising the look

The audiogram element accepts:

  • color β€” hex or named colour for the bars.
  • amplitude β€” how much the bars react to the audio (default ~5).
  • height β€” visual height of the waveform.
  • x, y β€” position (same coordinate system as every other element).
  • width β€” horizontal width of the waveform.

For the full property list, see the audiogram element reference.

See also