The captioning API for styled, burned-in subtitles

ZapCap renders captions through one API

Send a video, choose a style, get a finished captioned MP4, a transparent overlay, or a green-screen layer back. Not a transcript API — finished captioned video, programmatically. (Prefer a UI? The same engine powers the web editor at zapcap.ai.) Webhook-native. Multilingual. Per-minute pricing.

API access on the Pro plan · buy render credits in the dashboard
Why this API exists

Captions are the last mile — ZapCap renders them

Transcription APIs return text. Video automation APIs need you to design a template. ZapCap fills the gap in between — finished, styled captioned video, returned by a single task call.

  • Not a transcript API — we ship the rendered video, with captions baked in.
  • No template to design — no scenes, no timelines, no templates to maintain — one task call.
  • Backend-first — a clean REST API for SaaS, agencies, and ad ops — with the zapcap.ai web editor when you want a UI.
Quickstart

Three endpoints, the whole API

Upload a video, create a render task, receive a webhook. Everything else — caption layout, transcription, translation, animation — happens server-side.

POST/videos

Upload

Send a remote URL or multipart upload. Returns a videoId.

POST/videos/:id/task

Create render task

Pick a templateId, set render options, attach a webhook notification.

GET/videos/:id/task/:taskId

Poll task (optional)

Webhook-first, but polling works if your stack doesn't accept callbacks.

1# Upload a video URL
2curl https://api.zapcap.ai/videos/url \
3 -H "x-api-key: $ZAPCAP_KEY" \
4 -d '{"url":"https://cdn.acme.com/clip.mp4"}'
5
6# Create a task with a caption template
7# fetch IDs from GET /templates
8curl https://api.zapcap.ai/videos/$ID/task \
9 -H "x-api-key: $ZAPCAP_KEY" \
10 -d '{
11 "templateId": "<TEMPLATE_UUID>",
12 "language": "en",
13 "notification": {
14 "type": "webhook",
15 "notificationsFor": ["render"],
16 "recipient": "https://acme.com/hooks/zapcap"
17 }
18 }'
19
20# Webhook fires →
21# { taskId, event: "completed", notificationFor: "render", renderUrl }
Output modes

Pick the format your pipeline wants

Burned-in MP4

Captions baked into the source frames. The right answer for TikTok, Reels, Shorts, ad creative.

Transparent overlay

Alpha-channel caption layer — MOV ProRes 4444 or WebM VP9 alpha — for NLE compositing.

Green-screen layer

Caption layer with #04F404 backdrop, for editors and live tools without alpha support.

Pricing

Per-minute API credits, usage-based

Pay $0.10/min for rendered video. Volume credits and dedicated capacity are available when you outgrow the dashboard plan.