caption.extract for speech-to-text transcription, and caption.burn for overlaying subtitles onto video.
Extract captions (speech-to-text)
Thecaption.extract job uses AI-powered transcription (Deepgram) to convert spoken audio into timed SRT subtitles.
Parameters
| Parameter | Type | Default | Description |
|---|---|---|---|
language | string | "en" | Language code (ISO 639-1). Supports 30+ languages. |
model | string | "nova-3" | Deepgram model. Options: nova-3 (best quality), nova-2, base |
Output
The result is an SRT file containing timed captions:Burn captions onto video
Thecaption.burn job overlays subtitles directly onto the video frames, producing a new video with hard-coded captions.
subtitles input, Rendobar will automatically extract captions first and then burn them — combining both steps in a single job.
Subtitle format
Rendobar accepts SRT format. Each block contains a sequence number, timestamp range, and text:Two-step workflow
For maximum control, use a two-step workflow:
This gives you full control over caption accuracy before committing to the final video.
Cost
| Job Type | Cost Model |
|---|---|
caption.extract | $0.50/min of input |
caption.burn | $1.00/min of output |
Job Types
Full list of operations
GDPR Guide
Redact faces alongside captions