StyloDocs
Workflow blocks

Transcribe Audio

Convert audio files to text using speech-to-text AI

View as Markdown

ai/transcribe-audio — Convert audio files to text using speech-to-text AI.

Where it appears

The Transcribe Audio block lives in the AI group of the Workflow Builder.

Fields

FieldTypeRequiredDefaultDescription
audioFiletext (multi-line, supports references)YesAudio file to transcribe. Supports MP3, WAV, M4A, FLAC, OGG, WebM (max 25 MB).
languagetextNoISO-639-1 language code (e.g., en, es, fr). Leave empty for auto-detection.

Inputs and outputs

Inputs are the configurable fields above. The block produces these outputs:

Outputs

Reference an output downstream with <Transcribe Audio.field>.

NameTypeDescription
successbooleanWhether transcription succeeded
textstringTranscribed text
languagestringDetected language code
durationnumberAudio duration in seconds
segmentsjsonArray of segments with text and timestamps
errorstringError message if failed

Example

A minimal configuration for this block:

{
  "type": "ai/transcribe-audio",
  "config": {
    "audioFile": "<PreviousStep.field>"
  }
}

Limits and failure modes

  • Required field (audioFile) must be set, or the block fails validation before it runs.
  • On failure the error output is populated; downstream blocks that reference other outputs may not receive values.

On this page