Create Transcription

Supported Providers

Azure OpenAI
Fireworks AI
Groq
OpenAI

Create Transcription

post

Authorizations

Body

filestring · binaryRequired

The audio file object (not file name) to transcribe, in one of these formats: flac, mp3, mp4, mpeg, mpga, m4a, ogg, wav, or webm.

modelany ofRequired

ID of the model to use. The options are gpt-4o-transcribe, gpt-4o-mini-transcribe, and whisper-1.

Example: whisper-1

stringOptional

string · enumOptionalPossible values:

languagestringOptional

The language of the input audio. Supplying the input language in ISO-639-1 format will improve accuracy and latency.

promptstringOptional

An optional text to guide the model's style or continue a previous audio segment. The prompt should match the audio language.

response_formatstring · enumOptional

The format of the transcript output, in one of these options: json, text, srt, verbose_json, or vtt.

Default: jsonPossible values:

temperaturenumberOptional

The sampling temperature, between 0 and 1. Higher values like 0.8 will make the output more random, while lower values like 0.2 will make it more focused and deterministic. If set to 0, the model will use log probability to automatically increase the temperature until certain thresholds are hit.

Default: 0

Responses

200

application/json

Responseone of

post

curl https://api.portkey.ai/v1/audio/transcriptions \
  -H "x-portkey-api-key: $PORTKEY_API_KEY" \
  -H "x-portkey-virtual-key: $PORTKEY_PROVIDER_VIRTUAL_KEY" \
  -H "Content-Type: multipart/form-data" \
  -F file="@/path/to/file/audio.mp3" \
  -F model="whisper-1"

200

{
  "text": "text"
}

PreviousCreate Speech NextCreate Translation

Last updated 1 year ago

Was this helpful?