Create Transcription

Supported Providers
  • Azure OpenAI

  • Fireworks AI

  • Groq

  • OpenAI

Create Transcription

post
Authorizations
Body
filestring · binaryRequired

The audio file object (not file name) to transcribe, in one of these formats: flac, mp3, mp4, mpeg, mpga, m4a, ogg, wav, or webm.

modelany ofRequired

ID of the model to use. The options are gpt-4o-transcribe, gpt-4o-mini-transcribe, and whisper-1.

Example: whisper-1
stringOptional
or
string · enumOptionalPossible values:
languagestringOptional

The language of the input audio. Supplying the input language in ISO-639-1 format will improve accuracy and latency.

promptstringOptional

An optional text to guide the model's style or continue a previous audio segment. The prompt should match the audio language.

response_formatstring · enumOptional

The format of the transcript output, in one of these options: json, text, srt, verbose_json, or vtt.

Default: jsonPossible values:
temperaturenumberOptional

The sampling temperature, between 0 and 1. Higher values like 0.8 will make the output more random, while lower values like 0.2 will make it more focused and deterministic. If set to 0, the model will use log probability to automatically increase the temperature until certain thresholds are hit.

Default: 0
Responses
200

OK

application/json
Responseone of
or
post
curl https://api.portkey.ai/v1/audio/transcriptions \
  -H "x-portkey-api-key: $PORTKEY_API_KEY" \
  -H "x-portkey-virtual-key: $PORTKEY_PROVIDER_VIRTUAL_KEY" \
  -H "Content-Type: multipart/form-data" \
  -F file="@/path/to/file/audio.mp3" \
  -F model="whisper-1"
200

OK

{
  "text": "text"
}

Last updated

Was this helpful?