Serverless AI Audio Transcription
Convert speech to text with our advanced audio transcription service. Perfect for transcribing interviews, meetings, lectures, or any audio content. Supports multiple audio formats including MP3, WAV, OGG, and MPEG.
Endpoint
Convert audio recordings to accurate text transcriptions with our AI-powered transcription API.
POSThttps://api.xtartapp.com/audio/transcription
*Authentication
You must use a Bearer token provided by the dashboard to use for authentication. Add the token to the Authorization header.
Request body
Name | Type | Required | Max | Description |
---|---|---|---|---|
audio | string | Yes | - | An optimized audio file to transcribe. Check Audio Upload docs to upload an audio and before using this API. |
Response
Name | Type | Description |
---|---|---|
data | object | Object containing the transcription results |
data.transcription | string | The full text transcription of the audio file |
data.confidence | number | Confidence score between 0 and 1 indicating the overall transcription accuracy |
data.language | string | The detected language of the audio content |
metadata | object | Metadata about the transcription process |
metadata.cost | number | The cost of the transcription request |
Errors
{
"error": "string",
"code": "string"
}
HTTP Status | Error Code | Message |
---|---|---|
400 | INVALID_PAYLOAD | Occurs when the payload is invalid. |
400 | INVALID_AUDIO | Occurs when the audio file format is not supported. |
400 | AUDIO_TRANSCRIPTION_ERROR | Service is busy, experiencing high demand, or encountered an error processing the audio. |
400 | INSUFFICIENT_BALANCE | The user has insufficient balance. |
401 | UNAUTHORIZED | The authentication token is invalid or expired. |
429 | TOO_MANY_REQUESTS | The request limit has been reached. |