Serverless AI Audio Transcription

Convert speech to text with our advanced audio transcription service. Perfect for transcribing interviews, meetings, lectures, or any audio content. Supports multiple audio formats including MP3, WAV, OGG, and MPEG.

Endpoint

Convert audio recordings to accurate text transcriptions with our AI-powered transcription API.

POSThttps://api.xtartapp.com/audio/transcription

*Authentication

You must use a Bearer token provided by the dashboard to use for authentication. Add the token to the Authorization header.

Request body

Name	Type	Required	Max	Description
audio	string	Yes	-	An optimized audio file to transcribe. Check Audio Upload docs to upload an audio and before using this API.

Response

Name	Type	Description
data	object	Object containing the transcription results
data.transcription	string	The full text transcription of the audio file
data.confidence	number	Confidence score between 0 and 1 indicating the overall transcription accuracy
data.language	string	The detected language of the audio content
metadata	object	Metadata about the transcription process
metadata.cost	number	The cost of the transcription request

Errors

{
   "error": "string", 
   "code": "string"
}

HTTP Status	Error Code	Message
400	INVALID_PAYLOAD	Occurs when the payload is invalid.
400	INVALID_AUDIO	Occurs when the audio file format is not supported.
400	AUDIO_TRANSCRIPTION_ERROR	Service is busy, experiencing high demand, or encountered an error processing the audio.
400	INSUFFICIENT_BALANCE	The user has insufficient balance.
401	UNAUTHORIZED	The authentication token is invalid or expired.
429	TOO_MANY_REQUESTS	The request limit has been reached.