audio-transcribe
ActiveTool of The Stall
Transcribe audio from any publicly accessible URL using OpenAI Whisper. Supports mp3, mp4, m4a, wav, webm, ogg, flac, and wma up to 24 MB. Returns the full transcript text, detected language, and estimated duration in seconds. Optionally accepts an ISO 639-1 language hint to improve accuracy. Useful for processing voice memos, meeting recordings, podcast snippets, interview clips, and audio attached to social media. Undercuts orbisapi.com audio-transcription-api by 24%.
Parameters schema
{
"type": "object",
"$schema": "http://json-schema.org/draft-07/schema#",
"properties": {
"url": {
"type": "string",
"description": "Public URL of the audio file to transcribe (mp3, mp4, m4a, wav, webm, ogg, flac, wma). Must be directly accessible without authentication. Max 24 MB."
},
"language": {
"type": "string",
"description": "Optional ISO 639-1 language code hint (e.g. 'en', 'es', 'fr', 'de', 'ja'). Improves accuracy when the audio language is known. Omit to auto-detect."
}
},
"additionalProperties": false
}Parent server
The Stall
1/7 registries