cf-ai-whisper
Free speech-to-text and SRT subtitle generation using OpenAI Whisper on Cloudflare Workers AI.
skill install https://www.promptspace.in/skills/cf-ai-whisperWhat it does
The cf-ai-whisper skill provides professional-grade audio transcription by leveraging OpenAI Whisper running on Cloudflare Workers AI. It allows developers and users to convert speech to text with high accuracy without requiring an OpenAI API key or account. The skill handles diverse audio formats and can automatically generate SRT subtitle files with word-level timestamps.
Why use this skill
Most transcription services are expensive or require complex local setups. This skill offers a cost-effective alternative by utilizing Cloudflare’s free tier infrastructure. Unlike basic LLM prompting, it manages the binary data handling, audio MIME-type mapping, and complex timestamp calculations required to create perfectly synced subtitles. It eliminates the frustration of manual transcription while keeping your data within the Cloudflare ecosystem you already use.
Supported tools
- Frameworks: Cloudflare Workers AI, OpenAI Whisper
- Audio Formats: WAV, MP3, MP4, M4A, OGG, FLAC, WebM, and more
- Outputs: Plain text transcripts (.txt) and SubRip Subtitle files (.srt)
The Output
The skill produces two primary artifacts: a clean, high-accuracy text transcript and a professionally formatted SRT file. The output includes metadata such as word count and relies on word-level timestamps to ensure subtitle synchronization is precise.
Use cases
- Generate SRT subtitles for videos with precise word-level timestamps
- Convert voice memos and podcast episodes into searchable text transcripts
- Transcribe recorded meetings without paying for expensive third-party seats
- Batch process audio folders into text using Cloudflare's free AI tier
Example
Prompt
Sample output preview is available after purchase.