Skip to main content
PROMPT SPACE
C
$12.00Universal

cf-ai-whisper

Free speech-to-text and SRT subtitle generation using OpenAI Whisper on Cloudflare Workers AI.

skill install https://www.promptspace.in/skills/cf-ai-whisper

What it does

The cf-ai-whisper skill provides professional-grade audio transcription by leveraging OpenAI Whisper running on Cloudflare Workers AI. It allows developers and users to convert speech to text with high accuracy without requiring an OpenAI API key or account. The skill handles diverse audio formats and can automatically generate SRT subtitle files with word-level timestamps.

Why use this skill

Most transcription services are expensive or require complex local setups. This skill offers a cost-effective alternative by utilizing Cloudflare’s free tier infrastructure. Unlike basic LLM prompting, it manages the binary data handling, audio MIME-type mapping, and complex timestamp calculations required to create perfectly synced subtitles. It eliminates the frustration of manual transcription while keeping your data within the Cloudflare ecosystem you already use.

Supported tools

  • Frameworks: Cloudflare Workers AI, OpenAI Whisper
  • Audio Formats: WAV, MP3, MP4, M4A, OGG, FLAC, WebM, and more
  • Outputs: Plain text transcripts (.txt) and SubRip Subtitle files (.srt)

The Output

The skill produces two primary artifacts: a clean, high-accuracy text transcript and a professionally formatted SRT file. The output includes metadata such as word count and relies on word-level timestamps to ensure subtitle synchronization is precise.

Use cases

  • Generate SRT subtitles for videos with precise word-level timestamps
  • Convert voice memos and podcast episodes into searchable text transcripts
  • Transcribe recorded meetings without paying for expensive third-party seats
  • Batch process audio folders into text using Cloudflare's free AI tier

Example

Prompt

Transcribe my meeting.mp3 and generate an SRT file for it.

Sample output preview is available after purchase.

Frequently asked questions

This skill provides accurate speech-to-text transcription and automated SRT subtitle generation by leveraging OpenAI Whisper on the Cloudflare Workers AI infrastructure. It solves the problem of high costs and complex setups associated with professional-grade audio processing.