by PromptSpace
SoundTag AI: Automatically describe and batch-rename audio files based on their actual sound using local ML or Gemini AI.
$10
One-time purchase
by PromptSpace
SoundTag AI: Automatically describe and batch-rename audio files based on their actual sound using local ML or Gemini AI.
$10
One-time purchase
⚡ Skill ready to install in Claude Code, Gemini CLI, or any MCP-compatible client. Read the install guides →
This skill solves the problem of messy, auto-generated audio filenames like audio_track_v2_final_99.wav. It analyzes the actual content of sound files and renames them with human-readable, descriptive titles such as
ElevenLabs_2024-07-21T15_43_56_George_pre_s50_sb75_se0_b_m2.mp3 → Elevenlabs_George_Voice_Speech.mp3 or
Bright_Trumpet_Fanfare.wav or Large_Crowd_Cheering.mp3.
Unlike simple prompting, this skill implements a sophisticated two-step workflow. It first attempts a high-speed local classification to save on API costs and privacy. For ambiguous sounds, it provides a structured "improvement pass" using Gemini. It intelligently combines ML labels with hidden hints from the original filename to ensure context is never lost. It handles environment constraints automatically, including specific dependency versions (Torch/Transformers) to fit within sandboxed resource limits.
The result is a clean, organized directory where every sound file follows a consistent Title_Case_With_Underscores naming convention, making your sample libraries and field recordings instantly searchable.
Requires Python 3.8+ installed. Model download is ~350MB on first run. Works best on clearly identifiable sounds — abstract/cinematic SFX may need the optional Gemini enhancement step. Processes first 10 seconds of each file.
mkdir -p ~/.claude/skills/describe-rename-sound-files && curl -s -X POST 'https://api.promptspace.in/api/skills/describe-rename-sound-files/install' | python3 -c "import sys,json; sys.stdout.write(json.load(sys.stdin).get('installInstructions') or '')" > ~/.claude/skills/describe-rename-sound-files/SKILL.mdFree skills install directly. Paid skills require purchase - use the download button above after buying.
Security Scanned
Passed automated security review
No special permissions declared or detected
OpenClaw, Cursor, Claude Code, Codex CLI
PromptSpace
We build AI agent skill packages for content creators. Specializing in Chinese social media automation.