Is my audio data processed privately?

The skill primarily uses a local Audio Spectrogram Transformer (AST) model that runs entirely within your sandbox environment. This ensures your audio files never leave your system, though an optional Gemini API mode is available for more complex descriptions if you provide a key.

How does the local model fit within sandbox resource limits?

The local ML model is optimized for the sandbox using specific versions of PyTorch and Transformers to fit within a 2GB disk limit. It uses the MIT/AST model fine-tuned on 527 AudioSet categories to ensure high accuracy without requiring high-end hardware.

Can it handle different audio formats and batch renaming?

Yes. It supports batch processing for .wav, .mp3, .ogg, .flac, .aac, and .m4a formats. It intelligently combines ML classification with existing filename hints to create descriptive, human-readable titles.

What happens if the local model can't identify a sound?

If the local model (Method B) produces ambiguous labels, you can use the Gemini API (Method A) or a browser-based fallback (Method C). Gemini provides more nuanced, cinematic descriptions for complex soundscapes or music moods.

describe-rename-sound-files

by PromptSpace

SoundTag AI: Automatically describe and batch-rename audio files based on their actual sound using local ML or Gemini AI.

Identify specific instruments to rename generic music project tracks
Convert cryptic field recording names into descriptive environmental labels
Organize voiceover exports by speaker name and performance style

Security scannedInstant install

$10

One-time purchase

Included in download

Downloadable skill package
Works with OpenClaw, Cursor
Instant install

PromptSpace

Trust & Verification

Last updatedRecently
Tested onOpenClaw, Cursor, Claude Code
SecurityScanned — no malicious code detected
SupportCommunity support via contact page
LicenseCommercial use — single seat

describe-rename-sound-files

by PromptSpace

SoundTag AI: Automatically describe and batch-rename audio files based on their actual sound using local ML or Gemini AI.

70 views

$10

One-time purchase

⚡ Skill ready to install in Claude Code, Gemini CLI, or any MCP-compatible client. Read the install guides →

Included in download

Downloadable skill package
Works with OpenClaw, Cursor
Instant install

70 views

About This Skill

SoundTag AI -Listens & Renames Your Sound Files

What it does

This skill solves the problem of messy, auto-generated audio filenames like audio_track_v2_final_99.wav. It analyzes the actual content of sound files and renames them with human-readable, descriptive titles such as ElevenLabs_2024-07-21T15_43_56_George_pre_s50_sb75_se0_b_m2.mp3 → Elevenlabs_George_Voice_Speech.mp3 or Bright_Trumpet_Fanfare.wav or Large_Crowd_Cheering.mp3.

Supported tools

Local ML (AST): Uses the MIT Audio Spectrogram Transformer to classify sounds into 527 categories (Speech, Music, Explosion, etc.) entirely offline.
Google Gemini API: Leverages advanced multimodal AI for nuanced descriptions of cinematic SFX, moods, and complex textures.
Batch Processing: Supports .wav, .mp3, .ogg, .flac, .aac, .m4a, and more.

Why use this skill

Unlike simple prompting, this skill implements a sophisticated two-step workflow. It first attempts a high-speed local classification to save on API costs and privacy. For ambiguous sounds, it provides a structured "improvement pass" using Gemini. It intelligently combines ML labels with hidden hints from the original filename to ensure context is never lost. It handles environment constraints automatically, including specific dependency versions (Torch/Transformers) to fit within sandboxed resource limits.

Output

The result is a clean, organized directory where every sound file follows a consistent Title_Case_With_Underscores naming convention, making your sample libraries and field recordings instantly searchable.

Use Cases

Identify specific instruments to rename generic music project tracks
Convert cryptic field recording names into descriptive environmental labels
Organize voiceover exports by speaker name and performance style
Batch-process sound effect libraries using AI-generated content tags

Known Limitations

Requires Python 3.8+ installed. Model download is ~350MB on first run. Works best on clearly identifiable sounds — abstract/cinematic SFX may need the optional Gemini enhancement step. Processes first 10 seconds of each file.

How to Install

mkdir -p ~/.claude/skills/describe-rename-sound-files && curl -s -X POST 'https://api.promptspace.in/api/skills/describe-rename-sound-files/install' | python3 -c "import sys,json; sys.stdout.write(json.load(sys.stdin).get('installInstructions') or '')" > ~/.claude/skills/describe-rename-sound-files/SKILL.md

Free skills install directly. Paid skills require purchase - use the download button above after buying.

Reviews

No reviews yet. Be the first to review this skill after you install it.

Security Scanned

Passed automated security review

Permissions

No special permissions declared or detected

Creator

PromptSpace

We build AI agent skill packages for content creators. Specializing in Chinese social media automation.

describe-rename-sound-files

Included in download

Trust & Verification

describe-rename-sound-files

Included in download

About This Skill

SoundTag AI -Listens & Renames Your Sound Files

What it does

Supported tools

Why use this skill

Output

Use Cases

Known Limitations

How to Install

Reviews

Permissions

Tags

Creator

Frequently Asked Questions

Learn More About AI Agent Skills