by PromptSpace
High-precision OCR for images, tables, and handwriting using NVIDIA NeMo Retriever.
$12
One-time purchase
by PromptSpace
High-precision OCR for images, tables, and handwriting using NVIDIA NeMo Retriever.
$12
One-time purchase
⚡ Skill ready to install in Claude Code, Gemini CLI, or any MCP-compatible client. Read the install guides →
This skill provides high-performance Optical Character Recognition (OCR) by leveraging the NVIDIA NeMo Retriever API. It allows your AI agent to "see" and extract text from images and documents with professional-grade accuracy. It handles complex structures like tables, charts, receipts, and even handwriting, returning structured text along with confidence scores and bounding box data.
Standard LLM vision capabilities can sometimes hallucinate text or struggle with small, dense data like tables or low-quality screenshots. This skill uses a specialized OCR model optimized for precision. It supports batch processing of entire directories, provides confidence metrics to ensure data reliability, and automatically saves output to structured files for further analysis. It is significantly faster and more accurate for data extraction tasks than generic vision prompting.
mkdir -p ~/.claude/skills/nvidia-ocr && curl -s -X POST 'https://api.promptspace.in/api/skills/nvidia-ocr/install' | python3 -c "import sys,json; sys.stdout.write(json.load(sys.stdin).get('installInstructions') or '')" > ~/.claude/skills/nvidia-ocr/SKILL.mdFree skills install directly. Paid skills require purchase - use the download button above after buying.
Security Scanned
Passed automated security review
No special permissions declared or detected
OpenClaw, Cursor, Claude Code, Codex CLI
PromptSpace
We build AI agent skill packages for content creators. Specializing in Chinese social media automation.