nvidia-ocr
by PromptSpace
High-precision OCR for images, tables, and handwriting using NVIDIA NeMo Retriever.
- Extract tabular data from screenshots or PDFs into structured text.
- Digitize handwritten notes and save them as searchable markdown.
- Batch process a folder of images to extract and aggregate text data.
$12
One-time purchase
Included in download
- Downloadable skill package
- Works with OpenClaw, Cursor
- Instant install
nvidia-ocr
by PromptSpace
High-precision OCR for images, tables, and handwriting using NVIDIA NeMo Retriever.
$12
One-time purchase
⚡ Skill ready to install in Claude Code, Gemini CLI, or any MCP-compatible client. Read the install guides →
Included in download
- Downloadable skill package
- Works with OpenClaw, Cursor
- Instant install
About This Skill
What it does
This skill provides high-performance Optical Character Recognition (OCR) by leveraging the NVIDIA NeMo Retriever API. It allows your AI agent to "see" and extract text from images and documents with professional-grade accuracy. It handles complex structures like tables, charts, receipts, and even handwriting, returning structured text along with confidence scores and bounding box data.
Why use this skill
Standard LLM vision capabilities can sometimes hallucinate text or struggle with small, dense data like tables or low-quality screenshots. This skill uses a specialized OCR model optimized for precision. It supports batch processing of entire directories, provides confidence metrics to ensure data reliability, and automatically saves output to structured files for further analysis. It is significantly faster and more accurate for data extraction tasks than generic vision prompting.
Supported tools
- NVIDIA NeMo Retriever: State-of-the-art OCR foundation model.
- Python Integration: Built-in handling for Base64 encoding and batch file processing.
- Exporting: Saves results locally in .txt or .md formats for easy developer access.
Use Cases
- Extract tabular data from screenshots or PDFs into structured text.
- Digitize handwritten notes and save them as searchable markdown.
- Batch process a folder of images to extract and aggregate text data.
- Verify automated test results by extracting text from UI screenshots.
How to Install
mkdir -p ~/.claude/skills/nvidia-ocr && curl -s -X POST 'https://api.promptspace.in/api/skills/nvidia-ocr/install' | python3 -c "import sys,json; sys.stdout.write(json.load(sys.stdin).get('installInstructions') or '')" > ~/.claude/skills/nvidia-ocr/SKILL.mdFree skills install directly. Paid skills require purchase - use the download button above after buying.
Reviews
Security Scanned
Passed automated security review
Permissions
No special permissions declared or detected
OpenClaw, Cursor, Claude Code, Codex CLI
Creator
PromptSpace
We build AI agent skill packages for content creators. Specializing in Chinese social media automation.