rag-architect
by PromptSpace
Design, debug, and optimize production RAG systems with expert architecture, hybrid search, and grounding strategies.
- Construct hybrid search pipelines combining semantic and keyword retrieval
- Debug hallucination risks by implementing strict source grounding protocols
- Optimize indexing strategies for low-latency document retrieval at scale
$5
One-time purchase
Included in download
- Downloadable skill package
- Works with OpenClaw, Cursor
- Instant install
rag-architect
by PromptSpace
Design, debug, and optimize production RAG systems with expert architecture, hybrid search, and grounding strategies.
$5
One-time purchase
⚡ Skill ready to install in Claude Code, Gemini CLI, or any MCP-compatible client. Read the install guides →
Included in download
- Downloadable skill package
- Works with OpenClaw, Cursor
- Instant install
About This Skill
Advanced RAG System Architecture & Debugging
Designing a production-ready Retrieval-Augmented Generation (RAG) system requires more than just a vector database and a prompt. The RAG Architect skill provides a developer-centric framework for building, hardening, and troubleshooting complex retrieval stacks, moving beyond generic implementations to high-performance architecture.
What it does
This skill acts as a senior systems architect for your AI pipeline. It analyzes ingestion workflows, document parsing, chunking strategies, embedding selection, and vector store performance. Whether you are building from scratch or fixing a broken implementation, it applies a rigorous, evidence-based methodology to ensure your agent stays grounded and accurate.
Supported Capabilities
- Architecture Design: Decisions for hybrid search, reranking, and context packing tailored to your specific corpus (Legal, Code, Product Docs, etc.).
- Truth-First Debugging: Systematic isolation of failures across the pipeline—from bad parsing to stale indexes and tenant leakage.
- Infrastructure Selection: Unbiased tradeoff analysis for vector databases (pgvector, Qdrant, Milvus), embedding models, and rerankers.
- Production Hardening: Implementing multi-tenant isolation, citation grounding, and incremental re-indexing strategies.
- Evaluation Frameworks: Establishing metrics for recall@k, precision, and faithfulness to ensure changes are data-driven rather than anecdotal.
Why use this skill?
Standard LLM prompts often treat "bad answers" as model hallucinations. This skill identifies when the problem is actually a metadata filter mismatch, poor chunking semantics, or an inefficient reranker. It helps you reduce latency and cost by optimizing the weakest stage of your pipeline rather than over-relying on expensive long-context windows.
Use Cases
- Construct hybrid search pipelines combining semantic and keyword retrieval
- Debug hallucination risks by implementing strict source grounding protocols
- Optimize indexing strategies for low-latency document retrieval at scale
- Architect multi-stage re-ranking workflows to improve answer precision
Known Limitations
- Cannot perform the actual vector DB migration or infrastructure provisioning.
- Effectiveness is limited without access to specific log samples or retrieval metrics.
- Does not generate frontend UI.
How to Install
mkdir -p ~/.claude/skills/rag-architect && curl -s -X POST 'https://api.promptspace.in/api/skills/rag-architect/install' | python3 -c "import sys,json; sys.stdout.write(json.load(sys.stdin).get('installInstructions') or '')" > ~/.claude/skills/rag-architect/SKILL.mdFree skills install directly. Paid skills require purchase - use the download button above after buying.
Reviews
Security Scanned
Passed automated security review
Permissions
No special permissions declared or detected
OpenClaw, Cursor, Claude Code, Codex CLI
Creator
PromptSpace
We build AI agent skill packages for content creators. Specializing in Chinese social media automation.