
MODULAR RAG MCP SERVER
A modular RAG (Retrieval-Augmented Generation) system with MCP Server architecture. Using Skill to make AI follow each step of the spec and complete the code 100% by AI.
Send to Your Agent
Copy this prompt and paste it into Claude, Cursor, or any MCP client to install instantly.
Install the MODULAR RAG MCP SERVER MCP server for me.
Server: MODULAR RAG MCP SERVER
Description: A modular RAG (Retrieval-Augmented Generation) system with MCP Server architecture. Using Skill to make AI follow each step of the spec and complete the code 100% by AI.
Install command: docker run jerry-ai-dev/MODULAR-RAG-MCP-SERVER
GitHub: https://github.com/jerry-ai-dev/MODULAR-RAG-MCP-SERVER
Please install this MCP server and confirm when ready. Paste into any AI agent with MCP support — works with Claude Desktop, Cursor, Cline, and more.
Quick Install
Config file: ~/Library/Application Support/Claude/claude_desktop_config.json
- Open Claude Desktop app
- Go to Settings → Developer → Edit Config
- Paste the config below into the file
- Restart Claude Desktop
{
"mcpServers": {
"modular-rag-mcp-server": {
"command": "docker",
"args": [
"run",
"jerry-ai-dev/MODULAR-RAG-MCP-SERVER"
]
}
}
} Add this to your ~/Library/Application Support/Claude/claude_desktop_config.json
Installation Methods
docker run jerry-ai-dev/MODULAR-RAG-MCP-SERVERTools (1)
execute Execute MODULAR-RAG-MCP-SERVER operations
Compatibility
About
Modular RAG MCP Server
一个可插拔、可观测的模块化 RAG(检索增强生成)服务框架,通过 MCP(Model Context Protocol)协议对外暴露工具接口,支持 Copilot / Claude 等 AI 助手直接调用。同时也是一份专为大模型相关岗位学习与面试求职设计的实战项目与配套教学资源。
Overview
A modular RAG (Retrieval-Augmented Generation) system with MCP Server architecture. Using Skill to make AI follow each step of the spec and complete the code 100% by AI.
Installation
DOCKER
docker run jerry-ai-dev/MODULAR-RAG-MCP-SERVER
Related Resources
MCP Registry Server
MCP ServerPublish and discover MCP servers via the official MCP Registry. Powered by HAPI MCP server.
MCP Server Builder
SkillBuild high-quality Model Context Protocol (MCP) servers to integrate external APIs and services with AI assistants. Follow best practices for tool design, security, and cross-platform compatibility.
RAG Pipeline Architect
PromptDesign production-ready Retrieval-Augmented Generation pipelines with advanced chunking strategies, embedding optimization, and hybrid search capabilities for enterprise knowledge bases.
RAG
GlossaryRAG stands for Retrieval-Augmented Generation. It is a pattern that gives a language model access to information outside its training data by fetching relevant documents at query time and including them in the prompt. Instead of memorizing facts, the model reasons over retrieved snippets, which makes answers more accurate, current, and traceable. A typical RAG pipeline has four stages. First, documents are split into chunks and converted into embeddings using an embedding model. Second, those embeddings are stored in a vector database. Third, when a user asks a question, the system embeds the query and searches the database for the closest chunks. Finally, the retrieved chunks are added to the prompt as context, and the model generates an answer grounded in that evidence. RAG is especially useful when answers depend on private data, such as internal wikis, support tickets, or product documentation. It also reduces hallucination because the model can cite the retrieved text. Teams often tune RAG by changing chunk size, overlap, reranking algorithms, and query rewriting strategies.
Mcp
MCP ServerCatalog of official Microsoft MCP (Model Context Protocol) server implementations for AI-powered data access and tool integration
Compare LLM Pricing
This server works with Claude, GPT-4o, and other models. Compare API costs side-by-side to find the cheapest option for your workflow.