Skip to main content
MCP Server

A modular RAG (Retrieval-Augmented Generation) system with MCP Server architecture. Using Skill to make AI follow each step of the spec and complete the code 100% by AI.

Share
955 stars Python MIT Updated: 2026-06-04

Send to Your Agent

Copy this prompt and paste it into Claude, Cursor, or any MCP client to install instantly.

Install the MODULAR RAG MCP SERVER MCP server for me.

Server: MODULAR RAG MCP SERVER
Description: A modular RAG (Retrieval-Augmented Generation) system with MCP Server architecture. Using Skill to make AI follow each step of the spec and complete the code 100% by AI.
Install command: docker run jerry-ai-dev/MODULAR-RAG-MCP-SERVER

GitHub: https://github.com/jerry-ai-dev/MODULAR-RAG-MCP-SERVER

Please install this MCP server and confirm when ready.

Paste into any AI agent with MCP support — works with Claude Desktop, Cursor, Cline, and more.

Quick Install

Config file: ~/Library/Application Support/Claude/claude_desktop_config.json

  1. Open Claude Desktop app
  2. Go to Settings → Developer → Edit Config
  3. Paste the config below into the file
  4. Restart Claude Desktop
{
  "mcpServers": {
    "modular-rag-mcp-server": {
      "command": "docker",
      "args": [
        "run",
        "jerry-ai-dev/MODULAR-RAG-MCP-SERVER"
      ]
    }
  }
}

Add this to your ~/Library/Application Support/Claude/claude_desktop_config.json

Installation Methods

docker Run with Docker
docker run jerry-ai-dev/MODULAR-RAG-MCP-SERVER

Tools (1)

execute

Execute MODULAR-RAG-MCP-SERVER operations

Compatibility

Claude Desktop
Cursor
Cline
VS Code
Devin (formerly Windsurf)

About

Modular RAG MCP Server

一个可插拔、可观测的模块化 RAG(检索增强生成)服务框架,通过 MCP(Model Context Protocol)协议对外暴露工具接口,支持 Copilot / Claude 等 AI 助手直接调用。同时也是一份专为大模型相关岗位学习与面试求职设计的实战项目与配套教学资源。


Overview

A modular RAG (Retrieval-Augmented Generation) system with MCP Server architecture. Using Skill to make AI follow each step of the spec and complete the code 100% by AI.

Installation

DOCKER

docker run jerry-ai-dev/MODULAR-RAG-MCP-SERVER

Related Resources

MCP Registry Server

MCP Server

Publish and discover MCP servers via the official MCP Registry. Powered by HAPI MCP server.

MCP Server Builder

Skill

Build high-quality Model Context Protocol (MCP) servers to integrate external APIs and services with AI assistants. Follow best practices for tool design, security, and cross-platform compatibility.

RAG Pipeline Architect

Prompt

Design production-ready Retrieval-Augmented Generation pipelines with advanced chunking strategies, embedding optimization, and hybrid search capabilities for enterprise knowledge bases.

RAG

Glossary

RAG stands for Retrieval-Augmented Generation. It is a pattern that gives a language model access to information outside its training data by fetching relevant documents at query time and including them in the prompt. Instead of memorizing facts, the model reasons over retrieved snippets, which makes answers more accurate, current, and traceable. A typical RAG pipeline has four stages. First, documents are split into chunks and converted into embeddings using an embedding model. Second, those embeddings are stored in a vector database. Third, when a user asks a question, the system embeds the query and searches the database for the closest chunks. Finally, the retrieved chunks are added to the prompt as context, and the model generates an answer grounded in that evidence. RAG is especially useful when answers depend on private data, such as internal wikis, support tickets, or product documentation. It also reduces hallucination because the model can cite the retrieved text. Teams often tune RAG by changing chunk size, overlap, reranking algorithms, and query rewriting strategies.

Mcp

MCP Server

Catalog of official Microsoft MCP (Model Context Protocol) server implementations for AI-powered data access and tool integration

Compare LLM Pricing

This server works with Claude, GPT-4o, and other models. Compare API costs side-by-side to find the cheapest option for your workflow.