AI Memory & Search🔴Developer

RAGFlow

Name: RAGFlow
Brand: RAGFlow
Availability: InStock

Open-source RAG engine with deep document understanding, chunk visualization, citation tracking, hybrid search, and agent workflow capabilities for enterprise knowledge bases.

Starting atFree

Visit RAGFlow →

💡

In Plain English

An open-source system for building AI that answers questions from your documents — with deep understanding of complex document formats.

Overview

RAGFlow is an Apache-2.0 open-source Retrieval-Augmented Generation engine from InfiniFlow, with self-hosting available at no software license cost and hosted cloud pricing spanning Free, Starter, Pro, and Enterprise tiers, designed to act as a context layer for LLM applications and AI agents. Its public positioning is broader than a simple vector database wrapper: the project combines document ingestion, deep document understanding, chunking, hybrid retrieval, reranking, citations, configurable LLM and embedding models, and agent workflow tooling in one platform. The GitHub README describes RAGFlow as a RAG engine that fuses RAG with agent capabilities, while the product site frames it as a way to build a superior context layer for AI agents and enterprise use cases.

The strongest part of RAGFlow is its focus on messy enterprise data. The project emphasizes deep document understanding for unstructured data with complicated formats and supports a broad range of input types, including Word documents, slide decks, spreadsheets, text files, images, scanned copies, structured data, and web pages. It also includes built-in ingestion and ETL-style processing intended to cleanse and structure multi-format data into semantic representations before retrieval. For teams building knowledge-base assistants over PDFs, scanned documents, internal files, and mixed business records, that ingestion layer is a major part of the value proposition.

RAGFlow also puts unusual emphasis on explainability and grounding. It offers template-based chunking with multiple options, chunk visualization for human inspection, and traceable citations that let users quickly view references behind generated answers. This matters for professional and regulated workflows where the answer alone is not enough and users need to inspect where a claim came from. Its retrieval stack combines vector search, BM25/full-text search, custom scoring, multiple recall, and fused reranking, which gives teams more retrieval control than a bare vector-only setup.

Beyond retrieval, RAGFlow has been evolving toward agent orchestration. The website describes unified AI agent orchestration that integrates RAG, tools, MCPs, web search, chat, datasets, models, and visual workflows. The listed industry examples include equity investment research, legal precedent analysis, and manufacturing maintenance support, each using agent-style steps such as search, retrieval, HTTP calls, conditional logic, report generation, clarification, and instruction output. Recent updates listed in the repository also show support for agent memory, agentic workflow and MCP, a Python/JavaScript code executor component, and multiple chat channels.

RAGFlow is suitable for engineering teams that want an open-source, self-hostable RAG platform with a UI and production-oriented components, not just a developer library. It can be deployed via Docker Compose, configured with different LLM and embedding providers, and integrated through APIs. However, it is not a zero-maintenance tool. Self-hosting has meaningful infrastructure requirements, including at least 4 CPU cores, 16 GB RAM, 50 GB disk, Docker, Docker Compose, and Python 3.13 according to the README. The deployment stack also involves services such as Elasticsearch by default, with Infinity as an alternative document engine, plus MySQL, MinIO, and Redis in the development setup. Teams should expect real DevOps ownership for production use.

The commercial cloud offering lowers the operational burden and includes Free, Starter, Pro, and Enterprise tiers. The published cloud limits make the lower tiers best for evaluation and small team usage: Free includes 5 apps, 1 team member, 0.1 GB dataset storage, and 500 monthly credits; Starter increases this to 50 apps, 5 team members, 5 GB storage, and 5,000 credits; Pro includes unlimited apps, 20 team members, 50 GB storage, and 20,000 credits. Enterprise adds BYOC deployment, on-premises deployment, dedicated support, and custom SLA. Overall, RAGFlow is best viewed as a serious RAG and agent platform for teams that value document parsing quality, retrieval transparency, and deployment control, while accepting the complexity that comes with a full-stack open-source system.

🎨

Vibe Coding Friendly?

▼

Difficulty:intermediate

Suitability for vibe coding depends on your experience level and the specific use case.

Learn about Vibe Coding →

Was this helpful?

Key Features

Deep Document Understanding+

Parses PDFs, Word docs, and more with structure-aware chunking that preserves tables, headers, figures, and hierarchical relationships.

Use Case:

Processing financial reports where table data and section context must be preserved for accurate retrieval.

Visual Chunk Editor+

Web UI showing exactly how each document was chunked, with the ability to manually adjust boundaries and verify parsing quality.

Use Case:

Quality-checking document parsing before deploying a knowledge base to production users.

Citation Tracking+

Every generated answer includes links to specific source chunks, enabling users to verify claims against original documents.

Use Case:

Building a compliance knowledge assistant where every answer must be traceable to source policy documents.

Multi-Turn Conversation+

Maintains conversation context across multiple exchanges, enabling follow-up questions and clarification without losing thread.

Use Case:

Creating a customer-facing knowledge assistant that handles complex multi-step inquiries.

Table Understanding+

Specialized parsing for complex tables that maintains row/column relationships during indexing and retrieval.

Use Case:

Querying data from annual reports, spec sheets, or compliance matrices embedded in PDF documents.

Multi-Tenancy Support+

Built-in tenant isolation enabling multiple teams or clients to have separate knowledge bases within one deployment.

Use Case:

Deploying a shared RAG platform across departments with isolated data access controls.

Pricing Plans

Open Source Self-Hosted

Free software license; infrastructure and model costs not included

Free

$0/month

Starter

$29/month shown with a higher $59/month reference price on the site

Pro

$129/month shown with a higher $259/month reference price on the site

Enterprise

Contact sales

See Full Pricing →Free vs Paid →Is it worth it? →

Ready to get started with RAGFlow?

View Pricing Options →

Best Use Cases

🎯

Building enterprise knowledge-base assistants over PDFs, scanned files, office documents, spreadsheets, images, structured records, and web pages.

⚡

Legal or compliance research workflows that require source-grounded answers, precedent retrieval, and traceable citations.

🔧

Financial research assistants that combine internal records, external sources, retrieval, metrics, and report generation.

🚀

Manufacturing or field-support copilots that retrieve validated maintenance procedures from internal manuals and supplement them with external technical references.

💡

Teams that need a self-hostable RAG platform with a UI, APIs, configurable LLMs, configurable embedding models, and visible chunking controls.

🔄

Agent workflows where RAG needs to be combined with tools, MCPs, web search, conditional steps, code execution, and report-style outputs.

Limitations & What It Can't Do

We believe in transparent reviews. Here's what RAGFlow doesn't handle well:

⚠RAGFlow is a full platform rather than a lightweight library, so it may be excessive for simple prototypes that only need a few vector-search calls.
⚠Production self-hosting requires DevOps work around Docker, search/storage services, environment configuration, model provider keys, backups, and monitoring.
⚠The README notes x86-only prebuilt Docker images, creating friction for teams standardized on ARM64 infrastructure.
⚠Using the code executor sandbox requires gVisor, adding another deployment prerequisite for that feature.
⚠Published cloud storage limits on Free, Starter, and Pro plans may constrain large document collections unless teams use Enterprise or self-host.

Pros & Cons

✓ Pros

✓Strong document-ingestion focus: supports complex unstructured formats as well as Word, slides, spreadsheets, text, images, scanned copies, structured data, and web pages.
✓Explainable chunking workflow with template-based chunking options and visualization of text chunks so humans can inspect or intervene before retrieval quality problems become answer quality problems.
✓Grounded answer design includes quick reference views and traceable citations, which is useful for legal, finance, compliance, and internal knowledge workflows where source evidence matters.
✓Hybrid retrieval stack combines vector search, BM25/full-text search, custom scoring, multiple recall, and fused reranking rather than relying only on embeddings.
✓Open-source Apache-2.0 project with substantial GitHub traction, public documentation, Docker-based deployment, APIs, and active release history.
✓Agent capabilities are built into the product direction, including visual workflows, tools, MCP integration, web search, chat channels, agent memory, and code executor support.

✗ Cons

✗Self-hosting is infrastructure-heavy for casual users: the README lists minimum requirements of 4 CPU cores, 16 GB RAM, 50 GB disk, Docker, Docker Compose, and Python 3.13.
✗Prebuilt Docker images are documented as x86 only; ARM64 users must build compatible images themselves, and switching Infinity on Linux ARM64 is not officially supported.
✗The Docker image is now a slim edition that relies on external LLM and embedding services, so teams still need to configure and pay for model providers or run compatible model infrastructure.
✗The full stack has several moving parts, including document engine configuration, Docker environment files, backend service settings, and storage/search dependencies, which raises operational complexity.
✗Cloud lower tiers have tight dataset-storage limits, especially the Free tier at 0.1 GB and Starter at 5 GB, which may be too small for realistic enterprise document collections.

Frequently Asked Questions

Is RAGFlow open source?+

Yes. The GitHub repository lists RAGFlow under the Apache-2.0 license. The product also offers a hosted cloud service with Free, Starter, Pro, and Enterprise tiers.

What kinds of data can RAGFlow process?+

RAGFlow states support for Word documents, slides, spreadsheets, text files, images, scanned copies, structured data, web pages, and other heterogeneous sources. Its website also describes a built-in ingestion pipeline for cleansing and processing multi-format data.

Does RAGFlow only use vector search?+

No. The website describes high-precision hybrid search that combines vector search, BM25, custom scoring, and advanced reranking. The README also mentions multiple recall paired with fused reranking.

Can RAGFlow be self-hosted?+

Yes. The README provides Docker Compose and source-development instructions. Documented self-hosting prerequisites include at least 4 CPU cores, 16 GB RAM, 50 GB disk, Docker 24.0.0 or later, Docker Compose v2.26.1 or later, and Python 3.13.

Does RAGFlow support AI agents?+

Yes. RAGFlow describes unified AI agent orchestration with RAG, tools, MCPs, visual workflows, web search, chat, models, retrieval, and datasets. Recent listed updates include agentic workflow and MCP, agent memory, and a Python/JavaScript code executor component.

🦞

New to AI tools?

Read practical guides for choosing and using AI tools

Read Guides →

Get updates on RAGFlow and 370+ other AI tools

Weekly insights on the latest AI tools, features, and trends delivered to your inbox.

What's New in 2026

•The GitHub README references v0.26.0 dated June 11, 2026.

•The README lists multiple chat-channel integrations such as Feishu, Discord, Telegram, and Line.

•The README lists a 2026-04-24 update adding support for DeepSeek v4.

•The README lists a 2026-03-24 update for an official RAGFlow Skill on OpenClaw to access RAGFlow datasets.

•The website’s 2026 positioning emphasizes enterprise agent context, ETL for AI data, high-precision hybrid search, and unified AI agent orchestration.

Alternatives to RAGFlow

GraphRAG

Knowledge & Documents

Microsoft's graph-based retrieval augmented generation for complex document understanding and multi-hop reasoning.

LlamaIndex

AI agent framework

LlamaIndex is an open-source Python and TypeScript framework for building RAG, document workflows, and AI agents — with LlamaCloud for managed parsing, extraction, and indexing.

Dify

LLM app platform

Dify is an open-source LLM app development platform that combines a visual workflow builder, RAG pipelines, agent tools, and an LLMOps backbone.

Unstructured

Document Processing & OCR

Unstructured data platform for GenAI that connects to any source, processes 64+ file types, and outputs clean AI-ready inputs.

View All Alternatives & Detailed Comparison →

User Reviews

No reviews yet. Be the first to share your experience!

Quick Info

Try RAGFlow Today

Get started with RAGFlow and see if it's the right fit for your needs.

Get Started →

Need help choosing the right AI stack?

Take our 60-second quiz to get personalized tool recommendations

Find Your Perfect AI Stack →

Want a faster launch?

Explore 20 ready-to-deploy AI agent templates for sales, support, dev, research, and operations.

Browse Agent Templates →

More about RAGFlow

Pricing Review Alternatives Free vs Paid Pros & Cons Worth It?Tutorial

Overview

Key Features

Deep Document Understanding+

Parses PDFs, Word docs, and more with structure-aware chunking that preserves tables, headers, figures, and hierarchical relationships.

Use Case:

Processing financial reports where table data and section context must be preserved for accurate retrieval.

Visual Chunk Editor+

Web UI showing exactly how each document was chunked, with the ability to manually adjust boundaries and verify parsing quality.

Use Case:

Quality-checking document parsing before deploying a knowledge base to production users.

Citation Tracking+

Every generated answer includes links to specific source chunks, enabling users to verify claims against original documents.

Use Case:

Building a compliance knowledge assistant where every answer must be traceable to source policy documents.

Multi-Turn Conversation+

Maintains conversation context across multiple exchanges, enabling follow-up questions and clarification without losing thread.

Use Case:

Creating a customer-facing knowledge assistant that handles complex multi-step inquiries.

Table Understanding+

Specialized parsing for complex tables that maintains row/column relationships during indexing and retrieval.

Use Case:

Querying data from annual reports, spec sheets, or compliance matrices embedded in PDF documents.

Multi-Tenancy Support+

Built-in tenant isolation enabling multiple teams or clients to have separate knowledge bases within one deployment.

Use Case:

Deploying a shared RAG platform across departments with isolated data access controls.

Pricing Plans

Open Source Self-Hosted

Free software license; infrastructure and model costs not included

Free

$0/month

Starter

$29/month shown with a higher $59/month reference price on the site

Pro

$129/month shown with a higher $259/month reference price on the site

Enterprise

Contact sales

Best Use Cases

🎯

Building enterprise knowledge-base assistants over PDFs, scanned files, office documents, spreadsheets, images, structured records, and web pages.

⚡

Legal or compliance research workflows that require source-grounded answers, precedent retrieval, and traceable citations.

🔧

Financial research assistants that combine internal records, external sources, retrieval, metrics, and report generation.

🚀

Manufacturing or field-support copilots that retrieve validated maintenance procedures from internal manuals and supplement them with external technical references.

💡

Teams that need a self-hostable RAG platform with a UI, APIs, configurable LLMs, configurable embedding models, and visible chunking controls.

🔄

Agent workflows where RAG needs to be combined with tools, MCPs, web search, conditional steps, code execution, and report-style outputs.

Limitations & What It Can't Do

We believe in transparent reviews. Here's what RAGFlow doesn't handle well:

⚠RAGFlow is a full platform rather than a lightweight library, so it may be excessive for simple prototypes that only need a few vector-search calls.

⚠Production self-hosting requires DevOps work around Docker, search/storage services, environment configuration, model provider keys, backups, and monitoring.

⚠The README notes x86-only prebuilt Docker images, creating friction for teams standardized on ARM64 infrastructure.

⚠Using the code executor sandbox requires gVisor, adding another deployment prerequisite for that feature.

⚠Published cloud storage limits on Free, Starter, and Pro plans may constrain large document collections unless teams use Enterprise or self-host.

Pros & Cons

✓ Pros

✓Strong document-ingestion focus: supports complex unstructured formats as well as Word, slides, spreadsheets, text, images, scanned copies, structured data, and web pages.
✓Explainable chunking workflow with template-based chunking options and visualization of text chunks so humans can inspect or intervene before retrieval quality problems become answer quality problems.
✓Grounded answer design includes quick reference views and traceable citations, which is useful for legal, finance, compliance, and internal knowledge workflows where source evidence matters.
✓Hybrid retrieval stack combines vector search, BM25/full-text search, custom scoring, multiple recall, and fused reranking rather than relying only on embeddings.
✓Open-source Apache-2.0 project with substantial GitHub traction, public documentation, Docker-based deployment, APIs, and active release history.
✓Agent capabilities are built into the product direction, including visual workflows, tools, MCP integration, web search, chat channels, agent memory, and code executor support.

✗ Cons

✗Self-hosting is infrastructure-heavy for casual users: the README lists minimum requirements of 4 CPU cores, 16 GB RAM, 50 GB disk, Docker, Docker Compose, and Python 3.13.
✗Prebuilt Docker images are documented as x86 only; ARM64 users must build compatible images themselves, and switching Infinity on Linux ARM64 is not officially supported.
✗The Docker image is now a slim edition that relies on external LLM and embedding services, so teams still need to configure and pay for model providers or run compatible model infrastructure.
✗The full stack has several moving parts, including document engine configuration, Docker environment files, backend service settings, and storage/search dependencies, which raises operational complexity.
✗Cloud lower tiers have tight dataset-storage limits, especially the Free tier at 0.1 GB and Starter at 5 GB, which may be too small for realistic enterprise document collections.

Frequently Asked Questions

Is RAGFlow open source?+

Yes. The GitHub repository lists RAGFlow under the Apache-2.0 license. The product also offers a hosted cloud service with Free, Starter, Pro, and Enterprise tiers.

What kinds of data can RAGFlow process?+

Does RAGFlow only use vector search?+

No. The website describes high-precision hybrid search that combines vector search, BM25, custom scoring, and advanced reranking. The README also mentions multiple recall paired with fused reranking.

Can RAGFlow be self-hosted?+

Does RAGFlow support AI agents?+

What's New in 2026

•The GitHub README references v0.26.0 dated June 11, 2026.

•The README lists multiple chat-channel integrations such as Feishu, Discord, Telegram, and Line.

•The README lists a 2026-04-24 update adding support for DeepSeek v4.

•The README lists a 2026-03-24 update for an official RAGFlow Skill on OpenClaw to access RAGFlow datasets.

•The website’s 2026 positioning emphasizes enterprise agent context, ETL for AI data, high-precision hybrid search, and unified AI agent orchestration.