High-performance open-source tool that converts PDFs, images, PPTX, DOCX, XLSX, HTML, EPUB, and other documents to markdown, JSON, chunks, or HTML with deep-learning-powered OCR, layout detection, and optional LLM cleanup.
Converts PDFs and documents to clean markdown or JSON — fast, accurate, handles tables, equations, and complex layouts with AI.
Marker is a free open-source document conversion pipeline for permitted research, personal, and qualifying startup use, with Datalab managed API pricing at $4 per 1,000 pages for Fast/Balanced mode, $6 per 1,000 pages for High Accuracy and extraction workflows, and custom self-hosting pricing. Marker is designed to turn complex documents into clean, structured outputs for AI, search, analytics, and knowledge-base workflows. Its core job is converting PDFs and other document formats into markdown, JSON, chunks, or HTML while preserving useful document structure such as headings, tables, forms, equations, inline math, links, references, images, and code blocks. Five concrete implementation facts define the tool: it is installable as the Python package marker-pdf; the README requires Python 3.10+; non-PDF support is enabled through the fuller marker-pdf[full] installation; the project supports local execution through CLI commands such as markersingle and marker; and it can also be used through Python APIs, a Streamlit GUI, or a lightweight FastAPI server. The input coverage is broad for a document AI converter: PDF, image, PPTX, DOCX, XLSX, HTML, and EPUB files are all described as supported inputs, while markdown, HTML, JSON, and chunks are documented outputs. The chunk output is especially relevant for retrieval workflows because it flattens top-level blocks for easier ingestion into downstream RAG or search systems. Marker also includes specialized modes for narrower tasks, including table-only conversion, OCR-only conversion, and beta structured extraction. Its optional --usellm mode can connect to services such as Gemini, Google Vertex, Ollama, Claude, OpenAI-compatible endpoints, and Azure OpenAI to improve hard cases like table merging across pages, inline math, table formatting, and form value extraction. Local deployment is practical for developers who can manage PyTorch and model dependencies, but resource planning matters: the README states Marker may use about 5GB of VRAM per worker at peak and about 3.5GB on average. Licensing also matters. The repository states that the code is GPL-3.0 and that model weights are free for research, personal use, and startups under $2M in funding or revenue, while broader commercial licensing or removing GPL requirements requires Datalab commercial licensing. For teams that do not want to operate the stack directly, Datalab offers a managed API, high-volume batch arrangements, and commercial self-hosted or on-premise options with custom terms.
Was this helpful?
Marker is a leading open-source document conversion tool, combining deep learning layout detection with OCR to produce clean markdown, JSON, chunks, or HTML from complex documents. Its LLM-enhanced mode can improve difficult cases such as tables, forms, and math. Main limitations are GPL licensing restrictions, resource requirements for local throughput, and the need to use Datalab’s paid API or commercial self-hosting for many production commercial scenarios.
Uses Surya models for detecting document regions: text blocks, headers, tables, figures, equations, code blocks, page headers, and footers. Handles multi-column layouts and complex page structures with reading order detection.
Use Case:
Converting a two-column research paper into single-column markdown with correct reading order and section hierarchy.
Integrated Surya OCR engine optimized for document text recognition. Supports 90+ languages and handles mixed-language documents with higher accuracy than Tesseract for most document types.
Use Case:
Processing scanned technical documents in multiple languages where Tesseract OCR produces too many errors.
Detects tables and converts them to properly formatted markdown tables or structured JSON with column alignment. Handles simple and moderately complex table structures, with LLM-enhanced mode for merging tables across pages.
Use Case:
Converting a technical specification PDF with comparison tables into structured data where table relationships are preserved.
Optional --use_llm flag pairs Marker with Gemini, Claude, OpenAI, or Ollama models to improve table formatting, handle inline math, extract form values, and merge tables split across pages. Benchmarks in the README report higher accuracy when LLM assistance is enabled for relevant cases.
Use Case:
Processing complex financial reports where tables span multiple pages and inline calculations need accurate LaTeX conversion.
Accepts PDF, image, PPTX, DOCX, XLSX, HTML, and EPUB files. Outputs markdown, JSON (structured), chunks (pre-segmented for RAG), or HTML. Extensible with custom processors for specialized formatting logic.
Use Case:
Building an ingestion pipeline that converts a mix of PowerPoint presentations, Word documents, and PDFs into chunked JSON for a vector database.
Datalab offers a hosted managed API with published page-based pricing and custom self-hosted deployments for teams that do not want to operate the open-source stack directly. Published managed pricing lists $4 per 1,000 pages for Fast and Balanced mode and $6 per 1,000 pages for High Accuracy mode, structured extraction, track changes, and spreadsheets.
Use Case:
A compliance team that processes thousands of regulatory PDFs monthly using the managed API or a commercial self-hosted deployment to avoid maintaining a custom document conversion service.
Free for permitted uses
$4 per 1,000 pages for Fast and Balanced mode; $6 per 1,000 pages for High Accuracy mode, structured extraction, track changes, and spreadsheets; $25 monthly credit included on the managed plan
Custom pricing
Custom pricing
Ready to get started with Marker?
View Pricing Options →Marker works with these platforms and services:
We believe in transparent reviews. Here's what Marker doesn't handle well:
Weekly insights on the latest AI tools, features, and trends delivered to your inbox.
MCP / Agent Infrastructure
IBM-originated open-source document processing software for parsing, understanding, serializing, and chunking complex documents for AI pipelines.
Document AI
LlamaParse: Extract and analyze structured data from complex PDFs and documents using LLM-powered parsing.
Document Processing & OCR
Unstructured data platform for GenAI that connects to any source, processes 64+ file types, and outputs clean AI-ready inputs.
Automation & Workflows
Enterprise-grade text extraction and document processing framework that detects and extracts content from 1,000+ file formats. Free, containerized, and battle-tested across 18 years of production deployment.
Automation & Workflows
AWS document intelligence service that extracts text, tables, forms, and handwriting from scanned documents using machine learning — with specialized APIs for invoices, IDs, and lending documents.
No reviews yet. Be the first to share your experience!
Get started with Marker and see if it's the right fit for your needs.
Get Started →Take our 60-second quiz to get personalized tool recommendations
Find Your Perfect AI Stack →Explore 20 ready-to-deploy AI agent templates for sales, support, dev, research, and operations.
Browse Agent Templates →