docling-project/docling

Analysis updated 2026-06-20

★ 59,251PythonAudience · developerComplexity · 3/5Setup · moderate

Mindmap

mindmap
  root((repo))
    Input Formats
      PDF
      Word and PowerPoint
      Excel and HTML
      Images and Audio
    AI Output
      Markdown
      JSON
      HTML
    Key Capabilities
      Table extraction
      Layout detection
      Reading order
      Local processing
    Integrations
      LangChain
      LlamaIndex
      Haystack
    Origin
      IBM Research Zurich
      Linux Foundation AI

mindmap root((repo)) Input Formats PDF Word and PowerPoint Excel and HTML Images and Audio AI Output Markdown JSON HTML Key Capabilities Table extraction Layout detection Reading order Local processing Integrations LangChain LlamaIndex Haystack Origin IBM Research Zurich Linux Foundation AI

Click or tap to explore — scroll the page freely

What do people build with it?

USE CASE 1

Turn a library of PDFs into AI-readable text for a chatbot or search tool, preserving tables and structure.

USE CASE 2

Build a document Q&A app that ingests Word, PowerPoint, and Excel files without losing formatting context.

USE CASE 3

Process sensitive internal documents locally without sending data to any external service.

USE CASE 4

Plug document parsing into an existing LangChain or LlamaIndex pipeline with minimal setup.

What is it built with?

PythonLangChainLlamaIndexHaystackHeron layout modelLaTeXSpeech recognition

How does it compare?

	docling-project/docling	666ghj/mirofish	meta-llama/llama
Stars	59,251	59,373	59,389
Language	Python	Python	Python
Setup difficulty	moderate	hard	hard
Complexity	3/5	4/5	3/5
Audience	developer	pm founder	developer

Figures from each repo's GitHub metadata at analysis time.

How do you get it running?

Difficulty · moderate Time to first run · 30min

Install via pip. First run downloads layout detection models automatically. GPU speeds up PDF processing but is not required. Works fully offline after initial model download.

No license was mentioned in the explanation.

In plain English

Docling is a Python library and command-line tool for converting documents from many different file formats into structured, AI-friendly output. The problem it solves is that before you can use a document as context for an AI system, you need to extract its text and structure in a clean, organized form, which is especially difficult for PDFs because they were designed for printing rather than machine reading. PDFs often contain tables, multi-column layouts, headers, footnotes, charts, and mathematical formulas that simple text extraction tools mangle or miss entirely. Docling handles these challenges with purpose-built understanding of page layout, reading order, table structure, and image content. The library accepts a wide range of input formats including PDF, Word documents (DOCX), PowerPoint (PPTX), Excel (XLSX), HTML, images in formats like PNG and JPEG, LaTeX, and audio files through speech recognition. It converts all of these into a unified internal document representation and then exports to Markdown, HTML, or JSON, preserving the structural information that makes the content useful for AI processing. For PDFs, it uses a layout detection model called Heron that identifies different regions of each page. The tool can run entirely locally, which matters when handling sensitive documents in environments without internet access. It integrates directly with popular AI application frameworks like LangChain, LlamaIndex, and Haystack, so you can plug it into an existing retrieval-augmented generation pipeline. The project was started by IBM Research Zurich and is now hosted under the Linux Foundation AI and Data initiative. You would use Docling when building an AI application that needs to ingest and understand a variety of real-world document formats.

Copy-paste prompts

Prompt 1

I'm using Docling to convert PDFs for a RAG pipeline. Show me a Python example that loads a PDF, converts it with Docling, and feeds the output into LlamaIndex for question answering.

Prompt 2

How does Docling handle multi-column PDF layouts and tables? Explain what the Heron layout model does and when it might fail.

Prompt 3

I need to process confidential documents without internet access. Walk me through running Docling fully locally, including any model downloads needed upfront.

Prompt 4

Show me how to use Docling's CLI to batch-convert a folder of DOCX and PPTX files to Markdown.

Prompt 5

What output format should I choose from Docling, Markdown, HTML, or JSON, if I want to feed documents into a vector database for semantic search? Explain the trade-offs.

Frequently asked questions

What is docling?

Docling converts PDFs, Word docs, PowerPoint, Excel, images, and more into clean, structured text that AI tools can actually read and understand, handling tricky layouts, tables, and multi-column pages automatically.

What language is docling written in?

Mainly Python. The stack also includes Python, LangChain, LlamaIndex.

What license does docling use?

No license was mentioned in the explanation.

How hard is docling to set up?

Setup difficulty is rated moderate, with roughly 30min to a first successful run.

Who is docling for?

Mainly developer.

Open on GitHub → Explain another repo

This repo across BitVibe Labs

Scan in gitsafehub Deploy in gitdeployhub docling-project on gitmyhub

Verify against the repo before relying on details.