docling/docs/index.md at ad2f738231e414a75e0b649250b58f11c86c7c17

mirror of https://github.com/DS4SD/docling.git synced 2025-12-08 20:58:11 +00:00

Files

Roy Derks e5cd7020bd docs: Add instructions for using Docling with MCP to README (#2219 )

* docs: Add instructions for using Docling with MCP to README

* DCO Remediation Commit for Roy Derks <10717410+royderks@users.noreply.github.com>

Signed-off-by: Roy Derks <roy.derks@ibm.com>

* DCO Remediation Commit for Roy Derks <10717410+royderks@users.noreply.github.com>

I, Roy Derks <10717410+royderks@users.noreply.github.com>, hereby add my Signed-off-by to this commit: 4b9ba1d0ef

Signed-off-by: Roy Derks <roy.derks@ibm.com>

* docs: reorganize documentation on MCP server

Signed-off-by: Cesar Berrospi Ramis <ceb@zurich.ibm.com>

* docs: align README with documentation index page

Signed-off-by: Cesar Berrospi Ramis <ceb@zurich.ibm.com>

---------

Signed-off-by: Roy Derks <roy.derks@ibm.com>
Signed-off-by: Cesar Berrospi Ramis <ceb@zurich.ibm.com>
Co-authored-by: Roy Derks <roy.derks@ibm.com>
Co-authored-by: Cesar Berrospi Ramis <ceb@zurich.ibm.com>

2025-09-10 10:02:28 +02:00

4.7 KiB

Vendored

Raw Blame History

Docling simplifies document processing, parsing diverse formats — including advanced PDF understanding — and providing seamless integrations with the gen AI ecosystem.

Features

🗂️ Parsing of multiple document formats incl. PDF, DOCX, PPTX, XLSX, HTML, WAV, MP3, images (PNG, TIFF, JPEG, ...), and more
📑 Advanced PDF understanding incl. page layout, reading order, table structure, code, formulas, image classification, and more
🧬 Unified, expressive DoclingDocument representation format
↪️ Various export formats and options, including Markdown, HTML, DocTags and lossless JSON
🔒 Local execution capabilities for sensitive data and air-gapped environments
🤖 Plug-and-play integrations incl. LangChain, LlamaIndex, Crew AI & Haystack for agentic AI
🔍 Extensive OCR support for scanned PDFs and images
👓 Support of several Visual Language Models (SmolDocling)
🎙️ Support for Audio with Automatic Speech Recognition (ASR) models
🔌 Connect to any agent using the Docling MCP server
💻 Simple and convenient CLI

What's new

📤 Structured [information extraction][extraction] [🧪 beta]
📑 New layout model (Heron) by default, for faster PDF parsing
🔌 MCP server for agentic applications

Coming soon

📝 Metadata extraction, including title, authors, references & language
📝 Chart understanding (Barchart, Piechart, LinePlot, etc)
📝 Complex chemistry understanding (Molecular structures)
📝 Parsing of Web Video Text Tracks (WebVTT) files

Get started

Check out our getting started page to get the ball rolling!

Live assistant

Do you want to leverage the power of AI and get live support on Docling? Try out the Chat with Dosu functionalities provided by our friends at Dosu.

LF AI & Data

Docling is hosted as a project in the LF AI & Data Foundation.

IBM ❤️ Open Source AI

The project was started by the AI for knowledge team at IBM Research Zurich.

4.7 KiB Vendored Raw Blame History