mirror of
https://github.com/DS4SD/docling.git
synced 2025-12-08 12:48:28 +00:00
docs: Add instructions for using Docling with MCP to README (#2219)
* docs: Add instructions for using Docling with MCP to README
* DCO Remediation Commit for Roy Derks <10717410+royderks@users.noreply.github.com>
Signed-off-by: Roy Derks <roy.derks@ibm.com>
* DCO Remediation Commit for Roy Derks <10717410+royderks@users.noreply.github.com>
I, Roy Derks <10717410+royderks@users.noreply.github.com>, hereby add my Signed-off-by to this commit: 4b9ba1d0ef
Signed-off-by: Roy Derks <roy.derks@ibm.com>
* docs: reorganize documentation on MCP server
Signed-off-by: Cesar Berrospi Ramis <ceb@zurich.ibm.com>
* docs: align README with documentation index page
Signed-off-by: Cesar Berrospi Ramis <ceb@zurich.ibm.com>
---------
Signed-off-by: Roy Derks <roy.derks@ibm.com>
Signed-off-by: Cesar Berrospi Ramis <ceb@zurich.ibm.com>
Co-authored-by: Roy Derks <roy.derks@ibm.com>
Co-authored-by: Cesar Berrospi Ramis <ceb@zurich.ibm.com>
This commit is contained in:
7
docs/index.md
vendored
7
docs/index.md
vendored
@@ -30,13 +30,20 @@ Docling simplifies document processing, parsing diverse formats — including ad
|
||||
* 🔍 Extensive OCR support for scanned PDFs and images
|
||||
* 👓 Support of several Visual Language Models ([SmolDocling](https://huggingface.co/ds4sd/SmolDocling-256M-preview))
|
||||
* 🎙️ Support for Audio with Automatic Speech Recognition (ASR) models
|
||||
* 🔌 Connect to any agent using the [Docling MCP](https://docling-project.github.io/docling/usage/mcp/) server
|
||||
* 💻 Simple and convenient CLI
|
||||
|
||||
### What's new
|
||||
* 📤 Structured [information extraction][extraction] \[🧪 beta\]
|
||||
* 📑 New layout model (**Heron**) by default, for faster PDF parsing
|
||||
* 🔌 [MCP server](https://docling-project.github.io/docling/usage/mcp/) for agentic applications
|
||||
|
||||
### Coming soon
|
||||
|
||||
* 📝 Metadata extraction, including title, authors, references & language
|
||||
* 📝 Chart understanding (Barchart, Piechart, LinePlot, etc)
|
||||
* 📝 Complex chemistry understanding (Molecular structures)
|
||||
* 📝 Parsing of Web Video Text Tracks (WebVTT) files
|
||||
|
||||
## Get started
|
||||
|
||||
|
||||
31
docs/usage/mcp.md
vendored
Normal file
31
docs/usage/mcp.md
vendored
Normal file
@@ -0,0 +1,31 @@
|
||||
New AI trends focus on Agentic AI, an artificial intelligence system that can accomplish a specific goal with limited supervision.
|
||||
Agents can act autonomously to understand, plan, and execute a specific task.
|
||||
|
||||
To address the integration problem, the [Model Context Protocol](https://modelcontextprotocol.io) (MCP) emerges as a popular standard for connecting AI applications to external tools.
|
||||
|
||||
## Docling MCP
|
||||
|
||||
Docling supports the development of AI agents by providing an MCP Server. It allows you to experiment with document processing in different MCP Clients. Adding [Docling MCP](https://github.com/docling-project/docling-mcp) in your favorite client is usually as simple as adding the following entry in the configuration file:
|
||||
|
||||
```json
|
||||
{
|
||||
"mcpServers": {
|
||||
"docling": {
|
||||
"command": "uvx",
|
||||
"args": [
|
||||
"--from=docling-mcp",
|
||||
"docling-mcp-server"
|
||||
]
|
||||
}
|
||||
}
|
||||
}
|
||||
```
|
||||
|
||||
When using [Claude on your desktop](https://claude.ai/download), just edit the config file `claude_desktop_config.json` with the snippet above or the example provided [here](https://github.com/docling-project/docling-mcp/blob/main/docs/integrations/claude_desktop_config.json).
|
||||
|
||||
In **[LM Studio](https://lmstudio.ai/)**, edit the `mcp.json` file with the appropriate section or simply clik on the button below for a direct install.
|
||||
|
||||
[](https://lmstudio.ai/install-mcp?name=docling&config=eyJjb21tYW5kIjoidXZ4IiwiYXJncyI6WyItLWZyb209ZG9jbGluZy1tY3AiLCJkb2NsaW5nLW1jcC1zZXJ2ZXIiXX0%3D)
|
||||
|
||||
|
||||
Docling MCP also provides tools specific for some applications and frameworks. See the [Docling MCP](https://github.com/docling-project/docling-mcp) Server repository for more details. You will find examples of building agents powered by Docling capabilities and leveraging frameworks like [LlamaIndex](https://www.llamaindex.ai/), [Llama Stack](https://github.com/llamastack/llama-stack), [Pydantic AI](https://ai.pydantic.dev/), or [smolagents](https://github.com/huggingface/smolagents).
|
||||
Reference in New Issue
Block a user