mirror of
https://github.com/DS4SD/docling.git
synced 2025-12-09 13:18:24 +00:00
Merge branch 'main' of https://github.com/docling-project/docling
This commit is contained in:
4
docs/examples/minimal_vlm_pipeline.py
vendored
4
docs/examples/minimal_vlm_pipeline.py
vendored
@@ -3,7 +3,7 @@
|
||||
#
|
||||
# What this example does
|
||||
# - Runs the VLM-powered pipeline on a PDF (by URL) and prints Markdown output.
|
||||
# - Shows two setups: default (Transformers/SmolDocling) and macOS MPS/MLX.
|
||||
# - Shows two setups: default (Transformers/GraniteDocling) and macOS MPS/MLX.
|
||||
#
|
||||
# Prerequisites
|
||||
# - Install Docling with VLM extras and the appropriate backend (Transformers or MLX).
|
||||
@@ -15,7 +15,7 @@
|
||||
#
|
||||
# Notes
|
||||
# - `source` may be a local path or a URL to a PDF.
|
||||
# - The second section demonstrates macOS MPS acceleration via MLX (`vlm_model_specs.SMOLDOCLING_MLX`).
|
||||
# - The second section demonstrates macOS MPS acceleration via MLX (`vlm_model_specs.GRANITEDOCLING_MLX`).
|
||||
# - For more configurations and model comparisons, see `docs/examples/compare_vlm_models.py`.
|
||||
|
||||
# %%
|
||||
|
||||
4
docs/index.md
vendored
4
docs/index.md
vendored
@@ -21,7 +21,7 @@ Docling simplifies document processing, parsing diverse formats — including ad
|
||||
|
||||
## Features
|
||||
|
||||
* 🗂️ Parsing of [multiple document formats][supported_formats] incl. PDF, DOCX, PPTX, XLSX, HTML, WAV, MP3, images (PNG, TIFF, JPEG, ...), and more
|
||||
* 🗂️ Parsing of [multiple document formats][supported_formats] incl. PDF, DOCX, PPTX, XLSX, HTML, WAV, MP3, VTT, images (PNG, TIFF, JPEG, ...), and more
|
||||
* 📑 Advanced PDF understanding incl. page layout, reading order, table structure, code, formulas, image classification, and more
|
||||
* 🧬 Unified, expressive [DoclingDocument][docling_document] representation format
|
||||
* ↪️ Various [export formats][supported_formats] and options, including Markdown, HTML, [DocTags](https://arxiv.org/abs/2503.11576) and lossless JSON
|
||||
@@ -37,13 +37,13 @@ Docling simplifies document processing, parsing diverse formats — including ad
|
||||
* 📤 Structured [information extraction][extraction] \[🧪 beta\]
|
||||
* 📑 New layout model (**Heron**) by default, for faster PDF parsing
|
||||
* 🔌 [MCP server](https://docling-project.github.io/docling/usage/mcp/) for agentic applications
|
||||
* 💬 Parsing of Web Video Text Tracks (WebVTT) files
|
||||
|
||||
### Coming soon
|
||||
|
||||
* 📝 Metadata extraction, including title, authors, references & language
|
||||
* 📝 Chart understanding (Barchart, Piechart, LinePlot, etc)
|
||||
* 📝 Complex chemistry understanding (Molecular structures)
|
||||
* 📝 Parsing of Web Video Text Tracks (WebVTT) files
|
||||
|
||||
## Get started
|
||||
|
||||
|
||||
5
docs/usage/supported_formats.md
vendored
5
docs/usage/supported_formats.md
vendored
@@ -11,10 +11,11 @@ Below you can find a listing of all supported input and output formats.
|
||||
| PDF | |
|
||||
| DOCX, XLSX, PPTX | Default formats in MS Office 2007+, based on Office Open XML |
|
||||
| Markdown | |
|
||||
| AsciiDoc | |
|
||||
| AsciiDoc | Human-readable, plain-text markup language for structured technical content |
|
||||
| HTML, XHTML | |
|
||||
| CSV | |
|
||||
| PNG, JPEG, TIFF, BMP, WEBP | Image formats |
|
||||
| WebVTT | Web Video Text Tracks format for displaying timed text |
|
||||
|
||||
Schema-specific support:
|
||||
|
||||
@@ -32,4 +33,4 @@ Schema-specific support:
|
||||
| Markdown | |
|
||||
| JSON | Lossless serialization of Docling Document |
|
||||
| Text | Plain text, i.e. without Markdown markers |
|
||||
| Doctags | |
|
||||
| [Doctags](https://arxiv.org/pdf/2503.11576) | Markup format for efficiently representing the full content and layout characteristics of a document |
|
||||
|
||||
Reference in New Issue
Block a user