Merge branch 'dev/mlx' of github.com:DS4SD/docling into dev/mlx

2025-07-30 14:04:27 +00:00 · 2025-03-19 14:56:28 +01:00 · 2025-03-19 14:56:28 +01:00 · 39a949df6e
commit 39a949df6e
parent 26f0543c9a 9cc5abd241
4 changed files with 33 additions and 13 deletions
--- a/README.md
+++ b/README.md
@ -35,7 +35,7 @@ Docling simplifies document processing, parsing diverse formats — including ad
 * 🔒 Local execution capabilities for sensitive data and air-gapped environments
 * 🤖 Plug-and-play [integrations][integrations] incl. LangChain, LlamaIndex, Crew AI & Haystack for agentic AI
 * 🔍 Extensive OCR support for scanned PDFs and images
-* 🥚 Support of Visual Language Models ([SmolDocling](https://huggingface.co/ds4sd/SmolDocling-256M-preview))
+* 🥚 Support of Visual Language Models ([SmolDocling](https://huggingface.co/ds4sd/SmolDocling-256M-preview)) 🆕
 * 💻 Simple and convenient CLI
 ### Coming soon
@ -57,7 +57,7 @@ More [detailed installation instructions](https://docling-project.github.io/docl
 ## Getting started
-To convert individual documents, use `convert()`, for example:
+To convert individual documents with python, use `convert()`, for example:
 ```python
 from docling.document_converter import DocumentConverter
@ -71,6 +71,21 @@ print(result.document.export_to_markdown())  # output: "## Docling Technical Rep
 More [advanced usage options](https://docling-project.github.io/docling/usage/) are available in
 the docs.
 ## CLI
 Docling has built-in CLI to run conversions.
 A simple example would look like this:
 ```bash
 docling https://arxiv.org/pdf/2206.01062
 ```
 You can also use 🥚[SmolDocling](https://huggingface.co/ds4sd/SmolDocling-256M-preview) via Docling CLI:
 ```bash
 docling --pipeline vlm --vlm-model smoldocling https://arxiv.org/pdf/2206.01062
 ```
 This will use MLX acceleration on supported Apple Silicon hardware.
 Read more [here](https://docling-project.github.io/docling/usage/)
 ## Documentation
 Check out Docling's [documentation](https://docling-project.github.io/docling/), for details on
--- a/docs/examples/minimal_vlm_pipeline.py
+++ b/docs/examples/minimal_vlm_pipeline.py
@ -68,18 +68,13 @@ for source in sources:
    res = converter.convert(source)
    print("------------------------------------------------")
    print("MD:")
    print("------------------------------------------------")
    print("")
    print(res.document.export_to_markdown())
    doctags = ""
    for page in res.pages:
        print("")
        print("Predicted page in DOCTAGS:")
        print(page.predictions.vlm_response.text)
        doctags += page.predictions.vlm_response.text
    res.document.save_as_html(
        filename=Path("{}/{}.html".format(out_path, res.input.file.stem)),
@ -90,14 +85,17 @@ for source in sources:
    with (out_path / f"{res.input.file.stem}.json").open("w") as fp:
        fp.write(json.dumps(res.document.export_to_dict()))
-    with (out_path / f"{res.input.file.stem}.md").open("w") as fp:
+    res.document.save_as_json(
-        fp.write(res.document.export_to_markdown())
+        out_path / f"{res.input.file.stem}.md",
        image_mode=ImageRefMode.PLACEHOLDER,
    )
-    with (out_path / f"{res.input.file.stem}.doctag").open("w") as fp:
+    res.document.save_as_markdown(
-        fp.write(doctags)
+        out_path / f"{res.input.file.stem}.md",
        image_mode=ImageRefMode.PLACEHOLDER,
    )
    pg_num = res.document.num_pages()
    print("")
    inference_time = time.time() - start_time
    print(
--- a/docs/index.md
+++ b/docs/index.md
@ -26,7 +26,7 @@ Docling simplifies document processing, parsing diverse formats — including ad
 * 🔒 Local execution capabilities for sensitive data and air-gapped environments
 * 🤖 Plug-and-play [integrations][integrations] incl. LangChain, LlamaIndex, Crew AI & Haystack for agentic AI
 * 🔍 Extensive OCR support for scanned PDFs and images
-* 🥚 Support of Visual Language Models ([SmolDocling](https://huggingface.co/ds4sd/SmolDocling-256M-preview))
+* 🥚 Support of Visual Language Models ([SmolDocling](https://huggingface.co/ds4sd/SmolDocling-256M-preview)) 🆕
 * 💻 Simple and convenient CLI
 ### Coming soon
--- a/docs/usage/index.md
+++ b/docs/usage/index.md
@ -26,6 +26,13 @@ To see all available options (export formats etc.) run `docling --help`. More de
 ### Advanced options
 #### SmolDocling via CLI
 You can also use 🥚[SmolDocling](https://huggingface.co/ds4sd/SmolDocling-256M-preview) via Docling CLI:
 ```bash
 docling --pipeline vlm --vlm-model smoldocling https://arxiv.org/pdf/2206.01062
 ```
 This will use MLX acceleration on supported Apple Silicon hardware.
 #### Model prefetching and offline usage
 By default, models are downloaded automatically upon first usage. If you would prefer