docling/docling/backend
Cesar Berrospi Ramis c2ae1cc4ca
docs: description of supported formats and backends (#788)
* chore: remove type-ignore marks for attaching text to non GroupItems

After commit b74208 of docling-core, text items can be attached to any NodeItem
and therefore the ignore[arg-type] type marks can be removed.

Signed-off-by: Cesar Berrospi Ramis <75900930+ceberam@users.noreply.github.com>

* test: remove unnecessary imports

Signed-off-by: Cesar Berrospi Ramis <75900930+ceberam@users.noreply.github.com>

* docs: add documentation on supported formats and backends

Signed-off-by: Cesar Berrospi Ramis <75900930+ceberam@users.noreply.github.com>

* docs: add notebook example with XML backends

Signed-off-by: Cesar Berrospi Ramis <75900930+ceberam@users.noreply.github.com>

---------

Signed-off-by: Cesar Berrospi Ramis <75900930+ceberam@users.noreply.github.com>
2025-01-26 08:10:33 +01:00
..
json feat: add Docling JSON ingestion (#783) 2025-01-24 18:05:23 +01:00
xml docs: description of supported formats and backends (#788) 2025-01-26 08:10:33 +01:00
__init__.py Initial commit 2024-07-15 09:42:42 +02:00
abstract_backend.py feat: add Docling JSON ingestion (#783) 2025-01-24 18:05:23 +01:00
asciidoc_backend.py feat: Code and equation model for PDF and code blocks in markdown (#752) 2025-01-24 16:54:22 +01:00
docling_parse_backend.py refactor: allow the usage of backends in the enrich models and generalize the interface (#742) 2025-01-15 09:52:38 +01:00
docling_parse_v2_backend.py refactor: allow the usage of backends in the enrich models and generalize the interface (#742) 2025-01-15 09:52:38 +01:00
html_backend.py feat: Code and equation model for PDF and code blocks in markdown (#752) 2025-01-24 16:54:22 +01:00
md_backend.py feat: Code and equation model for PDF and code blocks in markdown (#752) 2025-01-24 16:54:22 +01:00
msexcel_backend.py fix: added extraction of byte-images in excel (#804) 2025-01-24 18:48:02 +01:00
mspowerpoint_backend.py fix(mspowerpoint): handle invalid images in PowerPoint slides (#650) 2025-01-07 13:58:10 +01:00
msword_backend.py feat: Code and equation model for PDF and code blocks in markdown (#752) 2025-01-24 16:54:22 +01:00
pdf_backend.py feat: Code and equation model for PDF and code blocks in markdown (#752) 2025-01-24 16:54:22 +01:00
pypdfium2_backend.py refactor: allow the usage of backends in the enrich models and generalize the interface (#742) 2025-01-15 09:52:38 +01:00