docling/docling/backend
Pedro Ribeiro ca07927792 get merged_text from boundingbox instead of merging it to prevent overlaps
Signed-off-by: Pedro Ribeiro <pedro_ribeiro_93@hotmail.com>
2025-05-18 16:42:23 +01:00
..
docx ci: add coverage and ruff (#1383) 2025-04-14 18:01:26 +02:00
json feat: add Docling JSON ingestion (#783) 2025-01-24 18:05:23 +01:00
xml chore: typo fix (#1465) 2025-04-28 08:52:09 +02:00
__init__.py Initial commit 2024-07-15 09:42:42 +02:00
abstract_backend.py feat: add Docling JSON ingestion (#783) 2025-01-24 18:05:23 +01:00
asciidoc_backend.py ci: add coverage and ruff (#1383) 2025-04-14 18:01:26 +02:00
csv_backend.py ci: add coverage and ruff (#1383) 2025-04-14 18:01:26 +02:00
docling_parse_backend.py ci: add coverage and ruff (#1383) 2025-04-14 18:01:26 +02:00
docling_parse_v2_backend.py ci: add coverage and ruff (#1383) 2025-04-14 18:01:26 +02:00
docling_parse_v4_backend.py ci: add coverage and ruff (#1383) 2025-04-14 18:01:26 +02:00
html_backend.py fix(html): handle address, details, and summary tags (#1436) 2025-04-23 09:30:59 +02:00
md_backend.py chore: typo fix (#1465) 2025-04-28 08:52:09 +02:00
msexcel_backend.py ci: add coverage and ruff (#1383) 2025-04-14 18:01:26 +02:00
mspowerpoint_backend.py ci: add coverage and ruff (#1383) 2025-04-14 18:01:26 +02:00
msword_backend.py chore: typo fix (#1465) 2025-04-28 08:52:09 +02:00
pdf_backend.py ci: add coverage and ruff (#1383) 2025-04-14 18:01:26 +02:00
pypdfium2_backend.py get merged_text from boundingbox instead of merging it to prevent overlaps 2025-05-18 16:42:23 +01:00