docling/docling
Christoph Auer 4200fb5632 fix: Improve OCR results, stricten criteria before dropping bitmap areas (#719)
fix: Properly care for all bitmap elements in OCR

Signed-off-by: Christoph Auer <cau@zurich.ibm.com>
Signed-off-by: Adam Kliment <adam@netmilk.net>
2025-03-13 10:26:27 +01:00
..
backend fix: Improve OCR results, stricten criteria before dropping bitmap areas (#719) 2025-03-13 10:26:27 +01:00
chunking feat: expose new hybrid chunker, update docs (#384) 2024-12-09 08:28:29 +01:00
cli feat: added http header support for document converter and cli (#642) 2025-01-07 10:15:14 +01:00
datamodel fix: Improve OCR results, stricten criteria before dropping bitmap areas (#719) 2025-03-13 10:26:27 +01:00
models fix: Improve OCR results, stricten criteria before dropping bitmap areas (#719) 2025-03-13 10:26:27 +01:00
pipeline feat: Updated Layout processing with forms and key-value areas (#530) 2024-12-17 17:32:24 +01:00
utils feat: Updated Layout processing with forms and key-value areas (#530) 2024-12-17 17:32:24 +01:00
__init__.py Initial commit 2024-07-15 09:42:42 +02:00
document_converter.py feat: added http header support for document converter and cli (#642) 2025-01-07 10:15:14 +01:00
exceptions.py fix: improve handling of disallowed formats (#429) 2024-12-03 12:45:32 +01:00
py.typed fix: Add py.typed marker file (#531) 2024-12-06 13:42:14 +01:00