docling/docling
Maxim Lysak d0a1180478
fix: Fixes for wordx (#432)
* fixes for referencing drawing blip in wordx

Signed-off-by: Maksym Lysak <mly@zurich.ibm.com>

* Added safety try-except when trying to load pillow image from a docx blob. Added explicit dependency on lxml.

Signed-off-by: Maksym Lysak <mly@zurich.ibm.com>

* Added test for word file with embedded emf images, re-generated full tests for docx, eased up dependency on lxml

Signed-off-by: Maksym Lysak <mly@zurich.ibm.com>

* Updated lxml dependency version

Signed-off-by: Maksym Lysak <mly@zurich.ibm.com>

---------

Signed-off-by: Maksym Lysak <mly@zurich.ibm.com>
Co-authored-by: Maksym Lysak <mly@zurich.ibm.com>
2024-11-26 14:44:43 +01:00
..
backend fix: Fixes for wordx (#432) 2024-11-26 14:44:43 +01:00
cli fix: python3.9 support (#396) 2024-11-20 15:21:40 +01:00
datamodel feat: add support for ocrmac OCR engine on macOS (#276) 2024-11-20 12:51:19 +01:00
models feat: add support for ocrmac OCR engine on macOS (#276) 2024-11-20 12:51:19 +01:00
pipeline feat: add support for ocrmac OCR engine on macOS (#276) 2024-11-20 12:51:19 +01:00
utils feat: Add pipeline timings and toggle visualization, establish debug settings (#183) 2024-10-30 15:04:19 +01:00
__init__.py Initial commit 2024-07-15 09:42:42 +02:00
document_converter.py fix: python3.9 support (#396) 2024-11-20 15:21:40 +01:00