mirror of
https://github.com/DS4SD/docling.git
synced 2025-12-08 20:58:11 +00:00
fix: Fixes for wordx (#432)
* fixes for referencing drawing blip in wordx Signed-off-by: Maksym Lysak <mly@zurich.ibm.com> * Added safety try-except when trying to load pillow image from a docx blob. Added explicit dependency on lxml. Signed-off-by: Maksym Lysak <mly@zurich.ibm.com> * Added test for word file with embedded emf images, re-generated full tests for docx, eased up dependency on lxml Signed-off-by: Maksym Lysak <mly@zurich.ibm.com> * Updated lxml dependency version Signed-off-by: Maksym Lysak <mly@zurich.ibm.com> --------- Signed-off-by: Maksym Lysak <mly@zurich.ibm.com> Co-authored-by: Maksym Lysak <mly@zurich.ibm.com>
This commit is contained in:
13
tests/data/groundtruth/docling_v2/test_emf_docx.docx.md
Normal file
13
tests/data/groundtruth/docling_v2/test_emf_docx.docx.md
Normal file
@@ -0,0 +1,13 @@
|
||||
Test with three images in unusual formats
|
||||
|
||||
Raster in emf:
|
||||
|
||||
<!-- image -->
|
||||
|
||||
Vector in emf:
|
||||
|
||||
<!-- image -->
|
||||
|
||||
Raster in webp:
|
||||
|
||||
<!-- image -->
|
||||
Reference in New Issue
Block a user