feat: Extracting picture data for raster images found in PPTX (#349)

* Added picture data for pptx pictures

Signed-off-by: Maksym Lysak <mly@zurich.ibm.com>

* Added tests for pptx

Signed-off-by: Maksym Lysak <mly@zurich.ibm.com>

* Inferring image DPI from pptx file

Signed-off-by: Maksym Lysak <mly@zurich.ibm.com>

---------

Signed-off-by: Maksym Lysak <mly@zurich.ibm.com>
Co-authored-by: Maksym Lysak <mly@zurich.ibm.com>
This commit is contained in:
Maxim Lysak
2024-11-18 15:22:28 +01:00
committed by GitHub
parent 7dbdbdeaf3
commit 7a97d7119f
9 changed files with 2467 additions and 1 deletions

View File

@@ -0,0 +1,5 @@
item-0 at level 0: unspecified: group _root_
item-1 at level 1: chapter: group slide-0
item-2 at level 2: title: Docling
item-3 at level 2: paragraph: Image test
item-4 at level 2: picture