Commit Graph

18 Commits

Author SHA1 Message Date
Maksym Lysak
a7a1f32b10 Added example on how to get original predicted doctags in minimal_smol_docling
Signed-off-by: Maksym Lysak <mly@zurich.ibm.com>
2025-02-24 14:39:18 +01:00
Maksym Lysak
853544ba11 Addressing PR comments, added enabled property to SmolDocling, and related VLM pipeline option, few other minor things
Signed-off-by: Maksym Lysak <mly@zurich.ibm.com>
2025-02-24 13:46:47 +01:00
Christoph Auer
55fa4eb4e3 Fix repo id
Signed-off-by: Christoph Auer <cau@zurich.ibm.com>
2025-02-24 13:20:05 +01:00
Christoph Auer
6f9f4f4aee Update minimal smoldocling example
Signed-off-by: Christoph Auer <cau@zurich.ibm.com>
2025-02-24 13:18:25 +01:00
Maksym Lysak
d7abe1b1cd Updated example of Smol Docling usage
Signed-off-by: Maksym Lysak <mly@zurich.ibm.com>
2025-02-24 13:15:19 +01:00
Maksym Lysak
7c4ab5c716 Moved artifacts_path for SmolDocling into vlm_options instead of global pipeline option
Signed-off-by: Maksym Lysak <mly@zurich.ibm.com>
2025-02-24 13:15:19 +01:00
Maksym Lysak
f2751e11f9 Introduced SmolDoclingOptions to configure model parameters (such as query and artifacts path) via client code, see example in minimal_smol_docling. Provisioning for other potential vlm all-in-one models.
Signed-off-by: Maksym Lysak <mly@zurich.ibm.com>
2025-02-24 13:15:15 +01:00
Maksym Lysak
0fe12d819a Updated vlm pipeline assembly and smol docling model code to support updated doctags
Signed-off-by: Maksym Lysak <mly@zurich.ibm.com>
2025-02-24 13:12:55 +01:00
Maksym Lysak
9901729d8c Exposed "force_backend_text" as pipeline parameter
Signed-off-by: Maksym Lysak <mly@zurich.ibm.com>
2025-02-24 13:12:51 +01:00
Maksym Lysak
0dc3ac43b1 Added capability for vlm_pipeline to grab text from preconfigured backend
Signed-off-by: Maksym Lysak <mly@zurich.ibm.com>
2025-02-24 12:56:57 +01:00
Maksym Lysak
e0929781f4 Added tokens/sec measurement, improved example
Signed-off-by: Maksym Lysak <mly@zurich.ibm.com>
2025-02-24 12:56:57 +01:00
Maksym Lysak
2a43c199d5 Cleaned up logs, added pages to vlm_pipeline, basic timing per page measurement in smol_docling models
Signed-off-by: Maksym Lysak <mly@zurich.ibm.com>
2025-02-24 12:56:57 +01:00
Maksym Lysak
61bb9dbba2 Properly propagating image data per page, together with predicted tags in VLM pipeline. This enables correct figure extraction and page numbers in provenances
Signed-off-by: Maksym Lysak <mly@zurich.ibm.com>
2025-02-24 12:56:56 +01:00
Maksym Lysak
01c46e24b1 Fix for table span compute in vlm_pipeline
Signed-off-by: Maksym Lysak <mly@zurich.ibm.com>
2025-02-24 12:56:56 +01:00
Maksym Lysak
1b968e4984 Fixes to preserve page image and demo export to html
Signed-off-by: Maksym Lysak <mly@zurich.ibm.com>
2025-02-24 12:56:56 +01:00
Maksym Lysak
3c4c647615 WIP, first working code for inference of SmolDocling, and vlm pipeline assembly code, example included.
Signed-off-by: Maksym Lysak <mly@zurich.ibm.com>
2025-02-24 12:56:56 +01:00
Maksym Lysak
03c8d45790 wip smolDocling inference and vlm pipeline
Signed-off-by: Maksym Lysak <mly@zurich.ibm.com>
2025-02-24 12:56:52 +01:00
Christoph Auer
dc3a388aa2 Skeleton for SmolDocling model and VLM Pipeline
Signed-off-by: Christoph Auer <cau@zurich.ibm.com>
Signed-off-by: Maksym Lysak <mly@zurich.ibm.com>
2025-02-24 11:46:04 +01:00