Maksym Lysak
|
a7a1f32b10
|
Added example on how to get original predicted doctags in minimal_smol_docling
Signed-off-by: Maksym Lysak <mly@zurich.ibm.com>
|
2025-02-24 14:39:18 +01:00 |
|
Maksym Lysak
|
853544ba11
|
Addressing PR comments, added enabled property to SmolDocling, and related VLM pipeline option, few other minor things
Signed-off-by: Maksym Lysak <mly@zurich.ibm.com>
|
2025-02-24 13:46:47 +01:00 |
|
Christoph Auer
|
55fa4eb4e3
|
Fix repo id
Signed-off-by: Christoph Auer <cau@zurich.ibm.com>
|
2025-02-24 13:20:05 +01:00 |
|
Christoph Auer
|
6f9f4f4aee
|
Update minimal smoldocling example
Signed-off-by: Christoph Auer <cau@zurich.ibm.com>
|
2025-02-24 13:18:25 +01:00 |
|
Maksym Lysak
|
d7abe1b1cd
|
Updated example of Smol Docling usage
Signed-off-by: Maksym Lysak <mly@zurich.ibm.com>
|
2025-02-24 13:15:19 +01:00 |
|
Maksym Lysak
|
7c4ab5c716
|
Moved artifacts_path for SmolDocling into vlm_options instead of global pipeline option
Signed-off-by: Maksym Lysak <mly@zurich.ibm.com>
|
2025-02-24 13:15:19 +01:00 |
|
Maksym Lysak
|
f2751e11f9
|
Introduced SmolDoclingOptions to configure model parameters (such as query and artifacts path) via client code, see example in minimal_smol_docling. Provisioning for other potential vlm all-in-one models.
Signed-off-by: Maksym Lysak <mly@zurich.ibm.com>
|
2025-02-24 13:15:15 +01:00 |
|
Maksym Lysak
|
0fe12d819a
|
Updated vlm pipeline assembly and smol docling model code to support updated doctags
Signed-off-by: Maksym Lysak <mly@zurich.ibm.com>
|
2025-02-24 13:12:55 +01:00 |
|
Maksym Lysak
|
9901729d8c
|
Exposed "force_backend_text" as pipeline parameter
Signed-off-by: Maksym Lysak <mly@zurich.ibm.com>
|
2025-02-24 13:12:51 +01:00 |
|
Maksym Lysak
|
0dc3ac43b1
|
Added capability for vlm_pipeline to grab text from preconfigured backend
Signed-off-by: Maksym Lysak <mly@zurich.ibm.com>
|
2025-02-24 12:56:57 +01:00 |
|
Maksym Lysak
|
e0929781f4
|
Added tokens/sec measurement, improved example
Signed-off-by: Maksym Lysak <mly@zurich.ibm.com>
|
2025-02-24 12:56:57 +01:00 |
|
Maksym Lysak
|
2a43c199d5
|
Cleaned up logs, added pages to vlm_pipeline, basic timing per page measurement in smol_docling models
Signed-off-by: Maksym Lysak <mly@zurich.ibm.com>
|
2025-02-24 12:56:57 +01:00 |
|
Maksym Lysak
|
61bb9dbba2
|
Properly propagating image data per page, together with predicted tags in VLM pipeline. This enables correct figure extraction and page numbers in provenances
Signed-off-by: Maksym Lysak <mly@zurich.ibm.com>
|
2025-02-24 12:56:56 +01:00 |
|
Maksym Lysak
|
01c46e24b1
|
Fix for table span compute in vlm_pipeline
Signed-off-by: Maksym Lysak <mly@zurich.ibm.com>
|
2025-02-24 12:56:56 +01:00 |
|
Maksym Lysak
|
1b968e4984
|
Fixes to preserve page image and demo export to html
Signed-off-by: Maksym Lysak <mly@zurich.ibm.com>
|
2025-02-24 12:56:56 +01:00 |
|
Maksym Lysak
|
3c4c647615
|
WIP, first working code for inference of SmolDocling, and vlm pipeline assembly code, example included.
Signed-off-by: Maksym Lysak <mly@zurich.ibm.com>
|
2025-02-24 12:56:56 +01:00 |
|
Maksym Lysak
|
03c8d45790
|
wip smolDocling inference and vlm pipeline
Signed-off-by: Maksym Lysak <mly@zurich.ibm.com>
|
2025-02-24 12:56:52 +01:00 |
|
Christoph Auer
|
dc3a388aa2
|
Skeleton for SmolDocling model and VLM Pipeline
Signed-off-by: Christoph Auer <cau@zurich.ibm.com>
Signed-off-by: Maksym Lysak <mly@zurich.ibm.com>
|
2025-02-24 11:46:04 +01:00 |
|