Christoph Auer
e00f362405
Update tests, use TextCell.from_ocr property
...
Run Docs CI / build-docs (push) Failing after 1m26s
Run CI / code-checks (push) Failing after 6m37s
Signed-off-by: Christoph Auer <cau@zurich.ibm.com>
2025-03-13 16:04:08 +01:00
Christoph Auer
f1cce8ff07
Ground-truth files updated
...
Signed-off-by: Christoph Auer <cau@zurich.ibm.com>
2025-03-12 19:57:18 +01:00
Christoph Auer
15282547cb
update test cases
...
Signed-off-by: Christoph Auer <cau@zurich.ibm.com>
2025-03-12 11:04:48 +01:00
Christoph Auer
1b9fcf0edf
Fix streams
...
Signed-off-by: Christoph Auer <cau@zurich.ibm.com>
2025-03-11 16:31:42 +01:00
Christoph Auer
c56ab3a66b
fix: Proper handling of orphan IDs in layout postprocessing ( #1118 )
...
* Fix the handling of orphan IDs in layout postprocessing
Signed-off-by: Christoph Auer <cau@zurich.ibm.com>
* Update test cases
Signed-off-by: Christoph Auer <cau@zurich.ibm.com>
---------
Signed-off-by: Christoph Auer <cau@zurich.ibm.com>
2025-03-05 14:30:59 +01:00
Christoph Auer
c93e36988f
feat: Implement new reading-order model ( #916 )
...
* Implement new reading-order model, replacing DS GLM model (WIP)
Signed-off-by: Christoph Auer <cau@zurich.ibm.com>
* Update reading-order model branch
Signed-off-by: Christoph Auer <cau@zurich.ibm.com>
* Update lockfile [skip ci]
Signed-off-by: Christoph Auer <cau@zurich.ibm.com>
* Add captions, footnotes and merges [skip ci]
Signed-off-by: Christoph Auer <cau@zurich.ibm.com>
* Updates for reading-order implementation
Signed-off-by: Christoph Auer <cau@zurich.ibm.com>
* Updates for reading-order implementation
Signed-off-by: Christoph Auer <cau@zurich.ibm.com>
* Update tests and lockfile
Signed-off-by: Christoph Auer <cau@zurich.ibm.com>
* Fixes, update tests
Signed-off-by: Christoph Auer <cau@zurich.ibm.com>
* Add normalization, update tests again
Signed-off-by: Christoph Auer <cau@zurich.ibm.com>
* Update tests with code
Signed-off-by: Christoph Auer <cau@zurich.ibm.com>
* Push final lockfile
Signed-off-by: Christoph Auer <cau@zurich.ibm.com>
* sanitize text
Signed-off-by: Michele Dolfi <dol@zurich.ibm.com>
* Inlcude furniture, Update tests with furniture
Signed-off-by: Christoph Auer <cau@zurich.ibm.com>
* Fix content_layer assignment
Signed-off-by: Christoph Auer <cau@zurich.ibm.com>
* chore: Delete empty file docling/models/ds_glm_model.py
Signed-off-by: Nikos Livathinos <nli@zurich.ibm.com>
---------
Signed-off-by: Christoph Auer <cau@zurich.ibm.com>
Signed-off-by: Michele Dolfi <dol@zurich.ibm.com>
Signed-off-by: Nikos Livathinos <nli@zurich.ibm.com>
Co-authored-by: Michele Dolfi <dol@zurich.ibm.com>
Co-authored-by: Nikos Livathinos <nli@zurich.ibm.com>
2025-02-20 17:51:17 +01:00
Michele Dolfi
d01a2e73ee
test: update results with new docling-core ( #839 )
...
* test: update results with new docling-core
Signed-off-by: Michele Dolfi <dol@zurich.ibm.com>
* fix table output in 2203.01017v2.md
Signed-off-by: Michele Dolfi <dol@zurich.ibm.com>
---------
Signed-off-by: Michele Dolfi <dol@zurich.ibm.com>
2025-01-30 14:07:52 +01:00
Christoph Auer
f9144f2bb6
docs: Add example for inspection of picture content ( #624 )
...
* chore: Add example for inspection of picture content
Signed-off-by: Christoph Auer <cau@zurich.ibm.com>
* fix: Test case re-generation
Signed-off-by: Christoph Auer <cau@zurich.ibm.com>
* fix: Test case re-generation only on CPU
Signed-off-by: Christoph Auer <cau@zurich.ibm.com>
* fix: Add missing GT files
Signed-off-by: Christoph Auer <cau@zurich.ibm.com>
---------
Signed-off-by: Christoph Auer <cau@zurich.ibm.com>
2025-01-29 10:39:00 +01:00