mirror of
https://github.com/DS4SD/docling.git
synced 2025-12-11 06:08:09 +00:00
fix: updated the render_as_doctags with the new arguments from docling-core (#93)
* updated the render_as_doctags with the new arguments from docling-core Signed-off-by: Peter Staar <taa@zurich.ibm.com> * ensuring that docling-core is >1.5.0 to accomodate with the latest export-to-doctags parameters Signed-off-by: Peter Staar <taa@zurich.ibm.com> * added the doctags tests Signed-off-by: Peter Staar <taa@zurich.ibm.com> * updated the README Signed-off-by: Peter Staar <taa@zurich.ibm.com> * fix poetry lock Signed-off-by: Michele Dolfi <dol@zurich.ibm.com> * Fix formatting problems Signed-off-by: Christoph Auer <cau@zurich.ibm.com> * fixed the doctag export in docling/utils/export.py Signed-off-by: Peter Staar <taa@zurich.ibm.com> * propagate xsize and ysize Signed-off-by: Michele Dolfi <dol@zurich.ibm.com> --------- Signed-off-by: Peter Staar <taa@zurich.ibm.com> Signed-off-by: Michele Dolfi <dol@zurich.ibm.com> Signed-off-by: Christoph Auer <cau@zurich.ibm.com> Co-authored-by: Michele Dolfi <dol@zurich.ibm.com> Co-authored-by: Christoph Auer <cau@zurich.ibm.com>
This commit is contained in:
committed by
GitHub
parent
dce9934a0f
commit
4794ce460a
@@ -111,7 +111,7 @@ def generate_multimodal_pages(
|
||||
)
|
||||
# No page-tagging since we only do 1 page at the time
|
||||
content_dt = doc.export_to_document_tokens(
|
||||
main_text_start=start_ix, main_text_stop=end_ix, page_tagging=False
|
||||
main_text_start=start_ix, main_text_stop=end_ix, add_page_index=False
|
||||
)
|
||||
|
||||
return content_text, content_md, content_dt, page_cells, page_segments, page
|
||||
|
||||
Reference in New Issue
Block a user