mirror of
https://github.com/DS4SD/docling.git
synced 2025-07-27 20:44:16 +00:00
* Fix for docx when headers are also lists, now recorded as appropriate headers and subheaders, unit test included Signed-off-by: Maksym Lysak <mly@zurich.ibm.com> * Update docling/backend/msword_backend.py Co-authored-by: Cesar Berrospi Ramis <75900930+ceberam@users.noreply.github.com> Signed-off-by: Maxim Lysak <101627549+maxmnemonic@users.noreply.github.com> * Update docling/backend/msword_backend.py Co-authored-by: Cesar Berrospi Ramis <75900930+ceberam@users.noreply.github.com> Signed-off-by: Maxim Lysak <101627549+maxmnemonic@users.noreply.github.com> --------- Signed-off-by: Maksym Lysak <mly@zurich.ibm.com> Signed-off-by: Maxim Lysak <101627549+maxmnemonic@users.noreply.github.com> Co-authored-by: Maksym Lysak <mly@zurich.ibm.com> Co-authored-by: Cesar Berrospi Ramis <75900930+ceberam@users.noreply.github.com> |
||
---|---|---|
.. | ||
docx | ||
groundtruth | ||
html | ||
md | ||
pptx | ||
pubmed | ||
uspto | ||
xlsx | ||
2203.01017v2.pdf | ||
2206.01062.pdf | ||
2305.03393v1-pg9-img.png | ||
2305.03393v1-pg9.pdf | ||
2305.03393v1.pdf | ||
amt_handbook_sample.pdf | ||
code_and_formula.pdf | ||
picture_classification.pdf | ||
redp5110_sampled.pdf | ||
test_01.asciidoc | ||
test_02.asciidoc |