mirror of
https://github.com/DS4SD/docling.git
synced 2025-07-25 19:44:34 +00:00
* add real e2e tests for html and docx Signed-off-by: Peter Staar <taa@zurich.ibm.com> * updated the output of itxt Signed-off-by: Peter Staar <taa@zurich.ibm.com> * reformatted the text Signed-off-by: Peter Staar <taa@zurich.ibm.com> * fixed the tests Signed-off-by: Peter Staar <taa@zurich.ibm.com> * fixed the tests (2) Signed-off-by: Peter Staar <taa@zurich.ibm.com> * fixed the examples (1) Signed-off-by: Peter Staar <taa@zurich.ibm.com> * fixed the output of the test Signed-off-by: Peter Staar <taa@zurich.ibm.com> * updated the tests, moved the ground-truth Signed-off-by: Peter Staar <taa@zurich.ibm.com> * moved the ground-truth data Signed-off-by: Peter Staar <taa@zurich.ibm.com> * fixed the html tests Signed-off-by: Peter Staar <taa@zurich.ibm.com> * restructure title fix (#187) Signed-off-by: Panos Vagenas <35837085+vagenas@users.noreply.github.com> --------- Signed-off-by: Peter Staar <taa@zurich.ibm.com> Signed-off-by: Panos Vagenas <35837085+vagenas@users.noreply.github.com> Co-authored-by: Panos Vagenas <35837085+vagenas@users.noreply.github.com>
11 lines
655 B
Plaintext
11 lines
655 B
Plaintext
item-0 at level 0: unspecified: group _root_
|
|
item-1 at level 1: title: Introduction
|
|
item-2 at level 2: paragraph: This is the first paragraph of the introduction.
|
|
item-3 at level 2: section_header: Background
|
|
item-4 at level 3: paragraph: Some background information here.
|
|
item-5 at level 3: list: group list
|
|
item-6 at level 4: list_item: First item in unordered list
|
|
item-7 at level 4: list_item: Second item in unordered list
|
|
item-8 at level 3: ordered_list: group ordered list
|
|
item-9 at level 4: list_item: First item in ordered list
|
|
item-10 at level 4: list_item: Second item in ordered list |