docling/tests/data/groundtruth/docling_v2
Maxim Lysak 88c1673057
fix: MD Backend, fixes to properly handle trailing inline text and emphasis in headers (#178)
* Small fix to properly handle trailing inline text in the md backend

Signed-off-by: Maksym Lysak <mly@zurich.ibm.com>

* Added proper handling of headers with bold, italic or emphasis

Signed-off-by: Maksym Lysak <mly@zurich.ibm.com>

* removed print

Signed-off-by: Maksym Lysak <mly@zurich.ibm.com>

* Made smarter processing of headers, with arbitrary styling

Signed-off-by: Maksym Lysak <mly@zurich.ibm.com>

* Updated docling-core to 2.2.1

Signed-off-by: Maksym Lysak <mly@zurich.ibm.com>

* Updated tests because of the change in Markdown export in docling-core

Signed-off-by: Maksym Lysak <mly@zurich.ibm.com>

---------

Signed-off-by: Maksym Lysak <mly@zurich.ibm.com>
Co-authored-by: Maksym Lysak <mly@zurich.ibm.com>
2024-10-25 18:02:20 +02:00
..
2203.01017v2.doctags.txt feat!: Docling v2 (#117) 2024-10-16 21:02:03 +02:00
2203.01017v2.json feat: Support AsciiDoc and Markdown input format (#168) 2024-10-23 16:14:26 +02:00
2203.01017v2.md fix: MD Backend, fixes to properly handle trailing inline text and emphasis in headers (#178) 2024-10-25 18:02:20 +02:00
2203.01017v2.pages.json feat!: Docling v2 (#117) 2024-10-16 21:02:03 +02:00
2206.01062.doctags.txt feat!: Docling v2 (#117) 2024-10-16 21:02:03 +02:00
2206.01062.json feat: Support AsciiDoc and Markdown input format (#168) 2024-10-23 16:14:26 +02:00
2206.01062.md feat: Support AsciiDoc and Markdown input format (#168) 2024-10-23 16:14:26 +02:00
2206.01062.pages.json feat!: Docling v2 (#117) 2024-10-16 21:02:03 +02:00
2305.03393v1-pg9.doctags.txt feat!: Docling v2 (#117) 2024-10-16 21:02:03 +02:00
2305.03393v1-pg9.json feat: Support AsciiDoc and Markdown input format (#168) 2024-10-23 16:14:26 +02:00
2305.03393v1-pg9.md feat: Support AsciiDoc and Markdown input format (#168) 2024-10-23 16:14:26 +02:00
2305.03393v1-pg9.pages.json feat!: Docling v2 (#117) 2024-10-16 21:02:03 +02:00
2305.03393v1.doctags.txt feat!: Docling v2 (#117) 2024-10-16 21:02:03 +02:00
2305.03393v1.json feat: Support AsciiDoc and Markdown input format (#168) 2024-10-23 16:14:26 +02:00
2305.03393v1.md fix: MD Backend, fixes to properly handle trailing inline text and emphasis in headers (#178) 2024-10-25 18:02:20 +02:00
2305.03393v1.pages.json feat!: Docling v2 (#117) 2024-10-16 21:02:03 +02:00
redp5110.doctags.txt feat!: Docling v2 (#117) 2024-10-16 21:02:03 +02:00
redp5110.json feat: Support AsciiDoc and Markdown input format (#168) 2024-10-23 16:14:26 +02:00
redp5110.md fix: MD Backend, fixes to properly handle trailing inline text and emphasis in headers (#178) 2024-10-25 18:02:20 +02:00
redp5110.pages.json feat!: Docling v2 (#117) 2024-10-16 21:02:03 +02:00
redp5695.doctags.txt feat!: Docling v2 (#117) 2024-10-16 21:02:03 +02:00
redp5695.json feat: Support AsciiDoc and Markdown input format (#168) 2024-10-23 16:14:26 +02:00
redp5695.md feat: Support AsciiDoc and Markdown input format (#168) 2024-10-23 16:14:26 +02:00
redp5695.pages.json feat!: Docling v2 (#117) 2024-10-16 21:02:03 +02:00
test_01.asciidoc.md feat: Support AsciiDoc and Markdown input format (#168) 2024-10-23 16:14:26 +02:00
test_02.asciidoc.md feat: Support AsciiDoc and Markdown input format (#168) 2024-10-23 16:14:26 +02:00