docling/tests/data
Maxim Lysak 88c1673057
fix: MD Backend, fixes to properly handle trailing inline text and emphasis in headers (#178)
* Small fix to properly handle trailing inline text in the md backend

Signed-off-by: Maksym Lysak <mly@zurich.ibm.com>

* Added proper handling of headers with bold, italic or emphasis

Signed-off-by: Maksym Lysak <mly@zurich.ibm.com>

* removed print

Signed-off-by: Maksym Lysak <mly@zurich.ibm.com>

* Made smarter processing of headers, with arbitrary styling

Signed-off-by: Maksym Lysak <mly@zurich.ibm.com>

* Updated docling-core to 2.2.1

Signed-off-by: Maksym Lysak <mly@zurich.ibm.com>

* Updated tests because of the change in Markdown export in docling-core

Signed-off-by: Maksym Lysak <mly@zurich.ibm.com>

---------

Signed-off-by: Maksym Lysak <mly@zurich.ibm.com>
Co-authored-by: Maksym Lysak <mly@zurich.ibm.com>
2024-10-25 18:02:20 +02:00
..
groundtruth fix: MD Backend, fixes to properly handle trailing inline text and emphasis in headers (#178) 2024-10-25 18:02:20 +02:00
2203.01017v2.pdf fix: Add unit tests (#51) 2024-08-30 14:08:20 +02:00
2206.01062.pdf fix: Add unit tests (#51) 2024-08-30 14:08:20 +02:00
2305.03393v1-pg9-img.png feat!: Docling v2 (#117) 2024-10-16 21:02:03 +02:00
2305.03393v1-pg9.pdf fix: Add unit tests (#51) 2024-08-30 14:08:20 +02:00
2305.03393v1.pdf fix: Add unit tests (#51) 2024-08-30 14:08:20 +02:00
lorem_ipsum.docx feat!: Docling v2 (#117) 2024-10-16 21:02:03 +02:00
powerpoint_sample.pptx feat: Support AsciiDoc and Markdown input format (#168) 2024-10-23 16:14:26 +02:00
redp5110.pdf fix: Add unit tests (#51) 2024-08-30 14:08:20 +02:00
redp5695.pdf fix: Add unit tests (#51) 2024-08-30 14:08:20 +02:00
test_01.asciidoc feat: Support AsciiDoc and Markdown input format (#168) 2024-10-23 16:14:26 +02:00
test_02.asciidoc feat: Support AsciiDoc and Markdown input format (#168) 2024-10-23 16:14:26 +02:00
wiki_duck.html feat!: Docling v2 (#117) 2024-10-16 21:02:03 +02:00
word_sample.docx feat!: Docling v2 (#117) 2024-10-16 21:02:03 +02:00