docling/tests
Maxim Lysak 88c1673057
fix: MD Backend, fixes to properly handle trailing inline text and emphasis in headers (#178)
* Small fix to properly handle trailing inline text in the md backend

Signed-off-by: Maksym Lysak <mly@zurich.ibm.com>

* Added proper handling of headers with bold, italic or emphasis

Signed-off-by: Maksym Lysak <mly@zurich.ibm.com>

* removed print

Signed-off-by: Maksym Lysak <mly@zurich.ibm.com>

* Made smarter processing of headers, with arbitrary styling

Signed-off-by: Maksym Lysak <mly@zurich.ibm.com>

* Updated docling-core to 2.2.1

Signed-off-by: Maksym Lysak <mly@zurich.ibm.com>

* Updated tests because of the change in Markdown export in docling-core

Signed-off-by: Maksym Lysak <mly@zurich.ibm.com>

---------

Signed-off-by: Maksym Lysak <mly@zurich.ibm.com>
Co-authored-by: Maksym Lysak <mly@zurich.ibm.com>
2024-10-25 18:02:20 +02:00
..
data fix: MD Backend, fixes to properly handle trailing inline text and emphasis in headers (#178) 2024-10-25 18:02:20 +02:00
data_scanned feat!: Docling v2 (#117) 2024-10-16 21:02:03 +02:00
__init__.py fix: Add unit tests (#51) 2024-08-30 14:08:20 +02:00
test_backend_asciidoc.py feat: Support AsciiDoc and Markdown input format (#168) 2024-10-23 16:14:26 +02:00
test_backend_docling_parse_v2.py feat!: Docling v2 (#117) 2024-10-16 21:02:03 +02:00
test_backend_docling_parse.py feat!: Docling v2 (#117) 2024-10-16 21:02:03 +02:00
test_backend_pdfium.py feat!: Docling v2 (#117) 2024-10-16 21:02:03 +02:00
test_cli.py feat: add docling cli (#75) 2024-09-13 14:03:09 +02:00
test_e2e_conversion.py feat!: Docling v2 (#117) 2024-10-16 21:02:03 +02:00
test_e2e_ocr_conversion.py feat!: Docling v2 (#117) 2024-10-16 21:02:03 +02:00
test_input_doc.py fix: set valid=false for invalid backends (#171) 2024-10-23 15:52:30 +02:00
test_interfaces.py feat!: Docling v2 (#117) 2024-10-16 21:02:03 +02:00
test_legacy_format_transform.py feat!: Docling v2 (#117) 2024-10-16 21:02:03 +02:00
test_options.py feat: add coverage_threshold to skip OCR for small images (#161) 2024-10-18 13:58:23 +02:00
verify_utils.py fix: fix legacy doc ref (#162) 2024-10-18 13:11:20 +02:00