docling/tests
Copilot 8d50a59d48
fix: multi-page image support (tiff) (#1928)
* Initial plan

* Fix multi-page TIFF image support

Co-authored-by: cau-git <60343111+cau-git@users.noreply.github.com>

* add RGB conversion

Signed-off-by: Christoph Auer <cau@zurich.ibm.com>

* Remove pointless test

Signed-off-by: Christoph Auer <cau@zurich.ibm.com>

* Add multi-page TIFF test data and verification tests

Co-authored-by: cau-git <60343111+cau-git@users.noreply.github.com>

* Revert "Add multi-page TIFF test data and verification tests"

This reverts commit 130a10e2d9.

* Proper test for 2 page tiff file

Signed-off-by: Christoph Auer <cau@zurich.ibm.com>

* DCO Remediation Commit for copilot-swe-agent[bot] <198982749+Copilot@users.noreply.github.com>

I, copilot-swe-agent[bot] <198982749+Copilot@users.noreply.github.com>, hereby add my Signed-off-by to this commit: 420df478f3
I, copilot-swe-agent[bot] <198982749+Copilot@users.noreply.github.com>, hereby add my Signed-off-by to this commit: c1d722725f
I, Christoph Auer <cau@zurich.ibm.com>, hereby add my Signed-off-by to this commit: 6aa85cc933
I, copilot-swe-agent[bot] <198982749+Copilot@users.noreply.github.com>, hereby add my Signed-off-by to this commit: 130a10e2d9
I, Christoph Auer <cau@zurich.ibm.com>, hereby add my Signed-off-by to this commit: d571f36299
I, Christoph Auer <cau@zurich.ibm.com>, hereby add my Signed-off-by to this commit: 2aab66288b

Signed-off-by: Christoph Auer <cau@zurich.ibm.com>

* Proper test for 2 page tiff file (2)

Signed-off-by: Christoph Auer <cau@zurich.ibm.com>

---------

Signed-off-by: Christoph Auer <cau@zurich.ibm.com>
Co-authored-by: copilot-swe-agent[bot] <198982749+Copilot@users.noreply.github.com>
Co-authored-by: cau-git <60343111+cau-git@users.noreply.github.com>
Co-authored-by: Christoph Auer <cau@zurich.ibm.com>
2025-07-23 09:55:40 +02:00
..
data fix: multi-page image support (tiff) (#1928) 2025-07-23 09:55:40 +02:00
data_scanned test: Update tests to use default PDF backend (DPv4) (#1923) 2025-07-10 15:16:56 +02:00
__init__.py fix: Add unit tests (#51) 2024-08-30 14:08:20 +02:00
test_asr_pipeline.py feat: Support audio input (#1763) 2025-06-23 14:47:26 +02:00
test_backend_asciidoc.py fix(asciidoc): set default size when missing in image directive (#1769) 2025-06-16 10:38:46 +02:00
test_backend_csv.py chore: fix or ignore runtime and deprecation warnings (#1660) 2025-05-28 17:55:31 +02:00
test_backend_docling_json.py feat: add Docling JSON ingestion (#783) 2025-01-24 18:05:23 +01:00
test_backend_docling_parse_v2.py ci: add coverage and ruff (#1383) 2025-04-14 18:01:26 +02:00
test_backend_docling_parse_v4.py chore: Safer unloading of DPv4 backend (#1867) 2025-06-30 14:41:21 +02:00
test_backend_docling_parse.py ci: add coverage and ruff (#1383) 2025-04-14 18:01:26 +02:00
test_backend_html.py refactor(HTML): handle text from styled html (#1960) 2025-07-22 13:16:31 +02:00
test_backend_jats.py fix: fix HTML table parser and JATS backend bugs (#1948) 2025-07-16 10:49:24 +02:00
test_backend_markdown.py feat: enable precision control in float serialization (#1914) 2025-07-09 16:39:17 +02:00
test_backend_msexcel.py feat: support xlsm files (#1520) 2025-06-10 16:55:59 +02:00
test_backend_msword.py fix(docx): ensure list items have a list parent (#1827) 2025-06-20 14:47:25 +02:00
test_backend_patent_uspto.py feat: enable precision control in float serialization (#1914) 2025-07-09 16:39:17 +02:00
test_backend_pdfium.py fix(pypdfium): resolve overlapping text when merging bounding boxes (#1549) 2025-05-19 15:26:00 +02:00
test_backend_pptx.py feat: leverage new list modeling, capture default markers (#1856) 2025-06-27 16:37:15 +02:00
test_backend_webp.py feat: support image/webp file type (#1415) 2025-05-14 09:47:28 +02:00
test_cli.py fix: Test cases for RTL programmatic PDFs and fixes for the formula model (#903) 2025-02-07 08:43:31 +01:00
test_code_formula.py fix: formula conversion with page_range param set (#1791) 2025-06-17 13:58:45 +02:00
test_data_gen_flag.py fix(markdown): handle nested lists (#910) 2025-02-07 12:55:12 +01:00
test_document_picture_classifier.py fix: use only backend for picture classifier (#1904) 2025-07-07 16:23:16 +02:00
test_e2e_conversion.py test: Update tests to use default PDF backend (DPv4) (#1923) 2025-07-10 15:16:56 +02:00
test_e2e_ocr_conversion.py test: Update tests to use default PDF backend (DPv4) (#1923) 2025-07-10 15:16:56 +02:00
test_input_doc.py fix: multi-page image support (tiff) (#1928) 2025-07-23 09:55:40 +02:00
test_interfaces.py test: Update tests to use default PDF backend (DPv4) (#1923) 2025-07-10 15:16:56 +02:00
test_invalid_input.py ci: add coverage and ruff (#1383) 2025-04-14 18:01:26 +02:00
test_legacy_format_transform.py feat: enable precision control in float serialization (#1914) 2025-07-09 16:39:17 +02:00
test_ocr_utils.py fix(ocr-utils): unit test and fix the rotate_bounding_box function (#1897) 2025-07-08 18:03:29 +02:00
test_options.py feat: new vlm-models support (#1570) 2025-06-02 17:01:06 +02:00
test_settings_load.py fix(settings): fix nested settings load via environment variables (#1551) 2025-05-14 13:42:10 +02:00
verify_utils.py feat: enable precision control in float serialization (#1914) 2025-07-09 16:39:17 +02:00