docling/tests/data
Matteo-Omenetti fae4d10479 updated poetry to ref correct branch in docling-core. Updated tests gt
Signed-off-by: Matteo-Omenetti <Matteo.Omenetti1@ibm.com>
2025-02-12 17:04:26 +01:00
..
asciidoc fix: Test cases for RTL programmatic PDFs and fixes for the formula model (#903) 2025-02-07 08:43:31 +01:00
docx fix(docx): merged table cells not properly converted (#857) 2025-02-03 10:20:03 +01:00
groundtruth updated poetry to ref correct branch in docling-core. Updated tests gt 2025-02-12 17:04:26 +01:00
html fix: parse html with omitted body tag (#818) 2025-01-27 16:59:00 +01:00
md fix(markdown): handle nested lists (#910) 2025-02-07 12:55:12 +01:00
pdf fix: Test cases for RTL programmatic PDFs and fixes for the formula model (#903) 2025-02-07 08:43:31 +01:00
pptx feat: Extracting picture data for raster images found in PPTX (#349) 2024-11-18 15:22:28 +01:00
pubmed feat: Create a backend to transform PubMed XML files to DoclingDocument (#557) 2024-12-17 19:27:09 +01:00
uspto feat: create a backend to parse USPTO patents into DoclingDocument (#606) 2024-12-17 16:35:23 +01:00
xlsx fix: added extraction of byte-images in excel (#804) 2025-01-24 18:48:02 +01:00
2305.03393v1-pg9-img.png feat!: Docling v2 (#117) 2024-10-16 21:02:03 +02:00