docling/tests/data
Cesar Berrospi Ramis d26dac61a8
fix(docx): ensure list items have a list parent (#1827)
Signed-off-by: Cesar Berrospi Ramis <75900930+ceberam@users.noreply.github.com>
2025-06-20 14:47:25 +02:00
..
asciidoc fix(asciidoc): set default size when missing in image directive (#1769) 2025-06-16 10:38:46 +02:00
csv feat: Add support for CSV input with new backend to transform CSV files to DoclingDocument (#945) 2025-02-14 08:55:09 +01:00
docx fix(msword_backend): Identify text in the same line after an image #1425 (#1610) 2025-06-20 10:55:30 +02:00
groundtruth fix(docx): ensure list items have a list parent (#1827) 2025-06-20 14:47:25 +02:00
html test: add missing ground truth files (#1667) 2025-05-28 13:26:49 +02:00
jats feat(xml-jats): parse XML JATS documents (#967) 2025-02-17 10:43:31 +01:00
md feat(markdown): add formatting & improve inline support (#1804) 2025-06-18 15:57:57 +02:00
pdf fix(pypdfium): resolve overlapping text when merging bounding boxes (#1549) 2025-05-19 15:26:00 +02:00
pptx fix: pptx line break and space handling (#1664) 2025-06-16 10:44:30 +02:00
uspto feat: create a backend to parse USPTO patents into DoclingDocument (#606) 2024-12-17 16:35:23 +01:00
webp feat: Make Page.parsed_page the only source of truth for text cells, add OCR cells to it (#1745) 2025-06-13 19:01:55 +02:00
xlsx feat: support xlsm files (#1520) 2025-06-10 16:55:59 +02:00
2305.03393v1-pg9-img.png feat!: Docling v2 (#117) 2024-10-16 21:02:03 +02:00