docling/tests/data
Christoph Auer c0447206af Merge from main
Signed-off-by: Christoph Auer <cau@zurich.ibm.com>
2024-10-08 14:42:33 +02:00
..
2203.01017v2.doctags.txt feat: new torch-based docling models (#120) 2024-10-03 18:42:33 +02:00
2203.01017v2.json feat: new torch-based docling models (#120) 2024-10-03 18:42:33 +02:00
2203.01017v2.md feat: new torch-based docling models (#120) 2024-10-03 18:42:33 +02:00
2203.01017v2.pages.json feat: new torch-based docling models (#120) 2024-10-03 18:42:33 +02:00
2203.01017v2.pdf fix: Add unit tests (#51) 2024-08-30 14:08:20 +02:00
2206.01062.doctags.txt feat: new torch-based docling models (#120) 2024-10-03 18:42:33 +02:00
2206.01062.json feat: new torch-based docling models (#120) 2024-10-03 18:42:33 +02:00
2206.01062.md feat: new torch-based docling models (#120) 2024-10-03 18:42:33 +02:00
2206.01062.pages.json feat: new torch-based docling models (#120) 2024-10-03 18:42:33 +02:00
2206.01062.pdf fix: Add unit tests (#51) 2024-08-30 14:08:20 +02:00
2305.03393v1-pg9.doctags.txt feat: new torch-based docling models (#120) 2024-10-03 18:42:33 +02:00
2305.03393v1-pg9.json feat: new torch-based docling models (#120) 2024-10-03 18:42:33 +02:00
2305.03393v1-pg9.md feat: new torch-based docling models (#120) 2024-10-03 18:42:33 +02:00
2305.03393v1-pg9.pages.json feat: new torch-based docling models (#120) 2024-10-03 18:42:33 +02:00
2305.03393v1-pg9.pdf fix: Add unit tests (#51) 2024-08-30 14:08:20 +02:00
2305.03393v1.doctags.txt feat: new torch-based docling models (#120) 2024-10-03 18:42:33 +02:00
2305.03393v1.json feat: new torch-based docling models (#120) 2024-10-03 18:42:33 +02:00
2305.03393v1.md feat: new torch-based docling models (#120) 2024-10-03 18:42:33 +02:00
2305.03393v1.pages.json feat: new torch-based docling models (#120) 2024-10-03 18:42:33 +02:00
2305.03393v1.pdf fix: Add unit tests (#51) 2024-08-30 14:08:20 +02:00
lorem_ipsum.docx Working on a first version of DOCX native backend 2024-10-04 18:19:40 +02:00
powerpoint_sample.pptx Fundamental refactoring for multi-format support 2024-10-01 16:54:09 +02:00
redp5110.doctags.txt feat: new torch-based docling models (#120) 2024-10-03 18:42:33 +02:00
redp5110.json feat: new torch-based docling models (#120) 2024-10-03 18:42:33 +02:00
redp5110.md feat: new torch-based docling models (#120) 2024-10-03 18:42:33 +02:00
redp5110.pages.json feat: new torch-based docling models (#120) 2024-10-03 18:42:33 +02:00
redp5110.pdf fix: Add unit tests (#51) 2024-08-30 14:08:20 +02:00
redp5695.doctags.txt feat: new torch-based docling models (#120) 2024-10-03 18:42:33 +02:00
redp5695.json feat: new torch-based docling models (#120) 2024-10-03 18:42:33 +02:00
redp5695.md feat: new torch-based docling models (#120) 2024-10-03 18:42:33 +02:00
redp5695.pages.json feat: new torch-based docling models (#120) 2024-10-03 18:42:33 +02:00
redp5695.pdf fix: Add unit tests (#51) 2024-08-30 14:08:20 +02:00
wiki_duck.html Fundamental refactoring for multi-format support 2024-10-01 16:54:09 +02:00
word_sample.docx Improved docx parsing 2024-10-07 13:00:50 +02:00