docling/tests/data/groundtruth/docling_v2/example_03.html.md
Alexander Vaagan 733360c7b2 A new HTML backend that handles styled html (ignors it) as well as images.
- Updated unit tests
- Added documentation (Example notebook)

Note: MyPy fails.
Seems to be a known issue with BeautifulSoup:
https://github.com/python/typeshed/pull/13604

Signed-off-by: Alexander Vaagan <alexander.vaagan@gmail.com>
Signed-off-by: vaaale <2428222+vaaale@users.noreply.github.com>
2025-05-24 22:29:22 +02:00

685 B

Example Document

Introduction

This is the first paragraph of the introduction.

Background

Some background information here.

  • First item in unordered list Nested item 1 Nested item 2
    • Nested item 1
    • Nested item 2
  • Second item in unordered list
  1. First item in ordered list Nested ordered item 1 Nested ordered item 2
    1. Nested ordered item 1
    2. Nested ordered item 2
  2. Second item in ordered list

Data Table

Header 1 Header 2 Header 3
Row 1, Col 1 Row 1, Col 2 Row 1, Col 3
Row 2, Col 1 Row 2, Col 2 Row 2, Col 3
Row 3, Col 1 Row 3, Col 2 Row 3, Col 3