docling/tests/data/groundtruth/docling_v2/nested.md.md
Alexander Vaagan 733360c7b2 A new HTML backend that handles styled html (ignors it) as well as images.
- Updated unit tests
- Added documentation (Example notebook)

Note: MyPy fails.
Seems to be a known issue with BeautifulSoup:
https://github.com/python/typeshed/pull/13604

Signed-off-by: Alexander Vaagan <alexander.vaagan@gmail.com>
Signed-off-by: vaaale <2428222+vaaale@users.noreply.github.com>
2025-05-24 22:29:22 +02:00

746 B
Raw Blame History

Nesting

A list featuring nesting:

  • abc

  • def

  • after one empty line

  • afer two empty lines

  • changing symbol

A nested HTML list:

  • First item
  • Second item with subitems: Subitem 1 Subitem 2
    • Subitem 1
    • Subitem 2
  • Last list item

Table nesting apparently not yet suported by HTML backend:

<table>

<tr>

<td>Cell</td>

<td>Nested Table

<table>

<tr>

<td>Cell 1</td>

<>

</tr>

<tr>

<td>Cell 2</td>

</tr>

<tr>

<td>Cell 3</td>

</tr>

<tr>

<td>Cell 4</td>

</tr>

</table>

</td>

</tr>

<tr><td>additional row</td></tr>

</table>