docling/tests/data/groundtruth/docling_v2/nested.md.md
Alexander Vaagan 733360c7b2 A new HTML backend that handles styled html (ignors it) as well as images.
- Updated unit tests
- Added documentation (Example notebook)

Note: MyPy fails.
Seems to be a known issue with BeautifulSoup:
https://github.com/python/typeshed/pull/13604

Signed-off-by: Alexander Vaagan <alexander.vaagan@gmail.com>
Signed-off-by: vaaale <2428222+vaaale@users.noreply.github.com>
2025-05-24 22:29:22 +02:00

67 lines
746 B
Markdown
Vendored
Raw Blame History

This file contains ambiguous Unicode characters

This file contains Unicode characters that might be confused with other characters. If you think that this is intentional, you can safely ignore this warning. Use the Escape button to reveal them.

# Nesting
A list featuring nesting:
- abc
- def
- after one empty line
- afer two empty lines
- changing symbol
A nested HTML list:
- First item
- Second item with subitems: Subitem 1 Subitem 2
- Subitem 1
- Subitem 2
- Last list item
Table nesting apparently not yet suported by HTML backend:
&lt;table&gt;
&lt;tr&gt;
&lt;td&gt;Cell&lt;/td&gt;
&lt;td&gt;Nested Table
&lt;table&gt;
&lt;tr&gt;
&lt;td&gt;Cell 1&lt;/td&gt;
&lt;&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Cell 2&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Cell 3&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Cell 4&lt;/td&gt;
&lt;/tr&gt;
&lt;/table&gt;
&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;&lt;td&gt;additional row&lt;/td&gt;&lt;/tr&gt;
&lt;/table&gt;