diff --git a/tests/data/groundtruth/docling_v2/mixed.md.md b/tests/data/groundtruth/docling_v2/mixed.md.md index bbb3e63b..6cd5d52b 100644 --- a/tests/data/groundtruth/docling_v2/mixed.md.md +++ b/tests/data/groundtruth/docling_v2/mixed.md.md @@ -6,48 +6,20 @@ Some text Here is a table: -<table> - <tr> - <th>Character</th> - <th>Name in German</th> - <th>Name in French</th> - <th>Name in Italian</th> - </tr> - <tr> - <td>Scrooge McDuck</td> - <td>Dagobert Duck</td> - <td>Balthazar Picsou</td> - <td>Paperone</td> - </tr> - <tr> - <td>Huey</td> - <td>Tick</td> - <td>Riri</td> - <td>Qui</td> - </tr> - <tr> - <td>Dewey</td> - <td>Trick</td> - <td>Fifi</td> - <td>Quo</td> - </tr> - <tr> - <td>Louie</td> - <td>Track</td> - <td>Loulou</td> - <td>Qua</td> - </tr> -</table> +| Character | Name in German | Name in French | Name in Italian | +|----------------|------------------|------------------|-------------------| +| Scrooge McDuck | Dagobert Duck | Balthazar Picsou | Paperone | +| Huey | Tick | Riri | Qui | +| Dewey | Trick | Fifi | Quo | +| Louie | Track | Loulou | Qua | And here is more HTML: -<p>Some paragraph.</p> -<div> - <p>Now a div — almost there...</p> - <ul> - <li>foo</li> - <li>bar</li> - </ul> -</div> +Some paragraph. + +Now a div — almost there... + +- foo +- bar The end! diff --git a/tests/data/groundtruth/docling_v2/mixed_without_h1.md.md b/tests/data/groundtruth/docling_v2/mixed_without_h1.md.md index a2b88864..5f76d50c 100644 --- a/tests/data/groundtruth/docling_v2/mixed_without_h1.md.md +++ b/tests/data/groundtruth/docling_v2/mixed_without_h1.md.md @@ -1,7 +1,5 @@ ## Some heading -<!-- This is HTML --> - - A. first - subitem - B. second diff --git a/tests/data/groundtruth/docling_v2/nested.md.md b/tests/data/groundtruth/docling_v2/nested.md.md index 252a5d57..6e430e0c 100644 --- a/tests/data/groundtruth/docling_v2/nested.md.md +++ b/tests/data/groundtruth/docling_v2/nested.md.md @@ -24,40 +24,8 @@ A list featuring nesting: A nested HTML list: -<ul> - <li>First item</li> - <li>Second item with subitems: - <ul> - <li>Subitem 1</li> - <li>Subitem 2</li> - </ul> - </li> - <li>Last list item</li> -</ul> -<!-- -Table nesting apparently not yet suported by HTML backend: - -<table> - <tr> - <td>Cell</td> - <td>Nested Table - <table> - <tr> - <td>Cell 1</td> - <> - </tr> - <tr> - <td>Cell 2</td> - </tr> - <tr> - <td>Cell 3</td> - </tr> - <tr> - <td>Cell 4</td> - </tr> - </table> - </td> - </tr> - <tr><td>additional row</td></tr> -</table> ---> +- First item +- Second item with subitems: + - Subitem 1 + - Subitem 2 +- Last list item