mirror of
https://github.com/DS4SD/docling.git
synced 2025-12-08 12:48:28 +00:00
* Rich tables support for HTML backend Signed-off-by: Maksym Lysak <mly@zurich.ibm.com> * Decoupling JATS backend from HTML backend, ways of creating tables changed significantly Signed-off-by: Maksym Lysak <mly@zurich.ibm.com> * updated and added tests Signed-off-by: Maksym Lysak <mly@zurich.ibm.com> * Refactored parse_table_data in html_backend into few smaller functions Signed-off-by: Maksym Lysak <mly@zurich.ibm.com> * Changing scope of few functions in html_backend.py, making them static, when possible Signed-off-by: Maksym Lysak <mly@zurich.ibm.com> * Fix for HTML tables that have tbody and/or thead, now these tables are also properly supported Signed-off-by: Maksym Lysak <mly@zurich.ibm.com> --------- Signed-off-by: Maksym Lysak <mly@zurich.ibm.com> Co-authored-by: Maksym Lysak <mly@zurich.ibm.com>
60 lines
2.4 KiB
HTML
Vendored
60 lines
2.4 KiB
HTML
Vendored
<html>
|
|
<head>
|
|
<style>
|
|
table, th, td {border: 1px solid black; border-collapse: collapse;}
|
|
td {padding:30px;}
|
|
table {margin: 30px;}
|
|
</style>
|
|
</head>
|
|
<body>
|
|
<h1>Header</h1>
|
|
<p>This is the first paragraph.</p>
|
|
<table>
|
|
<tr>
|
|
<td>A</td>
|
|
<td>B</td>
|
|
</tr>
|
|
<tr>
|
|
<td>
|
|
<table>
|
|
<tr>
|
|
<td>A1</td><td>B1</td><td>C1</td>
|
|
</tr>
|
|
<tr>
|
|
<td>D1</td>
|
|
<td>
|
|
<table>
|
|
<tr>
|
|
<td>I</td><td>II</td>
|
|
</tr>
|
|
<tr>
|
|
<td>III</td><td>IV</td>
|
|
</tr>
|
|
<tr>
|
|
<td>V</td>
|
|
<td>
|
|
<table>
|
|
<tr>
|
|
<td>E1</td><td>E2</td>
|
|
</tr>
|
|
<tr>
|
|
<td>E3</td><td>E4</td>
|
|
</tr>
|
|
</table>
|
|
</td>
|
|
</tr>
|
|
<tr>
|
|
<td>VII</td><td>VIII</td>
|
|
</tr>
|
|
</table>
|
|
</td>
|
|
<td>F1</td>
|
|
</tr>
|
|
</table>
|
|
</td>
|
|
<td>2...</td>
|
|
</tr>
|
|
</table>
|
|
After table
|
|
</body>
|
|
</html> |