mirror of
https://github.com/DS4SD/docling.git
synced 2025-07-25 19:44:34 +00:00
feat(html): Parse text in div elements as TextItem Signed-off-by: Cesar Berrospi Ramis <75900930+ceberam@users.noreply.github.com>
13 lines
336 B
HTML
13 lines
336 B
HTML
<html>
|
|
<head>
|
|
<title>Sample HTML File</title>
|
|
</head>
|
|
<body>
|
|
<div>This is a div with text.</div>
|
|
<div>This is another div with text.</div>
|
|
<p>This is a regular paragraph.</p>
|
|
<div>This is a third div<br/>with a new line.</div>
|
|
<div><p>This is a fourth div with a <b>bold</b> paragraph.</p></div>
|
|
</body>
|
|
</html>
|