mirror of
https://github.com/DS4SD/docling.git
synced 2025-12-08 20:58:11 +00:00
fix(html): Parse text in div elements as TextItem (#1041)
feat(html): Parse text in div elements as TextItem Signed-off-by: Cesar Berrospi Ramis <75900930+ceberam@users.noreply.github.com>
This commit is contained in:
committed by
GitHub
parent
1d17e7397a
commit
1b0ead6907
12
tests/data/html/example_06.html
Normal file
12
tests/data/html/example_06.html
Normal file
@@ -0,0 +1,12 @@
|
||||
<html>
|
||||
<head>
|
||||
<title>Sample HTML File</title>
|
||||
</head>
|
||||
<body>
|
||||
<div>This is a div with text.</div>
|
||||
<div>This is another div with text.</div>
|
||||
<p>This is a regular paragraph.</p>
|
||||
<div>This is a third div<br/>with a new line.</div>
|
||||
<div><p>This is a fourth div with a <b>bold</b> paragraph.</p></div>
|
||||
</body>
|
||||
</html>
|
||||
Reference in New Issue
Block a user