mirror of
https://github.com/DS4SD/docling.git
synced 2025-12-08 20:58:11 +00:00
fix(html): Parse text in div elements as TextItem (#1041)
feat(html): Parse text in div elements as TextItem Signed-off-by: Cesar Berrospi Ramis <75900930+ceberam@users.noreply.github.com>
This commit is contained in:
committed by
GitHub
parent
1d17e7397a
commit
1b0ead6907
10
tests/data/groundtruth/docling_v2/example_06.html.md
Normal file
10
tests/data/groundtruth/docling_v2/example_06.html.md
Normal file
@@ -0,0 +1,10 @@
|
||||
This is a div with text.
|
||||
|
||||
This is another div with text.
|
||||
|
||||
This is a regular paragraph.
|
||||
|
||||
This is a third div
|
||||
with a new line.
|
||||
|
||||
This is a fourth div with a bold paragraph.
|
||||
Reference in New Issue
Block a user