mirror of
https://github.com/DS4SD/docling.git
synced 2025-12-09 13:18:24 +00:00
feat(html): Support in-line anchor tags in HTML texts (#1659)
* re-implement links for html backend. Signed-off-by: Roman Kayan BAZG <roman.kayan@bazg.admin.ch> * fix inline groups in list items. write specific test for find_parent_annotation of _extract_text_and_hyperlink_recursively. Signed-off-by: Roman Kayan BAZG <roman.kayan@bazg.admin.ch> * implement hack for images. Signed-off-by: Roman Kayan BAZG <roman.kayan@bazg.admin.ch> --------- Signed-off-by: Roman Kayan BAZG <roman.kayan@bazg.admin.ch>
This commit is contained in:
6
tests/data/groundtruth/docling_v2/hyperlink_01.html.itxt
vendored
Normal file
6
tests/data/groundtruth/docling_v2/hyperlink_01.html.itxt
vendored
Normal file
@@ -0,0 +1,6 @@
|
||||
item-0 at level 0: unspecified: group _root_
|
||||
item-1 at level 1: title: Something
|
||||
item-2 at level 2: inline: group group
|
||||
item-3 at level 3: text: Please follow the link to:
|
||||
item-4 at level 3: text: This page
|
||||
item-5 at level 3: text: .
|
||||
Reference in New Issue
Block a user