mirror of
https://github.com/DS4SD/docling.git
synced 2025-12-09 05:08:14 +00:00
feat(html): Support in-line anchor tags in HTML texts (#1659)
* re-implement links for html backend. Signed-off-by: Roman Kayan BAZG <roman.kayan@bazg.admin.ch> * fix inline groups in list items. write specific test for find_parent_annotation of _extract_text_and_hyperlink_recursively. Signed-off-by: Roman Kayan BAZG <roman.kayan@bazg.admin.ch> * implement hack for images. Signed-off-by: Roman Kayan BAZG <roman.kayan@bazg.admin.ch> --------- Signed-off-by: Roman Kayan BAZG <roman.kayan@bazg.admin.ch>
This commit is contained in:
8
tests/data/groundtruth/docling_v2/hyperlink_03.html.md
vendored
Normal file
8
tests/data/groundtruth/docling_v2/hyperlink_03.html.md
vendored
Normal file
@@ -0,0 +1,8 @@
|
||||
- [My Section](#)
|
||||
- [Some page](/start.html)
|
||||
- [A sub page](/home2.html)
|
||||
- This is my [Homepage](/home.html)
|
||||
- List item inner
|
||||
- More text [with some links](/some_links.html) and more text.
|
||||
- [Main navigation](#main-navigation)
|
||||
- [My organisation](#)
|
||||
Reference in New Issue
Block a user