mirror of
https://github.com/DS4SD/docling.git
synced 2025-12-09 13:18:24 +00:00
feat(html): Support in-line anchor tags in HTML texts (#1659)
* re-implement links for html backend. Signed-off-by: Roman Kayan BAZG <roman.kayan@bazg.admin.ch> * fix inline groups in list items. write specific test for find_parent_annotation of _extract_text_and_hyperlink_recursively. Signed-off-by: Roman Kayan BAZG <roman.kayan@bazg.admin.ch> * implement hack for images. Signed-off-by: Roman Kayan BAZG <roman.kayan@bazg.admin.ch> --------- Signed-off-by: Roman Kayan BAZG <roman.kayan@bazg.admin.ch>
This commit is contained in:
40
tests/data/html/hyperlink_03.html
vendored
Normal file
40
tests/data/html/hyperlink_03.html
vendored
Normal file
@@ -0,0 +1,40 @@
|
||||
<html>
|
||||
|
||||
<body>
|
||||
<ul class="nav navbar-nav">
|
||||
<li class="dropdown">
|
||||
<a id="main-dropdown" href="#" aria-label="My Section" class="dropdown-toggle" data-toggle="dropdown"><span
|
||||
class="icon icon--right"></span> My Section</a>
|
||||
<ul class="dropdown-menu" role="menu">
|
||||
<li class="dropdown-header">
|
||||
<a href="/start.html" aria-label="Some page" target="_blank" title="">Some
|
||||
page</a>
|
||||
<ul>
|
||||
<li>
|
||||
<a href="/home2.html" aria-label="Some other page" target="_blank" title=""> A sub page</a>
|
||||
</li>
|
||||
</ul>
|
||||
<ul>
|
||||
<li>This is my <a href="/home.html">Homepage</a>
|
||||
<ul>
|
||||
<li>
|
||||
List item inner
|
||||
</li>
|
||||
<li>
|
||||
More text <a href="/some_links.html">with some links</a> and more text.
|
||||
</li>
|
||||
</ul>
|
||||
</li>
|
||||
<li><a href="#main-navigation">Main navigation</a></li>
|
||||
</ul>
|
||||
</li>
|
||||
</ul>
|
||||
</li>
|
||||
<li class="dropdown">
|
||||
<a id="other-dropdown" href="#" aria-label="My Org" class="dropdown-toggle"><span
|
||||
class="icon icon--right"></span> My organisation</a>
|
||||
</li>
|
||||
</ul>
|
||||
</body>
|
||||
|
||||
</html>
|
||||
Reference in New Issue
Block a user