mirror of
https://github.com/DS4SD/docling.git
synced 2025-12-08 20:58:11 +00:00
fix: support escaped characters in markdown backend (#2304)
fix: improve markdown backend to support input documents with escaped characters Signed-off-by: Lucas Morin <lucas.morin222@gmail.com>
This commit is contained in:
41
tests/data/groundtruth/docling_v2/escaped_characters.md.md
vendored
Normal file
41
tests/data/groundtruth/docling_v2/escaped_characters.md.md
vendored
Normal file
@@ -0,0 +1,41 @@
|
||||
# Headers:
|
||||
|
||||
## & < > " '
|
||||
|
||||
Text: 00:16.000 ----> 00:18.000 & < > " '
|
||||
|
||||
# Lists
|
||||
|
||||
1. & < > " '
|
||||
|
||||
- & < > " '
|
||||
|
||||
# Inline code
|
||||
|
||||
```
|
||||
& < > " '
|
||||
```
|
||||
|
||||
# Code block
|
||||
|
||||
```
|
||||
& < > " '
|
||||
```
|
||||
|
||||
# Table
|
||||
|
||||
| Key | Example |
|
||||
|--------------|-----------|
|
||||
| Ampersand | & |
|
||||
| Less-than | < |
|
||||
| Greater-than | > |
|
||||
| Quotes | " |
|
||||
| Apostrophes | ' |
|
||||
|
||||
# Raw HTML
|
||||
|
||||
& < > " '/div>
|
||||
|
||||
## Link
|
||||
|
||||
[& < > " '](https://en.wikipedia.org/wiki/Albert_Einstein)
|
||||
Reference in New Issue
Block a user