Improve numbered list detection for msword docs

This fixes the list detection in MSWord docs by properly tracking and counting
the list entries. It fixes
https://github.com/docling-project/docling/issues/2090
This commit is contained in:
Nikhil Verma
2025-08-19 15:15:59 +05:30
parent d2494da8b8
commit 509da6658e
3 changed files with 135 additions and 25 deletions

View File

@@ -18,9 +18,9 @@ To get started with swimming, first lay down in a water and try not to drown:
Also, dont forget:
- Wear sunglasses
- Dont forget to drink water
- Use sun cream
1. Wear sunglasses
2. Dont forget to drink water
3. Use sun cream
Hmm, what else…
@@ -40,6 +40,6 @@ Here are some interesting things a respectful duck could eat:
And lets add another list in the end:
- Leaves
- Berries
- Grain
1. Leaves
2. Berries
3. Grain