fix: Improve numbered list detection for msword docs (#2100)

* Improve numbered list detection for msword docs

This fixes the list detection in MSWord docs by properly tracking and counting
the list entries. It fixes
https://github.com/docling-project/docling/issues/2090

* DCO Remediation Commit for Nikhil Verma <nikhilgotmail@gmail.com>

I, Nikhil Verma <nikhilgotmail@gmail.com>, hereby add my Signed-off-by to this commit: 509da6658e

Signed-off-by: Nikhil Verma <nikhilgotmail@gmail.com>

---------

Signed-off-by: Nikhil Verma <nikhilgotmail@gmail.com>
This commit is contained in:
Nikhil Verma
2025-08-22 14:08:34 +05:30
committed by GitHub
parent 94fcc46aa9
commit 3f03709885
3 changed files with 135 additions and 25 deletions

View File

@@ -12,9 +12,9 @@ Paragraph 2.1.2
#### Test 2:
- List item a
- List item b
- List item c
1. List item a
2. List item b
3. List item c
#### Test 3:

View File

@@ -18,9 +18,9 @@ To get started with swimming, first lay down in a water and try not to drown:
Also, dont forget:
- Wear sunglasses
- Dont forget to drink water
- Use sun cream
1. Wear sunglasses
2. Dont forget to drink water
3. Use sun cream
Hmm, what else…
@@ -40,6 +40,6 @@ Here are some interesting things a respectful duck could eat:
And lets add another list in the end:
- Leaves
- Berries
- Grain
1. Leaves
2. Berries
3. Grain