docs(gpu): Add benchmarks of standard pipeline with OCR (#2764)

* add results for standard + OCR and more Windows timings

Signed-off-by: Michele Dolfi <dol@zurich.ibm.com>

* fix runtime selection for py 3.14 in CI

Signed-off-by: Michele Dolfi <dol@zurich.ibm.com>

---------

Signed-off-by: Michele Dolfi <dol@zurich.ibm.com>
This commit is contained in:
Michele Dolfi
2025-12-10 20:43:20 +01:00
committed by GitHub
parent da7678a754
commit d03439ccc5
2 changed files with 28 additions and 3 deletions

3
docs/usage/gpu.md vendored
View File

@@ -156,7 +156,8 @@ TBA.
</thead>
<tbody>
<tr><td>Standard - Inline (no OCR)</td><td>3.1 pages/second</td><td>-</td><td>7.9 pages/second<br /><small><em>[cpu-only]* 1.5 pages/second</em></small></td><td>-</td><td>4.2 pages/second<br /><small><em>[cpu-only]* 1.2 pages/second</em></small></td><td>-</td></tr>
<tr><td>VLM - Inference server (GraniteDocling)</td><td>2.4 pages/second</td><td>-</td><td>3.8 pages/second</td><td>3.6-4.5 pages/second</td><td>-</td><td>-</td></tr>
<tr><td>Standard - Inline (with OCR)</td><td></td><td></td><td>tba</td><td>1.6 pages/second</td><td>tba</td><td>1.1 pages/second</td></tr>
<tr><td>VLM - Inference server (GraniteDocling)</td><td>2.4 pages/second</td><td>-</td><td>3.8 pages/second</td><td>3.6-4.5 pages/second</td><td>2.0 pages/second</td><td>2.8-3.2 pages/second</td></tr>
</tbody>
</table>