docs: DPK pipeline example using docling library (#2112)

* Notebook showing example on how to use docling transforms in DPK

Signed-off-by: Maroun Touma <touma@us.ibm.com>

* fix HF Token name

Signed-off-by: Maroun Touma <touma@us.ibm.com>

* use %pip instead of pip install jupyter lab

Signed-off-by: Maroun Touma <touma@us.ibm.com>

* run formatter

Signed-off-by: Maroun Touma <touma@us.ibm.com>

* add example to mkdocs and fix typo

Signed-off-by: Maroun Touma <touma@us.ibm.com>

---------

Signed-off-by: Maroun Touma <touma@us.ibm.com>
This commit is contained in:
Maroun Touma
2025-08-21 04:14:36 -04:00
committed by GitHub
parent 8996d612aa
commit e76298c40d
2 changed files with 773 additions and 0 deletions

View File

@@ -99,6 +99,8 @@ nav:
- examples/serialization.ipynb
- examples/hybrid_chunking.ipynb
- examples/advanced_chunking_and_serialization.ipynb
- ✂️ Data Preparation and Embedding Pipeline:
- examples/dpk-ingest-chunck-tokenize.ipynb
- 🤖 RAG with AI dev frameworks:
- examples/rag_haystack.ipynb
- examples/rag_langchain.ipynb