docling/docs/integrations/data_prep_kit.md
Panos Vagenas 32055fe9d6 docs: add Data Prep Kit integration
Signed-off-by: Panos Vagenas <35837085+vagenas@users.noreply.github.com>
2024-11-12 11:09:40 +01:00

753 B

Get started

Docling is used by the Data Prep Kit [↗] open-source toolkit for preparing unstructured data for LLM application development ranging from laptop scale to datacenter scale.

Below you find the Data Prep Kit modules powered by Docling.

PDF ingestion to Parquet

Document chunking