docling/docs/integrations/data_prep_kit.md
Panos Vagenas 6fd739fb7b docs: add DocETL, Kotaemon, spaCy integrations; minor docs improvements
Signed-off-by: Panos Vagenas <35837085+vagenas@users.noreply.github.com>
2024-11-21 14:19:08 +01:00

761 B

Get started

Docling is used by the Data Prep Kit open-source toolkit for preparing unstructured data for LLM application development ranging from laptop scale to datacenter scale.

Below you find the Data Prep Kit modules powered by Docling.

PDF ingestion to Parquet

Document chunking