docling/docs
Panos Vagenas ce38baf7f7 add multiple improvements and fixes
Add typing, switch to list comprehensions where possible,
encapsulate all methods within new chunker implementation,
use dataclass instead of unmanged dictionary,
list dependencies in setup installation line.

Fix token counting bug due to static initialization of
`semchunk.Chunker`.

Use expanded chunk typing (from -core) including
embedding-specific and gen-specific texts.

Signed-off-by: Panos Vagenas <35837085+vagenas@users.noreply.github.com>
2024-11-19 23:36:50 +01:00
..
assets docs: add graphical band in readme (#154) 2024-10-17 18:15:40 +02:00
concepts chore: update the with input formats and DoclingDocument (#188) 2024-10-30 15:02:28 +01:00
examples add multiple improvements and fixes 2024-11-19 23:36:50 +01:00
integrations docs: update LlamaIndex docs for Docling v2 (#182) 2024-10-28 14:28:26 +01:00
overrides feat!: Docling v2 (#117) 2024-10-16 21:02:03 +02:00
stylesheets docs: introduce docs site (#141) 2024-10-14 14:13:13 +02:00
index.md chore: update the with input formats and DoclingDocument (#188) 2024-10-30 15:02:28 +01:00
installation.md docs: introduce docs site (#141) 2024-10-14 14:13:13 +02:00
usage.md chore: various minor docs fixes (#169) 2024-10-22 15:29:36 +02:00
v2.md Ensure all models work only on valid pages (#158) 2024-10-18 08:54:06 +02:00