mirror of
https://github.com/DS4SD/docling.git
synced 2025-07-30 22:14:37 +00:00
add multiple improvements and fixes
Add typing, switch to list comprehensions where possible, encapsulate all methods within new chunker implementation, use dataclass instead of unmanged dictionary, list dependencies in setup installation line. Fix token counting bug due to static initialization of `semchunk.Chunker`. Use expanded chunk typing (from -core) including embedding-specific and gen-specific texts. Signed-off-by: Panos Vagenas <35837085+vagenas@users.noreply.github.com>
This commit is contained in:
parent
5a8186b8fb
commit
ce38baf7f7
File diff suppressed because one or more lines are too long
Loading…
Reference in New Issue
Block a user