Christoph Auer
74e0452b6a
Add migration instructions to doc (wip)
...
Signed-off-by: Christoph Auer <cau@zurich.ibm.com>
2024-10-15 17:08:48 +02:00
Christoph Auer
ba9eaf1bd7
CLI and error handling fixes
...
Signed-off-by: Christoph Auer <cau@zurich.ibm.com>
2024-10-15 15:58:39 +02:00
Christoph Auer
27f4ed3620
Enable mypy and fix many reported errors
...
Signed-off-by: Christoph Auer <cau@zurich.ibm.com>
2024-10-15 14:58:00 +02:00
Christoph Auer
dac82ca7f2
Import statement updates from docling-core
...
Signed-off-by: Christoph Auer <cau@zurich.ibm.com>
2024-10-15 10:11:10 +02:00
Christoph Auer
afafb97b87
Update CLI
...
Signed-off-by: Christoph Auer <cau@zurich.ibm.com>
2024-10-15 09:50:06 +02:00
Christoph Auer
497ddb34a8
Big refactoring for legacy_document support
...
Signed-off-by: Christoph Auer <cau@zurich.ibm.com>
2024-10-14 16:36:11 +02:00
Panos Vagenas
136f16e85a
feat!: simplify conversion API ( #139 )
...
Signed-off-by: Panos Vagenas <35837085+vagenas@users.noreply.github.com>
2024-10-11 14:52:37 +02:00
Christoph Auer
304d16029a
More renaming, design enrichment interface
...
Signed-off-by: Christoph Auer <cau@zurich.ibm.com>
2024-10-11 10:21:31 +02:00
Christoph Auer
7cad290ceb
Refactor test data, legacy usage and more
...
Signed-off-by: Christoph Auer <cau@zurich.ibm.com>
2024-10-10 13:54:44 +02:00
Christoph Auer
b5a27386c1
Merge from main, update OCR model and test cases
...
Signed-off-by: Christoph Auer <cau@zurich.ibm.com>
2024-10-09 16:04:19 +02:00
Christoph Auer
0dfbd0b6fc
Update examples and test cases
...
Signed-off-by: Christoph Auer <cau@zurich.ibm.com>
2024-10-09 15:20:27 +02:00
Michele Dolfi
f96ea86a00
feat: add options for choosing OCR engines ( #118 )
...
---------
Signed-off-by: Michele Dolfi <dol@zurich.ibm.com>
Signed-off-by: Nikos Livathinos <nli@zurich.ibm.com>
Signed-off-by: Peter Staar <taa@zurich.ibm.com>
Co-authored-by: Nikos Livathinos <nli@zurich.ibm.com>
Co-authored-by: Peter Staar <taa@zurich.ibm.com>
2024-10-08 19:07:08 +02:00
Christoph Auer
1fa7cd9855
Fundamental refactoring for multi-format support
...
Signed-off-by: Christoph Auer <cau@zurich.ibm.com>
2024-10-01 16:54:09 +02:00
Christoph Auer
d6df76f90b
feat: Support tableformer model choice ( #90 )
...
* Support tableformer model choice
Signed-off-by: Christoph Auer <cau@zurich.ibm.com>
* Update datamodel structure
Signed-off-by: Christoph Auer <cau@zurich.ibm.com>
* Update docs
Signed-off-by: Christoph Auer <cau@zurich.ibm.com>
* Cleanup
Signed-off-by: Christoph Auer <cau@zurich.ibm.com>
* Add test unit for table options
Signed-off-by: Christoph Auer <cau@zurich.ibm.com>
* Ensure import backwards-compatibility for PipelineOptions
Signed-off-by: Christoph Auer <cau@zurich.ibm.com>
* Update README
Signed-off-by: Christoph Auer <cau@zurich.ibm.com>
* Adjust parameters on custom_convert
Signed-off-by: Christoph Auer <60343111+cau-git@users.noreply.github.com>
* Update Dockerfile
Signed-off-by: Christoph Auer <60343111+cau-git@users.noreply.github.com>
---------
Signed-off-by: Christoph Auer <cau@zurich.ibm.com>
Signed-off-by: Christoph Auer <60343111+cau-git@users.noreply.github.com>
2024-09-26 21:37:08 +02:00
Panos Vagenas
d96b96c848
fix: fix OCR setting for pypdfium, minor refactor ( #102 )
...
Signed-off-by: Panos Vagenas <35837085+vagenas@users.noreply.github.com>
2024-09-24 14:36:00 +02:00
Panos Vagenas
3c46e4266c
feat: add URL support to CLI ( #99 )
...
Signed-off-by: Panos Vagenas <35837085+vagenas@users.noreply.github.com>
2024-09-24 08:47:53 +02:00
Michele Dolfi
2870fdc857
fix: CLI compatibility with python 3.10 and 3.11 ( #79 )
...
Signed-off-by: Michele Dolfi <dol@zurich.ibm.com>
2024-09-16 12:32:45 +02:00
Peter W. J. Staar
98990784df
feat: add docling cli ( #75 )
...
* chore: add simple convert script
Signed-off-by: Peter Staar <taa@zurich.ibm.com>
* reformatted all
Signed-off-by: Peter Staar <taa@zurich.ibm.com>
* reformatted all
Signed-off-by: Peter Staar <taa@zurich.ibm.com>
* added default arg
Signed-off-by: Peter Staar <taa@zurich.ibm.com>
* use typer for the docling CLI
Signed-off-by: Michele Dolfi <dol@zurich.ibm.com>
* describe output when saving
Signed-off-by: Michele Dolfi <dol@zurich.ibm.com>
* add tests for CLI
Signed-off-by: Michele Dolfi <dol@zurich.ibm.com>
* add export options
Signed-off-by: Michele Dolfi <dol@zurich.ibm.com>
---------
Signed-off-by: Peter Staar <taa@zurich.ibm.com>
Signed-off-by: Michele Dolfi <dol@zurich.ibm.com>
Co-authored-by: Michele Dolfi <dol@zurich.ibm.com>
2024-09-13 14:03:09 +02:00