* Initial async pdf pipeline
Signed-off-by: Christoph Auer <cau@zurich.ibm.com>
* UpstreamAwareQueue
Signed-off-by: Christoph Auer <cau@zurich.ibm.com>
* Refactoring into async pipeline primitives and graph
Signed-off-by: Christoph Auer <cau@zurich.ibm.com>
* Cleanups and safety improvements
Signed-off-by: Christoph Auer <cau@zurich.ibm.com>
* Better threaded PDF pipeline
Signed-off-by: Christoph Auer <cau@zurich.ibm.com>
* Pin docling-ibm-models
Signed-off-by: Christoph Auer <cau@zurich.ibm.com>
* Remove unused args
Signed-off-by: Christoph Auer <cau@zurich.ibm.com>
* Add test
Signed-off-by: Christoph Auer <cau@zurich.ibm.com>
* Revise pipeline
Signed-off-by: Christoph Auer <cau@zurich.ibm.com>
* Unload doc backend
Signed-off-by: Christoph Auer <cau@zurich.ibm.com>
* Revert "Unload doc backend"
This reverts commit 01066f0b6e.
* Remove redundant method
Signed-off-by: Christoph Auer <cau@zurich.ibm.com>
* Update threaded test
Signed-off-by: Ubuntu <ubuntu@ip-172-31-30-253.eu-central-1.compute.internal>
* Stop accumulating docs in test run
Signed-off-by: Christoph Auer <cau@zurich.ibm.com>
* Fix: don't starve on docs with > max_queue_size pages
Signed-off-by: Christoph Auer <cau@zurich.ibm.com>
* Fix: don't starve on docs with > max_queue_size pages
Signed-off-by: Christoph Auer <cau@zurich.ibm.com>
* DCO Remediation Commit for Christoph Auer <cau@zurich.ibm.com>
I, Christoph Auer <cau@zurich.ibm.com>, hereby add my Signed-off-by to this commit: fa71cde950
I, Ubuntu <ubuntu@ip-172-31-30-253.eu-central-1.compute.internal>, hereby add my Signed-off-by to this commit: d66da87d96
Signed-off-by: Christoph Auer <cau@zurich.ibm.com>
* Fix: python3.9 compat
Signed-off-by: Christoph Auer <cau@zurich.ibm.com>
* Option to enable threadpool with doc_batch_concurrency setting
Signed-off-by: Christoph Auer <cau@zurich.ibm.com>
* Clean up unused code
Signed-off-by: Christoph Auer <cau@zurich.ibm.com>
* Fix settings defaults expectations
Signed-off-by: Christoph Auer <cau@zurich.ibm.com>
* Use released docling-ibm-models
Signed-off-by: Christoph Auer <cau@zurich.ibm.com>
* Remove ignores for typing/linting
Signed-off-by: Christoph Auer <cau@zurich.ibm.com>
---------
Signed-off-by: Christoph Auer <cau@zurich.ibm.com>
Signed-off-by: Ubuntu <ubuntu@ip-172-31-30-253.eu-central-1.compute.internal>
Co-authored-by: Ubuntu <ubuntu@ip-172-31-30-253.eu-central-1.compute.internal>