mirror of
https://github.com/DS4SD/docling.git
synced 2025-07-27 04:24:45 +00:00
* Initial async pdf pipeline Signed-off-by: Christoph Auer <cau@zurich.ibm.com> * UpstreamAwareQueue Signed-off-by: Christoph Auer <cau@zurich.ibm.com> * Refactoring into async pipeline primitives and graph Signed-off-by: Christoph Auer <cau@zurich.ibm.com> * Cleanups and safety improvements Signed-off-by: Christoph Auer <cau@zurich.ibm.com> * Better threaded PDF pipeline Signed-off-by: Christoph Auer <cau@zurich.ibm.com> * Pin docling-ibm-models Signed-off-by: Christoph Auer <cau@zurich.ibm.com> * Remove unused args Signed-off-by: Christoph Auer <cau@zurich.ibm.com> * Add test Signed-off-by: Christoph Auer <cau@zurich.ibm.com> * Revise pipeline Signed-off-by: Christoph Auer <cau@zurich.ibm.com> * Unload doc backend Signed-off-by: Christoph Auer <cau@zurich.ibm.com> * Revert "Unload doc backend" This reverts commit01066f0b6e
. * Remove redundant method Signed-off-by: Christoph Auer <cau@zurich.ibm.com> * Update threaded test Signed-off-by: Ubuntu <ubuntu@ip-172-31-30-253.eu-central-1.compute.internal> * Stop accumulating docs in test run Signed-off-by: Christoph Auer <cau@zurich.ibm.com> * Fix: don't starve on docs with > max_queue_size pages Signed-off-by: Christoph Auer <cau@zurich.ibm.com> * Fix: don't starve on docs with > max_queue_size pages Signed-off-by: Christoph Auer <cau@zurich.ibm.com> * DCO Remediation Commit for Christoph Auer <cau@zurich.ibm.com> I, Christoph Auer <cau@zurich.ibm.com>, hereby add my Signed-off-by to this commit:fa71cde950
I, Ubuntu <ubuntu@ip-172-31-30-253.eu-central-1.compute.internal>, hereby add my Signed-off-by to this commit:d66da87d96
Signed-off-by: Christoph Auer <cau@zurich.ibm.com> * Fix: python3.9 compat Signed-off-by: Christoph Auer <cau@zurich.ibm.com> * Option to enable threadpool with doc_batch_concurrency setting Signed-off-by: Christoph Auer <cau@zurich.ibm.com> * Clean up unused code Signed-off-by: Christoph Auer <cau@zurich.ibm.com> * Fix settings defaults expectations Signed-off-by: Christoph Auer <cau@zurich.ibm.com> * Use released docling-ibm-models Signed-off-by: Christoph Auer <cau@zurich.ibm.com> * Remove ignores for typing/linting Signed-off-by: Christoph Auer <cau@zurich.ibm.com> --------- Signed-off-by: Christoph Auer <cau@zurich.ibm.com> Signed-off-by: Ubuntu <ubuntu@ip-172-31-30-253.eu-central-1.compute.internal> Co-authored-by: Ubuntu <ubuntu@ip-172-31-30-253.eu-central-1.compute.internal>
30 lines
756 B
Python
30 lines
756 B
Python
import os
|
|
|
|
|
|
def _setup_env():
|
|
os.environ["DOCLING_PERF_PAGE_BATCH_SIZE"] = "12"
|
|
os.environ["DOCLING_DEBUG_VISUALIZE_RAW_LAYOUT"] = "True"
|
|
os.environ["DOCLING_ARTIFACTS_PATH"] = "/path/to/artifacts"
|
|
|
|
|
|
def test_settings():
|
|
_setup_env()
|
|
|
|
import importlib
|
|
|
|
import docling.datamodel.settings as m
|
|
|
|
# Reinitialize settings module
|
|
importlib.reload(m)
|
|
|
|
# Check top level setting
|
|
assert str(m.settings.artifacts_path) == "/path/to/artifacts"
|
|
|
|
# Check nested set via environment variables
|
|
assert m.settings.perf.page_batch_size == 12
|
|
assert m.settings.debug.visualize_raw_layout is True
|
|
|
|
# Check nested defaults
|
|
assert m.settings.perf.doc_batch_size == 1
|
|
assert m.settings.debug.visualize_ocr is False
|