mirror of
https://github.com/DS4SD/docling.git
synced 2025-12-08 12:48:28 +00:00
fix: Tesseract OCR CLI can't process images composed with numbers only (#1201)
Some checks failed
Run Docs CD / build-deploy-docs (push) Failing after 1m28s
Run Docs CI / build-docs (push) Failing after 53s
Some checks failed
Run Docs CD / build-deploy-docs (push) Failing after 1m28s
Run Docs CI / build-docs (push) Failing after 53s
fix wrong type text extracted by tesseract_ocr_cli_model Signed-off-by: gvl4 <Guilhem.VERMOREL@3ds.com> Co-authored-by: gvl4 <Guilhem.VERMOREL@3ds.com>
This commit is contained in:
@@ -247,7 +247,7 @@ class TesseractOcrCliModel(BaseOcrModel):
|
||||
|
||||
cell = TextCell(
|
||||
index=ix,
|
||||
text=text,
|
||||
text=str(text),
|
||||
orig=text,
|
||||
from_ocr=True,
|
||||
confidence=conf / 100.0,
|
||||
|
||||
Reference in New Issue
Block a user