mirror of
https://github.com/DS4SD/docling.git
synced 2025-12-08 20:58:11 +00:00
fix: Tesseract OCR CLI can't process images composed with numbers only (#1201)
Some checks failed
Run Docs CD / build-deploy-docs (push) Failing after 1m28s
Run Docs CI / build-docs (push) Failing after 53s
Some checks failed
Run Docs CD / build-deploy-docs (push) Failing after 1m28s
Run Docs CI / build-docs (push) Failing after 53s
fix wrong type text extracted by tesseract_ocr_cli_model Signed-off-by: gvl4 <Guilhem.VERMOREL@3ds.com> Co-authored-by: gvl4 <Guilhem.VERMOREL@3ds.com>
This commit is contained in:
@@ -247,7 +247,7 @@ class TesseractOcrCliModel(BaseOcrModel):
|
|||||||
|
|
||||||
cell = TextCell(
|
cell = TextCell(
|
||||||
index=ix,
|
index=ix,
|
||||||
text=text,
|
text=str(text),
|
||||||
orig=text,
|
orig=text,
|
||||||
from_ocr=True,
|
from_ocr=True,
|
||||||
confidence=conf / 100.0,
|
confidence=conf / 100.0,
|
||||||
|
|||||||
Reference in New Issue
Block a user