From d737bb941678392c8e3dabe7004ff4e0e1df7896 Mon Sep 17 00:00:00 2001 From: Matteo Omenetti Date: Fri, 24 Jan 2025 08:19:33 -0500 Subject: [PATCH] gt for new pdf Signed-off-by: Matteo Omenetti --- .../docling_v1/code_and_formula.doctags.txt | 13 ++++++++++ .../docling_v1/code_and_formula.json | 1 + .../docling_v1/code_and_formula.md | 19 ++++++++++++++ .../docling_v1/code_and_formula.pages.json | 1 + .../docling_v2/code_and_formula.doctags.txt | 14 +++++++++++ .../docling_v2/code_and_formula.json | 1 + .../docling_v2/code_and_formula.md | 25 +++++++++++++++++++ .../docling_v2/code_and_formula.pages.json | 1 + 8 files changed, 75 insertions(+) create mode 100644 tests/data/groundtruth/docling_v1/code_and_formula.doctags.txt create mode 100644 tests/data/groundtruth/docling_v1/code_and_formula.json create mode 100644 tests/data/groundtruth/docling_v1/code_and_formula.md create mode 100644 tests/data/groundtruth/docling_v1/code_and_formula.pages.json create mode 100644 tests/data/groundtruth/docling_v2/code_and_formula.doctags.txt create mode 100644 tests/data/groundtruth/docling_v2/code_and_formula.json create mode 100644 tests/data/groundtruth/docling_v2/code_and_formula.md create mode 100644 tests/data/groundtruth/docling_v2/code_and_formula.pages.json diff --git a/tests/data/groundtruth/docling_v1/code_and_formula.doctags.txt b/tests/data/groundtruth/docling_v1/code_and_formula.doctags.txt new file mode 100644 index 00000000..1f1a0db7 --- /dev/null +++ b/tests/data/groundtruth/docling_v1/code_and_formula.doctags.txt @@ -0,0 +1,13 @@ + +Java Code Example +Lorem ipsum dolor sit amet, consetetur sadipscing elitr, sed diam nonumy eirmod tempor invidunt ut labore et dolore magna aliquyam erat, sed diam voluptua. At vero eos et accusam et justo duo dolores et ea rebum. Stet clita kasd gubergren, no sea takimata sanctus est Lorem ipsum dolor sit amet. Lorem ipsum dolor sit amet, consetetur sadipscing elitr, sed diam nonumy eirmod tempor invidunt ut labore et dolore magna aliquyam erat, sed diam voluptua. At vero eos et accusam et justo duo dolores et ea rebum. Stet clita kasd gubergren, no sea takimata sanctus est Lorem ipsum dolor sit amet. Lorem ipsum dolor sit amet, consetetur sadipscing elitr, sed diam nonumy eirmod tempor invidunt ut labore et dolore magna aliquyam erat, sed diam voluptua. At vero eos et accusam et justo duo dolores et ea rebum. Stet clita kasd gubergren, no sea takimata sanctus est Lorem ipsum dolor sit amet. +Listing 1: Simple Java Program +public static void print() { System.out.println( "Java Code" ); } +Lorem ipsum dolor sit amet, consetetur sadipscing elitr, sed diam nonumy eirmod tempor invidunt ut labore et dolore magna aliquyam erat, sed diam voluptua. At vero eos et accusam et justo duo dolores et ea rebum. Stet clita kasd gubergren, no sea takimata sanctus est Lorem ipsum dolor sit amet. Lorem ipsum dolor sit amet, consetetur sadipscing elitr, sed diam nonumy eirmod tempor invidunt ut labore et dolore magna aliquyam erat, sed diam voluptua. At vero eos et accusam et justo duo dolores et ea rebum. Stet clita kasd gubergren, no sea takimata sanctus est Lorem ipsum dolor sit amet. Lorem ipsum dolor sit amet, consetetur sadipscing elitr, sed diam nonumy eirmod tempor invidunt ut labore et dolore magna aliquyam erat, sed diam voluptua. At vero eos et accusam et justo duo dolores et ea rebum. Stet clita kasd gubergren, no sea takimata sanctus est Lorem ipsum dolor sit amet. +Formula +Lorem ipsum dolor sit amet, consetetur sadipscing elitr, sed diam nonumy eirmod tempor invidunt ut labore et dolore magna aliquyam erat, sed diam voluptua. At vero eos et accusam et justo duo dolores et ea rebum. Stet clita kasd gubergren, no sea takimata sanctus est Lorem ipsum dolor sit amet. Lorem ipsum dolor sit amet, consetetur sadipscing elitr, sed diam nonumy eirmod tempor invidunt ut labore et dolore magna aliquyam erat, sed diam voluptua. At vero eos et accusam et justo duo dolores et ea rebum. Stet clita kasd gubergren, no sea takimata sanctus est Lorem ipsum dolor sit amet. Lorem ipsum dolor sit amet, consetetur sadipscing elitr, sed diam nonumy eirmod tempor invidunt ut labore et dolore magna aliquyam erat, sed diam voluptua. At vero eos et accusam et justo duo dolores et ea rebum. Stet clita kasd gubergren, no sea takimata sanctus est Lorem ipsum dolor sit amet. +Duis autem vel eum iriure dolor in hendrerit in vulputate velit esse molestie consequat, vel illum dolore eu feugiat nulla facilisis at vero eros et accumsan et iusto odio dignissim qui blandit praesent luptatum zzril delenit augue duis dolore te feugait nulla facilisi. Lorem ipsum dolor sit amet, consectetuer adipiscing elit, sed diam nonummy nibh euismod tincidunt. +Lorem ipsum dolor sit amet, consetetur sadipscing elitr, sed diam nonumy eirmod tempor invidunt ut labore et dolore magna aliquyam erat, sed diam voluptua. At vero eos et accusam et justo duo dolores et ea rebum. Stet clita kasd gubergren, no sea takimata sanctus est Lorem ipsum dolor sit amet. Lorem ipsum dolor sit amet, consetetur sadipscing elitr, sed diam nonumy eirmod tempor invidunt ut labore et dolore magna aliquyam erat, sed diam voluptua. At vero eos et accusam et justo duo dolores et ea rebum. Stet clita kasd gubergren, no sea takimata sanctus est Lorem ipsum dolor sit amet. Lorem ipsum dolor sit amet, consetetur sadipscing elitr, sed diam nonumy eirmod tempor invidunt ut labore et dolore magna aliquyam erat, sed diam voluptua. At vero eos et accusam et justo duo dolores et ea rebum. Stet clita kasd gubergren, no sea takimata sanctus est Lorem ipsum dolor sit amet. +Duis autem vel eum iriure dolor in hendrerit in vulputate velit esse molestie consequat, vel illum dolore eu feugiat nulla facilisis at vero eros et accumsan et iusto odio dignissim qui blandit praesent luptatum zzril delenit augue duis dolore te feugait nulla facilisi. Lorem ipsum dolor sit amet, consectetuer adipiscing elit, sed diam nonummy nibh euismod tincidunt ut laoreet dolore magna aliquam erat volutpat. +Duis autem vel eum iriure dolor in hendrerit in vulputate velit esse molestie consequat, vel illum dolore eu feugiat nulla facilisis at vero eros et accumsan et iusto odio dignissim qui blandit praesent luptatum zzril delenit augue duis dolore te feugait nulla facilisi. Lorem ipsum dolor sit amet, consectetuer adipiscing elit, sed diam nonummy nibh euismod tincidunt ut laoreet dolore magna aliquam erat volutpat. + \ No newline at end of file diff --git a/tests/data/groundtruth/docling_v1/code_and_formula.json b/tests/data/groundtruth/docling_v1/code_and_formula.json new file mode 100644 index 00000000..8b90e794 --- /dev/null +++ b/tests/data/groundtruth/docling_v1/code_and_formula.json @@ -0,0 +1 @@ +{"_name": "", "type": "pdf-document", "description": {"title": null, "abstract": null, "authors": null, "affiliations": null, "subjects": null, "keywords": null, "publication_date": null, "languages": null, "license": null, "publishers": null, "url_refs": null, "references": null, "publication": null, "reference_count": null, "citation_count": null, "citation_date": null, "advanced": null, "analytics": null, "logs": [], "collection": null, "acquisition": null}, "file-info": {"filename": "code_and_formula.pdf", "filename-prov": null, "document-hash": "71dd5c52aa241a6d5275d5f681023b6bb3dee3346e171075d1247c87d49e7a82", "#-pages": 2, "collection-name": null, "description": null, "page-hashes": [{"hash": "d3280840ba2dff9749506fce3f05166336b768d2096f6b332fcbfacbc0540edf", "model": "default", "page": 1}, {"hash": "e9aac72f810d71aeb3710b13689d8c55e6eeda390fb497f09e865f1e94f74f2b", "model": "default", "page": 2}]}, "main-text": [{"prov": [{"bbox": [133.76800537109375, 653.6340942382812, 273.4540100097656, 667.99462890625], "page": 1, "span": [0, 17], "__ref_s3_data": null}], "text": "Java Code Example", "type": "subtitle-level-1", "payload": null, "name": "Section-header", "font": null}, {"prov": [{"bbox": [133.76800537109375, 501.4163513183594, 477.48065185546875, 642.8859252929688], "page": 1, "span": [0, 887], "__ref_s3_data": null}], "text": "Lorem ipsum dolor sit amet, consetetur sadipscing elitr, sed diam nonumy eirmod tempor invidunt ut labore et dolore magna aliquyam erat, sed diam voluptua. At vero eos et accusam et justo duo dolores et ea rebum. Stet clita kasd gubergren, no sea takimata sanctus est Lorem ipsum dolor sit amet. Lorem ipsum dolor sit amet, consetetur sadipscing elitr, sed diam nonumy eirmod tempor invidunt ut labore et dolore magna aliquyam erat, sed diam voluptua. At vero eos et accusam et justo duo dolores et ea rebum. Stet clita kasd gubergren, no sea takimata sanctus est Lorem ipsum dolor sit amet. Lorem ipsum dolor sit amet, consetetur sadipscing elitr, sed diam nonumy eirmod tempor invidunt ut labore et dolore magna aliquyam erat, sed diam voluptua. At vero eos et accusam et justo duo dolores et ea rebum. Stet clita kasd gubergren, no sea takimata sanctus est Lorem ipsum dolor sit amet.", "type": "paragraph", "payload": null, "name": "Text", "font": null}, {"prov": [{"bbox": [236.17599487304688, 480.4953308105469, 375.069580078125, 490.45794677734375], "page": 1, "span": [0, 30], "__ref_s3_data": null}], "text": "Listing 1: Simple Java Program", "type": "paragraph", "payload": null, "name": "paragraph", "font": null}, {"prov": [{"bbox": [134.23899841308594, 443.9358215332031, 337.5928649902344, 474.2005310058594], "page": 1, "span": [0, 65], "__ref_s3_data": null}], "text": "public static void print() { System.out.println( \"Java Code\" ); }", "type": "paragraph", "payload": null, "name": "Code", "font": null}, {"prov": [{"bbox": [133.76800537109375, 290.80633544921875, 477.47589111328125, 432.27593994140625], "page": 1, "span": [0, 887], "__ref_s3_data": null}], "text": "Lorem ipsum dolor sit amet, consetetur sadipscing elitr, sed diam nonumy eirmod tempor invidunt ut labore et dolore magna aliquyam erat, sed diam voluptua. At vero eos et accusam et justo duo dolores et ea rebum. Stet clita kasd gubergren, no sea takimata sanctus est Lorem ipsum dolor sit amet. Lorem ipsum dolor sit amet, consetetur sadipscing elitr, sed diam nonumy eirmod tempor invidunt ut labore et dolore magna aliquyam erat, sed diam voluptua. At vero eos et accusam et justo duo dolores et ea rebum. Stet clita kasd gubergren, no sea takimata sanctus est Lorem ipsum dolor sit amet. Lorem ipsum dolor sit amet, consetetur sadipscing elitr, sed diam nonumy eirmod tempor invidunt ut labore et dolore magna aliquyam erat, sed diam voluptua. At vero eos et accusam et justo duo dolores et ea rebum. Stet clita kasd gubergren, no sea takimata sanctus est Lorem ipsum dolor sit amet.", "type": "paragraph", "payload": null, "name": "Text", "font": null}, {"prov": [{"bbox": [303.13299560546875, 86.87435150146484, 308.1142883300781, 96.83694458007812], "page": 1, "span": [0, 1], "__ref_s3_data": null}], "text": "1", "type": "page-footer", "payload": null, "name": "Page-footer", "font": null}, {"prov": [{"bbox": [133.76800537109375, 703.5241088867188, 191.51429748535156, 717.8846435546875], "page": 2, "span": [0, 7], "__ref_s3_data": null}], "text": "Formula", "type": "subtitle-level-1", "payload": null, "name": "Section-header", "font": null}, {"prov": [{"bbox": [133.76800537109375, 551.3063354492188, 477.48065185546875, 692.7759399414062], "page": 2, "span": [0, 887], "__ref_s3_data": null}], "text": "Lorem ipsum dolor sit amet, consetetur sadipscing elitr, sed diam nonumy eirmod tempor invidunt ut labore et dolore magna aliquyam erat, sed diam voluptua. At vero eos et accusam et justo duo dolores et ea rebum. Stet clita kasd gubergren, no sea takimata sanctus est Lorem ipsum dolor sit amet. Lorem ipsum dolor sit amet, consetetur sadipscing elitr, sed diam nonumy eirmod tempor invidunt ut labore et dolore magna aliquyam erat, sed diam voluptua. At vero eos et accusam et justo duo dolores et ea rebum. Stet clita kasd gubergren, no sea takimata sanctus est Lorem ipsum dolor sit amet. Lorem ipsum dolor sit amet, consetetur sadipscing elitr, sed diam nonumy eirmod tempor invidunt ut labore et dolore magna aliquyam erat, sed diam voluptua. At vero eos et accusam et justo duo dolores et ea rebum. Stet clita kasd gubergren, no sea takimata sanctus est Lorem ipsum dolor sit amet.", "type": "paragraph", "payload": null, "name": "Text", "font": null}, {"prov": [{"bbox": [133.76800537109375, 491.53033447265625, 477.4748229980469, 549.3139038085938], "page": 2, "span": [0, 369], "__ref_s3_data": null}], "text": "Duis autem vel eum iriure dolor in hendrerit in vulputate velit esse molestie consequat, vel illum dolore eu feugiat nulla facilisis at vero eros et accumsan et iusto odio dignissim qui blandit praesent luptatum zzril delenit augue duis dolore te feugait nulla facilisi. Lorem ipsum dolor sit amet, consectetuer adipiscing elit, sed diam nonummy nibh euismod tincidunt.", "type": "paragraph", "payload": null, "name": "Text", "font": null}, {"prov": [{"bbox": [280.5539855957031, 467.6203308105469, 330.69659423828125, 479.4553527832031], "page": 2, "span": [0, 12], "__ref_s3_data": null}], "text": "a 2 + 8 = 12", "type": "equation", "payload": null, "name": "Formula", "font": null}, {"prov": [{"bbox": [133.76800537109375, 318.1803283691406, 477.47589111328125, 459.64996337890625], "page": 2, "span": [0, 887], "__ref_s3_data": null}], "text": "Lorem ipsum dolor sit amet, consetetur sadipscing elitr, sed diam nonumy eirmod tempor invidunt ut labore et dolore magna aliquyam erat, sed diam voluptua. At vero eos et accusam et justo duo dolores et ea rebum. Stet clita kasd gubergren, no sea takimata sanctus est Lorem ipsum dolor sit amet. Lorem ipsum dolor sit amet, consetetur sadipscing elitr, sed diam nonumy eirmod tempor invidunt ut labore et dolore magna aliquyam erat, sed diam voluptua. At vero eos et accusam et justo duo dolores et ea rebum. Stet clita kasd gubergren, no sea takimata sanctus est Lorem ipsum dolor sit amet. Lorem ipsum dolor sit amet, consetetur sadipscing elitr, sed diam nonumy eirmod tempor invidunt ut labore et dolore magna aliquyam erat, sed diam voluptua. At vero eos et accusam et justo duo dolores et ea rebum. Stet clita kasd gubergren, no sea takimata sanctus est Lorem ipsum dolor sit amet.", "type": "paragraph", "payload": null, "name": "Text", "font": null}, {"prov": [{"bbox": [133.76800537109375, 246.44935607910156, 477.4748229980469, 316.1879577636719], "page": 2, "span": [0, 415], "__ref_s3_data": null}], "text": "Duis autem vel eum iriure dolor in hendrerit in vulputate velit esse molestie consequat, vel illum dolore eu feugiat nulla facilisis at vero eros et accumsan et iusto odio dignissim qui blandit praesent luptatum zzril delenit augue duis dolore te feugait nulla facilisi. Lorem ipsum dolor sit amet, consectetuer adipiscing elit, sed diam nonummy nibh euismod tincidunt ut laoreet dolore magna aliquam erat volutpat.", "type": "paragraph", "payload": null, "name": "Text", "font": null}, {"prov": [{"bbox": [133.76800537109375, 174.71835327148438, 477.4748229980469, 244.4569549560547], "page": 2, "span": [0, 415], "__ref_s3_data": null}], "text": "Duis autem vel eum iriure dolor in hendrerit in vulputate velit esse molestie consequat, vel illum dolore eu feugiat nulla facilisis at vero eros et accumsan et iusto odio dignissim qui blandit praesent luptatum zzril delenit augue duis dolore te feugait nulla facilisi. Lorem ipsum dolor sit amet, consectetuer adipiscing elit, sed diam nonummy nibh euismod tincidunt ut laoreet dolore magna aliquam erat volutpat.", "type": "paragraph", "payload": null, "name": "Text", "font": null}, {"prov": [{"bbox": [303.13299560546875, 136.7633514404297, 308.1142883300781, 146.7259521484375], "page": 2, "span": [0, 1], "__ref_s3_data": null}], "text": "1", "type": "page-footer", "payload": null, "name": "Page-footer", "font": null}], "figures": [], "tables": [], "bitmaps": null, "equations": [], "footnotes": [], "page-dimensions": [{"height": 792.0, "page": 1, "width": 612.0}, {"height": 841.8900146484375, "page": 2, "width": 595.2760009765625}], "page-footers": [], "page-headers": [], "_s3_data": null, "identifiers": null} \ No newline at end of file diff --git a/tests/data/groundtruth/docling_v1/code_and_formula.md b/tests/data/groundtruth/docling_v1/code_and_formula.md new file mode 100644 index 00000000..4d77dc5c --- /dev/null +++ b/tests/data/groundtruth/docling_v1/code_and_formula.md @@ -0,0 +1,19 @@ +## Java Code Example + +Lorem ipsum dolor sit amet, consetetur sadipscing elitr, sed diam nonumy eirmod tempor invidunt ut labore et dolore magna aliquyam erat, sed diam voluptua. At vero eos et accusam et justo duo dolores et ea rebum. Stet clita kasd gubergren, no sea takimata sanctus est Lorem ipsum dolor sit amet. Lorem ipsum dolor sit amet, consetetur sadipscing elitr, sed diam nonumy eirmod tempor invidunt ut labore et dolore magna aliquyam erat, sed diam voluptua. At vero eos et accusam et justo duo dolores et ea rebum. Stet clita kasd gubergren, no sea takimata sanctus est Lorem ipsum dolor sit amet. Lorem ipsum dolor sit amet, consetetur sadipscing elitr, sed diam nonumy eirmod tempor invidunt ut labore et dolore magna aliquyam erat, sed diam voluptua. At vero eos et accusam et justo duo dolores et ea rebum. Stet clita kasd gubergren, no sea takimata sanctus est Lorem ipsum dolor sit amet. + +Listing 1: Simple Java Program + +public static void print() { System.out.println( "Java Code" ); } + +Lorem ipsum dolor sit amet, consetetur sadipscing elitr, sed diam nonumy eirmod tempor invidunt ut labore et dolore magna aliquyam erat, sed diam voluptua. At vero eos et accusam et justo duo dolores et ea rebum. Stet clita kasd gubergren, no sea takimata sanctus est Lorem ipsum dolor sit amet. Lorem ipsum dolor sit amet, consetetur sadipscing elitr, sed diam nonumy eirmod tempor invidunt ut labore et dolore magna aliquyam erat, sed diam voluptua. At vero eos et accusam et justo duo dolores et ea rebum. Stet clita kasd gubergren, no sea takimata sanctus est Lorem ipsum dolor sit amet. Lorem ipsum dolor sit amet, consetetur sadipscing elitr, sed diam nonumy eirmod tempor invidunt ut labore et dolore magna aliquyam erat, sed diam voluptua. At vero eos et accusam et justo duo dolores et ea rebum. Stet clita kasd gubergren, no sea takimata sanctus est Lorem ipsum dolor sit amet. + +## Formula + +Lorem ipsum dolor sit amet, consetetur sadipscing elitr, sed diam nonumy eirmod tempor invidunt ut labore et dolore magna aliquyam erat, sed diam voluptua. At vero eos et accusam et justo duo dolores et ea rebum. Stet clita kasd gubergren, no sea takimata sanctus est Lorem ipsum dolor sit amet. Lorem ipsum dolor sit amet, consetetur sadipscing elitr, sed diam nonumy eirmod tempor invidunt ut labore et dolore magna aliquyam erat, sed diam voluptua. At vero eos et accusam et justo duo dolores et ea rebum. Stet clita kasd gubergren, no sea takimata sanctus est Lorem ipsum dolor sit amet. Lorem ipsum dolor sit amet, consetetur sadipscing elitr, sed diam nonumy eirmod tempor invidunt ut labore et dolore magna aliquyam erat, sed diam voluptua. At vero eos et accusam et justo duo dolores et ea rebum. Stet clita kasd gubergren, no sea takimata sanctus est Lorem ipsum dolor sit amet. + +Duis autem vel eum iriure dolor in hendrerit in vulputate velit esse molestie consequat, vel illum dolore eu feugiat nulla facilisis at vero eros et accumsan et iusto odio dignissim qui blandit praesent luptatum zzril delenit augue duis dolore te feugait nulla facilisi. Lorem ipsum dolor sit amet, consectetuer adipiscing elit, sed diam nonummy nibh euismod tincidunt. + +Lorem ipsum dolor sit amet, consetetur sadipscing elitr, sed diam nonumy eirmod tempor invidunt ut labore et dolore magna aliquyam erat, sed diam voluptua. At vero eos et accusam et justo duo dolores et ea rebum. Stet clita kasd gubergren, no sea takimata sanctus est Lorem ipsum dolor sit amet. Lorem ipsum dolor sit amet, consetetur sadipscing elitr, sed diam nonumy eirmod tempor invidunt ut labore et dolore magna aliquyam erat, sed diam voluptua. At vero eos et accusam et justo duo dolores et ea rebum. Stet clita kasd gubergren, no sea takimata sanctus est Lorem ipsum dolor sit amet. Lorem ipsum dolor sit amet, consetetur sadipscing elitr, sed diam nonumy eirmod tempor invidunt ut labore et dolore magna aliquyam erat, sed diam voluptua. At vero eos et accusam et justo duo dolores et ea rebum. Stet clita kasd gubergren, no sea takimata sanctus est Lorem ipsum dolor sit amet. + +Duis autem vel eum iriure dolor in hendrerit in vulputate velit esse molestie consequat, vel illum dolore eu feugiat nulla facilisis at vero eros et accumsan et iusto odio dignissim qui blandit praesent luptatum zzril delenit augue duis dolore te feugait nulla facilisi. Lorem ipsum dolor sit amet, consectetuer adipiscing elit, sed diam nonummy nibh euismod tincidunt ut laoreet dolore magna aliquam erat volutpat. \ No newline at end of file diff --git a/tests/data/groundtruth/docling_v1/code_and_formula.pages.json b/tests/data/groundtruth/docling_v1/code_and_formula.pages.json new file mode 100644 index 00000000..27682fdd --- /dev/null +++ b/tests/data/groundtruth/docling_v1/code_and_formula.pages.json @@ -0,0 +1 @@ +[{"page_no": 0, "size": {"width": 612.0, "height": 792.0}, "cells": [{"id": 0, "text": "Java Code Example", "bbox": {"l": 133.76801, "t": 124.00536999999997, "r": 273.45401, "b": 138.36590999999999, "coord_origin": "TOPLEFT"}}, {"id": 1, "text": "Lorem ipsum dolor sit amet, consetetur sadipscing elitr, sed diam nonumy eir-", "bbox": {"l": 133.76801, "t": 149.11406999999997, "r": 477.47687, "b": 159.07665999999995, "coord_origin": "TOPLEFT"}}, {"id": 2, "text": "mod tempor invidunt ut labore et dolore magna aliquyam erat, sed diam volup-", "bbox": {"l": 133.76801, "t": 161.06908999999996, "r": 477.47781000000003, "b": 171.03168000000005, "coord_origin": "TOPLEFT"}}, {"id": 3, "text": "tua. At vero eos et accusam et justo duo dolores et ea rebum. Stet clita kasd", "bbox": {"l": 133.76801, "t": 173.02405, "r": 477.46985, "b": 182.98663, "coord_origin": "TOPLEFT"}}, {"id": 4, "text": "gubergren, no sea takimata sanctus est Lorem ipsum dolor sit amet. Lorem ip-", "bbox": {"l": 133.76801, "t": 184.97906, "r": 477.47186, "b": 194.94164999999998, "coord_origin": "TOPLEFT"}}, {"id": 5, "text": "sum dolor sit amet, consetetur sadipscing elitr, sed diam nonumy eirmod tempor", "bbox": {"l": 133.76801, "t": 196.93408, "r": 477.46993999999995, "b": 206.89666999999997, "coord_origin": "TOPLEFT"}}, {"id": 6, "text": "invidunt ut labore et dolore magna aliquyam erat, sed diam voluptua. At vero", "bbox": {"l": 133.76801, "t": 208.88909999999998, "r": 477.4806500000001, "b": 218.85168, "coord_origin": "TOPLEFT"}}, {"id": 7, "text": "eos et accusam et justo duo dolores et ea rebum.", "bbox": {"l": 133.76801, "t": 220.84509000000003, "r": 355.20087, "b": 230.80768, "coord_origin": "TOPLEFT"}}, {"id": 8, "text": "Stet clita kasd gubergren,", "bbox": {"l": 362.60107, "t": 220.84509000000003, "r": 477.47495000000004, "b": 230.80768, "coord_origin": "TOPLEFT"}}, {"id": 9, "text": "no sea takimata sanctus est Lorem ipsum dolor sit amet. Lorem ipsum dolor", "bbox": {"l": 133.76801, "t": 232.80005000000006, "r": 477.47083, "b": 242.76262999999994, "coord_origin": "TOPLEFT"}}, {"id": 10, "text": "sit amet, consetetur sadipscing elitr, sed diam nonumy eirmod tempor invidunt", "bbox": {"l": 133.76801, "t": 244.75507000000005, "r": 477.47092, "b": 254.71765000000005, "coord_origin": "TOPLEFT"}}, {"id": 11, "text": "ut labore et dolore magna aliquyam erat, sed diam voluptua. At vero eos et", "bbox": {"l": 133.76801, "t": 256.71007999999995, "r": 477.46588, "b": 266.67267000000004, "coord_origin": "TOPLEFT"}}, {"id": 12, "text": "accusam et justo duo dolores et ea rebum. Stet clita kasd gubergren, no sea", "bbox": {"l": 133.76801, "t": 268.66510000000005, "r": 477.47586, "b": 278.62769000000003, "coord_origin": "TOPLEFT"}}, {"id": 13, "text": "takimata sanctus est Lorem ipsum dolor sit amet.", "bbox": {"l": 133.76801, "t": 280.62103, "r": 351.47672, "b": 290.58365, "coord_origin": "TOPLEFT"}}, {"id": 14, "text": "Listing 1: Simple Java Program", "bbox": {"l": 236.17598999999998, "t": 301.54204999999996, "r": 375.06958, "b": 311.50467, "coord_origin": "TOPLEFT"}}, {"id": 15, "text": "public static void", "bbox": {"l": 134.575, "t": 317.79947000000004, "r": 235.44695, "b": 326.14719, "coord_origin": "TOPLEFT"}}, {"id": 16, "text": "print() {", "bbox": {"l": 241.8759, "t": 317.79947000000004, "r": 292.24908, "b": 326.14719, "coord_origin": "TOPLEFT"}}, {"id": 17, "text": "System.out.println(", "bbox": {"l": 157.17101, "t": 328.75745, "r": 264.02368, "b": 337.10516000000007, "coord_origin": "TOPLEFT"}}, {"id": 18, "text": "\"Java Code\"", "bbox": {"l": 264.15811, "t": 328.75745, "r": 325.35376, "b": 337.10516000000007, "coord_origin": "TOPLEFT"}}, {"id": 19, "text": ");", "bbox": {"l": 326.2952, "t": 328.75745, "r": 337.59286, "b": 337.10516000000007, "coord_origin": "TOPLEFT"}}, {"id": 20, "text": "}", "bbox": {"l": 134.239, "t": 339.71646, "r": 138.94637, "b": 348.06418, "coord_origin": "TOPLEFT"}}, {"id": 21, "text": "Lorem ipsum dolor sit amet, consetetur sadipscing elitr, sed diam nonumy", "bbox": {"l": 148.71201, "t": 359.72406, "r": 477.46985, "b": 369.68667999999997, "coord_origin": "TOPLEFT"}}, {"id": 22, "text": "eirmod tempor invidunt ut labore et dolore magna aliquyam erat, sed diam", "bbox": {"l": 133.76801, "t": 371.67905, "r": 477.4758, "b": 381.64166000000006, "coord_origin": "TOPLEFT"}}, {"id": 23, "text": "voluptua. At vero eos et accusam et justo duo dolores et ea rebum. Stet clita", "bbox": {"l": 133.76801, "t": 383.63403, "r": 477.4758, "b": 393.59665, "coord_origin": "TOPLEFT"}}, {"id": 24, "text": "kasd gubergren, no sea takimata sanctus est Lorem ipsum dolor sit amet. Lorem", "bbox": {"l": 133.76801, "t": 395.58905, "r": 477.47284, "b": 405.55167, "coord_origin": "TOPLEFT"}}, {"id": 25, "text": "ipsum dolor sit amet, consetetur sadipscing elitr, sed diam nonumy eirmod", "bbox": {"l": 133.76801, "t": 407.54404, "r": 477.47589, "b": 417.50665, "coord_origin": "TOPLEFT"}}, {"id": 26, "text": "tempor invidunt ut labore et dolore magna aliquyam erat, sed diam voluptua. At", "bbox": {"l": 133.76801, "t": 419.50003000000004, "r": 477.47177000000005, "b": 429.46265, "coord_origin": "TOPLEFT"}}, {"id": 27, "text": "vero eos et accusam et justo duo dolores et ea rebum. Stet clita kasd gubergren,", "bbox": {"l": 133.76801, "t": 431.45505, "r": 477.47387999999995, "b": 441.41766000000007, "coord_origin": "TOPLEFT"}}, {"id": 28, "text": "no sea takimata sanctus est Lorem ipsum dolor sit amet. Lorem ipsum dolor", "bbox": {"l": 133.76801, "t": 443.41003, "r": 477.47083, "b": 453.37265, "coord_origin": "TOPLEFT"}}, {"id": 29, "text": "sit amet, consetetur sadipscing elitr, sed diam nonumy eirmod tempor invidunt", "bbox": {"l": 133.76801, "t": 455.36505, "r": 477.47092, "b": 465.32767, "coord_origin": "TOPLEFT"}}, {"id": 30, "text": "ut labore et dolore magna aliquyam erat, sed diam voluptua. At vero eos et", "bbox": {"l": 133.76801, "t": 467.32004, "r": 477.46585, "b": 477.28265, "coord_origin": "TOPLEFT"}}, {"id": 31, "text": "accusam et justo duo dolores et ea rebum. Stet clita kasd gubergren, no sea", "bbox": {"l": 133.76801, "t": 479.27603, "r": 477.47586, "b": 489.23865, "coord_origin": "TOPLEFT"}}, {"id": 32, "text": "takimata sanctus est Lorem ipsum dolor sit amet.", "bbox": {"l": 133.76801, "t": 491.23105, "r": 351.47672, "b": 501.19366, "coord_origin": "TOPLEFT"}}, {"id": 33, "text": "1", "bbox": {"l": 303.133, "t": 695.163055, "r": 308.11429, "b": 705.125648, "coord_origin": "TOPLEFT"}}], "predictions": {"layout": {"clusters": [{"id": 2, "label": "section_header", "bbox": {"l": 133.76801, "t": 124.00536999999997, "r": 273.45401, "b": 138.36590999999999, "coord_origin": "TOPLEFT"}, "confidence": 0.9581764936447144, "cells": [{"id": 0, "text": "Java Code Example", "bbox": {"l": 133.76801, "t": 124.00536999999997, "r": 273.45401, "b": 138.36590999999999, "coord_origin": "TOPLEFT"}}], "children": []}, {"id": 0, "label": "text", "bbox": {"l": 133.76801, "t": 149.11406999999997, "r": 477.4806500000001, "b": 290.58365, "coord_origin": "TOPLEFT"}, "confidence": 0.9872456789016724, "cells": [{"id": 1, "text": "Lorem ipsum dolor sit amet, consetetur sadipscing elitr, sed diam nonumy eir-", "bbox": {"l": 133.76801, "t": 149.11406999999997, "r": 477.47687, "b": 159.07665999999995, "coord_origin": "TOPLEFT"}}, {"id": 2, "text": "mod tempor invidunt ut labore et dolore magna aliquyam erat, sed diam volup-", "bbox": {"l": 133.76801, "t": 161.06908999999996, "r": 477.47781000000003, "b": 171.03168000000005, "coord_origin": "TOPLEFT"}}, {"id": 3, "text": "tua. At vero eos et accusam et justo duo dolores et ea rebum. Stet clita kasd", "bbox": {"l": 133.76801, "t": 173.02405, "r": 477.46985, "b": 182.98663, "coord_origin": "TOPLEFT"}}, {"id": 4, "text": "gubergren, no sea takimata sanctus est Lorem ipsum dolor sit amet. Lorem ip-", "bbox": {"l": 133.76801, "t": 184.97906, "r": 477.47186, "b": 194.94164999999998, "coord_origin": "TOPLEFT"}}, {"id": 5, "text": "sum dolor sit amet, consetetur sadipscing elitr, sed diam nonumy eirmod tempor", "bbox": {"l": 133.76801, "t": 196.93408, "r": 477.46993999999995, "b": 206.89666999999997, "coord_origin": "TOPLEFT"}}, {"id": 6, "text": "invidunt ut labore et dolore magna aliquyam erat, sed diam voluptua. At vero", "bbox": {"l": 133.76801, "t": 208.88909999999998, "r": 477.4806500000001, "b": 218.85168, "coord_origin": "TOPLEFT"}}, {"id": 7, "text": "eos et accusam et justo duo dolores et ea rebum.", "bbox": {"l": 133.76801, "t": 220.84509000000003, "r": 355.20087, "b": 230.80768, "coord_origin": "TOPLEFT"}}, {"id": 8, "text": "Stet clita kasd gubergren,", "bbox": {"l": 362.60107, "t": 220.84509000000003, "r": 477.47495000000004, "b": 230.80768, "coord_origin": "TOPLEFT"}}, {"id": 9, "text": "no sea takimata sanctus est Lorem ipsum dolor sit amet. Lorem ipsum dolor", "bbox": {"l": 133.76801, "t": 232.80005000000006, "r": 477.47083, "b": 242.76262999999994, "coord_origin": "TOPLEFT"}}, {"id": 10, "text": "sit amet, consetetur sadipscing elitr, sed diam nonumy eirmod tempor invidunt", "bbox": {"l": 133.76801, "t": 244.75507000000005, "r": 477.47092, "b": 254.71765000000005, "coord_origin": "TOPLEFT"}}, {"id": 11, "text": "ut labore et dolore magna aliquyam erat, sed diam voluptua. At vero eos et", "bbox": {"l": 133.76801, "t": 256.71007999999995, "r": 477.46588, "b": 266.67267000000004, "coord_origin": "TOPLEFT"}}, {"id": 12, "text": "accusam et justo duo dolores et ea rebum. Stet clita kasd gubergren, no sea", "bbox": {"l": 133.76801, "t": 268.66510000000005, "r": 477.47586, "b": 278.62769000000003, "coord_origin": "TOPLEFT"}}, {"id": 13, "text": "takimata sanctus est Lorem ipsum dolor sit amet.", "bbox": {"l": 133.76801, "t": 280.62103, "r": 351.47672, "b": 290.58365, "coord_origin": "TOPLEFT"}}], "children": []}, {"id": 5, "label": "caption", "bbox": {"l": 236.17598999999998, "t": 301.54204999999996, "r": 375.06958, "b": 311.50467, "coord_origin": "TOPLEFT"}, "confidence": 0.5425266027450562, "cells": [{"id": 14, "text": "Listing 1: Simple Java Program", "bbox": {"l": 236.17598999999998, "t": 301.54204999999996, "r": 375.06958, "b": 311.50467, "coord_origin": "TOPLEFT"}}], "children": []}, {"id": 4, "label": "code", "bbox": {"l": 134.239, "t": 317.79947000000004, "r": 337.59286, "b": 348.06418, "coord_origin": "TOPLEFT"}, "confidence": 0.6383119821548462, "cells": [{"id": 15, "text": "public static void", "bbox": {"l": 134.575, "t": 317.79947000000004, "r": 235.44695, "b": 326.14719, "coord_origin": "TOPLEFT"}}, {"id": 16, "text": "print() {", "bbox": {"l": 241.8759, "t": 317.79947000000004, "r": 292.24908, "b": 326.14719, "coord_origin": "TOPLEFT"}}, {"id": 17, "text": "System.out.println(", "bbox": {"l": 157.17101, "t": 328.75745, "r": 264.02368, "b": 337.10516000000007, "coord_origin": "TOPLEFT"}}, {"id": 18, "text": "\"Java Code\"", "bbox": {"l": 264.15811, "t": 328.75745, "r": 325.35376, "b": 337.10516000000007, "coord_origin": "TOPLEFT"}}, {"id": 19, "text": ");", "bbox": {"l": 326.2952, "t": 328.75745, "r": 337.59286, "b": 337.10516000000007, "coord_origin": "TOPLEFT"}}, {"id": 20, "text": "}", "bbox": {"l": 134.239, "t": 339.71646, "r": 138.94637, "b": 348.06418, "coord_origin": "TOPLEFT"}}], "children": []}, {"id": 1, "label": "text", "bbox": {"l": 133.76801, "t": 359.72406, "r": 477.47589, "b": 501.19366, "coord_origin": "TOPLEFT"}, "confidence": 0.9869542717933655, "cells": [{"id": 21, "text": "Lorem ipsum dolor sit amet, consetetur sadipscing elitr, sed diam nonumy", "bbox": {"l": 148.71201, "t": 359.72406, "r": 477.46985, "b": 369.68667999999997, "coord_origin": "TOPLEFT"}}, {"id": 22, "text": "eirmod tempor invidunt ut labore et dolore magna aliquyam erat, sed diam", "bbox": {"l": 133.76801, "t": 371.67905, "r": 477.4758, "b": 381.64166000000006, "coord_origin": "TOPLEFT"}}, {"id": 23, "text": "voluptua. At vero eos et accusam et justo duo dolores et ea rebum. Stet clita", "bbox": {"l": 133.76801, "t": 383.63403, "r": 477.4758, "b": 393.59665, "coord_origin": "TOPLEFT"}}, {"id": 24, "text": "kasd gubergren, no sea takimata sanctus est Lorem ipsum dolor sit amet. Lorem", "bbox": {"l": 133.76801, "t": 395.58905, "r": 477.47284, "b": 405.55167, "coord_origin": "TOPLEFT"}}, {"id": 25, "text": "ipsum dolor sit amet, consetetur sadipscing elitr, sed diam nonumy eirmod", "bbox": {"l": 133.76801, "t": 407.54404, "r": 477.47589, "b": 417.50665, "coord_origin": "TOPLEFT"}}, {"id": 26, "text": "tempor invidunt ut labore et dolore magna aliquyam erat, sed diam voluptua. At", "bbox": {"l": 133.76801, "t": 419.50003000000004, "r": 477.47177000000005, "b": 429.46265, "coord_origin": "TOPLEFT"}}, {"id": 27, "text": "vero eos et accusam et justo duo dolores et ea rebum. Stet clita kasd gubergren,", "bbox": {"l": 133.76801, "t": 431.45505, "r": 477.47387999999995, "b": 441.41766000000007, "coord_origin": "TOPLEFT"}}, {"id": 28, "text": "no sea takimata sanctus est Lorem ipsum dolor sit amet. Lorem ipsum dolor", "bbox": {"l": 133.76801, "t": 443.41003, "r": 477.47083, "b": 453.37265, "coord_origin": "TOPLEFT"}}, {"id": 29, "text": "sit amet, consetetur sadipscing elitr, sed diam nonumy eirmod tempor invidunt", "bbox": {"l": 133.76801, "t": 455.36505, "r": 477.47092, "b": 465.32767, "coord_origin": "TOPLEFT"}}, {"id": 30, "text": "ut labore et dolore magna aliquyam erat, sed diam voluptua. At vero eos et", "bbox": {"l": 133.76801, "t": 467.32004, "r": 477.46585, "b": 477.28265, "coord_origin": "TOPLEFT"}}, {"id": 31, "text": "accusam et justo duo dolores et ea rebum. Stet clita kasd gubergren, no sea", "bbox": {"l": 133.76801, "t": 479.27603, "r": 477.47586, "b": 489.23865, "coord_origin": "TOPLEFT"}}, {"id": 32, "text": "takimata sanctus est Lorem ipsum dolor sit amet.", "bbox": {"l": 133.76801, "t": 491.23105, "r": 351.47672, "b": 501.19366, "coord_origin": "TOPLEFT"}}], "children": []}, {"id": 3, "label": "page_footer", "bbox": {"l": 303.133, "t": 695.163055, "r": 308.11429, "b": 705.125648, "coord_origin": "TOPLEFT"}, "confidence": 0.8374724984169006, "cells": [{"id": 33, "text": "1", "bbox": {"l": 303.133, "t": 695.163055, "r": 308.11429, "b": 705.125648, "coord_origin": "TOPLEFT"}}], "children": []}]}, "tablestructure": {"table_map": {}}, "figures_classification": null, "equations_prediction": null}, "assembled": {"elements": [{"label": "section_header", "id": 2, "page_no": 0, "cluster": {"id": 2, "label": "section_header", "bbox": {"l": 133.76801, "t": 124.00536999999997, "r": 273.45401, "b": 138.36590999999999, "coord_origin": "TOPLEFT"}, "confidence": 0.9581764936447144, "cells": [{"id": 0, "text": "Java Code Example", "bbox": {"l": 133.76801, "t": 124.00536999999997, "r": 273.45401, "b": 138.36590999999999, "coord_origin": "TOPLEFT"}}], "children": []}, "text": "Java Code Example"}, {"label": "text", "id": 0, "page_no": 0, "cluster": {"id": 0, "label": "text", "bbox": {"l": 133.76801, "t": 149.11406999999997, "r": 477.4806500000001, "b": 290.58365, "coord_origin": "TOPLEFT"}, "confidence": 0.9872456789016724, "cells": [{"id": 1, "text": "Lorem ipsum dolor sit amet, consetetur sadipscing elitr, sed diam nonumy eir-", "bbox": {"l": 133.76801, "t": 149.11406999999997, "r": 477.47687, "b": 159.07665999999995, "coord_origin": "TOPLEFT"}}, {"id": 2, "text": "mod tempor invidunt ut labore et dolore magna aliquyam erat, sed diam volup-", "bbox": {"l": 133.76801, "t": 161.06908999999996, "r": 477.47781000000003, "b": 171.03168000000005, "coord_origin": "TOPLEFT"}}, {"id": 3, "text": "tua. At vero eos et accusam et justo duo dolores et ea rebum. Stet clita kasd", "bbox": {"l": 133.76801, "t": 173.02405, "r": 477.46985, "b": 182.98663, "coord_origin": "TOPLEFT"}}, {"id": 4, "text": "gubergren, no sea takimata sanctus est Lorem ipsum dolor sit amet. Lorem ip-", "bbox": {"l": 133.76801, "t": 184.97906, "r": 477.47186, "b": 194.94164999999998, "coord_origin": "TOPLEFT"}}, {"id": 5, "text": "sum dolor sit amet, consetetur sadipscing elitr, sed diam nonumy eirmod tempor", "bbox": {"l": 133.76801, "t": 196.93408, "r": 477.46993999999995, "b": 206.89666999999997, "coord_origin": "TOPLEFT"}}, {"id": 6, "text": "invidunt ut labore et dolore magna aliquyam erat, sed diam voluptua. At vero", "bbox": {"l": 133.76801, "t": 208.88909999999998, "r": 477.4806500000001, "b": 218.85168, "coord_origin": "TOPLEFT"}}, {"id": 7, "text": "eos et accusam et justo duo dolores et ea rebum.", "bbox": {"l": 133.76801, "t": 220.84509000000003, "r": 355.20087, "b": 230.80768, "coord_origin": "TOPLEFT"}}, {"id": 8, "text": "Stet clita kasd gubergren,", "bbox": {"l": 362.60107, "t": 220.84509000000003, "r": 477.47495000000004, "b": 230.80768, "coord_origin": "TOPLEFT"}}, {"id": 9, "text": "no sea takimata sanctus est Lorem ipsum dolor sit amet. Lorem ipsum dolor", "bbox": {"l": 133.76801, "t": 232.80005000000006, "r": 477.47083, "b": 242.76262999999994, "coord_origin": "TOPLEFT"}}, {"id": 10, "text": "sit amet, consetetur sadipscing elitr, sed diam nonumy eirmod tempor invidunt", "bbox": {"l": 133.76801, "t": 244.75507000000005, "r": 477.47092, "b": 254.71765000000005, "coord_origin": "TOPLEFT"}}, {"id": 11, "text": "ut labore et dolore magna aliquyam erat, sed diam voluptua. At vero eos et", "bbox": {"l": 133.76801, "t": 256.71007999999995, "r": 477.46588, "b": 266.67267000000004, "coord_origin": "TOPLEFT"}}, {"id": 12, "text": "accusam et justo duo dolores et ea rebum. Stet clita kasd gubergren, no sea", "bbox": {"l": 133.76801, "t": 268.66510000000005, "r": 477.47586, "b": 278.62769000000003, "coord_origin": "TOPLEFT"}}, {"id": 13, "text": "takimata sanctus est Lorem ipsum dolor sit amet.", "bbox": {"l": 133.76801, "t": 280.62103, "r": 351.47672, "b": 290.58365, "coord_origin": "TOPLEFT"}}], "children": []}, "text": "Lorem ipsum dolor sit amet, consetetur sadipscing elitr, sed diam nonumy eirmod tempor invidunt ut labore et dolore magna aliquyam erat, sed diam voluptua. At vero eos et accusam et justo duo dolores et ea rebum. Stet clita kasd gubergren, no sea takimata sanctus est Lorem ipsum dolor sit amet. Lorem ipsum dolor sit amet, consetetur sadipscing elitr, sed diam nonumy eirmod tempor invidunt ut labore et dolore magna aliquyam erat, sed diam voluptua. At vero eos et accusam et justo duo dolores et ea rebum. Stet clita kasd gubergren, no sea takimata sanctus est Lorem ipsum dolor sit amet. Lorem ipsum dolor sit amet, consetetur sadipscing elitr, sed diam nonumy eirmod tempor invidunt ut labore et dolore magna aliquyam erat, sed diam voluptua. At vero eos et accusam et justo duo dolores et ea rebum. Stet clita kasd gubergren, no sea takimata sanctus est Lorem ipsum dolor sit amet."}, {"label": "caption", "id": 5, "page_no": 0, "cluster": {"id": 5, "label": "caption", "bbox": {"l": 236.17598999999998, "t": 301.54204999999996, "r": 375.06958, "b": 311.50467, "coord_origin": "TOPLEFT"}, "confidence": 0.5425266027450562, "cells": [{"id": 14, "text": "Listing 1: Simple Java Program", "bbox": {"l": 236.17598999999998, "t": 301.54204999999996, "r": 375.06958, "b": 311.50467, "coord_origin": "TOPLEFT"}}], "children": []}, "text": "Listing 1: Simple Java Program"}, {"label": "code", "id": 4, "page_no": 0, "cluster": {"id": 4, "label": "code", "bbox": {"l": 134.239, "t": 317.79947000000004, "r": 337.59286, "b": 348.06418, "coord_origin": "TOPLEFT"}, "confidence": 0.6383119821548462, "cells": [{"id": 15, "text": "public static void", "bbox": {"l": 134.575, "t": 317.79947000000004, "r": 235.44695, "b": 326.14719, "coord_origin": "TOPLEFT"}}, {"id": 16, "text": "print() {", "bbox": {"l": 241.8759, "t": 317.79947000000004, "r": 292.24908, "b": 326.14719, "coord_origin": "TOPLEFT"}}, {"id": 17, "text": "System.out.println(", "bbox": {"l": 157.17101, "t": 328.75745, "r": 264.02368, "b": 337.10516000000007, "coord_origin": "TOPLEFT"}}, {"id": 18, "text": "\"Java Code\"", "bbox": {"l": 264.15811, "t": 328.75745, "r": 325.35376, "b": 337.10516000000007, "coord_origin": "TOPLEFT"}}, {"id": 19, "text": ");", "bbox": {"l": 326.2952, "t": 328.75745, "r": 337.59286, "b": 337.10516000000007, "coord_origin": "TOPLEFT"}}, {"id": 20, "text": "}", "bbox": {"l": 134.239, "t": 339.71646, "r": 138.94637, "b": 348.06418, "coord_origin": "TOPLEFT"}}], "children": []}, "text": "public static void print() { System.out.println( \"Java Code\" ); }"}, {"label": "text", "id": 1, "page_no": 0, "cluster": {"id": 1, "label": "text", "bbox": {"l": 133.76801, "t": 359.72406, "r": 477.47589, "b": 501.19366, "coord_origin": "TOPLEFT"}, "confidence": 0.9869542717933655, "cells": [{"id": 21, "text": "Lorem ipsum dolor sit amet, consetetur sadipscing elitr, sed diam nonumy", "bbox": {"l": 148.71201, "t": 359.72406, "r": 477.46985, "b": 369.68667999999997, "coord_origin": "TOPLEFT"}}, {"id": 22, "text": "eirmod tempor invidunt ut labore et dolore magna aliquyam erat, sed diam", "bbox": {"l": 133.76801, "t": 371.67905, "r": 477.4758, "b": 381.64166000000006, "coord_origin": "TOPLEFT"}}, {"id": 23, "text": "voluptua. At vero eos et accusam et justo duo dolores et ea rebum. Stet clita", "bbox": {"l": 133.76801, "t": 383.63403, "r": 477.4758, "b": 393.59665, "coord_origin": "TOPLEFT"}}, {"id": 24, "text": "kasd gubergren, no sea takimata sanctus est Lorem ipsum dolor sit amet. Lorem", "bbox": {"l": 133.76801, "t": 395.58905, "r": 477.47284, "b": 405.55167, "coord_origin": "TOPLEFT"}}, {"id": 25, "text": "ipsum dolor sit amet, consetetur sadipscing elitr, sed diam nonumy eirmod", "bbox": {"l": 133.76801, "t": 407.54404, "r": 477.47589, "b": 417.50665, "coord_origin": "TOPLEFT"}}, {"id": 26, "text": "tempor invidunt ut labore et dolore magna aliquyam erat, sed diam voluptua. At", "bbox": {"l": 133.76801, "t": 419.50003000000004, "r": 477.47177000000005, "b": 429.46265, "coord_origin": "TOPLEFT"}}, {"id": 27, "text": "vero eos et accusam et justo duo dolores et ea rebum. Stet clita kasd gubergren,", "bbox": {"l": 133.76801, "t": 431.45505, "r": 477.47387999999995, "b": 441.41766000000007, "coord_origin": "TOPLEFT"}}, {"id": 28, "text": "no sea takimata sanctus est Lorem ipsum dolor sit amet. Lorem ipsum dolor", "bbox": {"l": 133.76801, "t": 443.41003, "r": 477.47083, "b": 453.37265, "coord_origin": "TOPLEFT"}}, {"id": 29, "text": "sit amet, consetetur sadipscing elitr, sed diam nonumy eirmod tempor invidunt", "bbox": {"l": 133.76801, "t": 455.36505, "r": 477.47092, "b": 465.32767, "coord_origin": "TOPLEFT"}}, {"id": 30, "text": "ut labore et dolore magna aliquyam erat, sed diam voluptua. At vero eos et", "bbox": {"l": 133.76801, "t": 467.32004, "r": 477.46585, "b": 477.28265, "coord_origin": "TOPLEFT"}}, {"id": 31, "text": "accusam et justo duo dolores et ea rebum. Stet clita kasd gubergren, no sea", "bbox": {"l": 133.76801, "t": 479.27603, "r": 477.47586, "b": 489.23865, "coord_origin": "TOPLEFT"}}, {"id": 32, "text": "takimata sanctus est Lorem ipsum dolor sit amet.", "bbox": {"l": 133.76801, "t": 491.23105, "r": 351.47672, "b": 501.19366, "coord_origin": "TOPLEFT"}}], "children": []}, "text": "Lorem ipsum dolor sit amet, consetetur sadipscing elitr, sed diam nonumy eirmod tempor invidunt ut labore et dolore magna aliquyam erat, sed diam voluptua. At vero eos et accusam et justo duo dolores et ea rebum. Stet clita kasd gubergren, no sea takimata sanctus est Lorem ipsum dolor sit amet. Lorem ipsum dolor sit amet, consetetur sadipscing elitr, sed diam nonumy eirmod tempor invidunt ut labore et dolore magna aliquyam erat, sed diam voluptua. At vero eos et accusam et justo duo dolores et ea rebum. Stet clita kasd gubergren, no sea takimata sanctus est Lorem ipsum dolor sit amet. Lorem ipsum dolor sit amet, consetetur sadipscing elitr, sed diam nonumy eirmod tempor invidunt ut labore et dolore magna aliquyam erat, sed diam voluptua. At vero eos et accusam et justo duo dolores et ea rebum. Stet clita kasd gubergren, no sea takimata sanctus est Lorem ipsum dolor sit amet."}, {"label": "page_footer", "id": 3, "page_no": 0, "cluster": {"id": 3, "label": "page_footer", "bbox": {"l": 303.133, "t": 695.163055, "r": 308.11429, "b": 705.125648, "coord_origin": "TOPLEFT"}, "confidence": 0.8374724984169006, "cells": [{"id": 33, "text": "1", "bbox": {"l": 303.133, "t": 695.163055, "r": 308.11429, "b": 705.125648, "coord_origin": "TOPLEFT"}}], "children": []}, "text": "1"}], "body": [{"label": "section_header", "id": 2, "page_no": 0, "cluster": {"id": 2, "label": "section_header", "bbox": {"l": 133.76801, "t": 124.00536999999997, "r": 273.45401, "b": 138.36590999999999, "coord_origin": "TOPLEFT"}, "confidence": 0.9581764936447144, "cells": [{"id": 0, "text": "Java Code Example", "bbox": {"l": 133.76801, "t": 124.00536999999997, "r": 273.45401, "b": 138.36590999999999, "coord_origin": "TOPLEFT"}}], "children": []}, "text": "Java Code Example"}, {"label": "text", "id": 0, "page_no": 0, "cluster": {"id": 0, "label": "text", "bbox": {"l": 133.76801, "t": 149.11406999999997, "r": 477.4806500000001, "b": 290.58365, "coord_origin": "TOPLEFT"}, "confidence": 0.9872456789016724, "cells": [{"id": 1, "text": "Lorem ipsum dolor sit amet, consetetur sadipscing elitr, sed diam nonumy eir-", "bbox": {"l": 133.76801, "t": 149.11406999999997, "r": 477.47687, "b": 159.07665999999995, "coord_origin": "TOPLEFT"}}, {"id": 2, "text": "mod tempor invidunt ut labore et dolore magna aliquyam erat, sed diam volup-", "bbox": {"l": 133.76801, "t": 161.06908999999996, "r": 477.47781000000003, "b": 171.03168000000005, "coord_origin": "TOPLEFT"}}, {"id": 3, "text": "tua. At vero eos et accusam et justo duo dolores et ea rebum. Stet clita kasd", "bbox": {"l": 133.76801, "t": 173.02405, "r": 477.46985, "b": 182.98663, "coord_origin": "TOPLEFT"}}, {"id": 4, "text": "gubergren, no sea takimata sanctus est Lorem ipsum dolor sit amet. Lorem ip-", "bbox": {"l": 133.76801, "t": 184.97906, "r": 477.47186, "b": 194.94164999999998, "coord_origin": "TOPLEFT"}}, {"id": 5, "text": "sum dolor sit amet, consetetur sadipscing elitr, sed diam nonumy eirmod tempor", "bbox": {"l": 133.76801, "t": 196.93408, "r": 477.46993999999995, "b": 206.89666999999997, "coord_origin": "TOPLEFT"}}, {"id": 6, "text": "invidunt ut labore et dolore magna aliquyam erat, sed diam voluptua. At vero", "bbox": {"l": 133.76801, "t": 208.88909999999998, "r": 477.4806500000001, "b": 218.85168, "coord_origin": "TOPLEFT"}}, {"id": 7, "text": "eos et accusam et justo duo dolores et ea rebum.", "bbox": {"l": 133.76801, "t": 220.84509000000003, "r": 355.20087, "b": 230.80768, "coord_origin": "TOPLEFT"}}, {"id": 8, "text": "Stet clita kasd gubergren,", "bbox": {"l": 362.60107, "t": 220.84509000000003, "r": 477.47495000000004, "b": 230.80768, "coord_origin": "TOPLEFT"}}, {"id": 9, "text": "no sea takimata sanctus est Lorem ipsum dolor sit amet. Lorem ipsum dolor", "bbox": {"l": 133.76801, "t": 232.80005000000006, "r": 477.47083, "b": 242.76262999999994, "coord_origin": "TOPLEFT"}}, {"id": 10, "text": "sit amet, consetetur sadipscing elitr, sed diam nonumy eirmod tempor invidunt", "bbox": {"l": 133.76801, "t": 244.75507000000005, "r": 477.47092, "b": 254.71765000000005, "coord_origin": "TOPLEFT"}}, {"id": 11, "text": "ut labore et dolore magna aliquyam erat, sed diam voluptua. At vero eos et", "bbox": {"l": 133.76801, "t": 256.71007999999995, "r": 477.46588, "b": 266.67267000000004, "coord_origin": "TOPLEFT"}}, {"id": 12, "text": "accusam et justo duo dolores et ea rebum. Stet clita kasd gubergren, no sea", "bbox": {"l": 133.76801, "t": 268.66510000000005, "r": 477.47586, "b": 278.62769000000003, "coord_origin": "TOPLEFT"}}, {"id": 13, "text": "takimata sanctus est Lorem ipsum dolor sit amet.", "bbox": {"l": 133.76801, "t": 280.62103, "r": 351.47672, "b": 290.58365, "coord_origin": "TOPLEFT"}}], "children": []}, "text": "Lorem ipsum dolor sit amet, consetetur sadipscing elitr, sed diam nonumy eirmod tempor invidunt ut labore et dolore magna aliquyam erat, sed diam voluptua. At vero eos et accusam et justo duo dolores et ea rebum. Stet clita kasd gubergren, no sea takimata sanctus est Lorem ipsum dolor sit amet. Lorem ipsum dolor sit amet, consetetur sadipscing elitr, sed diam nonumy eirmod tempor invidunt ut labore et dolore magna aliquyam erat, sed diam voluptua. At vero eos et accusam et justo duo dolores et ea rebum. Stet clita kasd gubergren, no sea takimata sanctus est Lorem ipsum dolor sit amet. Lorem ipsum dolor sit amet, consetetur sadipscing elitr, sed diam nonumy eirmod tempor invidunt ut labore et dolore magna aliquyam erat, sed diam voluptua. At vero eos et accusam et justo duo dolores et ea rebum. Stet clita kasd gubergren, no sea takimata sanctus est Lorem ipsum dolor sit amet."}, {"label": "caption", "id": 5, "page_no": 0, "cluster": {"id": 5, "label": "caption", "bbox": {"l": 236.17598999999998, "t": 301.54204999999996, "r": 375.06958, "b": 311.50467, "coord_origin": "TOPLEFT"}, "confidence": 0.5425266027450562, "cells": [{"id": 14, "text": "Listing 1: Simple Java Program", "bbox": {"l": 236.17598999999998, "t": 301.54204999999996, "r": 375.06958, "b": 311.50467, "coord_origin": "TOPLEFT"}}], "children": []}, "text": "Listing 1: Simple Java Program"}, {"label": "code", "id": 4, "page_no": 0, "cluster": {"id": 4, "label": "code", "bbox": {"l": 134.239, "t": 317.79947000000004, "r": 337.59286, "b": 348.06418, "coord_origin": "TOPLEFT"}, "confidence": 0.6383119821548462, "cells": [{"id": 15, "text": "public static void", "bbox": {"l": 134.575, "t": 317.79947000000004, "r": 235.44695, "b": 326.14719, "coord_origin": "TOPLEFT"}}, {"id": 16, "text": "print() {", "bbox": {"l": 241.8759, "t": 317.79947000000004, "r": 292.24908, "b": 326.14719, "coord_origin": "TOPLEFT"}}, {"id": 17, "text": "System.out.println(", "bbox": {"l": 157.17101, "t": 328.75745, "r": 264.02368, "b": 337.10516000000007, "coord_origin": "TOPLEFT"}}, {"id": 18, "text": "\"Java Code\"", "bbox": {"l": 264.15811, "t": 328.75745, "r": 325.35376, "b": 337.10516000000007, "coord_origin": "TOPLEFT"}}, {"id": 19, "text": ");", "bbox": {"l": 326.2952, "t": 328.75745, "r": 337.59286, "b": 337.10516000000007, "coord_origin": "TOPLEFT"}}, {"id": 20, "text": "}", "bbox": {"l": 134.239, "t": 339.71646, "r": 138.94637, "b": 348.06418, "coord_origin": "TOPLEFT"}}], "children": []}, "text": "public static void print() { System.out.println( \"Java Code\" ); }"}, {"label": "text", "id": 1, "page_no": 0, "cluster": {"id": 1, "label": "text", "bbox": {"l": 133.76801, "t": 359.72406, "r": 477.47589, "b": 501.19366, "coord_origin": "TOPLEFT"}, "confidence": 0.9869542717933655, "cells": [{"id": 21, "text": "Lorem ipsum dolor sit amet, consetetur sadipscing elitr, sed diam nonumy", "bbox": {"l": 148.71201, "t": 359.72406, "r": 477.46985, "b": 369.68667999999997, "coord_origin": "TOPLEFT"}}, {"id": 22, "text": "eirmod tempor invidunt ut labore et dolore magna aliquyam erat, sed diam", "bbox": {"l": 133.76801, "t": 371.67905, "r": 477.4758, "b": 381.64166000000006, "coord_origin": "TOPLEFT"}}, {"id": 23, "text": "voluptua. At vero eos et accusam et justo duo dolores et ea rebum. Stet clita", "bbox": {"l": 133.76801, "t": 383.63403, "r": 477.4758, "b": 393.59665, "coord_origin": "TOPLEFT"}}, {"id": 24, "text": "kasd gubergren, no sea takimata sanctus est Lorem ipsum dolor sit amet. Lorem", "bbox": {"l": 133.76801, "t": 395.58905, "r": 477.47284, "b": 405.55167, "coord_origin": "TOPLEFT"}}, {"id": 25, "text": "ipsum dolor sit amet, consetetur sadipscing elitr, sed diam nonumy eirmod", "bbox": {"l": 133.76801, "t": 407.54404, "r": 477.47589, "b": 417.50665, "coord_origin": "TOPLEFT"}}, {"id": 26, "text": "tempor invidunt ut labore et dolore magna aliquyam erat, sed diam voluptua. At", "bbox": {"l": 133.76801, "t": 419.50003000000004, "r": 477.47177000000005, "b": 429.46265, "coord_origin": "TOPLEFT"}}, {"id": 27, "text": "vero eos et accusam et justo duo dolores et ea rebum. Stet clita kasd gubergren,", "bbox": {"l": 133.76801, "t": 431.45505, "r": 477.47387999999995, "b": 441.41766000000007, "coord_origin": "TOPLEFT"}}, {"id": 28, "text": "no sea takimata sanctus est Lorem ipsum dolor sit amet. Lorem ipsum dolor", "bbox": {"l": 133.76801, "t": 443.41003, "r": 477.47083, "b": 453.37265, "coord_origin": "TOPLEFT"}}, {"id": 29, "text": "sit amet, consetetur sadipscing elitr, sed diam nonumy eirmod tempor invidunt", "bbox": {"l": 133.76801, "t": 455.36505, "r": 477.47092, "b": 465.32767, "coord_origin": "TOPLEFT"}}, {"id": 30, "text": "ut labore et dolore magna aliquyam erat, sed diam voluptua. At vero eos et", "bbox": {"l": 133.76801, "t": 467.32004, "r": 477.46585, "b": 477.28265, "coord_origin": "TOPLEFT"}}, {"id": 31, "text": "accusam et justo duo dolores et ea rebum. Stet clita kasd gubergren, no sea", "bbox": {"l": 133.76801, "t": 479.27603, "r": 477.47586, "b": 489.23865, "coord_origin": "TOPLEFT"}}, {"id": 32, "text": "takimata sanctus est Lorem ipsum dolor sit amet.", "bbox": {"l": 133.76801, "t": 491.23105, "r": 351.47672, "b": 501.19366, "coord_origin": "TOPLEFT"}}], "children": []}, "text": "Lorem ipsum dolor sit amet, consetetur sadipscing elitr, sed diam nonumy eirmod tempor invidunt ut labore et dolore magna aliquyam erat, sed diam voluptua. At vero eos et accusam et justo duo dolores et ea rebum. Stet clita kasd gubergren, no sea takimata sanctus est Lorem ipsum dolor sit amet. Lorem ipsum dolor sit amet, consetetur sadipscing elitr, sed diam nonumy eirmod tempor invidunt ut labore et dolore magna aliquyam erat, sed diam voluptua. At vero eos et accusam et justo duo dolores et ea rebum. Stet clita kasd gubergren, no sea takimata sanctus est Lorem ipsum dolor sit amet. Lorem ipsum dolor sit amet, consetetur sadipscing elitr, sed diam nonumy eirmod tempor invidunt ut labore et dolore magna aliquyam erat, sed diam voluptua. At vero eos et accusam et justo duo dolores et ea rebum. Stet clita kasd gubergren, no sea takimata sanctus est Lorem ipsum dolor sit amet."}], "headers": [{"label": "page_footer", "id": 3, "page_no": 0, "cluster": {"id": 3, "label": "page_footer", "bbox": {"l": 303.133, "t": 695.163055, "r": 308.11429, "b": 705.125648, "coord_origin": "TOPLEFT"}, "confidence": 0.8374724984169006, "cells": [{"id": 33, "text": "1", "bbox": {"l": 303.133, "t": 695.163055, "r": 308.11429, "b": 705.125648, "coord_origin": "TOPLEFT"}}], "children": []}, "text": "1"}]}}, {"page_no": 1, "size": {"width": 595.2760009765625, "height": 841.8900146484375}, "cells": [{"id": 0, "text": "Formula", "bbox": {"l": 133.76801021944917, "t": 124.00537068468714, "r": 191.51430031418315, "b": 138.36590076397772, "coord_origin": "TOPLEFT"}}, {"id": 1, "text": "Lorem ipsum dolor sit amet, consetetur sadipscing elitr, sed diam nonumy eir-", "bbox": {"l": 133.76801021944917, "t": 149.114070823323, "r": 477.47687078331063, "b": 159.0766608783307, "coord_origin": "TOPLEFT"}}, {"id": 2, "text": "mod tempor invidunt ut labore et dolore magna aliquyam erat, sed diam volup-", "bbox": {"l": 133.76801021944917, "t": 161.06909088933185, "r": 477.4778107833122, "b": 171.03167094433945, "coord_origin": "TOPLEFT"}}, {"id": 3, "text": "tua. At vero eos et accusam et justo duo dolores et ea rebum. Stet clita kasd", "bbox": {"l": 133.76801021944917, "t": 173.0241009553406, "r": 477.46985078329914, "b": 182.9866910103483, "coord_origin": "TOPLEFT"}}, {"id": 4, "text": "gubergren, no sea takimata sanctus est Lorem ipsum dolor sit amet. Lorem ip-", "bbox": {"l": 133.76801021944917, "t": 184.97906102134914, "r": 477.4718607833024, "b": 194.94165107635683, "coord_origin": "TOPLEFT"}}, {"id": 5, "text": "sum dolor sit amet, consetetur sadipscing elitr, sed diam nonumy eirmod tempor", "bbox": {"l": 133.76801021944917, "t": 196.93511108736357, "r": 477.4699407832993, "b": 206.89770114237137, "coord_origin": "TOPLEFT"}}, {"id": 6, "text": "invidunt ut labore et dolore magna aliquyam erat, sed diam voluptua. At vero", "bbox": {"l": 133.76801021944917, "t": 208.890071153372, "r": 477.4806507833169, "b": 218.8526612083799, "coord_origin": "TOPLEFT"}}, {"id": 7, "text": "eos et accusam et justo duo dolores et ea rebum.", "bbox": {"l": 133.76801021944917, "t": 220.84509121938083, "r": 355.20087058271434, "b": 230.80767127438855, "coord_origin": "TOPLEFT"}}, {"id": 8, "text": "Stet clita kasd gubergren,", "bbox": {"l": 362.6010705948545, "t": 220.84509121938083, "r": 477.4749507833075, "b": 230.80767127438855, "coord_origin": "TOPLEFT"}}, {"id": 9, "text": "no sea takimata sanctus est Lorem ipsum dolor sit amet. Lorem ipsum dolor", "bbox": {"l": 133.76801021944917, "t": 232.8001112853898, "r": 477.4708307833007, "b": 242.76269134039728, "coord_origin": "TOPLEFT"}}, {"id": 10, "text": "sit amet, consetetur sadipscing elitr, sed diam nonumy eirmod tempor invidunt", "bbox": {"l": 133.76801021944917, "t": 244.75506135139813, "r": 477.4709207833008, "b": 254.71765140640582, "coord_origin": "TOPLEFT"}}, {"id": 11, "text": "ut labore et dolore magna aliquyam erat, sed diam voluptua. At vero eos et", "bbox": {"l": 133.76801021944917, "t": 256.71008141740685, "r": 477.4658807832926, "b": 266.6726614724146, "coord_origin": "TOPLEFT"}}, {"id": 12, "text": "accusam et justo duo dolores et ea rebum. Stet clita kasd gubergren, no sea", "bbox": {"l": 133.76801021944917, "t": 268.6660714834211, "r": 477.47586078330903, "b": 278.6286615384289, "coord_origin": "TOPLEFT"}}, {"id": 13, "text": "takimata sanctus est Lorem ipsum dolor sit amet.", "bbox": {"l": 133.76801021944917, "t": 280.62109154942993, "r": 351.4767205766048, "b": 290.58367160443765, "coord_origin": "TOPLEFT"}}, {"id": 14, "text": "Duis autem vel eum iriure dolor in hendrerit in vulputate velit esse molestie", "bbox": {"l": 148.71201024396512, "t": 292.5761116154388, "r": 477.4710707833011, "b": 302.5386916704465, "coord_origin": "TOPLEFT"}}, {"id": 15, "text": "consequat, vel illum dolore eu feugiat nulla facilisis at vero eros et accumsan et", "bbox": {"l": 133.76801021944917, "t": 304.5310616814471, "r": 477.47293078330415, "b": 314.4936517364549, "coord_origin": "TOPLEFT"}}, {"id": 16, "text": "iusto odio dignissim qui blandit praesent luptatum zzril delenit augue duis dolore", "bbox": {"l": 133.76801021944917, "t": 316.48608174745607, "r": 477.47482078330734, "b": 326.4486618024637, "coord_origin": "TOPLEFT"}}, {"id": 17, "text": "te feugait nulla facilisi. Lorem ipsum dolor sit amet, consectetuer adipiscing elit,", "bbox": {"l": 133.76801021944917, "t": 328.4411018134648, "r": 477.472810783304, "b": 338.40368186847246, "coord_origin": "TOPLEFT"}}, {"id": 18, "text": "sed diam nonummy nibh euismod tincidunt.", "bbox": {"l": 133.76801021944917, "t": 340.3970618794788, "r": 326.20752053515014, "b": 350.3596819344868, "coord_origin": "TOPLEFT"}}, {"id": 19, "text": "a", "bbox": {"l": 280.5539904602546, "t": 364.3070620114962, "r": 285.8142404688841, "b": 374.2696820665041, "coord_origin": "TOPLEFT"}}, {"id": 20, "text": "2", "bbox": {"l": 285.8200104688936, "t": 362.4346620011579, "r": 289.78809047540335, "b": 369.40844203966316, "coord_origin": "TOPLEFT"}}, {"id": 21, "text": "+ 8 = 12", "bbox": {"l": 292.50400047985886, "t": 364.3070620114962, "r": 330.6965905425146, "b": 374.2696820665041, "coord_origin": "TOPLEFT"}}, {"id": 22, "text": "Lorem ipsum dolor sit amet, consetetur sadipscing elitr, sed diam nonumy", "bbox": {"l": 148.71201024396512, "t": 382.24005211051195, "r": 477.4698807832991, "b": 392.2026621655199, "coord_origin": "TOPLEFT"}}, {"id": 23, "text": "eirmod tempor invidunt ut labore et dolore magna aliquyam erat, sed diam", "bbox": {"l": 133.76801021944917, "t": 394.19506217652076, "r": 477.47580078330884, "b": 404.15768223152867, "coord_origin": "TOPLEFT"}}, {"id": 24, "text": "voluptua. At vero eos et accusam et justo duo dolores et ea rebum. Stet clita", "bbox": {"l": 133.76801021944917, "t": 406.15005224252945, "r": 477.47580078330884, "b": 416.1126722975373, "coord_origin": "TOPLEFT"}}, {"id": 25, "text": "kasd gubergren, no sea takimata sanctus est Lorem ipsum dolor sit amet. Lorem", "bbox": {"l": 133.76801021944917, "t": 418.10507230853824, "r": 477.47284078330404, "b": 428.06768236354606, "coord_origin": "TOPLEFT"}}, {"id": 26, "text": "ipsum dolor sit amet, consetetur sadipscing elitr, sed diam nonumy eirmod", "bbox": {"l": 133.76801021944917, "t": 430.0600523745468, "r": 477.47589078330907, "b": 440.0226724295547, "coord_origin": "TOPLEFT"}}, {"id": 27, "text": "tempor invidunt ut labore et dolore magna aliquyam erat, sed diam voluptua. At", "bbox": {"l": 133.76801021944917, "t": 442.01605244056105, "r": 477.4717707833023, "b": 451.97866249556887, "coord_origin": "TOPLEFT"}}, {"id": 28, "text": "vero eos et accusam et justo duo dolores et ea rebum. Stet clita kasd gubergren,", "bbox": {"l": 133.76801021944917, "t": 453.9710625065698, "r": 477.47388078330573, "b": 463.93368256157777, "coord_origin": "TOPLEFT"}}, {"id": 29, "text": "no sea takimata sanctus est Lorem ipsum dolor sit amet. Lorem ipsum dolor", "bbox": {"l": 133.76801021944917, "t": 465.9260525725785, "r": 477.4708307833007, "b": 475.88867262758635, "coord_origin": "TOPLEFT"}}, {"id": 30, "text": "sit amet, consetetur sadipscing elitr, sed diam nonumy eirmod tempor invidunt", "bbox": {"l": 133.76801021944917, "t": 477.8810726385873, "r": 477.4709207833008, "b": 487.84368269359516, "coord_origin": "TOPLEFT"}}, {"id": 31, "text": "ut labore et dolore magna aliquyam erat, sed diam voluptua. At vero eos et", "bbox": {"l": 133.76801021944917, "t": 489.8360527045959, "r": 477.4658807832926, "b": 499.7986727596038, "coord_origin": "TOPLEFT"}}, {"id": 32, "text": "accusam et justo duo dolores et ea rebum. Stet clita kasd gubergren, no sea", "bbox": {"l": 133.76801021944917, "t": 501.7910427706045, "r": 477.47586078330903, "b": 511.7536628256125, "coord_origin": "TOPLEFT"}}, {"id": 33, "text": "takimata sanctus est Lorem ipsum dolor sit amet.", "bbox": {"l": 133.76801021944917, "t": 513.7470728366188, "r": 351.4767205766048, "b": 523.7096828916267, "coord_origin": "TOPLEFT"}}, {"id": 34, "text": "Duis autem vel eum iriure dolor in hendrerit in vulputate velit esse molestie", "bbox": {"l": 148.71201024396512, "t": 525.7020529026275, "r": 477.4710707833011, "b": 535.6646729576354, "coord_origin": "TOPLEFT"}}, {"id": 35, "text": "consequat, vel illum dolore eu feugiat nulla facilisis at vero eros et accumsan et", "bbox": {"l": 133.76801021944917, "t": 537.6570729686364, "r": 477.47293078330415, "b": 547.6196930236442, "coord_origin": "TOPLEFT"}}, {"id": 36, "text": "iusto odio dignissim qui blandit praesent luptatum zzril delenit augue duis dolore", "bbox": {"l": 133.76801021944917, "t": 549.612063034645, "r": 477.47482078330734, "b": 559.5746730896528, "coord_origin": "TOPLEFT"}}, {"id": 37, "text": "te feugait nulla facilisi. Lorem ipsum dolor sit amet, consectetuer adipiscing elit,", "bbox": {"l": 133.76801021944917, "t": 561.5670431006536, "r": 477.472810783304, "b": 571.5296631556615, "coord_origin": "TOPLEFT"}}, {"id": 38, "text": "sed diam nonummy nibh euismod tincidunt ut laoreet dolore magna aliquam erat", "bbox": {"l": 133.76801021944917, "t": 573.5220631666624, "r": 477.47479078330724, "b": 583.4846832216704, "coord_origin": "TOPLEFT"}}, {"id": 39, "text": "volutpat.", "bbox": {"l": 133.76801021944917, "t": 585.4780532326765, "r": 173.0565202839028, "b": 595.4406532876844, "coord_origin": "TOPLEFT"}}, {"id": 40, "text": "Duis autem vel eum iriure dolor in hendrerit in vulputate velit esse molestie", "bbox": {"l": 148.71201024396512, "t": 597.4330632986853, "r": 477.4710707833011, "b": 607.3956633536932, "coord_origin": "TOPLEFT"}}, {"id": 41, "text": "consequat, vel illum dolore eu feugiat nulla facilisis at vero eros et accumsan et", "bbox": {"l": 133.76801021944917, "t": 609.3880633646941, "r": 477.47293078330415, "b": 619.3506634197018, "coord_origin": "TOPLEFT"}}, {"id": 42, "text": "iusto odio dignissim qui blandit praesent luptatum zzril delenit augue duis dolore", "bbox": {"l": 133.76801021944917, "t": 621.3430634307027, "r": 477.47482078330734, "b": 631.3056634857105, "coord_origin": "TOPLEFT"}}, {"id": 43, "text": "te feugait nulla facilisi. Lorem ipsum dolor sit amet, consectetuer adipiscing elit,", "bbox": {"l": 133.76801021944917, "t": 633.2980634967114, "r": 477.472810783304, "b": 643.2606635517193, "coord_origin": "TOPLEFT"}}, {"id": 44, "text": "sed diam nonummy nibh euismod tincidunt ut laoreet dolore magna aliquam erat", "bbox": {"l": 133.76801021944917, "t": 645.2540535627256, "r": 477.47479078330724, "b": 655.2166536177334, "coord_origin": "TOPLEFT"}}, {"id": 45, "text": "volutpat.", "bbox": {"l": 133.76801021944917, "t": 657.2090636287344, "r": 173.0565202839028, "b": 667.1716636837423, "coord_origin": "TOPLEFT"}}, {"id": 46, "text": "1", "bbox": {"l": 303.13300049729594, "t": 695.1640638383003, "r": 308.1142905054678, "b": 705.1266638933081, "coord_origin": "TOPLEFT"}}], "predictions": {"layout": {"clusters": [{"id": 5, "label": "section_header", "bbox": {"l": 133.76801021944917, "t": 124.00537068468714, "r": 191.51430031418315, "b": 138.36590076397772, "coord_origin": "TOPLEFT"}, "confidence": 0.95060133934021, "cells": [{"id": 0, "text": "Formula", "bbox": {"l": 133.76801021944917, "t": 124.00537068468714, "r": 191.51430031418315, "b": 138.36590076397772, "coord_origin": "TOPLEFT"}}], "children": []}, {"id": 1, "label": "text", "bbox": {"l": 133.76801021944917, "t": 149.114070823323, "r": 477.4806507833169, "b": 290.58367160443765, "coord_origin": "TOPLEFT"}, "confidence": 0.9878177046775818, "cells": [{"id": 1, "text": "Lorem ipsum dolor sit amet, consetetur sadipscing elitr, sed diam nonumy eir-", "bbox": {"l": 133.76801021944917, "t": 149.114070823323, "r": 477.47687078331063, "b": 159.0766608783307, "coord_origin": "TOPLEFT"}}, {"id": 2, "text": "mod tempor invidunt ut labore et dolore magna aliquyam erat, sed diam volup-", "bbox": {"l": 133.76801021944917, "t": 161.06909088933185, "r": 477.4778107833122, "b": 171.03167094433945, "coord_origin": "TOPLEFT"}}, {"id": 3, "text": "tua. At vero eos et accusam et justo duo dolores et ea rebum. Stet clita kasd", "bbox": {"l": 133.76801021944917, "t": 173.0241009553406, "r": 477.46985078329914, "b": 182.9866910103483, "coord_origin": "TOPLEFT"}}, {"id": 4, "text": "gubergren, no sea takimata sanctus est Lorem ipsum dolor sit amet. Lorem ip-", "bbox": {"l": 133.76801021944917, "t": 184.97906102134914, "r": 477.4718607833024, "b": 194.94165107635683, "coord_origin": "TOPLEFT"}}, {"id": 5, "text": "sum dolor sit amet, consetetur sadipscing elitr, sed diam nonumy eirmod tempor", "bbox": {"l": 133.76801021944917, "t": 196.93511108736357, "r": 477.4699407832993, "b": 206.89770114237137, "coord_origin": "TOPLEFT"}}, {"id": 6, "text": "invidunt ut labore et dolore magna aliquyam erat, sed diam voluptua. At vero", "bbox": {"l": 133.76801021944917, "t": 208.890071153372, "r": 477.4806507833169, "b": 218.8526612083799, "coord_origin": "TOPLEFT"}}, {"id": 7, "text": "eos et accusam et justo duo dolores et ea rebum.", "bbox": {"l": 133.76801021944917, "t": 220.84509121938083, "r": 355.20087058271434, "b": 230.80767127438855, "coord_origin": "TOPLEFT"}}, {"id": 8, "text": "Stet clita kasd gubergren,", "bbox": {"l": 362.6010705948545, "t": 220.84509121938083, "r": 477.4749507833075, "b": 230.80767127438855, "coord_origin": "TOPLEFT"}}, {"id": 9, "text": "no sea takimata sanctus est Lorem ipsum dolor sit amet. Lorem ipsum dolor", "bbox": {"l": 133.76801021944917, "t": 232.8001112853898, "r": 477.4708307833007, "b": 242.76269134039728, "coord_origin": "TOPLEFT"}}, {"id": 10, "text": "sit amet, consetetur sadipscing elitr, sed diam nonumy eirmod tempor invidunt", "bbox": {"l": 133.76801021944917, "t": 244.75506135139813, "r": 477.4709207833008, "b": 254.71765140640582, "coord_origin": "TOPLEFT"}}, {"id": 11, "text": "ut labore et dolore magna aliquyam erat, sed diam voluptua. At vero eos et", "bbox": {"l": 133.76801021944917, "t": 256.71008141740685, "r": 477.4658807832926, "b": 266.6726614724146, "coord_origin": "TOPLEFT"}}, {"id": 12, "text": "accusam et justo duo dolores et ea rebum. Stet clita kasd gubergren, no sea", "bbox": {"l": 133.76801021944917, "t": 268.6660714834211, "r": 477.47586078330903, "b": 278.6286615384289, "coord_origin": "TOPLEFT"}}, {"id": 13, "text": "takimata sanctus est Lorem ipsum dolor sit amet.", "bbox": {"l": 133.76801021944917, "t": 280.62109154942993, "r": 351.4767205766048, "b": 290.58367160443765, "coord_origin": "TOPLEFT"}}], "children": []}, {"id": 3, "label": "text", "bbox": {"l": 133.76801021944917, "t": 292.5761116154388, "r": 477.47482078330734, "b": 350.3596819344868, "coord_origin": "TOPLEFT"}, "confidence": 0.9855858683586121, "cells": [{"id": 14, "text": "Duis autem vel eum iriure dolor in hendrerit in vulputate velit esse molestie", "bbox": {"l": 148.71201024396512, "t": 292.5761116154388, "r": 477.4710707833011, "b": 302.5386916704465, "coord_origin": "TOPLEFT"}}, {"id": 15, "text": "consequat, vel illum dolore eu feugiat nulla facilisis at vero eros et accumsan et", "bbox": {"l": 133.76801021944917, "t": 304.5310616814471, "r": 477.47293078330415, "b": 314.4936517364549, "coord_origin": "TOPLEFT"}}, {"id": 16, "text": "iusto odio dignissim qui blandit praesent luptatum zzril delenit augue duis dolore", "bbox": {"l": 133.76801021944917, "t": 316.48608174745607, "r": 477.47482078330734, "b": 326.4486618024637, "coord_origin": "TOPLEFT"}}, {"id": 17, "text": "te feugait nulla facilisi. Lorem ipsum dolor sit amet, consectetuer adipiscing elit,", "bbox": {"l": 133.76801021944917, "t": 328.4411018134648, "r": 477.472810783304, "b": 338.40368186847246, "coord_origin": "TOPLEFT"}}, {"id": 18, "text": "sed diam nonummy nibh euismod tincidunt.", "bbox": {"l": 133.76801021944917, "t": 340.3970618794788, "r": 326.20752053515014, "b": 350.3596819344868, "coord_origin": "TOPLEFT"}}], "children": []}, {"id": 6, "label": "formula", "bbox": {"l": 280.5539904602546, "t": 362.4346620011579, "r": 330.6965905425146, "b": 374.2696820665041, "coord_origin": "TOPLEFT"}, "confidence": 0.8711639046669006, "cells": [{"id": 19, "text": "a", "bbox": {"l": 280.5539904602546, "t": 364.3070620114962, "r": 285.8142404688841, "b": 374.2696820665041, "coord_origin": "TOPLEFT"}}, {"id": 20, "text": "2", "bbox": {"l": 285.8200104688936, "t": 362.4346620011579, "r": 289.78809047540335, "b": 369.40844203966316, "coord_origin": "TOPLEFT"}}, {"id": 21, "text": "+ 8 = 12", "bbox": {"l": 292.50400047985886, "t": 364.3070620114962, "r": 330.6965905425146, "b": 374.2696820665041, "coord_origin": "TOPLEFT"}}], "children": []}, {"id": 0, "label": "text", "bbox": {"l": 133.76801021944917, "t": 382.24005211051195, "r": 477.47589078330907, "b": 523.7096828916267, "coord_origin": "TOPLEFT"}, "confidence": 0.9880034923553467, "cells": [{"id": 22, "text": "Lorem ipsum dolor sit amet, consetetur sadipscing elitr, sed diam nonumy", "bbox": {"l": 148.71201024396512, "t": 382.24005211051195, "r": 477.4698807832991, "b": 392.2026621655199, "coord_origin": "TOPLEFT"}}, {"id": 23, "text": "eirmod tempor invidunt ut labore et dolore magna aliquyam erat, sed diam", "bbox": {"l": 133.76801021944917, "t": 394.19506217652076, "r": 477.47580078330884, "b": 404.15768223152867, "coord_origin": "TOPLEFT"}}, {"id": 24, "text": "voluptua. At vero eos et accusam et justo duo dolores et ea rebum. Stet clita", "bbox": {"l": 133.76801021944917, "t": 406.15005224252945, "r": 477.47580078330884, "b": 416.1126722975373, "coord_origin": "TOPLEFT"}}, {"id": 25, "text": "kasd gubergren, no sea takimata sanctus est Lorem ipsum dolor sit amet. Lorem", "bbox": {"l": 133.76801021944917, "t": 418.10507230853824, "r": 477.47284078330404, "b": 428.06768236354606, "coord_origin": "TOPLEFT"}}, {"id": 26, "text": "ipsum dolor sit amet, consetetur sadipscing elitr, sed diam nonumy eirmod", "bbox": {"l": 133.76801021944917, "t": 430.0600523745468, "r": 477.47589078330907, "b": 440.0226724295547, "coord_origin": "TOPLEFT"}}, {"id": 27, "text": "tempor invidunt ut labore et dolore magna aliquyam erat, sed diam voluptua. At", "bbox": {"l": 133.76801021944917, "t": 442.01605244056105, "r": 477.4717707833023, "b": 451.97866249556887, "coord_origin": "TOPLEFT"}}, {"id": 28, "text": "vero eos et accusam et justo duo dolores et ea rebum. Stet clita kasd gubergren,", "bbox": {"l": 133.76801021944917, "t": 453.9710625065698, "r": 477.47388078330573, "b": 463.93368256157777, "coord_origin": "TOPLEFT"}}, {"id": 29, "text": "no sea takimata sanctus est Lorem ipsum dolor sit amet. Lorem ipsum dolor", "bbox": {"l": 133.76801021944917, "t": 465.9260525725785, "r": 477.4708307833007, "b": 475.88867262758635, "coord_origin": "TOPLEFT"}}, {"id": 30, "text": "sit amet, consetetur sadipscing elitr, sed diam nonumy eirmod tempor invidunt", "bbox": {"l": 133.76801021944917, "t": 477.8810726385873, "r": 477.4709207833008, "b": 487.84368269359516, "coord_origin": "TOPLEFT"}}, {"id": 31, "text": "ut labore et dolore magna aliquyam erat, sed diam voluptua. At vero eos et", "bbox": {"l": 133.76801021944917, "t": 489.8360527045959, "r": 477.4658807832926, "b": 499.7986727596038, "coord_origin": "TOPLEFT"}}, {"id": 32, "text": "accusam et justo duo dolores et ea rebum. Stet clita kasd gubergren, no sea", "bbox": {"l": 133.76801021944917, "t": 501.7910427706045, "r": 477.47586078330903, "b": 511.7536628256125, "coord_origin": "TOPLEFT"}}, {"id": 33, "text": "takimata sanctus est Lorem ipsum dolor sit amet.", "bbox": {"l": 133.76801021944917, "t": 513.7470728366188, "r": 351.4767205766048, "b": 523.7096828916267, "coord_origin": "TOPLEFT"}}], "children": []}, {"id": 4, "label": "text", "bbox": {"l": 133.76801021944917, "t": 525.7020529026275, "r": 477.47482078330734, "b": 595.4406532876844, "coord_origin": "TOPLEFT"}, "confidence": 0.9844803214073181, "cells": [{"id": 34, "text": "Duis autem vel eum iriure dolor in hendrerit in vulputate velit esse molestie", "bbox": {"l": 148.71201024396512, "t": 525.7020529026275, "r": 477.4710707833011, "b": 535.6646729576354, "coord_origin": "TOPLEFT"}}, {"id": 35, "text": "consequat, vel illum dolore eu feugiat nulla facilisis at vero eros et accumsan et", "bbox": {"l": 133.76801021944917, "t": 537.6570729686364, "r": 477.47293078330415, "b": 547.6196930236442, "coord_origin": "TOPLEFT"}}, {"id": 36, "text": "iusto odio dignissim qui blandit praesent luptatum zzril delenit augue duis dolore", "bbox": {"l": 133.76801021944917, "t": 549.612063034645, "r": 477.47482078330734, "b": 559.5746730896528, "coord_origin": "TOPLEFT"}}, {"id": 37, "text": "te feugait nulla facilisi. Lorem ipsum dolor sit amet, consectetuer adipiscing elit,", "bbox": {"l": 133.76801021944917, "t": 561.5670431006536, "r": 477.472810783304, "b": 571.5296631556615, "coord_origin": "TOPLEFT"}}, {"id": 38, "text": "sed diam nonummy nibh euismod tincidunt ut laoreet dolore magna aliquam erat", "bbox": {"l": 133.76801021944917, "t": 573.5220631666624, "r": 477.47479078330724, "b": 583.4846832216704, "coord_origin": "TOPLEFT"}}, {"id": 39, "text": "volutpat.", "bbox": {"l": 133.76801021944917, "t": 585.4780532326765, "r": 173.0565202839028, "b": 595.4406532876844, "coord_origin": "TOPLEFT"}}], "children": []}, {"id": 2, "label": "text", "bbox": {"l": 133.76801021944917, "t": 597.4330632986853, "r": 477.47482078330734, "b": 667.1716636837423, "coord_origin": "TOPLEFT"}, "confidence": 0.9872003793716431, "cells": [{"id": 40, "text": "Duis autem vel eum iriure dolor in hendrerit in vulputate velit esse molestie", "bbox": {"l": 148.71201024396512, "t": 597.4330632986853, "r": 477.4710707833011, "b": 607.3956633536932, "coord_origin": "TOPLEFT"}}, {"id": 41, "text": "consequat, vel illum dolore eu feugiat nulla facilisis at vero eros et accumsan et", "bbox": {"l": 133.76801021944917, "t": 609.3880633646941, "r": 477.47293078330415, "b": 619.3506634197018, "coord_origin": "TOPLEFT"}}, {"id": 42, "text": "iusto odio dignissim qui blandit praesent luptatum zzril delenit augue duis dolore", "bbox": {"l": 133.76801021944917, "t": 621.3430634307027, "r": 477.47482078330734, "b": 631.3056634857105, "coord_origin": "TOPLEFT"}}, {"id": 43, "text": "te feugait nulla facilisi. Lorem ipsum dolor sit amet, consectetuer adipiscing elit,", "bbox": {"l": 133.76801021944917, "t": 633.2980634967114, "r": 477.472810783304, "b": 643.2606635517193, "coord_origin": "TOPLEFT"}}, {"id": 44, "text": "sed diam nonummy nibh euismod tincidunt ut laoreet dolore magna aliquam erat", "bbox": {"l": 133.76801021944917, "t": 645.2540535627256, "r": 477.47479078330724, "b": 655.2166536177334, "coord_origin": "TOPLEFT"}}, {"id": 45, "text": "volutpat.", "bbox": {"l": 133.76801021944917, "t": 657.2090636287344, "r": 173.0565202839028, "b": 667.1716636837423, "coord_origin": "TOPLEFT"}}], "children": []}, {"id": 7, "label": "page_footer", "bbox": {"l": 303.13300049729594, "t": 695.1640638383003, "r": 308.1142905054678, "b": 705.1266638933081, "coord_origin": "TOPLEFT"}, "confidence": 0.850279688835144, "cells": [{"id": 46, "text": "1", "bbox": {"l": 303.13300049729594, "t": 695.1640638383003, "r": 308.1142905054678, "b": 705.1266638933081, "coord_origin": "TOPLEFT"}}], "children": []}]}, "tablestructure": {"table_map": {}}, "figures_classification": null, "equations_prediction": null}, "assembled": {"elements": [{"label": "section_header", "id": 5, "page_no": 1, "cluster": {"id": 5, "label": "section_header", "bbox": {"l": 133.76801021944917, "t": 124.00537068468714, "r": 191.51430031418315, "b": 138.36590076397772, "coord_origin": "TOPLEFT"}, "confidence": 0.95060133934021, "cells": [{"id": 0, "text": "Formula", "bbox": {"l": 133.76801021944917, "t": 124.00537068468714, "r": 191.51430031418315, "b": 138.36590076397772, "coord_origin": "TOPLEFT"}}], "children": []}, "text": "Formula"}, {"label": "text", "id": 1, "page_no": 1, "cluster": {"id": 1, "label": "text", "bbox": {"l": 133.76801021944917, "t": 149.114070823323, "r": 477.4806507833169, "b": 290.58367160443765, "coord_origin": "TOPLEFT"}, "confidence": 0.9878177046775818, "cells": [{"id": 1, "text": "Lorem ipsum dolor sit amet, consetetur sadipscing elitr, sed diam nonumy eir-", "bbox": {"l": 133.76801021944917, "t": 149.114070823323, "r": 477.47687078331063, "b": 159.0766608783307, "coord_origin": "TOPLEFT"}}, {"id": 2, "text": "mod tempor invidunt ut labore et dolore magna aliquyam erat, sed diam volup-", "bbox": {"l": 133.76801021944917, "t": 161.06909088933185, "r": 477.4778107833122, "b": 171.03167094433945, "coord_origin": "TOPLEFT"}}, {"id": 3, "text": "tua. At vero eos et accusam et justo duo dolores et ea rebum. Stet clita kasd", "bbox": {"l": 133.76801021944917, "t": 173.0241009553406, "r": 477.46985078329914, "b": 182.9866910103483, "coord_origin": "TOPLEFT"}}, {"id": 4, "text": "gubergren, no sea takimata sanctus est Lorem ipsum dolor sit amet. Lorem ip-", "bbox": {"l": 133.76801021944917, "t": 184.97906102134914, "r": 477.4718607833024, "b": 194.94165107635683, "coord_origin": "TOPLEFT"}}, {"id": 5, "text": "sum dolor sit amet, consetetur sadipscing elitr, sed diam nonumy eirmod tempor", "bbox": {"l": 133.76801021944917, "t": 196.93511108736357, "r": 477.4699407832993, "b": 206.89770114237137, "coord_origin": "TOPLEFT"}}, {"id": 6, "text": "invidunt ut labore et dolore magna aliquyam erat, sed diam voluptua. At vero", "bbox": {"l": 133.76801021944917, "t": 208.890071153372, "r": 477.4806507833169, "b": 218.8526612083799, "coord_origin": "TOPLEFT"}}, {"id": 7, "text": "eos et accusam et justo duo dolores et ea rebum.", "bbox": {"l": 133.76801021944917, "t": 220.84509121938083, "r": 355.20087058271434, "b": 230.80767127438855, "coord_origin": "TOPLEFT"}}, {"id": 8, "text": "Stet clita kasd gubergren,", "bbox": {"l": 362.6010705948545, "t": 220.84509121938083, "r": 477.4749507833075, "b": 230.80767127438855, "coord_origin": "TOPLEFT"}}, {"id": 9, "text": "no sea takimata sanctus est Lorem ipsum dolor sit amet. Lorem ipsum dolor", "bbox": {"l": 133.76801021944917, "t": 232.8001112853898, "r": 477.4708307833007, "b": 242.76269134039728, "coord_origin": "TOPLEFT"}}, {"id": 10, "text": "sit amet, consetetur sadipscing elitr, sed diam nonumy eirmod tempor invidunt", "bbox": {"l": 133.76801021944917, "t": 244.75506135139813, "r": 477.4709207833008, "b": 254.71765140640582, "coord_origin": "TOPLEFT"}}, {"id": 11, "text": "ut labore et dolore magna aliquyam erat, sed diam voluptua. At vero eos et", "bbox": {"l": 133.76801021944917, "t": 256.71008141740685, "r": 477.4658807832926, "b": 266.6726614724146, "coord_origin": "TOPLEFT"}}, {"id": 12, "text": "accusam et justo duo dolores et ea rebum. Stet clita kasd gubergren, no sea", "bbox": {"l": 133.76801021944917, "t": 268.6660714834211, "r": 477.47586078330903, "b": 278.6286615384289, "coord_origin": "TOPLEFT"}}, {"id": 13, "text": "takimata sanctus est Lorem ipsum dolor sit amet.", "bbox": {"l": 133.76801021944917, "t": 280.62109154942993, "r": 351.4767205766048, "b": 290.58367160443765, "coord_origin": "TOPLEFT"}}], "children": []}, "text": "Lorem ipsum dolor sit amet, consetetur sadipscing elitr, sed diam nonumy eirmod tempor invidunt ut labore et dolore magna aliquyam erat, sed diam voluptua. At vero eos et accusam et justo duo dolores et ea rebum. Stet clita kasd gubergren, no sea takimata sanctus est Lorem ipsum dolor sit amet. Lorem ipsum dolor sit amet, consetetur sadipscing elitr, sed diam nonumy eirmod tempor invidunt ut labore et dolore magna aliquyam erat, sed diam voluptua. At vero eos et accusam et justo duo dolores et ea rebum. Stet clita kasd gubergren, no sea takimata sanctus est Lorem ipsum dolor sit amet. Lorem ipsum dolor sit amet, consetetur sadipscing elitr, sed diam nonumy eirmod tempor invidunt ut labore et dolore magna aliquyam erat, sed diam voluptua. At vero eos et accusam et justo duo dolores et ea rebum. Stet clita kasd gubergren, no sea takimata sanctus est Lorem ipsum dolor sit amet."}, {"label": "text", "id": 3, "page_no": 1, "cluster": {"id": 3, "label": "text", "bbox": {"l": 133.76801021944917, "t": 292.5761116154388, "r": 477.47482078330734, "b": 350.3596819344868, "coord_origin": "TOPLEFT"}, "confidence": 0.9855858683586121, "cells": [{"id": 14, "text": "Duis autem vel eum iriure dolor in hendrerit in vulputate velit esse molestie", "bbox": {"l": 148.71201024396512, "t": 292.5761116154388, "r": 477.4710707833011, "b": 302.5386916704465, "coord_origin": "TOPLEFT"}}, {"id": 15, "text": "consequat, vel illum dolore eu feugiat nulla facilisis at vero eros et accumsan et", "bbox": {"l": 133.76801021944917, "t": 304.5310616814471, "r": 477.47293078330415, "b": 314.4936517364549, "coord_origin": "TOPLEFT"}}, {"id": 16, "text": "iusto odio dignissim qui blandit praesent luptatum zzril delenit augue duis dolore", "bbox": {"l": 133.76801021944917, "t": 316.48608174745607, "r": 477.47482078330734, "b": 326.4486618024637, "coord_origin": "TOPLEFT"}}, {"id": 17, "text": "te feugait nulla facilisi. Lorem ipsum dolor sit amet, consectetuer adipiscing elit,", "bbox": {"l": 133.76801021944917, "t": 328.4411018134648, "r": 477.472810783304, "b": 338.40368186847246, "coord_origin": "TOPLEFT"}}, {"id": 18, "text": "sed diam nonummy nibh euismod tincidunt.", "bbox": {"l": 133.76801021944917, "t": 340.3970618794788, "r": 326.20752053515014, "b": 350.3596819344868, "coord_origin": "TOPLEFT"}}], "children": []}, "text": "Duis autem vel eum iriure dolor in hendrerit in vulputate velit esse molestie consequat, vel illum dolore eu feugiat nulla facilisis at vero eros et accumsan et iusto odio dignissim qui blandit praesent luptatum zzril delenit augue duis dolore te feugait nulla facilisi. Lorem ipsum dolor sit amet, consectetuer adipiscing elit, sed diam nonummy nibh euismod tincidunt."}, {"label": "formula", "id": 6, "page_no": 1, "cluster": {"id": 6, "label": "formula", "bbox": {"l": 280.5539904602546, "t": 362.4346620011579, "r": 330.6965905425146, "b": 374.2696820665041, "coord_origin": "TOPLEFT"}, "confidence": 0.8711639046669006, "cells": [{"id": 19, "text": "a", "bbox": {"l": 280.5539904602546, "t": 364.3070620114962, "r": 285.8142404688841, "b": 374.2696820665041, "coord_origin": "TOPLEFT"}}, {"id": 20, "text": "2", "bbox": {"l": 285.8200104688936, "t": 362.4346620011579, "r": 289.78809047540335, "b": 369.40844203966316, "coord_origin": "TOPLEFT"}}, {"id": 21, "text": "+ 8 = 12", "bbox": {"l": 292.50400047985886, "t": 364.3070620114962, "r": 330.6965905425146, "b": 374.2696820665041, "coord_origin": "TOPLEFT"}}], "children": []}, "text": "a 2 + 8 = 12"}, {"label": "text", "id": 0, "page_no": 1, "cluster": {"id": 0, "label": "text", "bbox": {"l": 133.76801021944917, "t": 382.24005211051195, "r": 477.47589078330907, "b": 523.7096828916267, "coord_origin": "TOPLEFT"}, "confidence": 0.9880034923553467, "cells": [{"id": 22, "text": "Lorem ipsum dolor sit amet, consetetur sadipscing elitr, sed diam nonumy", "bbox": {"l": 148.71201024396512, "t": 382.24005211051195, "r": 477.4698807832991, "b": 392.2026621655199, "coord_origin": "TOPLEFT"}}, {"id": 23, "text": "eirmod tempor invidunt ut labore et dolore magna aliquyam erat, sed diam", "bbox": {"l": 133.76801021944917, "t": 394.19506217652076, "r": 477.47580078330884, "b": 404.15768223152867, "coord_origin": "TOPLEFT"}}, {"id": 24, "text": "voluptua. At vero eos et accusam et justo duo dolores et ea rebum. Stet clita", "bbox": {"l": 133.76801021944917, "t": 406.15005224252945, "r": 477.47580078330884, "b": 416.1126722975373, "coord_origin": "TOPLEFT"}}, {"id": 25, "text": "kasd gubergren, no sea takimata sanctus est Lorem ipsum dolor sit amet. Lorem", "bbox": {"l": 133.76801021944917, "t": 418.10507230853824, "r": 477.47284078330404, "b": 428.06768236354606, "coord_origin": "TOPLEFT"}}, {"id": 26, "text": "ipsum dolor sit amet, consetetur sadipscing elitr, sed diam nonumy eirmod", "bbox": {"l": 133.76801021944917, "t": 430.0600523745468, "r": 477.47589078330907, "b": 440.0226724295547, "coord_origin": "TOPLEFT"}}, {"id": 27, "text": "tempor invidunt ut labore et dolore magna aliquyam erat, sed diam voluptua. At", "bbox": {"l": 133.76801021944917, "t": 442.01605244056105, "r": 477.4717707833023, "b": 451.97866249556887, "coord_origin": "TOPLEFT"}}, {"id": 28, "text": "vero eos et accusam et justo duo dolores et ea rebum. Stet clita kasd gubergren,", "bbox": {"l": 133.76801021944917, "t": 453.9710625065698, "r": 477.47388078330573, "b": 463.93368256157777, "coord_origin": "TOPLEFT"}}, {"id": 29, "text": "no sea takimata sanctus est Lorem ipsum dolor sit amet. Lorem ipsum dolor", "bbox": {"l": 133.76801021944917, "t": 465.9260525725785, "r": 477.4708307833007, "b": 475.88867262758635, "coord_origin": "TOPLEFT"}}, {"id": 30, "text": "sit amet, consetetur sadipscing elitr, sed diam nonumy eirmod tempor invidunt", "bbox": {"l": 133.76801021944917, "t": 477.8810726385873, "r": 477.4709207833008, "b": 487.84368269359516, "coord_origin": "TOPLEFT"}}, {"id": 31, "text": "ut labore et dolore magna aliquyam erat, sed diam voluptua. At vero eos et", "bbox": {"l": 133.76801021944917, "t": 489.8360527045959, "r": 477.4658807832926, "b": 499.7986727596038, "coord_origin": "TOPLEFT"}}, {"id": 32, "text": "accusam et justo duo dolores et ea rebum. Stet clita kasd gubergren, no sea", "bbox": {"l": 133.76801021944917, "t": 501.7910427706045, "r": 477.47586078330903, "b": 511.7536628256125, "coord_origin": "TOPLEFT"}}, {"id": 33, "text": "takimata sanctus est Lorem ipsum dolor sit amet.", "bbox": {"l": 133.76801021944917, "t": 513.7470728366188, "r": 351.4767205766048, "b": 523.7096828916267, "coord_origin": "TOPLEFT"}}], "children": []}, "text": "Lorem ipsum dolor sit amet, consetetur sadipscing elitr, sed diam nonumy eirmod tempor invidunt ut labore et dolore magna aliquyam erat, sed diam voluptua. At vero eos et accusam et justo duo dolores et ea rebum. Stet clita kasd gubergren, no sea takimata sanctus est Lorem ipsum dolor sit amet. Lorem ipsum dolor sit amet, consetetur sadipscing elitr, sed diam nonumy eirmod tempor invidunt ut labore et dolore magna aliquyam erat, sed diam voluptua. At vero eos et accusam et justo duo dolores et ea rebum. Stet clita kasd gubergren, no sea takimata sanctus est Lorem ipsum dolor sit amet. Lorem ipsum dolor sit amet, consetetur sadipscing elitr, sed diam nonumy eirmod tempor invidunt ut labore et dolore magna aliquyam erat, sed diam voluptua. At vero eos et accusam et justo duo dolores et ea rebum. Stet clita kasd gubergren, no sea takimata sanctus est Lorem ipsum dolor sit amet."}, {"label": "text", "id": 4, "page_no": 1, "cluster": {"id": 4, "label": "text", "bbox": {"l": 133.76801021944917, "t": 525.7020529026275, "r": 477.47482078330734, "b": 595.4406532876844, "coord_origin": "TOPLEFT"}, "confidence": 0.9844803214073181, "cells": [{"id": 34, "text": "Duis autem vel eum iriure dolor in hendrerit in vulputate velit esse molestie", "bbox": {"l": 148.71201024396512, "t": 525.7020529026275, "r": 477.4710707833011, "b": 535.6646729576354, "coord_origin": "TOPLEFT"}}, {"id": 35, "text": "consequat, vel illum dolore eu feugiat nulla facilisis at vero eros et accumsan et", "bbox": {"l": 133.76801021944917, "t": 537.6570729686364, "r": 477.47293078330415, "b": 547.6196930236442, "coord_origin": "TOPLEFT"}}, {"id": 36, "text": "iusto odio dignissim qui blandit praesent luptatum zzril delenit augue duis dolore", "bbox": {"l": 133.76801021944917, "t": 549.612063034645, "r": 477.47482078330734, "b": 559.5746730896528, "coord_origin": "TOPLEFT"}}, {"id": 37, "text": "te feugait nulla facilisi. Lorem ipsum dolor sit amet, consectetuer adipiscing elit,", "bbox": {"l": 133.76801021944917, "t": 561.5670431006536, "r": 477.472810783304, "b": 571.5296631556615, "coord_origin": "TOPLEFT"}}, {"id": 38, "text": "sed diam nonummy nibh euismod tincidunt ut laoreet dolore magna aliquam erat", "bbox": {"l": 133.76801021944917, "t": 573.5220631666624, "r": 477.47479078330724, "b": 583.4846832216704, "coord_origin": "TOPLEFT"}}, {"id": 39, "text": "volutpat.", "bbox": {"l": 133.76801021944917, "t": 585.4780532326765, "r": 173.0565202839028, "b": 595.4406532876844, "coord_origin": "TOPLEFT"}}], "children": []}, "text": "Duis autem vel eum iriure dolor in hendrerit in vulputate velit esse molestie consequat, vel illum dolore eu feugiat nulla facilisis at vero eros et accumsan et iusto odio dignissim qui blandit praesent luptatum zzril delenit augue duis dolore te feugait nulla facilisi. Lorem ipsum dolor sit amet, consectetuer adipiscing elit, sed diam nonummy nibh euismod tincidunt ut laoreet dolore magna aliquam erat volutpat."}, {"label": "text", "id": 2, "page_no": 1, "cluster": {"id": 2, "label": "text", "bbox": {"l": 133.76801021944917, "t": 597.4330632986853, "r": 477.47482078330734, "b": 667.1716636837423, "coord_origin": "TOPLEFT"}, "confidence": 0.9872003793716431, "cells": [{"id": 40, "text": "Duis autem vel eum iriure dolor in hendrerit in vulputate velit esse molestie", "bbox": {"l": 148.71201024396512, "t": 597.4330632986853, "r": 477.4710707833011, "b": 607.3956633536932, "coord_origin": "TOPLEFT"}}, {"id": 41, "text": "consequat, vel illum dolore eu feugiat nulla facilisis at vero eros et accumsan et", "bbox": {"l": 133.76801021944917, "t": 609.3880633646941, "r": 477.47293078330415, "b": 619.3506634197018, "coord_origin": "TOPLEFT"}}, {"id": 42, "text": "iusto odio dignissim qui blandit praesent luptatum zzril delenit augue duis dolore", "bbox": {"l": 133.76801021944917, "t": 621.3430634307027, "r": 477.47482078330734, "b": 631.3056634857105, "coord_origin": "TOPLEFT"}}, {"id": 43, "text": "te feugait nulla facilisi. Lorem ipsum dolor sit amet, consectetuer adipiscing elit,", "bbox": {"l": 133.76801021944917, "t": 633.2980634967114, "r": 477.472810783304, "b": 643.2606635517193, "coord_origin": "TOPLEFT"}}, {"id": 44, "text": "sed diam nonummy nibh euismod tincidunt ut laoreet dolore magna aliquam erat", "bbox": {"l": 133.76801021944917, "t": 645.2540535627256, "r": 477.47479078330724, "b": 655.2166536177334, "coord_origin": "TOPLEFT"}}, {"id": 45, "text": "volutpat.", "bbox": {"l": 133.76801021944917, "t": 657.2090636287344, "r": 173.0565202839028, "b": 667.1716636837423, "coord_origin": "TOPLEFT"}}], "children": []}, "text": "Duis autem vel eum iriure dolor in hendrerit in vulputate velit esse molestie consequat, vel illum dolore eu feugiat nulla facilisis at vero eros et accumsan et iusto odio dignissim qui blandit praesent luptatum zzril delenit augue duis dolore te feugait nulla facilisi. Lorem ipsum dolor sit amet, consectetuer adipiscing elit, sed diam nonummy nibh euismod tincidunt ut laoreet dolore magna aliquam erat volutpat."}, {"label": "page_footer", "id": 7, "page_no": 1, "cluster": {"id": 7, "label": "page_footer", "bbox": {"l": 303.13300049729594, "t": 695.1640638383003, "r": 308.1142905054678, "b": 705.1266638933081, "coord_origin": "TOPLEFT"}, "confidence": 0.850279688835144, "cells": [{"id": 46, "text": "1", "bbox": {"l": 303.13300049729594, "t": 695.1640638383003, "r": 308.1142905054678, "b": 705.1266638933081, "coord_origin": "TOPLEFT"}}], "children": []}, "text": "1"}], "body": [{"label": "section_header", "id": 5, "page_no": 1, "cluster": {"id": 5, "label": "section_header", "bbox": {"l": 133.76801021944917, "t": 124.00537068468714, "r": 191.51430031418315, "b": 138.36590076397772, "coord_origin": "TOPLEFT"}, "confidence": 0.95060133934021, "cells": [{"id": 0, "text": "Formula", "bbox": {"l": 133.76801021944917, "t": 124.00537068468714, "r": 191.51430031418315, "b": 138.36590076397772, "coord_origin": "TOPLEFT"}}], "children": []}, "text": "Formula"}, {"label": "text", "id": 1, "page_no": 1, "cluster": {"id": 1, "label": "text", "bbox": {"l": 133.76801021944917, "t": 149.114070823323, "r": 477.4806507833169, "b": 290.58367160443765, "coord_origin": "TOPLEFT"}, "confidence": 0.9878177046775818, "cells": [{"id": 1, "text": "Lorem ipsum dolor sit amet, consetetur sadipscing elitr, sed diam nonumy eir-", "bbox": {"l": 133.76801021944917, "t": 149.114070823323, "r": 477.47687078331063, "b": 159.0766608783307, "coord_origin": "TOPLEFT"}}, {"id": 2, "text": "mod tempor invidunt ut labore et dolore magna aliquyam erat, sed diam volup-", "bbox": {"l": 133.76801021944917, "t": 161.06909088933185, "r": 477.4778107833122, "b": 171.03167094433945, "coord_origin": "TOPLEFT"}}, {"id": 3, "text": "tua. At vero eos et accusam et justo duo dolores et ea rebum. Stet clita kasd", "bbox": {"l": 133.76801021944917, "t": 173.0241009553406, "r": 477.46985078329914, "b": 182.9866910103483, "coord_origin": "TOPLEFT"}}, {"id": 4, "text": "gubergren, no sea takimata sanctus est Lorem ipsum dolor sit amet. Lorem ip-", "bbox": {"l": 133.76801021944917, "t": 184.97906102134914, "r": 477.4718607833024, "b": 194.94165107635683, "coord_origin": "TOPLEFT"}}, {"id": 5, "text": "sum dolor sit amet, consetetur sadipscing elitr, sed diam nonumy eirmod tempor", "bbox": {"l": 133.76801021944917, "t": 196.93511108736357, "r": 477.4699407832993, "b": 206.89770114237137, "coord_origin": "TOPLEFT"}}, {"id": 6, "text": "invidunt ut labore et dolore magna aliquyam erat, sed diam voluptua. At vero", "bbox": {"l": 133.76801021944917, "t": 208.890071153372, "r": 477.4806507833169, "b": 218.8526612083799, "coord_origin": "TOPLEFT"}}, {"id": 7, "text": "eos et accusam et justo duo dolores et ea rebum.", "bbox": {"l": 133.76801021944917, "t": 220.84509121938083, "r": 355.20087058271434, "b": 230.80767127438855, "coord_origin": "TOPLEFT"}}, {"id": 8, "text": "Stet clita kasd gubergren,", "bbox": {"l": 362.6010705948545, "t": 220.84509121938083, "r": 477.4749507833075, "b": 230.80767127438855, "coord_origin": "TOPLEFT"}}, {"id": 9, "text": "no sea takimata sanctus est Lorem ipsum dolor sit amet. Lorem ipsum dolor", "bbox": {"l": 133.76801021944917, "t": 232.8001112853898, "r": 477.4708307833007, "b": 242.76269134039728, "coord_origin": "TOPLEFT"}}, {"id": 10, "text": "sit amet, consetetur sadipscing elitr, sed diam nonumy eirmod tempor invidunt", "bbox": {"l": 133.76801021944917, "t": 244.75506135139813, "r": 477.4709207833008, "b": 254.71765140640582, "coord_origin": "TOPLEFT"}}, {"id": 11, "text": "ut labore et dolore magna aliquyam erat, sed diam voluptua. At vero eos et", "bbox": {"l": 133.76801021944917, "t": 256.71008141740685, "r": 477.4658807832926, "b": 266.6726614724146, "coord_origin": "TOPLEFT"}}, {"id": 12, "text": "accusam et justo duo dolores et ea rebum. Stet clita kasd gubergren, no sea", "bbox": {"l": 133.76801021944917, "t": 268.6660714834211, "r": 477.47586078330903, "b": 278.6286615384289, "coord_origin": "TOPLEFT"}}, {"id": 13, "text": "takimata sanctus est Lorem ipsum dolor sit amet.", "bbox": {"l": 133.76801021944917, "t": 280.62109154942993, "r": 351.4767205766048, "b": 290.58367160443765, "coord_origin": "TOPLEFT"}}], "children": []}, "text": "Lorem ipsum dolor sit amet, consetetur sadipscing elitr, sed diam nonumy eirmod tempor invidunt ut labore et dolore magna aliquyam erat, sed diam voluptua. At vero eos et accusam et justo duo dolores et ea rebum. Stet clita kasd gubergren, no sea takimata sanctus est Lorem ipsum dolor sit amet. Lorem ipsum dolor sit amet, consetetur sadipscing elitr, sed diam nonumy eirmod tempor invidunt ut labore et dolore magna aliquyam erat, sed diam voluptua. At vero eos et accusam et justo duo dolores et ea rebum. Stet clita kasd gubergren, no sea takimata sanctus est Lorem ipsum dolor sit amet. Lorem ipsum dolor sit amet, consetetur sadipscing elitr, sed diam nonumy eirmod tempor invidunt ut labore et dolore magna aliquyam erat, sed diam voluptua. At vero eos et accusam et justo duo dolores et ea rebum. Stet clita kasd gubergren, no sea takimata sanctus est Lorem ipsum dolor sit amet."}, {"label": "text", "id": 3, "page_no": 1, "cluster": {"id": 3, "label": "text", "bbox": {"l": 133.76801021944917, "t": 292.5761116154388, "r": 477.47482078330734, "b": 350.3596819344868, "coord_origin": "TOPLEFT"}, "confidence": 0.9855858683586121, "cells": [{"id": 14, "text": "Duis autem vel eum iriure dolor in hendrerit in vulputate velit esse molestie", "bbox": {"l": 148.71201024396512, "t": 292.5761116154388, "r": 477.4710707833011, "b": 302.5386916704465, "coord_origin": "TOPLEFT"}}, {"id": 15, "text": "consequat, vel illum dolore eu feugiat nulla facilisis at vero eros et accumsan et", "bbox": {"l": 133.76801021944917, "t": 304.5310616814471, "r": 477.47293078330415, "b": 314.4936517364549, "coord_origin": "TOPLEFT"}}, {"id": 16, "text": "iusto odio dignissim qui blandit praesent luptatum zzril delenit augue duis dolore", "bbox": {"l": 133.76801021944917, "t": 316.48608174745607, "r": 477.47482078330734, "b": 326.4486618024637, "coord_origin": "TOPLEFT"}}, {"id": 17, "text": "te feugait nulla facilisi. Lorem ipsum dolor sit amet, consectetuer adipiscing elit,", "bbox": {"l": 133.76801021944917, "t": 328.4411018134648, "r": 477.472810783304, "b": 338.40368186847246, "coord_origin": "TOPLEFT"}}, {"id": 18, "text": "sed diam nonummy nibh euismod tincidunt.", "bbox": {"l": 133.76801021944917, "t": 340.3970618794788, "r": 326.20752053515014, "b": 350.3596819344868, "coord_origin": "TOPLEFT"}}], "children": []}, "text": "Duis autem vel eum iriure dolor in hendrerit in vulputate velit esse molestie consequat, vel illum dolore eu feugiat nulla facilisis at vero eros et accumsan et iusto odio dignissim qui blandit praesent luptatum zzril delenit augue duis dolore te feugait nulla facilisi. Lorem ipsum dolor sit amet, consectetuer adipiscing elit, sed diam nonummy nibh euismod tincidunt."}, {"label": "formula", "id": 6, "page_no": 1, "cluster": {"id": 6, "label": "formula", "bbox": {"l": 280.5539904602546, "t": 362.4346620011579, "r": 330.6965905425146, "b": 374.2696820665041, "coord_origin": "TOPLEFT"}, "confidence": 0.8711639046669006, "cells": [{"id": 19, "text": "a", "bbox": {"l": 280.5539904602546, "t": 364.3070620114962, "r": 285.8142404688841, "b": 374.2696820665041, "coord_origin": "TOPLEFT"}}, {"id": 20, "text": "2", "bbox": {"l": 285.8200104688936, "t": 362.4346620011579, "r": 289.78809047540335, "b": 369.40844203966316, "coord_origin": "TOPLEFT"}}, {"id": 21, "text": "+ 8 = 12", "bbox": {"l": 292.50400047985886, "t": 364.3070620114962, "r": 330.6965905425146, "b": 374.2696820665041, "coord_origin": "TOPLEFT"}}], "children": []}, "text": "a 2 + 8 = 12"}, {"label": "text", "id": 0, "page_no": 1, "cluster": {"id": 0, "label": "text", "bbox": {"l": 133.76801021944917, "t": 382.24005211051195, "r": 477.47589078330907, "b": 523.7096828916267, "coord_origin": "TOPLEFT"}, "confidence": 0.9880034923553467, "cells": [{"id": 22, "text": "Lorem ipsum dolor sit amet, consetetur sadipscing elitr, sed diam nonumy", "bbox": {"l": 148.71201024396512, "t": 382.24005211051195, "r": 477.4698807832991, "b": 392.2026621655199, "coord_origin": "TOPLEFT"}}, {"id": 23, "text": "eirmod tempor invidunt ut labore et dolore magna aliquyam erat, sed diam", "bbox": {"l": 133.76801021944917, "t": 394.19506217652076, "r": 477.47580078330884, "b": 404.15768223152867, "coord_origin": "TOPLEFT"}}, {"id": 24, "text": "voluptua. At vero eos et accusam et justo duo dolores et ea rebum. Stet clita", "bbox": {"l": 133.76801021944917, "t": 406.15005224252945, "r": 477.47580078330884, "b": 416.1126722975373, "coord_origin": "TOPLEFT"}}, {"id": 25, "text": "kasd gubergren, no sea takimata sanctus est Lorem ipsum dolor sit amet. Lorem", "bbox": {"l": 133.76801021944917, "t": 418.10507230853824, "r": 477.47284078330404, "b": 428.06768236354606, "coord_origin": "TOPLEFT"}}, {"id": 26, "text": "ipsum dolor sit amet, consetetur sadipscing elitr, sed diam nonumy eirmod", "bbox": {"l": 133.76801021944917, "t": 430.0600523745468, "r": 477.47589078330907, "b": 440.0226724295547, "coord_origin": "TOPLEFT"}}, {"id": 27, "text": "tempor invidunt ut labore et dolore magna aliquyam erat, sed diam voluptua. At", "bbox": {"l": 133.76801021944917, "t": 442.01605244056105, "r": 477.4717707833023, "b": 451.97866249556887, "coord_origin": "TOPLEFT"}}, {"id": 28, "text": "vero eos et accusam et justo duo dolores et ea rebum. Stet clita kasd gubergren,", "bbox": {"l": 133.76801021944917, "t": 453.9710625065698, "r": 477.47388078330573, "b": 463.93368256157777, "coord_origin": "TOPLEFT"}}, {"id": 29, "text": "no sea takimata sanctus est Lorem ipsum dolor sit amet. Lorem ipsum dolor", "bbox": {"l": 133.76801021944917, "t": 465.9260525725785, "r": 477.4708307833007, "b": 475.88867262758635, "coord_origin": "TOPLEFT"}}, {"id": 30, "text": "sit amet, consetetur sadipscing elitr, sed diam nonumy eirmod tempor invidunt", "bbox": {"l": 133.76801021944917, "t": 477.8810726385873, "r": 477.4709207833008, "b": 487.84368269359516, "coord_origin": "TOPLEFT"}}, {"id": 31, "text": "ut labore et dolore magna aliquyam erat, sed diam voluptua. At vero eos et", "bbox": {"l": 133.76801021944917, "t": 489.8360527045959, "r": 477.4658807832926, "b": 499.7986727596038, "coord_origin": "TOPLEFT"}}, {"id": 32, "text": "accusam et justo duo dolores et ea rebum. Stet clita kasd gubergren, no sea", "bbox": {"l": 133.76801021944917, "t": 501.7910427706045, "r": 477.47586078330903, "b": 511.7536628256125, "coord_origin": "TOPLEFT"}}, {"id": 33, "text": "takimata sanctus est Lorem ipsum dolor sit amet.", "bbox": {"l": 133.76801021944917, "t": 513.7470728366188, "r": 351.4767205766048, "b": 523.7096828916267, "coord_origin": "TOPLEFT"}}], "children": []}, "text": "Lorem ipsum dolor sit amet, consetetur sadipscing elitr, sed diam nonumy eirmod tempor invidunt ut labore et dolore magna aliquyam erat, sed diam voluptua. At vero eos et accusam et justo duo dolores et ea rebum. Stet clita kasd gubergren, no sea takimata sanctus est Lorem ipsum dolor sit amet. Lorem ipsum dolor sit amet, consetetur sadipscing elitr, sed diam nonumy eirmod tempor invidunt ut labore et dolore magna aliquyam erat, sed diam voluptua. At vero eos et accusam et justo duo dolores et ea rebum. Stet clita kasd gubergren, no sea takimata sanctus est Lorem ipsum dolor sit amet. Lorem ipsum dolor sit amet, consetetur sadipscing elitr, sed diam nonumy eirmod tempor invidunt ut labore et dolore magna aliquyam erat, sed diam voluptua. At vero eos et accusam et justo duo dolores et ea rebum. Stet clita kasd gubergren, no sea takimata sanctus est Lorem ipsum dolor sit amet."}, {"label": "text", "id": 4, "page_no": 1, "cluster": {"id": 4, "label": "text", "bbox": {"l": 133.76801021944917, "t": 525.7020529026275, "r": 477.47482078330734, "b": 595.4406532876844, "coord_origin": "TOPLEFT"}, "confidence": 0.9844803214073181, "cells": [{"id": 34, "text": "Duis autem vel eum iriure dolor in hendrerit in vulputate velit esse molestie", "bbox": {"l": 148.71201024396512, "t": 525.7020529026275, "r": 477.4710707833011, "b": 535.6646729576354, "coord_origin": "TOPLEFT"}}, {"id": 35, "text": "consequat, vel illum dolore eu feugiat nulla facilisis at vero eros et accumsan et", "bbox": {"l": 133.76801021944917, "t": 537.6570729686364, "r": 477.47293078330415, "b": 547.6196930236442, "coord_origin": "TOPLEFT"}}, {"id": 36, "text": "iusto odio dignissim qui blandit praesent luptatum zzril delenit augue duis dolore", "bbox": {"l": 133.76801021944917, "t": 549.612063034645, "r": 477.47482078330734, "b": 559.5746730896528, "coord_origin": "TOPLEFT"}}, {"id": 37, "text": "te feugait nulla facilisi. Lorem ipsum dolor sit amet, consectetuer adipiscing elit,", "bbox": {"l": 133.76801021944917, "t": 561.5670431006536, "r": 477.472810783304, "b": 571.5296631556615, "coord_origin": "TOPLEFT"}}, {"id": 38, "text": "sed diam nonummy nibh euismod tincidunt ut laoreet dolore magna aliquam erat", "bbox": {"l": 133.76801021944917, "t": 573.5220631666624, "r": 477.47479078330724, "b": 583.4846832216704, "coord_origin": "TOPLEFT"}}, {"id": 39, "text": "volutpat.", "bbox": {"l": 133.76801021944917, "t": 585.4780532326765, "r": 173.0565202839028, "b": 595.4406532876844, "coord_origin": "TOPLEFT"}}], "children": []}, "text": "Duis autem vel eum iriure dolor in hendrerit in vulputate velit esse molestie consequat, vel illum dolore eu feugiat nulla facilisis at vero eros et accumsan et iusto odio dignissim qui blandit praesent luptatum zzril delenit augue duis dolore te feugait nulla facilisi. Lorem ipsum dolor sit amet, consectetuer adipiscing elit, sed diam nonummy nibh euismod tincidunt ut laoreet dolore magna aliquam erat volutpat."}, {"label": "text", "id": 2, "page_no": 1, "cluster": {"id": 2, "label": "text", "bbox": {"l": 133.76801021944917, "t": 597.4330632986853, "r": 477.47482078330734, "b": 667.1716636837423, "coord_origin": "TOPLEFT"}, "confidence": 0.9872003793716431, "cells": [{"id": 40, "text": "Duis autem vel eum iriure dolor in hendrerit in vulputate velit esse molestie", "bbox": {"l": 148.71201024396512, "t": 597.4330632986853, "r": 477.4710707833011, "b": 607.3956633536932, "coord_origin": "TOPLEFT"}}, {"id": 41, "text": "consequat, vel illum dolore eu feugiat nulla facilisis at vero eros et accumsan et", "bbox": {"l": 133.76801021944917, "t": 609.3880633646941, "r": 477.47293078330415, "b": 619.3506634197018, "coord_origin": "TOPLEFT"}}, {"id": 42, "text": "iusto odio dignissim qui blandit praesent luptatum zzril delenit augue duis dolore", "bbox": {"l": 133.76801021944917, "t": 621.3430634307027, "r": 477.47482078330734, "b": 631.3056634857105, "coord_origin": "TOPLEFT"}}, {"id": 43, "text": "te feugait nulla facilisi. Lorem ipsum dolor sit amet, consectetuer adipiscing elit,", "bbox": {"l": 133.76801021944917, "t": 633.2980634967114, "r": 477.472810783304, "b": 643.2606635517193, "coord_origin": "TOPLEFT"}}, {"id": 44, "text": "sed diam nonummy nibh euismod tincidunt ut laoreet dolore magna aliquam erat", "bbox": {"l": 133.76801021944917, "t": 645.2540535627256, "r": 477.47479078330724, "b": 655.2166536177334, "coord_origin": "TOPLEFT"}}, {"id": 45, "text": "volutpat.", "bbox": {"l": 133.76801021944917, "t": 657.2090636287344, "r": 173.0565202839028, "b": 667.1716636837423, "coord_origin": "TOPLEFT"}}], "children": []}, "text": "Duis autem vel eum iriure dolor in hendrerit in vulputate velit esse molestie consequat, vel illum dolore eu feugiat nulla facilisis at vero eros et accumsan et iusto odio dignissim qui blandit praesent luptatum zzril delenit augue duis dolore te feugait nulla facilisi. Lorem ipsum dolor sit amet, consectetuer adipiscing elit, sed diam nonummy nibh euismod tincidunt ut laoreet dolore magna aliquam erat volutpat."}], "headers": [{"label": "page_footer", "id": 7, "page_no": 1, "cluster": {"id": 7, "label": "page_footer", "bbox": {"l": 303.13300049729594, "t": 695.1640638383003, "r": 308.1142905054678, "b": 705.1266638933081, "coord_origin": "TOPLEFT"}, "confidence": 0.850279688835144, "cells": [{"id": 46, "text": "1", "bbox": {"l": 303.13300049729594, "t": 695.1640638383003, "r": 308.1142905054678, "b": 705.1266638933081, "coord_origin": "TOPLEFT"}}], "children": []}, "text": "1"}]}}] \ No newline at end of file diff --git a/tests/data/groundtruth/docling_v2/code_and_formula.doctags.txt b/tests/data/groundtruth/docling_v2/code_and_formula.doctags.txt new file mode 100644 index 00000000..ad417540 --- /dev/null +++ b/tests/data/groundtruth/docling_v2/code_and_formula.doctags.txt @@ -0,0 +1,14 @@ + +Java Code Example +Lorem ipsum dolor sit amet, consetetur sadipscing elitr, sed diam nonumy eirmod tempor invidunt ut labore et dolore magna aliquyam erat, sed diam voluptua. At vero eos et accusam et justo duo dolores et ea rebum. Stet clita kasd gubergren, no sea takimata sanctus est Lorem ipsum dolor sit amet. Lorem ipsum dolor sit amet, consetetur sadipscing elitr, sed diam nonumy eirmod tempor invidunt ut labore et dolore magna aliquyam erat, sed diam voluptua. At vero eos et accusam et justo duo dolores et ea rebum. Stet clita kasd gubergren, no sea takimata sanctus est Lorem ipsum dolor sit amet. Lorem ipsum dolor sit amet, consetetur sadipscing elitr, sed diam nonumy eirmod tempor invidunt ut labore et dolore magna aliquyam erat, sed diam voluptua. At vero eos et accusam et justo duo dolores et ea rebum. Stet clita kasd gubergren, no sea takimata sanctus est Lorem ipsum dolor sit amet. +Listing 1: Simple Java Program +public static void print() { System.out.println( "Java Code" ); } +Lorem ipsum dolor sit amet, consetetur sadipscing elitr, sed diam nonumy eirmod tempor invidunt ut labore et dolore magna aliquyam erat, sed diam voluptua. At vero eos et accusam et justo duo dolores et ea rebum. Stet clita kasd gubergren, no sea takimata sanctus est Lorem ipsum dolor sit amet. Lorem ipsum dolor sit amet, consetetur sadipscing elitr, sed diam nonumy eirmod tempor invidunt ut labore et dolore magna aliquyam erat, sed diam voluptua. At vero eos et accusam et justo duo dolores et ea rebum. Stet clita kasd gubergren, no sea takimata sanctus est Lorem ipsum dolor sit amet. Lorem ipsum dolor sit amet, consetetur sadipscing elitr, sed diam nonumy eirmod tempor invidunt ut labore et dolore magna aliquyam erat, sed diam voluptua. At vero eos et accusam et justo duo dolores et ea rebum. Stet clita kasd gubergren, no sea takimata sanctus est Lorem ipsum dolor sit amet. +Formula +Lorem ipsum dolor sit amet, consetetur sadipscing elitr, sed diam nonumy eirmod tempor invidunt ut labore et dolore magna aliquyam erat, sed diam voluptua. At vero eos et accusam et justo duo dolores et ea rebum. Stet clita kasd gubergren, no sea takimata sanctus est Lorem ipsum dolor sit amet. Lorem ipsum dolor sit amet, consetetur sadipscing elitr, sed diam nonumy eirmod tempor invidunt ut labore et dolore magna aliquyam erat, sed diam voluptua. At vero eos et accusam et justo duo dolores et ea rebum. Stet clita kasd gubergren, no sea takimata sanctus est Lorem ipsum dolor sit amet. Lorem ipsum dolor sit amet, consetetur sadipscing elitr, sed diam nonumy eirmod tempor invidunt ut labore et dolore magna aliquyam erat, sed diam voluptua. At vero eos et accusam et justo duo dolores et ea rebum. Stet clita kasd gubergren, no sea takimata sanctus est Lorem ipsum dolor sit amet. +Duis autem vel eum iriure dolor in hendrerit in vulputate velit esse molestie consequat, vel illum dolore eu feugiat nulla facilisis at vero eros et accumsan et iusto odio dignissim qui blandit praesent luptatum zzril delenit augue duis dolore te feugait nulla facilisi. Lorem ipsum dolor sit amet, consectetuer adipiscing elit, sed diam nonummy nibh euismod tincidunt. +a 2 + 8 = 12 +Lorem ipsum dolor sit amet, consetetur sadipscing elitr, sed diam nonumy eirmod tempor invidunt ut labore et dolore magna aliquyam erat, sed diam voluptua. At vero eos et accusam et justo duo dolores et ea rebum. Stet clita kasd gubergren, no sea takimata sanctus est Lorem ipsum dolor sit amet. Lorem ipsum dolor sit amet, consetetur sadipscing elitr, sed diam nonumy eirmod tempor invidunt ut labore et dolore magna aliquyam erat, sed diam voluptua. At vero eos et accusam et justo duo dolores et ea rebum. Stet clita kasd gubergren, no sea takimata sanctus est Lorem ipsum dolor sit amet. Lorem ipsum dolor sit amet, consetetur sadipscing elitr, sed diam nonumy eirmod tempor invidunt ut labore et dolore magna aliquyam erat, sed diam voluptua. At vero eos et accusam et justo duo dolores et ea rebum. Stet clita kasd gubergren, no sea takimata sanctus est Lorem ipsum dolor sit amet. +Duis autem vel eum iriure dolor in hendrerit in vulputate velit esse molestie consequat, vel illum dolore eu feugiat nulla facilisis at vero eros et accumsan et iusto odio dignissim qui blandit praesent luptatum zzril delenit augue duis dolore te feugait nulla facilisi. Lorem ipsum dolor sit amet, consectetuer adipiscing elit, sed diam nonummy nibh euismod tincidunt ut laoreet dolore magna aliquam erat volutpat. +Duis autem vel eum iriure dolor in hendrerit in vulputate velit esse molestie consequat, vel illum dolore eu feugiat nulla facilisis at vero eros et accumsan et iusto odio dignissim qui blandit praesent luptatum zzril delenit augue duis dolore te feugait nulla facilisi. Lorem ipsum dolor sit amet, consectetuer adipiscing elit, sed diam nonummy nibh euismod tincidunt ut laoreet dolore magna aliquam erat volutpat. + \ No newline at end of file diff --git a/tests/data/groundtruth/docling_v2/code_and_formula.json b/tests/data/groundtruth/docling_v2/code_and_formula.json new file mode 100644 index 00000000..adcc14d1 --- /dev/null +++ b/tests/data/groundtruth/docling_v2/code_and_formula.json @@ -0,0 +1 @@ +{"schema_name": "DoclingDocument", "version": "1.0.0", "name": "code_and_formula", "origin": {"mimetype": "application/pdf", "binary_hash": 2394749058180317456, "filename": "code_and_formula.pdf", "uri": null}, "furniture": {"self_ref": "#/furniture", "parent": null, "children": [], "name": "_root_", "label": "unspecified"}, "body": {"self_ref": "#/body", "parent": null, "children": [{"cref": "#/texts/0"}, {"cref": "#/texts/1"}, {"cref": "#/texts/2"}, {"cref": "#/texts/3"}, {"cref": "#/texts/4"}, {"cref": "#/texts/5"}, {"cref": "#/texts/6"}, {"cref": "#/texts/7"}, {"cref": "#/texts/8"}, {"cref": "#/texts/9"}, {"cref": "#/texts/10"}, {"cref": "#/texts/11"}, {"cref": "#/texts/12"}, {"cref": "#/texts/13"}], "name": "_root_", "label": "unspecified"}, "groups": [], "texts": [{"self_ref": "#/texts/0", "parent": {"cref": "#/body"}, "children": [], "label": "section_header", "prov": [{"page_no": 1, "bbox": {"l": 133.76800537109375, "t": 667.99462890625, "r": 273.4540100097656, "b": 653.6340942382812, "coord_origin": "BOTTOMLEFT"}, "charspan": [0, 17]}], "orig": "Java Code Example", "text": "Java Code Example", "level": 1}, {"self_ref": "#/texts/1", "parent": {"cref": "#/body"}, "children": [], "label": "text", "prov": [{"page_no": 1, "bbox": {"l": 133.76800537109375, "t": 642.8859252929688, "r": 477.48065185546875, "b": 501.4163513183594, "coord_origin": "BOTTOMLEFT"}, "charspan": [0, 887]}], "orig": "Lorem ipsum dolor sit amet, consetetur sadipscing elitr, sed diam nonumy eirmod tempor invidunt ut labore et dolore magna aliquyam erat, sed diam voluptua. At vero eos et accusam et justo duo dolores et ea rebum. Stet clita kasd gubergren, no sea takimata sanctus est Lorem ipsum dolor sit amet. Lorem ipsum dolor sit amet, consetetur sadipscing elitr, sed diam nonumy eirmod tempor invidunt ut labore et dolore magna aliquyam erat, sed diam voluptua. At vero eos et accusam et justo duo dolores et ea rebum. Stet clita kasd gubergren, no sea takimata sanctus est Lorem ipsum dolor sit amet. Lorem ipsum dolor sit amet, consetetur sadipscing elitr, sed diam nonumy eirmod tempor invidunt ut labore et dolore magna aliquyam erat, sed diam voluptua. At vero eos et accusam et justo duo dolores et ea rebum. Stet clita kasd gubergren, no sea takimata sanctus est Lorem ipsum dolor sit amet.", "text": "Lorem ipsum dolor sit amet, consetetur sadipscing elitr, sed diam nonumy eirmod tempor invidunt ut labore et dolore magna aliquyam erat, sed diam voluptua. At vero eos et accusam et justo duo dolores et ea rebum. Stet clita kasd gubergren, no sea takimata sanctus est Lorem ipsum dolor sit amet. Lorem ipsum dolor sit amet, consetetur sadipscing elitr, sed diam nonumy eirmod tempor invidunt ut labore et dolore magna aliquyam erat, sed diam voluptua. At vero eos et accusam et justo duo dolores et ea rebum. Stet clita kasd gubergren, no sea takimata sanctus est Lorem ipsum dolor sit amet. Lorem ipsum dolor sit amet, consetetur sadipscing elitr, sed diam nonumy eirmod tempor invidunt ut labore et dolore magna aliquyam erat, sed diam voluptua. At vero eos et accusam et justo duo dolores et ea rebum. Stet clita kasd gubergren, no sea takimata sanctus est Lorem ipsum dolor sit amet."}, {"self_ref": "#/texts/2", "parent": {"cref": "#/body"}, "children": [], "label": "paragraph", "prov": [{"page_no": 1, "bbox": {"l": 236.17599487304688, "t": 490.45794677734375, "r": 375.069580078125, "b": 480.4953308105469, "coord_origin": "BOTTOMLEFT"}, "charspan": [0, 30]}], "orig": "Listing 1: Simple Java Program", "text": "Listing 1: Simple Java Program"}, {"self_ref": "#/texts/3", "parent": {"cref": "#/body"}, "children": [], "label": "code", "prov": [{"page_no": 1, "bbox": {"l": 134.23899841308594, "t": 474.2005310058594, "r": 337.5928649902344, "b": 443.9358215332031, "coord_origin": "BOTTOMLEFT"}, "charspan": [0, 65]}], "orig": "public static void print() { System.out.println( \"Java Code\" ); }", "text": "public static void print() { System.out.println( \"Java Code\" ); }", "code_language": "unknown"}, {"self_ref": "#/texts/4", "parent": {"cref": "#/body"}, "children": [], "label": "text", "prov": [{"page_no": 1, "bbox": {"l": 133.76800537109375, "t": 432.27593994140625, "r": 477.47589111328125, "b": 290.80633544921875, "coord_origin": "BOTTOMLEFT"}, "charspan": [0, 887]}], "orig": "Lorem ipsum dolor sit amet, consetetur sadipscing elitr, sed diam nonumy eirmod tempor invidunt ut labore et dolore magna aliquyam erat, sed diam voluptua. At vero eos et accusam et justo duo dolores et ea rebum. Stet clita kasd gubergren, no sea takimata sanctus est Lorem ipsum dolor sit amet. Lorem ipsum dolor sit amet, consetetur sadipscing elitr, sed diam nonumy eirmod tempor invidunt ut labore et dolore magna aliquyam erat, sed diam voluptua. At vero eos et accusam et justo duo dolores et ea rebum. Stet clita kasd gubergren, no sea takimata sanctus est Lorem ipsum dolor sit amet. Lorem ipsum dolor sit amet, consetetur sadipscing elitr, sed diam nonumy eirmod tempor invidunt ut labore et dolore magna aliquyam erat, sed diam voluptua. At vero eos et accusam et justo duo dolores et ea rebum. Stet clita kasd gubergren, no sea takimata sanctus est Lorem ipsum dolor sit amet.", "text": "Lorem ipsum dolor sit amet, consetetur sadipscing elitr, sed diam nonumy eirmod tempor invidunt ut labore et dolore magna aliquyam erat, sed diam voluptua. At vero eos et accusam et justo duo dolores et ea rebum. Stet clita kasd gubergren, no sea takimata sanctus est Lorem ipsum dolor sit amet. Lorem ipsum dolor sit amet, consetetur sadipscing elitr, sed diam nonumy eirmod tempor invidunt ut labore et dolore magna aliquyam erat, sed diam voluptua. At vero eos et accusam et justo duo dolores et ea rebum. Stet clita kasd gubergren, no sea takimata sanctus est Lorem ipsum dolor sit amet. Lorem ipsum dolor sit amet, consetetur sadipscing elitr, sed diam nonumy eirmod tempor invidunt ut labore et dolore magna aliquyam erat, sed diam voluptua. At vero eos et accusam et justo duo dolores et ea rebum. Stet clita kasd gubergren, no sea takimata sanctus est Lorem ipsum dolor sit amet."}, {"self_ref": "#/texts/5", "parent": {"cref": "#/body"}, "children": [], "label": "page_footer", "prov": [{"page_no": 1, "bbox": {"l": 303.13299560546875, "t": 96.83694458007812, "r": 308.1142883300781, "b": 86.87435150146484, "coord_origin": "BOTTOMLEFT"}, "charspan": [0, 1]}], "orig": "1", "text": "1"}, {"self_ref": "#/texts/6", "parent": {"cref": "#/body"}, "children": [], "label": "section_header", "prov": [{"page_no": 2, "bbox": {"l": 133.76800537109375, "t": 717.8846435546875, "r": 191.51429748535156, "b": 703.5241088867188, "coord_origin": "BOTTOMLEFT"}, "charspan": [0, 7]}], "orig": "Formula", "text": "Formula", "level": 1}, {"self_ref": "#/texts/7", "parent": {"cref": "#/body"}, "children": [], "label": "text", "prov": [{"page_no": 2, "bbox": {"l": 133.76800537109375, "t": 692.7759399414062, "r": 477.48065185546875, "b": 551.3063354492188, "coord_origin": "BOTTOMLEFT"}, "charspan": [0, 887]}], "orig": "Lorem ipsum dolor sit amet, consetetur sadipscing elitr, sed diam nonumy eirmod tempor invidunt ut labore et dolore magna aliquyam erat, sed diam voluptua. At vero eos et accusam et justo duo dolores et ea rebum. Stet clita kasd gubergren, no sea takimata sanctus est Lorem ipsum dolor sit amet. Lorem ipsum dolor sit amet, consetetur sadipscing elitr, sed diam nonumy eirmod tempor invidunt ut labore et dolore magna aliquyam erat, sed diam voluptua. At vero eos et accusam et justo duo dolores et ea rebum. Stet clita kasd gubergren, no sea takimata sanctus est Lorem ipsum dolor sit amet. Lorem ipsum dolor sit amet, consetetur sadipscing elitr, sed diam nonumy eirmod tempor invidunt ut labore et dolore magna aliquyam erat, sed diam voluptua. At vero eos et accusam et justo duo dolores et ea rebum. Stet clita kasd gubergren, no sea takimata sanctus est Lorem ipsum dolor sit amet.", "text": "Lorem ipsum dolor sit amet, consetetur sadipscing elitr, sed diam nonumy eirmod tempor invidunt ut labore et dolore magna aliquyam erat, sed diam voluptua. At vero eos et accusam et justo duo dolores et ea rebum. Stet clita kasd gubergren, no sea takimata sanctus est Lorem ipsum dolor sit amet. Lorem ipsum dolor sit amet, consetetur sadipscing elitr, sed diam nonumy eirmod tempor invidunt ut labore et dolore magna aliquyam erat, sed diam voluptua. At vero eos et accusam et justo duo dolores et ea rebum. Stet clita kasd gubergren, no sea takimata sanctus est Lorem ipsum dolor sit amet. Lorem ipsum dolor sit amet, consetetur sadipscing elitr, sed diam nonumy eirmod tempor invidunt ut labore et dolore magna aliquyam erat, sed diam voluptua. At vero eos et accusam et justo duo dolores et ea rebum. Stet clita kasd gubergren, no sea takimata sanctus est Lorem ipsum dolor sit amet."}, {"self_ref": "#/texts/8", "parent": {"cref": "#/body"}, "children": [], "label": "text", "prov": [{"page_no": 2, "bbox": {"l": 133.76800537109375, "t": 549.3139038085938, "r": 477.4748229980469, "b": 491.53033447265625, "coord_origin": "BOTTOMLEFT"}, "charspan": [0, 369]}], "orig": "Duis autem vel eum iriure dolor in hendrerit in vulputate velit esse molestie consequat, vel illum dolore eu feugiat nulla facilisis at vero eros et accumsan et iusto odio dignissim qui blandit praesent luptatum zzril delenit augue duis dolore te feugait nulla facilisi. Lorem ipsum dolor sit amet, consectetuer adipiscing elit, sed diam nonummy nibh euismod tincidunt.", "text": "Duis autem vel eum iriure dolor in hendrerit in vulputate velit esse molestie consequat, vel illum dolore eu feugiat nulla facilisis at vero eros et accumsan et iusto odio dignissim qui blandit praesent luptatum zzril delenit augue duis dolore te feugait nulla facilisi. Lorem ipsum dolor sit amet, consectetuer adipiscing elit, sed diam nonummy nibh euismod tincidunt."}, {"self_ref": "#/texts/9", "parent": {"cref": "#/body"}, "children": [], "label": "formula", "prov": [{"page_no": 2, "bbox": {"l": 280.5539855957031, "t": 479.4553527832031, "r": 330.69659423828125, "b": 467.6203308105469, "coord_origin": "BOTTOMLEFT"}, "charspan": [0, 12]}], "orig": "a 2 + 8 = 12", "text": "a 2 + 8 = 12"}, {"self_ref": "#/texts/10", "parent": {"cref": "#/body"}, "children": [], "label": "text", "prov": [{"page_no": 2, "bbox": {"l": 133.76800537109375, "t": 459.64996337890625, "r": 477.47589111328125, "b": 318.1803283691406, "coord_origin": "BOTTOMLEFT"}, "charspan": [0, 887]}], "orig": "Lorem ipsum dolor sit amet, consetetur sadipscing elitr, sed diam nonumy eirmod tempor invidunt ut labore et dolore magna aliquyam erat, sed diam voluptua. At vero eos et accusam et justo duo dolores et ea rebum. Stet clita kasd gubergren, no sea takimata sanctus est Lorem ipsum dolor sit amet. Lorem ipsum dolor sit amet, consetetur sadipscing elitr, sed diam nonumy eirmod tempor invidunt ut labore et dolore magna aliquyam erat, sed diam voluptua. At vero eos et accusam et justo duo dolores et ea rebum. Stet clita kasd gubergren, no sea takimata sanctus est Lorem ipsum dolor sit amet. Lorem ipsum dolor sit amet, consetetur sadipscing elitr, sed diam nonumy eirmod tempor invidunt ut labore et dolore magna aliquyam erat, sed diam voluptua. At vero eos et accusam et justo duo dolores et ea rebum. Stet clita kasd gubergren, no sea takimata sanctus est Lorem ipsum dolor sit amet.", "text": "Lorem ipsum dolor sit amet, consetetur sadipscing elitr, sed diam nonumy eirmod tempor invidunt ut labore et dolore magna aliquyam erat, sed diam voluptua. At vero eos et accusam et justo duo dolores et ea rebum. Stet clita kasd gubergren, no sea takimata sanctus est Lorem ipsum dolor sit amet. Lorem ipsum dolor sit amet, consetetur sadipscing elitr, sed diam nonumy eirmod tempor invidunt ut labore et dolore magna aliquyam erat, sed diam voluptua. At vero eos et accusam et justo duo dolores et ea rebum. Stet clita kasd gubergren, no sea takimata sanctus est Lorem ipsum dolor sit amet. Lorem ipsum dolor sit amet, consetetur sadipscing elitr, sed diam nonumy eirmod tempor invidunt ut labore et dolore magna aliquyam erat, sed diam voluptua. At vero eos et accusam et justo duo dolores et ea rebum. Stet clita kasd gubergren, no sea takimata sanctus est Lorem ipsum dolor sit amet."}, {"self_ref": "#/texts/11", "parent": {"cref": "#/body"}, "children": [], "label": "text", "prov": [{"page_no": 2, "bbox": {"l": 133.76800537109375, "t": 316.1879577636719, "r": 477.4748229980469, "b": 246.44935607910156, "coord_origin": "BOTTOMLEFT"}, "charspan": [0, 415]}], "orig": "Duis autem vel eum iriure dolor in hendrerit in vulputate velit esse molestie consequat, vel illum dolore eu feugiat nulla facilisis at vero eros et accumsan et iusto odio dignissim qui blandit praesent luptatum zzril delenit augue duis dolore te feugait nulla facilisi. Lorem ipsum dolor sit amet, consectetuer adipiscing elit, sed diam nonummy nibh euismod tincidunt ut laoreet dolore magna aliquam erat volutpat.", "text": "Duis autem vel eum iriure dolor in hendrerit in vulputate velit esse molestie consequat, vel illum dolore eu feugiat nulla facilisis at vero eros et accumsan et iusto odio dignissim qui blandit praesent luptatum zzril delenit augue duis dolore te feugait nulla facilisi. Lorem ipsum dolor sit amet, consectetuer adipiscing elit, sed diam nonummy nibh euismod tincidunt ut laoreet dolore magna aliquam erat volutpat."}, {"self_ref": "#/texts/12", "parent": {"cref": "#/body"}, "children": [], "label": "text", "prov": [{"page_no": 2, "bbox": {"l": 133.76800537109375, "t": 244.4569549560547, "r": 477.4748229980469, "b": 174.71835327148438, "coord_origin": "BOTTOMLEFT"}, "charspan": [0, 415]}], "orig": "Duis autem vel eum iriure dolor in hendrerit in vulputate velit esse molestie consequat, vel illum dolore eu feugiat nulla facilisis at vero eros et accumsan et iusto odio dignissim qui blandit praesent luptatum zzril delenit augue duis dolore te feugait nulla facilisi. Lorem ipsum dolor sit amet, consectetuer adipiscing elit, sed diam nonummy nibh euismod tincidunt ut laoreet dolore magna aliquam erat volutpat.", "text": "Duis autem vel eum iriure dolor in hendrerit in vulputate velit esse molestie consequat, vel illum dolore eu feugiat nulla facilisis at vero eros et accumsan et iusto odio dignissim qui blandit praesent luptatum zzril delenit augue duis dolore te feugait nulla facilisi. Lorem ipsum dolor sit amet, consectetuer adipiscing elit, sed diam nonummy nibh euismod tincidunt ut laoreet dolore magna aliquam erat volutpat."}, {"self_ref": "#/texts/13", "parent": {"cref": "#/body"}, "children": [], "label": "page_footer", "prov": [{"page_no": 2, "bbox": {"l": 303.13299560546875, "t": 146.7259521484375, "r": 308.1142883300781, "b": 136.7633514404297, "coord_origin": "BOTTOMLEFT"}, "charspan": [0, 1]}], "orig": "1", "text": "1"}], "pictures": [], "tables": [], "key_value_items": [], "pages": {"1": {"size": {"width": 612.0, "height": 792.0}, "image": null, "page_no": 1}, "2": {"size": {"width": 595.2760009765625, "height": 841.8900146484375}, "image": null, "page_no": 2}}} \ No newline at end of file diff --git a/tests/data/groundtruth/docling_v2/code_and_formula.md b/tests/data/groundtruth/docling_v2/code_and_formula.md new file mode 100644 index 00000000..2e2636a3 --- /dev/null +++ b/tests/data/groundtruth/docling_v2/code_and_formula.md @@ -0,0 +1,25 @@ +## Java Code Example + +Lorem ipsum dolor sit amet, consetetur sadipscing elitr, sed diam nonumy eirmod tempor invidunt ut labore et dolore magna aliquyam erat, sed diam voluptua. At vero eos et accusam et justo duo dolores et ea rebum. Stet clita kasd gubergren, no sea takimata sanctus est Lorem ipsum dolor sit amet. Lorem ipsum dolor sit amet, consetetur sadipscing elitr, sed diam nonumy eirmod tempor invidunt ut labore et dolore magna aliquyam erat, sed diam voluptua. At vero eos et accusam et justo duo dolores et ea rebum. Stet clita kasd gubergren, no sea takimata sanctus est Lorem ipsum dolor sit amet. Lorem ipsum dolor sit amet, consetetur sadipscing elitr, sed diam nonumy eirmod tempor invidunt ut labore et dolore magna aliquyam erat, sed diam voluptua. At vero eos et accusam et justo duo dolores et ea rebum. Stet clita kasd gubergren, no sea takimata sanctus est Lorem ipsum dolor sit amet. + +Listing 1: Simple Java Program + +``` +public static void print() { System.out.println( "Java Code" ); } +``` + +Lorem ipsum dolor sit amet, consetetur sadipscing elitr, sed diam nonumy eirmod tempor invidunt ut labore et dolore magna aliquyam erat, sed diam voluptua. At vero eos et accusam et justo duo dolores et ea rebum. Stet clita kasd gubergren, no sea takimata sanctus est Lorem ipsum dolor sit amet. Lorem ipsum dolor sit amet, consetetur sadipscing elitr, sed diam nonumy eirmod tempor invidunt ut labore et dolore magna aliquyam erat, sed diam voluptua. At vero eos et accusam et justo duo dolores et ea rebum. Stet clita kasd gubergren, no sea takimata sanctus est Lorem ipsum dolor sit amet. Lorem ipsum dolor sit amet, consetetur sadipscing elitr, sed diam nonumy eirmod tempor invidunt ut labore et dolore magna aliquyam erat, sed diam voluptua. At vero eos et accusam et justo duo dolores et ea rebum. Stet clita kasd gubergren, no sea takimata sanctus est Lorem ipsum dolor sit amet. + +## Formula + +Lorem ipsum dolor sit amet, consetetur sadipscing elitr, sed diam nonumy eirmod tempor invidunt ut labore et dolore magna aliquyam erat, sed diam voluptua. At vero eos et accusam et justo duo dolores et ea rebum. Stet clita kasd gubergren, no sea takimata sanctus est Lorem ipsum dolor sit amet. Lorem ipsum dolor sit amet, consetetur sadipscing elitr, sed diam nonumy eirmod tempor invidunt ut labore et dolore magna aliquyam erat, sed diam voluptua. At vero eos et accusam et justo duo dolores et ea rebum. Stet clita kasd gubergren, no sea takimata sanctus est Lorem ipsum dolor sit amet. Lorem ipsum dolor sit amet, consetetur sadipscing elitr, sed diam nonumy eirmod tempor invidunt ut labore et dolore magna aliquyam erat, sed diam voluptua. At vero eos et accusam et justo duo dolores et ea rebum. Stet clita kasd gubergren, no sea takimata sanctus est Lorem ipsum dolor sit amet. + +Duis autem vel eum iriure dolor in hendrerit in vulputate velit esse molestie consequat, vel illum dolore eu feugiat nulla facilisis at vero eros et accumsan et iusto odio dignissim qui blandit praesent luptatum zzril delenit augue duis dolore te feugait nulla facilisi. Lorem ipsum dolor sit amet, consectetuer adipiscing elit, sed diam nonummy nibh euismod tincidunt. + +a 2 + 8 = 12 + +Lorem ipsum dolor sit amet, consetetur sadipscing elitr, sed diam nonumy eirmod tempor invidunt ut labore et dolore magna aliquyam erat, sed diam voluptua. At vero eos et accusam et justo duo dolores et ea rebum. Stet clita kasd gubergren, no sea takimata sanctus est Lorem ipsum dolor sit amet. Lorem ipsum dolor sit amet, consetetur sadipscing elitr, sed diam nonumy eirmod tempor invidunt ut labore et dolore magna aliquyam erat, sed diam voluptua. At vero eos et accusam et justo duo dolores et ea rebum. Stet clita kasd gubergren, no sea takimata sanctus est Lorem ipsum dolor sit amet. Lorem ipsum dolor sit amet, consetetur sadipscing elitr, sed diam nonumy eirmod tempor invidunt ut labore et dolore magna aliquyam erat, sed diam voluptua. At vero eos et accusam et justo duo dolores et ea rebum. Stet clita kasd gubergren, no sea takimata sanctus est Lorem ipsum dolor sit amet. + +Duis autem vel eum iriure dolor in hendrerit in vulputate velit esse molestie consequat, vel illum dolore eu feugiat nulla facilisis at vero eros et accumsan et iusto odio dignissim qui blandit praesent luptatum zzril delenit augue duis dolore te feugait nulla facilisi. Lorem ipsum dolor sit amet, consectetuer adipiscing elit, sed diam nonummy nibh euismod tincidunt ut laoreet dolore magna aliquam erat volutpat. + +Duis autem vel eum iriure dolor in hendrerit in vulputate velit esse molestie consequat, vel illum dolore eu feugiat nulla facilisis at vero eros et accumsan et iusto odio dignissim qui blandit praesent luptatum zzril delenit augue duis dolore te feugait nulla facilisi. Lorem ipsum dolor sit amet, consectetuer adipiscing elit, sed diam nonummy nibh euismod tincidunt ut laoreet dolore magna aliquam erat volutpat. \ No newline at end of file diff --git a/tests/data/groundtruth/docling_v2/code_and_formula.pages.json b/tests/data/groundtruth/docling_v2/code_and_formula.pages.json new file mode 100644 index 00000000..27682fdd --- /dev/null +++ b/tests/data/groundtruth/docling_v2/code_and_formula.pages.json @@ -0,0 +1 @@ +[{"page_no": 0, "size": {"width": 612.0, "height": 792.0}, "cells": [{"id": 0, "text": "Java Code Example", "bbox": {"l": 133.76801, "t": 124.00536999999997, "r": 273.45401, "b": 138.36590999999999, "coord_origin": "TOPLEFT"}}, {"id": 1, "text": "Lorem ipsum dolor sit amet, consetetur sadipscing elitr, sed diam nonumy eir-", "bbox": {"l": 133.76801, "t": 149.11406999999997, "r": 477.47687, "b": 159.07665999999995, "coord_origin": "TOPLEFT"}}, {"id": 2, "text": "mod tempor invidunt ut labore et dolore magna aliquyam erat, sed diam volup-", "bbox": {"l": 133.76801, "t": 161.06908999999996, "r": 477.47781000000003, "b": 171.03168000000005, "coord_origin": "TOPLEFT"}}, {"id": 3, "text": "tua. At vero eos et accusam et justo duo dolores et ea rebum. Stet clita kasd", "bbox": {"l": 133.76801, "t": 173.02405, "r": 477.46985, "b": 182.98663, "coord_origin": "TOPLEFT"}}, {"id": 4, "text": "gubergren, no sea takimata sanctus est Lorem ipsum dolor sit amet. Lorem ip-", "bbox": {"l": 133.76801, "t": 184.97906, "r": 477.47186, "b": 194.94164999999998, "coord_origin": "TOPLEFT"}}, {"id": 5, "text": "sum dolor sit amet, consetetur sadipscing elitr, sed diam nonumy eirmod tempor", "bbox": {"l": 133.76801, "t": 196.93408, "r": 477.46993999999995, "b": 206.89666999999997, "coord_origin": "TOPLEFT"}}, {"id": 6, "text": "invidunt ut labore et dolore magna aliquyam erat, sed diam voluptua. At vero", "bbox": {"l": 133.76801, "t": 208.88909999999998, "r": 477.4806500000001, "b": 218.85168, "coord_origin": "TOPLEFT"}}, {"id": 7, "text": "eos et accusam et justo duo dolores et ea rebum.", "bbox": {"l": 133.76801, "t": 220.84509000000003, "r": 355.20087, "b": 230.80768, "coord_origin": "TOPLEFT"}}, {"id": 8, "text": "Stet clita kasd gubergren,", "bbox": {"l": 362.60107, "t": 220.84509000000003, "r": 477.47495000000004, "b": 230.80768, "coord_origin": "TOPLEFT"}}, {"id": 9, "text": "no sea takimata sanctus est Lorem ipsum dolor sit amet. Lorem ipsum dolor", "bbox": {"l": 133.76801, "t": 232.80005000000006, "r": 477.47083, "b": 242.76262999999994, "coord_origin": "TOPLEFT"}}, {"id": 10, "text": "sit amet, consetetur sadipscing elitr, sed diam nonumy eirmod tempor invidunt", "bbox": {"l": 133.76801, "t": 244.75507000000005, "r": 477.47092, "b": 254.71765000000005, "coord_origin": "TOPLEFT"}}, {"id": 11, "text": "ut labore et dolore magna aliquyam erat, sed diam voluptua. At vero eos et", "bbox": {"l": 133.76801, "t": 256.71007999999995, "r": 477.46588, "b": 266.67267000000004, "coord_origin": "TOPLEFT"}}, {"id": 12, "text": "accusam et justo duo dolores et ea rebum. Stet clita kasd gubergren, no sea", "bbox": {"l": 133.76801, "t": 268.66510000000005, "r": 477.47586, "b": 278.62769000000003, "coord_origin": "TOPLEFT"}}, {"id": 13, "text": "takimata sanctus est Lorem ipsum dolor sit amet.", "bbox": {"l": 133.76801, "t": 280.62103, "r": 351.47672, "b": 290.58365, "coord_origin": "TOPLEFT"}}, {"id": 14, "text": "Listing 1: Simple Java Program", "bbox": {"l": 236.17598999999998, "t": 301.54204999999996, "r": 375.06958, "b": 311.50467, "coord_origin": "TOPLEFT"}}, {"id": 15, "text": "public static void", "bbox": {"l": 134.575, "t": 317.79947000000004, "r": 235.44695, "b": 326.14719, "coord_origin": "TOPLEFT"}}, {"id": 16, "text": "print() {", "bbox": {"l": 241.8759, "t": 317.79947000000004, "r": 292.24908, "b": 326.14719, "coord_origin": "TOPLEFT"}}, {"id": 17, "text": "System.out.println(", "bbox": {"l": 157.17101, "t": 328.75745, "r": 264.02368, "b": 337.10516000000007, "coord_origin": "TOPLEFT"}}, {"id": 18, "text": "\"Java Code\"", "bbox": {"l": 264.15811, "t": 328.75745, "r": 325.35376, "b": 337.10516000000007, "coord_origin": "TOPLEFT"}}, {"id": 19, "text": ");", "bbox": {"l": 326.2952, "t": 328.75745, "r": 337.59286, "b": 337.10516000000007, "coord_origin": "TOPLEFT"}}, {"id": 20, "text": "}", "bbox": {"l": 134.239, "t": 339.71646, "r": 138.94637, "b": 348.06418, "coord_origin": "TOPLEFT"}}, {"id": 21, "text": "Lorem ipsum dolor sit amet, consetetur sadipscing elitr, sed diam nonumy", "bbox": {"l": 148.71201, "t": 359.72406, "r": 477.46985, "b": 369.68667999999997, "coord_origin": "TOPLEFT"}}, {"id": 22, "text": "eirmod tempor invidunt ut labore et dolore magna aliquyam erat, sed diam", "bbox": {"l": 133.76801, "t": 371.67905, "r": 477.4758, "b": 381.64166000000006, "coord_origin": "TOPLEFT"}}, {"id": 23, "text": "voluptua. At vero eos et accusam et justo duo dolores et ea rebum. Stet clita", "bbox": {"l": 133.76801, "t": 383.63403, "r": 477.4758, "b": 393.59665, "coord_origin": "TOPLEFT"}}, {"id": 24, "text": "kasd gubergren, no sea takimata sanctus est Lorem ipsum dolor sit amet. Lorem", "bbox": {"l": 133.76801, "t": 395.58905, "r": 477.47284, "b": 405.55167, "coord_origin": "TOPLEFT"}}, {"id": 25, "text": "ipsum dolor sit amet, consetetur sadipscing elitr, sed diam nonumy eirmod", "bbox": {"l": 133.76801, "t": 407.54404, "r": 477.47589, "b": 417.50665, "coord_origin": "TOPLEFT"}}, {"id": 26, "text": "tempor invidunt ut labore et dolore magna aliquyam erat, sed diam voluptua. At", "bbox": {"l": 133.76801, "t": 419.50003000000004, "r": 477.47177000000005, "b": 429.46265, "coord_origin": "TOPLEFT"}}, {"id": 27, "text": "vero eos et accusam et justo duo dolores et ea rebum. Stet clita kasd gubergren,", "bbox": {"l": 133.76801, "t": 431.45505, "r": 477.47387999999995, "b": 441.41766000000007, "coord_origin": "TOPLEFT"}}, {"id": 28, "text": "no sea takimata sanctus est Lorem ipsum dolor sit amet. Lorem ipsum dolor", "bbox": {"l": 133.76801, "t": 443.41003, "r": 477.47083, "b": 453.37265, "coord_origin": "TOPLEFT"}}, {"id": 29, "text": "sit amet, consetetur sadipscing elitr, sed diam nonumy eirmod tempor invidunt", "bbox": {"l": 133.76801, "t": 455.36505, "r": 477.47092, "b": 465.32767, "coord_origin": "TOPLEFT"}}, {"id": 30, "text": "ut labore et dolore magna aliquyam erat, sed diam voluptua. At vero eos et", "bbox": {"l": 133.76801, "t": 467.32004, "r": 477.46585, "b": 477.28265, "coord_origin": "TOPLEFT"}}, {"id": 31, "text": "accusam et justo duo dolores et ea rebum. Stet clita kasd gubergren, no sea", "bbox": {"l": 133.76801, "t": 479.27603, "r": 477.47586, "b": 489.23865, "coord_origin": "TOPLEFT"}}, {"id": 32, "text": "takimata sanctus est Lorem ipsum dolor sit amet.", "bbox": {"l": 133.76801, "t": 491.23105, "r": 351.47672, "b": 501.19366, "coord_origin": "TOPLEFT"}}, {"id": 33, "text": "1", "bbox": {"l": 303.133, "t": 695.163055, "r": 308.11429, "b": 705.125648, "coord_origin": "TOPLEFT"}}], "predictions": {"layout": {"clusters": [{"id": 2, "label": "section_header", "bbox": {"l": 133.76801, "t": 124.00536999999997, "r": 273.45401, "b": 138.36590999999999, "coord_origin": "TOPLEFT"}, "confidence": 0.9581764936447144, "cells": [{"id": 0, "text": "Java Code Example", "bbox": {"l": 133.76801, "t": 124.00536999999997, "r": 273.45401, "b": 138.36590999999999, "coord_origin": "TOPLEFT"}}], "children": []}, {"id": 0, "label": "text", "bbox": {"l": 133.76801, "t": 149.11406999999997, "r": 477.4806500000001, "b": 290.58365, "coord_origin": "TOPLEFT"}, "confidence": 0.9872456789016724, "cells": [{"id": 1, "text": "Lorem ipsum dolor sit amet, consetetur sadipscing elitr, sed diam nonumy eir-", "bbox": {"l": 133.76801, "t": 149.11406999999997, "r": 477.47687, "b": 159.07665999999995, "coord_origin": "TOPLEFT"}}, {"id": 2, "text": "mod tempor invidunt ut labore et dolore magna aliquyam erat, sed diam volup-", "bbox": {"l": 133.76801, "t": 161.06908999999996, "r": 477.47781000000003, "b": 171.03168000000005, "coord_origin": "TOPLEFT"}}, {"id": 3, "text": "tua. At vero eos et accusam et justo duo dolores et ea rebum. Stet clita kasd", "bbox": {"l": 133.76801, "t": 173.02405, "r": 477.46985, "b": 182.98663, "coord_origin": "TOPLEFT"}}, {"id": 4, "text": "gubergren, no sea takimata sanctus est Lorem ipsum dolor sit amet. Lorem ip-", "bbox": {"l": 133.76801, "t": 184.97906, "r": 477.47186, "b": 194.94164999999998, "coord_origin": "TOPLEFT"}}, {"id": 5, "text": "sum dolor sit amet, consetetur sadipscing elitr, sed diam nonumy eirmod tempor", "bbox": {"l": 133.76801, "t": 196.93408, "r": 477.46993999999995, "b": 206.89666999999997, "coord_origin": "TOPLEFT"}}, {"id": 6, "text": "invidunt ut labore et dolore magna aliquyam erat, sed diam voluptua. At vero", "bbox": {"l": 133.76801, "t": 208.88909999999998, "r": 477.4806500000001, "b": 218.85168, "coord_origin": "TOPLEFT"}}, {"id": 7, "text": "eos et accusam et justo duo dolores et ea rebum.", "bbox": {"l": 133.76801, "t": 220.84509000000003, "r": 355.20087, "b": 230.80768, "coord_origin": "TOPLEFT"}}, {"id": 8, "text": "Stet clita kasd gubergren,", "bbox": {"l": 362.60107, "t": 220.84509000000003, "r": 477.47495000000004, "b": 230.80768, "coord_origin": "TOPLEFT"}}, {"id": 9, "text": "no sea takimata sanctus est Lorem ipsum dolor sit amet. Lorem ipsum dolor", "bbox": {"l": 133.76801, "t": 232.80005000000006, "r": 477.47083, "b": 242.76262999999994, "coord_origin": "TOPLEFT"}}, {"id": 10, "text": "sit amet, consetetur sadipscing elitr, sed diam nonumy eirmod tempor invidunt", "bbox": {"l": 133.76801, "t": 244.75507000000005, "r": 477.47092, "b": 254.71765000000005, "coord_origin": "TOPLEFT"}}, {"id": 11, "text": "ut labore et dolore magna aliquyam erat, sed diam voluptua. At vero eos et", "bbox": {"l": 133.76801, "t": 256.71007999999995, "r": 477.46588, "b": 266.67267000000004, "coord_origin": "TOPLEFT"}}, {"id": 12, "text": "accusam et justo duo dolores et ea rebum. Stet clita kasd gubergren, no sea", "bbox": {"l": 133.76801, "t": 268.66510000000005, "r": 477.47586, "b": 278.62769000000003, "coord_origin": "TOPLEFT"}}, {"id": 13, "text": "takimata sanctus est Lorem ipsum dolor sit amet.", "bbox": {"l": 133.76801, "t": 280.62103, "r": 351.47672, "b": 290.58365, "coord_origin": "TOPLEFT"}}], "children": []}, {"id": 5, "label": "caption", "bbox": {"l": 236.17598999999998, "t": 301.54204999999996, "r": 375.06958, "b": 311.50467, "coord_origin": "TOPLEFT"}, "confidence": 0.5425266027450562, "cells": [{"id": 14, "text": "Listing 1: Simple Java Program", "bbox": {"l": 236.17598999999998, "t": 301.54204999999996, "r": 375.06958, "b": 311.50467, "coord_origin": "TOPLEFT"}}], "children": []}, {"id": 4, "label": "code", "bbox": {"l": 134.239, "t": 317.79947000000004, "r": 337.59286, "b": 348.06418, "coord_origin": "TOPLEFT"}, "confidence": 0.6383119821548462, "cells": [{"id": 15, "text": "public static void", "bbox": {"l": 134.575, "t": 317.79947000000004, "r": 235.44695, "b": 326.14719, "coord_origin": "TOPLEFT"}}, {"id": 16, "text": "print() {", "bbox": {"l": 241.8759, "t": 317.79947000000004, "r": 292.24908, "b": 326.14719, "coord_origin": "TOPLEFT"}}, {"id": 17, "text": "System.out.println(", "bbox": {"l": 157.17101, "t": 328.75745, "r": 264.02368, "b": 337.10516000000007, "coord_origin": "TOPLEFT"}}, {"id": 18, "text": "\"Java Code\"", "bbox": {"l": 264.15811, "t": 328.75745, "r": 325.35376, "b": 337.10516000000007, "coord_origin": "TOPLEFT"}}, {"id": 19, "text": ");", "bbox": {"l": 326.2952, "t": 328.75745, "r": 337.59286, "b": 337.10516000000007, "coord_origin": "TOPLEFT"}}, {"id": 20, "text": "}", "bbox": {"l": 134.239, "t": 339.71646, "r": 138.94637, "b": 348.06418, "coord_origin": "TOPLEFT"}}], "children": []}, {"id": 1, "label": "text", "bbox": {"l": 133.76801, "t": 359.72406, "r": 477.47589, "b": 501.19366, "coord_origin": "TOPLEFT"}, "confidence": 0.9869542717933655, "cells": [{"id": 21, "text": "Lorem ipsum dolor sit amet, consetetur sadipscing elitr, sed diam nonumy", "bbox": {"l": 148.71201, "t": 359.72406, "r": 477.46985, "b": 369.68667999999997, "coord_origin": "TOPLEFT"}}, {"id": 22, "text": "eirmod tempor invidunt ut labore et dolore magna aliquyam erat, sed diam", "bbox": {"l": 133.76801, "t": 371.67905, "r": 477.4758, "b": 381.64166000000006, "coord_origin": "TOPLEFT"}}, {"id": 23, "text": "voluptua. At vero eos et accusam et justo duo dolores et ea rebum. Stet clita", "bbox": {"l": 133.76801, "t": 383.63403, "r": 477.4758, "b": 393.59665, "coord_origin": "TOPLEFT"}}, {"id": 24, "text": "kasd gubergren, no sea takimata sanctus est Lorem ipsum dolor sit amet. Lorem", "bbox": {"l": 133.76801, "t": 395.58905, "r": 477.47284, "b": 405.55167, "coord_origin": "TOPLEFT"}}, {"id": 25, "text": "ipsum dolor sit amet, consetetur sadipscing elitr, sed diam nonumy eirmod", "bbox": {"l": 133.76801, "t": 407.54404, "r": 477.47589, "b": 417.50665, "coord_origin": "TOPLEFT"}}, {"id": 26, "text": "tempor invidunt ut labore et dolore magna aliquyam erat, sed diam voluptua. At", "bbox": {"l": 133.76801, "t": 419.50003000000004, "r": 477.47177000000005, "b": 429.46265, "coord_origin": "TOPLEFT"}}, {"id": 27, "text": "vero eos et accusam et justo duo dolores et ea rebum. Stet clita kasd gubergren,", "bbox": {"l": 133.76801, "t": 431.45505, "r": 477.47387999999995, "b": 441.41766000000007, "coord_origin": "TOPLEFT"}}, {"id": 28, "text": "no sea takimata sanctus est Lorem ipsum dolor sit amet. Lorem ipsum dolor", "bbox": {"l": 133.76801, "t": 443.41003, "r": 477.47083, "b": 453.37265, "coord_origin": "TOPLEFT"}}, {"id": 29, "text": "sit amet, consetetur sadipscing elitr, sed diam nonumy eirmod tempor invidunt", "bbox": {"l": 133.76801, "t": 455.36505, "r": 477.47092, "b": 465.32767, "coord_origin": "TOPLEFT"}}, {"id": 30, "text": "ut labore et dolore magna aliquyam erat, sed diam voluptua. At vero eos et", "bbox": {"l": 133.76801, "t": 467.32004, "r": 477.46585, "b": 477.28265, "coord_origin": "TOPLEFT"}}, {"id": 31, "text": "accusam et justo duo dolores et ea rebum. Stet clita kasd gubergren, no sea", "bbox": {"l": 133.76801, "t": 479.27603, "r": 477.47586, "b": 489.23865, "coord_origin": "TOPLEFT"}}, {"id": 32, "text": "takimata sanctus est Lorem ipsum dolor sit amet.", "bbox": {"l": 133.76801, "t": 491.23105, "r": 351.47672, "b": 501.19366, "coord_origin": "TOPLEFT"}}], "children": []}, {"id": 3, "label": "page_footer", "bbox": {"l": 303.133, "t": 695.163055, "r": 308.11429, "b": 705.125648, "coord_origin": "TOPLEFT"}, "confidence": 0.8374724984169006, "cells": [{"id": 33, "text": "1", "bbox": {"l": 303.133, "t": 695.163055, "r": 308.11429, "b": 705.125648, "coord_origin": "TOPLEFT"}}], "children": []}]}, "tablestructure": {"table_map": {}}, "figures_classification": null, "equations_prediction": null}, "assembled": {"elements": [{"label": "section_header", "id": 2, "page_no": 0, "cluster": {"id": 2, "label": "section_header", "bbox": {"l": 133.76801, "t": 124.00536999999997, "r": 273.45401, "b": 138.36590999999999, "coord_origin": "TOPLEFT"}, "confidence": 0.9581764936447144, "cells": [{"id": 0, "text": "Java Code Example", "bbox": {"l": 133.76801, "t": 124.00536999999997, "r": 273.45401, "b": 138.36590999999999, "coord_origin": "TOPLEFT"}}], "children": []}, "text": "Java Code Example"}, {"label": "text", "id": 0, "page_no": 0, "cluster": {"id": 0, "label": "text", "bbox": {"l": 133.76801, "t": 149.11406999999997, "r": 477.4806500000001, "b": 290.58365, "coord_origin": "TOPLEFT"}, "confidence": 0.9872456789016724, "cells": [{"id": 1, "text": "Lorem ipsum dolor sit amet, consetetur sadipscing elitr, sed diam nonumy eir-", "bbox": {"l": 133.76801, "t": 149.11406999999997, "r": 477.47687, "b": 159.07665999999995, "coord_origin": "TOPLEFT"}}, {"id": 2, "text": "mod tempor invidunt ut labore et dolore magna aliquyam erat, sed diam volup-", "bbox": {"l": 133.76801, "t": 161.06908999999996, "r": 477.47781000000003, "b": 171.03168000000005, "coord_origin": "TOPLEFT"}}, {"id": 3, "text": "tua. At vero eos et accusam et justo duo dolores et ea rebum. Stet clita kasd", "bbox": {"l": 133.76801, "t": 173.02405, "r": 477.46985, "b": 182.98663, "coord_origin": "TOPLEFT"}}, {"id": 4, "text": "gubergren, no sea takimata sanctus est Lorem ipsum dolor sit amet. Lorem ip-", "bbox": {"l": 133.76801, "t": 184.97906, "r": 477.47186, "b": 194.94164999999998, "coord_origin": "TOPLEFT"}}, {"id": 5, "text": "sum dolor sit amet, consetetur sadipscing elitr, sed diam nonumy eirmod tempor", "bbox": {"l": 133.76801, "t": 196.93408, "r": 477.46993999999995, "b": 206.89666999999997, "coord_origin": "TOPLEFT"}}, {"id": 6, "text": "invidunt ut labore et dolore magna aliquyam erat, sed diam voluptua. At vero", "bbox": {"l": 133.76801, "t": 208.88909999999998, "r": 477.4806500000001, "b": 218.85168, "coord_origin": "TOPLEFT"}}, {"id": 7, "text": "eos et accusam et justo duo dolores et ea rebum.", "bbox": {"l": 133.76801, "t": 220.84509000000003, "r": 355.20087, "b": 230.80768, "coord_origin": "TOPLEFT"}}, {"id": 8, "text": "Stet clita kasd gubergren,", "bbox": {"l": 362.60107, "t": 220.84509000000003, "r": 477.47495000000004, "b": 230.80768, "coord_origin": "TOPLEFT"}}, {"id": 9, "text": "no sea takimata sanctus est Lorem ipsum dolor sit amet. Lorem ipsum dolor", "bbox": {"l": 133.76801, "t": 232.80005000000006, "r": 477.47083, "b": 242.76262999999994, "coord_origin": "TOPLEFT"}}, {"id": 10, "text": "sit amet, consetetur sadipscing elitr, sed diam nonumy eirmod tempor invidunt", "bbox": {"l": 133.76801, "t": 244.75507000000005, "r": 477.47092, "b": 254.71765000000005, "coord_origin": "TOPLEFT"}}, {"id": 11, "text": "ut labore et dolore magna aliquyam erat, sed diam voluptua. At vero eos et", "bbox": {"l": 133.76801, "t": 256.71007999999995, "r": 477.46588, "b": 266.67267000000004, "coord_origin": "TOPLEFT"}}, {"id": 12, "text": "accusam et justo duo dolores et ea rebum. Stet clita kasd gubergren, no sea", "bbox": {"l": 133.76801, "t": 268.66510000000005, "r": 477.47586, "b": 278.62769000000003, "coord_origin": "TOPLEFT"}}, {"id": 13, "text": "takimata sanctus est Lorem ipsum dolor sit amet.", "bbox": {"l": 133.76801, "t": 280.62103, "r": 351.47672, "b": 290.58365, "coord_origin": "TOPLEFT"}}], "children": []}, "text": "Lorem ipsum dolor sit amet, consetetur sadipscing elitr, sed diam nonumy eirmod tempor invidunt ut labore et dolore magna aliquyam erat, sed diam voluptua. At vero eos et accusam et justo duo dolores et ea rebum. Stet clita kasd gubergren, no sea takimata sanctus est Lorem ipsum dolor sit amet. Lorem ipsum dolor sit amet, consetetur sadipscing elitr, sed diam nonumy eirmod tempor invidunt ut labore et dolore magna aliquyam erat, sed diam voluptua. At vero eos et accusam et justo duo dolores et ea rebum. Stet clita kasd gubergren, no sea takimata sanctus est Lorem ipsum dolor sit amet. Lorem ipsum dolor sit amet, consetetur sadipscing elitr, sed diam nonumy eirmod tempor invidunt ut labore et dolore magna aliquyam erat, sed diam voluptua. At vero eos et accusam et justo duo dolores et ea rebum. Stet clita kasd gubergren, no sea takimata sanctus est Lorem ipsum dolor sit amet."}, {"label": "caption", "id": 5, "page_no": 0, "cluster": {"id": 5, "label": "caption", "bbox": {"l": 236.17598999999998, "t": 301.54204999999996, "r": 375.06958, "b": 311.50467, "coord_origin": "TOPLEFT"}, "confidence": 0.5425266027450562, "cells": [{"id": 14, "text": "Listing 1: Simple Java Program", "bbox": {"l": 236.17598999999998, "t": 301.54204999999996, "r": 375.06958, "b": 311.50467, "coord_origin": "TOPLEFT"}}], "children": []}, "text": "Listing 1: Simple Java Program"}, {"label": "code", "id": 4, "page_no": 0, "cluster": {"id": 4, "label": "code", "bbox": {"l": 134.239, "t": 317.79947000000004, "r": 337.59286, "b": 348.06418, "coord_origin": "TOPLEFT"}, "confidence": 0.6383119821548462, "cells": [{"id": 15, "text": "public static void", "bbox": {"l": 134.575, "t": 317.79947000000004, "r": 235.44695, "b": 326.14719, "coord_origin": "TOPLEFT"}}, {"id": 16, "text": "print() {", "bbox": {"l": 241.8759, "t": 317.79947000000004, "r": 292.24908, "b": 326.14719, "coord_origin": "TOPLEFT"}}, {"id": 17, "text": "System.out.println(", "bbox": {"l": 157.17101, "t": 328.75745, "r": 264.02368, "b": 337.10516000000007, "coord_origin": "TOPLEFT"}}, {"id": 18, "text": "\"Java Code\"", "bbox": {"l": 264.15811, "t": 328.75745, "r": 325.35376, "b": 337.10516000000007, "coord_origin": "TOPLEFT"}}, {"id": 19, "text": ");", "bbox": {"l": 326.2952, "t": 328.75745, "r": 337.59286, "b": 337.10516000000007, "coord_origin": "TOPLEFT"}}, {"id": 20, "text": "}", "bbox": {"l": 134.239, "t": 339.71646, "r": 138.94637, "b": 348.06418, "coord_origin": "TOPLEFT"}}], "children": []}, "text": "public static void print() { System.out.println( \"Java Code\" ); }"}, {"label": "text", "id": 1, "page_no": 0, "cluster": {"id": 1, "label": "text", "bbox": {"l": 133.76801, "t": 359.72406, "r": 477.47589, "b": 501.19366, "coord_origin": "TOPLEFT"}, "confidence": 0.9869542717933655, "cells": [{"id": 21, "text": "Lorem ipsum dolor sit amet, consetetur sadipscing elitr, sed diam nonumy", "bbox": {"l": 148.71201, "t": 359.72406, "r": 477.46985, "b": 369.68667999999997, "coord_origin": "TOPLEFT"}}, {"id": 22, "text": "eirmod tempor invidunt ut labore et dolore magna aliquyam erat, sed diam", "bbox": {"l": 133.76801, "t": 371.67905, "r": 477.4758, "b": 381.64166000000006, "coord_origin": "TOPLEFT"}}, {"id": 23, "text": "voluptua. At vero eos et accusam et justo duo dolores et ea rebum. Stet clita", "bbox": {"l": 133.76801, "t": 383.63403, "r": 477.4758, "b": 393.59665, "coord_origin": "TOPLEFT"}}, {"id": 24, "text": "kasd gubergren, no sea takimata sanctus est Lorem ipsum dolor sit amet. Lorem", "bbox": {"l": 133.76801, "t": 395.58905, "r": 477.47284, "b": 405.55167, "coord_origin": "TOPLEFT"}}, {"id": 25, "text": "ipsum dolor sit amet, consetetur sadipscing elitr, sed diam nonumy eirmod", "bbox": {"l": 133.76801, "t": 407.54404, "r": 477.47589, "b": 417.50665, "coord_origin": "TOPLEFT"}}, {"id": 26, "text": "tempor invidunt ut labore et dolore magna aliquyam erat, sed diam voluptua. At", "bbox": {"l": 133.76801, "t": 419.50003000000004, "r": 477.47177000000005, "b": 429.46265, "coord_origin": "TOPLEFT"}}, {"id": 27, "text": "vero eos et accusam et justo duo dolores et ea rebum. Stet clita kasd gubergren,", "bbox": {"l": 133.76801, "t": 431.45505, "r": 477.47387999999995, "b": 441.41766000000007, "coord_origin": "TOPLEFT"}}, {"id": 28, "text": "no sea takimata sanctus est Lorem ipsum dolor sit amet. Lorem ipsum dolor", "bbox": {"l": 133.76801, "t": 443.41003, "r": 477.47083, "b": 453.37265, "coord_origin": "TOPLEFT"}}, {"id": 29, "text": "sit amet, consetetur sadipscing elitr, sed diam nonumy eirmod tempor invidunt", "bbox": {"l": 133.76801, "t": 455.36505, "r": 477.47092, "b": 465.32767, "coord_origin": "TOPLEFT"}}, {"id": 30, "text": "ut labore et dolore magna aliquyam erat, sed diam voluptua. At vero eos et", "bbox": {"l": 133.76801, "t": 467.32004, "r": 477.46585, "b": 477.28265, "coord_origin": "TOPLEFT"}}, {"id": 31, "text": "accusam et justo duo dolores et ea rebum. Stet clita kasd gubergren, no sea", "bbox": {"l": 133.76801, "t": 479.27603, "r": 477.47586, "b": 489.23865, "coord_origin": "TOPLEFT"}}, {"id": 32, "text": "takimata sanctus est Lorem ipsum dolor sit amet.", "bbox": {"l": 133.76801, "t": 491.23105, "r": 351.47672, "b": 501.19366, "coord_origin": "TOPLEFT"}}], "children": []}, "text": "Lorem ipsum dolor sit amet, consetetur sadipscing elitr, sed diam nonumy eirmod tempor invidunt ut labore et dolore magna aliquyam erat, sed diam voluptua. At vero eos et accusam et justo duo dolores et ea rebum. Stet clita kasd gubergren, no sea takimata sanctus est Lorem ipsum dolor sit amet. Lorem ipsum dolor sit amet, consetetur sadipscing elitr, sed diam nonumy eirmod tempor invidunt ut labore et dolore magna aliquyam erat, sed diam voluptua. At vero eos et accusam et justo duo dolores et ea rebum. Stet clita kasd gubergren, no sea takimata sanctus est Lorem ipsum dolor sit amet. Lorem ipsum dolor sit amet, consetetur sadipscing elitr, sed diam nonumy eirmod tempor invidunt ut labore et dolore magna aliquyam erat, sed diam voluptua. At vero eos et accusam et justo duo dolores et ea rebum. Stet clita kasd gubergren, no sea takimata sanctus est Lorem ipsum dolor sit amet."}, {"label": "page_footer", "id": 3, "page_no": 0, "cluster": {"id": 3, "label": "page_footer", "bbox": {"l": 303.133, "t": 695.163055, "r": 308.11429, "b": 705.125648, "coord_origin": "TOPLEFT"}, "confidence": 0.8374724984169006, "cells": [{"id": 33, "text": "1", "bbox": {"l": 303.133, "t": 695.163055, "r": 308.11429, "b": 705.125648, "coord_origin": "TOPLEFT"}}], "children": []}, "text": "1"}], "body": [{"label": "section_header", "id": 2, "page_no": 0, "cluster": {"id": 2, "label": "section_header", "bbox": {"l": 133.76801, "t": 124.00536999999997, "r": 273.45401, "b": 138.36590999999999, "coord_origin": "TOPLEFT"}, "confidence": 0.9581764936447144, "cells": [{"id": 0, "text": "Java Code Example", "bbox": {"l": 133.76801, "t": 124.00536999999997, "r": 273.45401, "b": 138.36590999999999, "coord_origin": "TOPLEFT"}}], "children": []}, "text": "Java Code Example"}, {"label": "text", "id": 0, "page_no": 0, "cluster": {"id": 0, "label": "text", "bbox": {"l": 133.76801, "t": 149.11406999999997, "r": 477.4806500000001, "b": 290.58365, "coord_origin": "TOPLEFT"}, "confidence": 0.9872456789016724, "cells": [{"id": 1, "text": "Lorem ipsum dolor sit amet, consetetur sadipscing elitr, sed diam nonumy eir-", "bbox": {"l": 133.76801, "t": 149.11406999999997, "r": 477.47687, "b": 159.07665999999995, "coord_origin": "TOPLEFT"}}, {"id": 2, "text": "mod tempor invidunt ut labore et dolore magna aliquyam erat, sed diam volup-", "bbox": {"l": 133.76801, "t": 161.06908999999996, "r": 477.47781000000003, "b": 171.03168000000005, "coord_origin": "TOPLEFT"}}, {"id": 3, "text": "tua. At vero eos et accusam et justo duo dolores et ea rebum. Stet clita kasd", "bbox": {"l": 133.76801, "t": 173.02405, "r": 477.46985, "b": 182.98663, "coord_origin": "TOPLEFT"}}, {"id": 4, "text": "gubergren, no sea takimata sanctus est Lorem ipsum dolor sit amet. Lorem ip-", "bbox": {"l": 133.76801, "t": 184.97906, "r": 477.47186, "b": 194.94164999999998, "coord_origin": "TOPLEFT"}}, {"id": 5, "text": "sum dolor sit amet, consetetur sadipscing elitr, sed diam nonumy eirmod tempor", "bbox": {"l": 133.76801, "t": 196.93408, "r": 477.46993999999995, "b": 206.89666999999997, "coord_origin": "TOPLEFT"}}, {"id": 6, "text": "invidunt ut labore et dolore magna aliquyam erat, sed diam voluptua. At vero", "bbox": {"l": 133.76801, "t": 208.88909999999998, "r": 477.4806500000001, "b": 218.85168, "coord_origin": "TOPLEFT"}}, {"id": 7, "text": "eos et accusam et justo duo dolores et ea rebum.", "bbox": {"l": 133.76801, "t": 220.84509000000003, "r": 355.20087, "b": 230.80768, "coord_origin": "TOPLEFT"}}, {"id": 8, "text": "Stet clita kasd gubergren,", "bbox": {"l": 362.60107, "t": 220.84509000000003, "r": 477.47495000000004, "b": 230.80768, "coord_origin": "TOPLEFT"}}, {"id": 9, "text": "no sea takimata sanctus est Lorem ipsum dolor sit amet. Lorem ipsum dolor", "bbox": {"l": 133.76801, "t": 232.80005000000006, "r": 477.47083, "b": 242.76262999999994, "coord_origin": "TOPLEFT"}}, {"id": 10, "text": "sit amet, consetetur sadipscing elitr, sed diam nonumy eirmod tempor invidunt", "bbox": {"l": 133.76801, "t": 244.75507000000005, "r": 477.47092, "b": 254.71765000000005, "coord_origin": "TOPLEFT"}}, {"id": 11, "text": "ut labore et dolore magna aliquyam erat, sed diam voluptua. At vero eos et", "bbox": {"l": 133.76801, "t": 256.71007999999995, "r": 477.46588, "b": 266.67267000000004, "coord_origin": "TOPLEFT"}}, {"id": 12, "text": "accusam et justo duo dolores et ea rebum. Stet clita kasd gubergren, no sea", "bbox": {"l": 133.76801, "t": 268.66510000000005, "r": 477.47586, "b": 278.62769000000003, "coord_origin": "TOPLEFT"}}, {"id": 13, "text": "takimata sanctus est Lorem ipsum dolor sit amet.", "bbox": {"l": 133.76801, "t": 280.62103, "r": 351.47672, "b": 290.58365, "coord_origin": "TOPLEFT"}}], "children": []}, "text": "Lorem ipsum dolor sit amet, consetetur sadipscing elitr, sed diam nonumy eirmod tempor invidunt ut labore et dolore magna aliquyam erat, sed diam voluptua. At vero eos et accusam et justo duo dolores et ea rebum. Stet clita kasd gubergren, no sea takimata sanctus est Lorem ipsum dolor sit amet. Lorem ipsum dolor sit amet, consetetur sadipscing elitr, sed diam nonumy eirmod tempor invidunt ut labore et dolore magna aliquyam erat, sed diam voluptua. At vero eos et accusam et justo duo dolores et ea rebum. Stet clita kasd gubergren, no sea takimata sanctus est Lorem ipsum dolor sit amet. Lorem ipsum dolor sit amet, consetetur sadipscing elitr, sed diam nonumy eirmod tempor invidunt ut labore et dolore magna aliquyam erat, sed diam voluptua. At vero eos et accusam et justo duo dolores et ea rebum. Stet clita kasd gubergren, no sea takimata sanctus est Lorem ipsum dolor sit amet."}, {"label": "caption", "id": 5, "page_no": 0, "cluster": {"id": 5, "label": "caption", "bbox": {"l": 236.17598999999998, "t": 301.54204999999996, "r": 375.06958, "b": 311.50467, "coord_origin": "TOPLEFT"}, "confidence": 0.5425266027450562, "cells": [{"id": 14, "text": "Listing 1: Simple Java Program", "bbox": {"l": 236.17598999999998, "t": 301.54204999999996, "r": 375.06958, "b": 311.50467, "coord_origin": "TOPLEFT"}}], "children": []}, "text": "Listing 1: Simple Java Program"}, {"label": "code", "id": 4, "page_no": 0, "cluster": {"id": 4, "label": "code", "bbox": {"l": 134.239, "t": 317.79947000000004, "r": 337.59286, "b": 348.06418, "coord_origin": "TOPLEFT"}, "confidence": 0.6383119821548462, "cells": [{"id": 15, "text": "public static void", "bbox": {"l": 134.575, "t": 317.79947000000004, "r": 235.44695, "b": 326.14719, "coord_origin": "TOPLEFT"}}, {"id": 16, "text": "print() {", "bbox": {"l": 241.8759, "t": 317.79947000000004, "r": 292.24908, "b": 326.14719, "coord_origin": "TOPLEFT"}}, {"id": 17, "text": "System.out.println(", "bbox": {"l": 157.17101, "t": 328.75745, "r": 264.02368, "b": 337.10516000000007, "coord_origin": "TOPLEFT"}}, {"id": 18, "text": "\"Java Code\"", "bbox": {"l": 264.15811, "t": 328.75745, "r": 325.35376, "b": 337.10516000000007, "coord_origin": "TOPLEFT"}}, {"id": 19, "text": ");", "bbox": {"l": 326.2952, "t": 328.75745, "r": 337.59286, "b": 337.10516000000007, "coord_origin": "TOPLEFT"}}, {"id": 20, "text": "}", "bbox": {"l": 134.239, "t": 339.71646, "r": 138.94637, "b": 348.06418, "coord_origin": "TOPLEFT"}}], "children": []}, "text": "public static void print() { System.out.println( \"Java Code\" ); }"}, {"label": "text", "id": 1, "page_no": 0, "cluster": {"id": 1, "label": "text", "bbox": {"l": 133.76801, "t": 359.72406, "r": 477.47589, "b": 501.19366, "coord_origin": "TOPLEFT"}, "confidence": 0.9869542717933655, "cells": [{"id": 21, "text": "Lorem ipsum dolor sit amet, consetetur sadipscing elitr, sed diam nonumy", "bbox": {"l": 148.71201, "t": 359.72406, "r": 477.46985, "b": 369.68667999999997, "coord_origin": "TOPLEFT"}}, {"id": 22, "text": "eirmod tempor invidunt ut labore et dolore magna aliquyam erat, sed diam", "bbox": {"l": 133.76801, "t": 371.67905, "r": 477.4758, "b": 381.64166000000006, "coord_origin": "TOPLEFT"}}, {"id": 23, "text": "voluptua. At vero eos et accusam et justo duo dolores et ea rebum. Stet clita", "bbox": {"l": 133.76801, "t": 383.63403, "r": 477.4758, "b": 393.59665, "coord_origin": "TOPLEFT"}}, {"id": 24, "text": "kasd gubergren, no sea takimata sanctus est Lorem ipsum dolor sit amet. Lorem", "bbox": {"l": 133.76801, "t": 395.58905, "r": 477.47284, "b": 405.55167, "coord_origin": "TOPLEFT"}}, {"id": 25, "text": "ipsum dolor sit amet, consetetur sadipscing elitr, sed diam nonumy eirmod", "bbox": {"l": 133.76801, "t": 407.54404, "r": 477.47589, "b": 417.50665, "coord_origin": "TOPLEFT"}}, {"id": 26, "text": "tempor invidunt ut labore et dolore magna aliquyam erat, sed diam voluptua. At", "bbox": {"l": 133.76801, "t": 419.50003000000004, "r": 477.47177000000005, "b": 429.46265, "coord_origin": "TOPLEFT"}}, {"id": 27, "text": "vero eos et accusam et justo duo dolores et ea rebum. Stet clita kasd gubergren,", "bbox": {"l": 133.76801, "t": 431.45505, "r": 477.47387999999995, "b": 441.41766000000007, "coord_origin": "TOPLEFT"}}, {"id": 28, "text": "no sea takimata sanctus est Lorem ipsum dolor sit amet. Lorem ipsum dolor", "bbox": {"l": 133.76801, "t": 443.41003, "r": 477.47083, "b": 453.37265, "coord_origin": "TOPLEFT"}}, {"id": 29, "text": "sit amet, consetetur sadipscing elitr, sed diam nonumy eirmod tempor invidunt", "bbox": {"l": 133.76801, "t": 455.36505, "r": 477.47092, "b": 465.32767, "coord_origin": "TOPLEFT"}}, {"id": 30, "text": "ut labore et dolore magna aliquyam erat, sed diam voluptua. At vero eos et", "bbox": {"l": 133.76801, "t": 467.32004, "r": 477.46585, "b": 477.28265, "coord_origin": "TOPLEFT"}}, {"id": 31, "text": "accusam et justo duo dolores et ea rebum. Stet clita kasd gubergren, no sea", "bbox": {"l": 133.76801, "t": 479.27603, "r": 477.47586, "b": 489.23865, "coord_origin": "TOPLEFT"}}, {"id": 32, "text": "takimata sanctus est Lorem ipsum dolor sit amet.", "bbox": {"l": 133.76801, "t": 491.23105, "r": 351.47672, "b": 501.19366, "coord_origin": "TOPLEFT"}}], "children": []}, "text": "Lorem ipsum dolor sit amet, consetetur sadipscing elitr, sed diam nonumy eirmod tempor invidunt ut labore et dolore magna aliquyam erat, sed diam voluptua. At vero eos et accusam et justo duo dolores et ea rebum. Stet clita kasd gubergren, no sea takimata sanctus est Lorem ipsum dolor sit amet. Lorem ipsum dolor sit amet, consetetur sadipscing elitr, sed diam nonumy eirmod tempor invidunt ut labore et dolore magna aliquyam erat, sed diam voluptua. At vero eos et accusam et justo duo dolores et ea rebum. Stet clita kasd gubergren, no sea takimata sanctus est Lorem ipsum dolor sit amet. Lorem ipsum dolor sit amet, consetetur sadipscing elitr, sed diam nonumy eirmod tempor invidunt ut labore et dolore magna aliquyam erat, sed diam voluptua. At vero eos et accusam et justo duo dolores et ea rebum. Stet clita kasd gubergren, no sea takimata sanctus est Lorem ipsum dolor sit amet."}], "headers": [{"label": "page_footer", "id": 3, "page_no": 0, "cluster": {"id": 3, "label": "page_footer", "bbox": {"l": 303.133, "t": 695.163055, "r": 308.11429, "b": 705.125648, "coord_origin": "TOPLEFT"}, "confidence": 0.8374724984169006, "cells": [{"id": 33, "text": "1", "bbox": {"l": 303.133, "t": 695.163055, "r": 308.11429, "b": 705.125648, "coord_origin": "TOPLEFT"}}], "children": []}, "text": "1"}]}}, {"page_no": 1, "size": {"width": 595.2760009765625, "height": 841.8900146484375}, "cells": [{"id": 0, "text": "Formula", "bbox": {"l": 133.76801021944917, "t": 124.00537068468714, "r": 191.51430031418315, "b": 138.36590076397772, "coord_origin": "TOPLEFT"}}, {"id": 1, "text": "Lorem ipsum dolor sit amet, consetetur sadipscing elitr, sed diam nonumy eir-", "bbox": {"l": 133.76801021944917, "t": 149.114070823323, "r": 477.47687078331063, "b": 159.0766608783307, "coord_origin": "TOPLEFT"}}, {"id": 2, "text": "mod tempor invidunt ut labore et dolore magna aliquyam erat, sed diam volup-", "bbox": {"l": 133.76801021944917, "t": 161.06909088933185, "r": 477.4778107833122, "b": 171.03167094433945, "coord_origin": "TOPLEFT"}}, {"id": 3, "text": "tua. At vero eos et accusam et justo duo dolores et ea rebum. Stet clita kasd", "bbox": {"l": 133.76801021944917, "t": 173.0241009553406, "r": 477.46985078329914, "b": 182.9866910103483, "coord_origin": "TOPLEFT"}}, {"id": 4, "text": "gubergren, no sea takimata sanctus est Lorem ipsum dolor sit amet. Lorem ip-", "bbox": {"l": 133.76801021944917, "t": 184.97906102134914, "r": 477.4718607833024, "b": 194.94165107635683, "coord_origin": "TOPLEFT"}}, {"id": 5, "text": "sum dolor sit amet, consetetur sadipscing elitr, sed diam nonumy eirmod tempor", "bbox": {"l": 133.76801021944917, "t": 196.93511108736357, "r": 477.4699407832993, "b": 206.89770114237137, "coord_origin": "TOPLEFT"}}, {"id": 6, "text": "invidunt ut labore et dolore magna aliquyam erat, sed diam voluptua. At vero", "bbox": {"l": 133.76801021944917, "t": 208.890071153372, "r": 477.4806507833169, "b": 218.8526612083799, "coord_origin": "TOPLEFT"}}, {"id": 7, "text": "eos et accusam et justo duo dolores et ea rebum.", "bbox": {"l": 133.76801021944917, "t": 220.84509121938083, "r": 355.20087058271434, "b": 230.80767127438855, "coord_origin": "TOPLEFT"}}, {"id": 8, "text": "Stet clita kasd gubergren,", "bbox": {"l": 362.6010705948545, "t": 220.84509121938083, "r": 477.4749507833075, "b": 230.80767127438855, "coord_origin": "TOPLEFT"}}, {"id": 9, "text": "no sea takimata sanctus est Lorem ipsum dolor sit amet. Lorem ipsum dolor", "bbox": {"l": 133.76801021944917, "t": 232.8001112853898, "r": 477.4708307833007, "b": 242.76269134039728, "coord_origin": "TOPLEFT"}}, {"id": 10, "text": "sit amet, consetetur sadipscing elitr, sed diam nonumy eirmod tempor invidunt", "bbox": {"l": 133.76801021944917, "t": 244.75506135139813, "r": 477.4709207833008, "b": 254.71765140640582, "coord_origin": "TOPLEFT"}}, {"id": 11, "text": "ut labore et dolore magna aliquyam erat, sed diam voluptua. At vero eos et", "bbox": {"l": 133.76801021944917, "t": 256.71008141740685, "r": 477.4658807832926, "b": 266.6726614724146, "coord_origin": "TOPLEFT"}}, {"id": 12, "text": "accusam et justo duo dolores et ea rebum. Stet clita kasd gubergren, no sea", "bbox": {"l": 133.76801021944917, "t": 268.6660714834211, "r": 477.47586078330903, "b": 278.6286615384289, "coord_origin": "TOPLEFT"}}, {"id": 13, "text": "takimata sanctus est Lorem ipsum dolor sit amet.", "bbox": {"l": 133.76801021944917, "t": 280.62109154942993, "r": 351.4767205766048, "b": 290.58367160443765, "coord_origin": "TOPLEFT"}}, {"id": 14, "text": "Duis autem vel eum iriure dolor in hendrerit in vulputate velit esse molestie", "bbox": {"l": 148.71201024396512, "t": 292.5761116154388, "r": 477.4710707833011, "b": 302.5386916704465, "coord_origin": "TOPLEFT"}}, {"id": 15, "text": "consequat, vel illum dolore eu feugiat nulla facilisis at vero eros et accumsan et", "bbox": {"l": 133.76801021944917, "t": 304.5310616814471, "r": 477.47293078330415, "b": 314.4936517364549, "coord_origin": "TOPLEFT"}}, {"id": 16, "text": "iusto odio dignissim qui blandit praesent luptatum zzril delenit augue duis dolore", "bbox": {"l": 133.76801021944917, "t": 316.48608174745607, "r": 477.47482078330734, "b": 326.4486618024637, "coord_origin": "TOPLEFT"}}, {"id": 17, "text": "te feugait nulla facilisi. Lorem ipsum dolor sit amet, consectetuer adipiscing elit,", "bbox": {"l": 133.76801021944917, "t": 328.4411018134648, "r": 477.472810783304, "b": 338.40368186847246, "coord_origin": "TOPLEFT"}}, {"id": 18, "text": "sed diam nonummy nibh euismod tincidunt.", "bbox": {"l": 133.76801021944917, "t": 340.3970618794788, "r": 326.20752053515014, "b": 350.3596819344868, "coord_origin": "TOPLEFT"}}, {"id": 19, "text": "a", "bbox": {"l": 280.5539904602546, "t": 364.3070620114962, "r": 285.8142404688841, "b": 374.2696820665041, "coord_origin": "TOPLEFT"}}, {"id": 20, "text": "2", "bbox": {"l": 285.8200104688936, "t": 362.4346620011579, "r": 289.78809047540335, "b": 369.40844203966316, "coord_origin": "TOPLEFT"}}, {"id": 21, "text": "+ 8 = 12", "bbox": {"l": 292.50400047985886, "t": 364.3070620114962, "r": 330.6965905425146, "b": 374.2696820665041, "coord_origin": "TOPLEFT"}}, {"id": 22, "text": "Lorem ipsum dolor sit amet, consetetur sadipscing elitr, sed diam nonumy", "bbox": {"l": 148.71201024396512, "t": 382.24005211051195, "r": 477.4698807832991, "b": 392.2026621655199, "coord_origin": "TOPLEFT"}}, {"id": 23, "text": "eirmod tempor invidunt ut labore et dolore magna aliquyam erat, sed diam", "bbox": {"l": 133.76801021944917, "t": 394.19506217652076, "r": 477.47580078330884, "b": 404.15768223152867, "coord_origin": "TOPLEFT"}}, {"id": 24, "text": "voluptua. At vero eos et accusam et justo duo dolores et ea rebum. Stet clita", "bbox": {"l": 133.76801021944917, "t": 406.15005224252945, "r": 477.47580078330884, "b": 416.1126722975373, "coord_origin": "TOPLEFT"}}, {"id": 25, "text": "kasd gubergren, no sea takimata sanctus est Lorem ipsum dolor sit amet. Lorem", "bbox": {"l": 133.76801021944917, "t": 418.10507230853824, "r": 477.47284078330404, "b": 428.06768236354606, "coord_origin": "TOPLEFT"}}, {"id": 26, "text": "ipsum dolor sit amet, consetetur sadipscing elitr, sed diam nonumy eirmod", "bbox": {"l": 133.76801021944917, "t": 430.0600523745468, "r": 477.47589078330907, "b": 440.0226724295547, "coord_origin": "TOPLEFT"}}, {"id": 27, "text": "tempor invidunt ut labore et dolore magna aliquyam erat, sed diam voluptua. At", "bbox": {"l": 133.76801021944917, "t": 442.01605244056105, "r": 477.4717707833023, "b": 451.97866249556887, "coord_origin": "TOPLEFT"}}, {"id": 28, "text": "vero eos et accusam et justo duo dolores et ea rebum. Stet clita kasd gubergren,", "bbox": {"l": 133.76801021944917, "t": 453.9710625065698, "r": 477.47388078330573, "b": 463.93368256157777, "coord_origin": "TOPLEFT"}}, {"id": 29, "text": "no sea takimata sanctus est Lorem ipsum dolor sit amet. Lorem ipsum dolor", "bbox": {"l": 133.76801021944917, "t": 465.9260525725785, "r": 477.4708307833007, "b": 475.88867262758635, "coord_origin": "TOPLEFT"}}, {"id": 30, "text": "sit amet, consetetur sadipscing elitr, sed diam nonumy eirmod tempor invidunt", "bbox": {"l": 133.76801021944917, "t": 477.8810726385873, "r": 477.4709207833008, "b": 487.84368269359516, "coord_origin": "TOPLEFT"}}, {"id": 31, "text": "ut labore et dolore magna aliquyam erat, sed diam voluptua. At vero eos et", "bbox": {"l": 133.76801021944917, "t": 489.8360527045959, "r": 477.4658807832926, "b": 499.7986727596038, "coord_origin": "TOPLEFT"}}, {"id": 32, "text": "accusam et justo duo dolores et ea rebum. Stet clita kasd gubergren, no sea", "bbox": {"l": 133.76801021944917, "t": 501.7910427706045, "r": 477.47586078330903, "b": 511.7536628256125, "coord_origin": "TOPLEFT"}}, {"id": 33, "text": "takimata sanctus est Lorem ipsum dolor sit amet.", "bbox": {"l": 133.76801021944917, "t": 513.7470728366188, "r": 351.4767205766048, "b": 523.7096828916267, "coord_origin": "TOPLEFT"}}, {"id": 34, "text": "Duis autem vel eum iriure dolor in hendrerit in vulputate velit esse molestie", "bbox": {"l": 148.71201024396512, "t": 525.7020529026275, "r": 477.4710707833011, "b": 535.6646729576354, "coord_origin": "TOPLEFT"}}, {"id": 35, "text": "consequat, vel illum dolore eu feugiat nulla facilisis at vero eros et accumsan et", "bbox": {"l": 133.76801021944917, "t": 537.6570729686364, "r": 477.47293078330415, "b": 547.6196930236442, "coord_origin": "TOPLEFT"}}, {"id": 36, "text": "iusto odio dignissim qui blandit praesent luptatum zzril delenit augue duis dolore", "bbox": {"l": 133.76801021944917, "t": 549.612063034645, "r": 477.47482078330734, "b": 559.5746730896528, "coord_origin": "TOPLEFT"}}, {"id": 37, "text": "te feugait nulla facilisi. Lorem ipsum dolor sit amet, consectetuer adipiscing elit,", "bbox": {"l": 133.76801021944917, "t": 561.5670431006536, "r": 477.472810783304, "b": 571.5296631556615, "coord_origin": "TOPLEFT"}}, {"id": 38, "text": "sed diam nonummy nibh euismod tincidunt ut laoreet dolore magna aliquam erat", "bbox": {"l": 133.76801021944917, "t": 573.5220631666624, "r": 477.47479078330724, "b": 583.4846832216704, "coord_origin": "TOPLEFT"}}, {"id": 39, "text": "volutpat.", "bbox": {"l": 133.76801021944917, "t": 585.4780532326765, "r": 173.0565202839028, "b": 595.4406532876844, "coord_origin": "TOPLEFT"}}, {"id": 40, "text": "Duis autem vel eum iriure dolor in hendrerit in vulputate velit esse molestie", "bbox": {"l": 148.71201024396512, "t": 597.4330632986853, "r": 477.4710707833011, "b": 607.3956633536932, "coord_origin": "TOPLEFT"}}, {"id": 41, "text": "consequat, vel illum dolore eu feugiat nulla facilisis at vero eros et accumsan et", "bbox": {"l": 133.76801021944917, "t": 609.3880633646941, "r": 477.47293078330415, "b": 619.3506634197018, "coord_origin": "TOPLEFT"}}, {"id": 42, "text": "iusto odio dignissim qui blandit praesent luptatum zzril delenit augue duis dolore", "bbox": {"l": 133.76801021944917, "t": 621.3430634307027, "r": 477.47482078330734, "b": 631.3056634857105, "coord_origin": "TOPLEFT"}}, {"id": 43, "text": "te feugait nulla facilisi. Lorem ipsum dolor sit amet, consectetuer adipiscing elit,", "bbox": {"l": 133.76801021944917, "t": 633.2980634967114, "r": 477.472810783304, "b": 643.2606635517193, "coord_origin": "TOPLEFT"}}, {"id": 44, "text": "sed diam nonummy nibh euismod tincidunt ut laoreet dolore magna aliquam erat", "bbox": {"l": 133.76801021944917, "t": 645.2540535627256, "r": 477.47479078330724, "b": 655.2166536177334, "coord_origin": "TOPLEFT"}}, {"id": 45, "text": "volutpat.", "bbox": {"l": 133.76801021944917, "t": 657.2090636287344, "r": 173.0565202839028, "b": 667.1716636837423, "coord_origin": "TOPLEFT"}}, {"id": 46, "text": "1", "bbox": {"l": 303.13300049729594, "t": 695.1640638383003, "r": 308.1142905054678, "b": 705.1266638933081, "coord_origin": "TOPLEFT"}}], "predictions": {"layout": {"clusters": [{"id": 5, "label": "section_header", "bbox": {"l": 133.76801021944917, "t": 124.00537068468714, "r": 191.51430031418315, "b": 138.36590076397772, "coord_origin": "TOPLEFT"}, "confidence": 0.95060133934021, "cells": [{"id": 0, "text": "Formula", "bbox": {"l": 133.76801021944917, "t": 124.00537068468714, "r": 191.51430031418315, "b": 138.36590076397772, "coord_origin": "TOPLEFT"}}], "children": []}, {"id": 1, "label": "text", "bbox": {"l": 133.76801021944917, "t": 149.114070823323, "r": 477.4806507833169, "b": 290.58367160443765, "coord_origin": "TOPLEFT"}, "confidence": 0.9878177046775818, "cells": [{"id": 1, "text": "Lorem ipsum dolor sit amet, consetetur sadipscing elitr, sed diam nonumy eir-", "bbox": {"l": 133.76801021944917, "t": 149.114070823323, "r": 477.47687078331063, "b": 159.0766608783307, "coord_origin": "TOPLEFT"}}, {"id": 2, "text": "mod tempor invidunt ut labore et dolore magna aliquyam erat, sed diam volup-", "bbox": {"l": 133.76801021944917, "t": 161.06909088933185, "r": 477.4778107833122, "b": 171.03167094433945, "coord_origin": "TOPLEFT"}}, {"id": 3, "text": "tua. At vero eos et accusam et justo duo dolores et ea rebum. Stet clita kasd", "bbox": {"l": 133.76801021944917, "t": 173.0241009553406, "r": 477.46985078329914, "b": 182.9866910103483, "coord_origin": "TOPLEFT"}}, {"id": 4, "text": "gubergren, no sea takimata sanctus est Lorem ipsum dolor sit amet. Lorem ip-", "bbox": {"l": 133.76801021944917, "t": 184.97906102134914, "r": 477.4718607833024, "b": 194.94165107635683, "coord_origin": "TOPLEFT"}}, {"id": 5, "text": "sum dolor sit amet, consetetur sadipscing elitr, sed diam nonumy eirmod tempor", "bbox": {"l": 133.76801021944917, "t": 196.93511108736357, "r": 477.4699407832993, "b": 206.89770114237137, "coord_origin": "TOPLEFT"}}, {"id": 6, "text": "invidunt ut labore et dolore magna aliquyam erat, sed diam voluptua. At vero", "bbox": {"l": 133.76801021944917, "t": 208.890071153372, "r": 477.4806507833169, "b": 218.8526612083799, "coord_origin": "TOPLEFT"}}, {"id": 7, "text": "eos et accusam et justo duo dolores et ea rebum.", "bbox": {"l": 133.76801021944917, "t": 220.84509121938083, "r": 355.20087058271434, "b": 230.80767127438855, "coord_origin": "TOPLEFT"}}, {"id": 8, "text": "Stet clita kasd gubergren,", "bbox": {"l": 362.6010705948545, "t": 220.84509121938083, "r": 477.4749507833075, "b": 230.80767127438855, "coord_origin": "TOPLEFT"}}, {"id": 9, "text": "no sea takimata sanctus est Lorem ipsum dolor sit amet. Lorem ipsum dolor", "bbox": {"l": 133.76801021944917, "t": 232.8001112853898, "r": 477.4708307833007, "b": 242.76269134039728, "coord_origin": "TOPLEFT"}}, {"id": 10, "text": "sit amet, consetetur sadipscing elitr, sed diam nonumy eirmod tempor invidunt", "bbox": {"l": 133.76801021944917, "t": 244.75506135139813, "r": 477.4709207833008, "b": 254.71765140640582, "coord_origin": "TOPLEFT"}}, {"id": 11, "text": "ut labore et dolore magna aliquyam erat, sed diam voluptua. At vero eos et", "bbox": {"l": 133.76801021944917, "t": 256.71008141740685, "r": 477.4658807832926, "b": 266.6726614724146, "coord_origin": "TOPLEFT"}}, {"id": 12, "text": "accusam et justo duo dolores et ea rebum. Stet clita kasd gubergren, no sea", "bbox": {"l": 133.76801021944917, "t": 268.6660714834211, "r": 477.47586078330903, "b": 278.6286615384289, "coord_origin": "TOPLEFT"}}, {"id": 13, "text": "takimata sanctus est Lorem ipsum dolor sit amet.", "bbox": {"l": 133.76801021944917, "t": 280.62109154942993, "r": 351.4767205766048, "b": 290.58367160443765, "coord_origin": "TOPLEFT"}}], "children": []}, {"id": 3, "label": "text", "bbox": {"l": 133.76801021944917, "t": 292.5761116154388, "r": 477.47482078330734, "b": 350.3596819344868, "coord_origin": "TOPLEFT"}, "confidence": 0.9855858683586121, "cells": [{"id": 14, "text": "Duis autem vel eum iriure dolor in hendrerit in vulputate velit esse molestie", "bbox": {"l": 148.71201024396512, "t": 292.5761116154388, "r": 477.4710707833011, "b": 302.5386916704465, "coord_origin": "TOPLEFT"}}, {"id": 15, "text": "consequat, vel illum dolore eu feugiat nulla facilisis at vero eros et accumsan et", "bbox": {"l": 133.76801021944917, "t": 304.5310616814471, "r": 477.47293078330415, "b": 314.4936517364549, "coord_origin": "TOPLEFT"}}, {"id": 16, "text": "iusto odio dignissim qui blandit praesent luptatum zzril delenit augue duis dolore", "bbox": {"l": 133.76801021944917, "t": 316.48608174745607, "r": 477.47482078330734, "b": 326.4486618024637, "coord_origin": "TOPLEFT"}}, {"id": 17, "text": "te feugait nulla facilisi. Lorem ipsum dolor sit amet, consectetuer adipiscing elit,", "bbox": {"l": 133.76801021944917, "t": 328.4411018134648, "r": 477.472810783304, "b": 338.40368186847246, "coord_origin": "TOPLEFT"}}, {"id": 18, "text": "sed diam nonummy nibh euismod tincidunt.", "bbox": {"l": 133.76801021944917, "t": 340.3970618794788, "r": 326.20752053515014, "b": 350.3596819344868, "coord_origin": "TOPLEFT"}}], "children": []}, {"id": 6, "label": "formula", "bbox": {"l": 280.5539904602546, "t": 362.4346620011579, "r": 330.6965905425146, "b": 374.2696820665041, "coord_origin": "TOPLEFT"}, "confidence": 0.8711639046669006, "cells": [{"id": 19, "text": "a", "bbox": {"l": 280.5539904602546, "t": 364.3070620114962, "r": 285.8142404688841, "b": 374.2696820665041, "coord_origin": "TOPLEFT"}}, {"id": 20, "text": "2", "bbox": {"l": 285.8200104688936, "t": 362.4346620011579, "r": 289.78809047540335, "b": 369.40844203966316, "coord_origin": "TOPLEFT"}}, {"id": 21, "text": "+ 8 = 12", "bbox": {"l": 292.50400047985886, "t": 364.3070620114962, "r": 330.6965905425146, "b": 374.2696820665041, "coord_origin": "TOPLEFT"}}], "children": []}, {"id": 0, "label": "text", "bbox": {"l": 133.76801021944917, "t": 382.24005211051195, "r": 477.47589078330907, "b": 523.7096828916267, "coord_origin": "TOPLEFT"}, "confidence": 0.9880034923553467, "cells": [{"id": 22, "text": "Lorem ipsum dolor sit amet, consetetur sadipscing elitr, sed diam nonumy", "bbox": {"l": 148.71201024396512, "t": 382.24005211051195, "r": 477.4698807832991, "b": 392.2026621655199, "coord_origin": "TOPLEFT"}}, {"id": 23, "text": "eirmod tempor invidunt ut labore et dolore magna aliquyam erat, sed diam", "bbox": {"l": 133.76801021944917, "t": 394.19506217652076, "r": 477.47580078330884, "b": 404.15768223152867, "coord_origin": "TOPLEFT"}}, {"id": 24, "text": "voluptua. At vero eos et accusam et justo duo dolores et ea rebum. Stet clita", "bbox": {"l": 133.76801021944917, "t": 406.15005224252945, "r": 477.47580078330884, "b": 416.1126722975373, "coord_origin": "TOPLEFT"}}, {"id": 25, "text": "kasd gubergren, no sea takimata sanctus est Lorem ipsum dolor sit amet. Lorem", "bbox": {"l": 133.76801021944917, "t": 418.10507230853824, "r": 477.47284078330404, "b": 428.06768236354606, "coord_origin": "TOPLEFT"}}, {"id": 26, "text": "ipsum dolor sit amet, consetetur sadipscing elitr, sed diam nonumy eirmod", "bbox": {"l": 133.76801021944917, "t": 430.0600523745468, "r": 477.47589078330907, "b": 440.0226724295547, "coord_origin": "TOPLEFT"}}, {"id": 27, "text": "tempor invidunt ut labore et dolore magna aliquyam erat, sed diam voluptua. At", "bbox": {"l": 133.76801021944917, "t": 442.01605244056105, "r": 477.4717707833023, "b": 451.97866249556887, "coord_origin": "TOPLEFT"}}, {"id": 28, "text": "vero eos et accusam et justo duo dolores et ea rebum. Stet clita kasd gubergren,", "bbox": {"l": 133.76801021944917, "t": 453.9710625065698, "r": 477.47388078330573, "b": 463.93368256157777, "coord_origin": "TOPLEFT"}}, {"id": 29, "text": "no sea takimata sanctus est Lorem ipsum dolor sit amet. Lorem ipsum dolor", "bbox": {"l": 133.76801021944917, "t": 465.9260525725785, "r": 477.4708307833007, "b": 475.88867262758635, "coord_origin": "TOPLEFT"}}, {"id": 30, "text": "sit amet, consetetur sadipscing elitr, sed diam nonumy eirmod tempor invidunt", "bbox": {"l": 133.76801021944917, "t": 477.8810726385873, "r": 477.4709207833008, "b": 487.84368269359516, "coord_origin": "TOPLEFT"}}, {"id": 31, "text": "ut labore et dolore magna aliquyam erat, sed diam voluptua. At vero eos et", "bbox": {"l": 133.76801021944917, "t": 489.8360527045959, "r": 477.4658807832926, "b": 499.7986727596038, "coord_origin": "TOPLEFT"}}, {"id": 32, "text": "accusam et justo duo dolores et ea rebum. Stet clita kasd gubergren, no sea", "bbox": {"l": 133.76801021944917, "t": 501.7910427706045, "r": 477.47586078330903, "b": 511.7536628256125, "coord_origin": "TOPLEFT"}}, {"id": 33, "text": "takimata sanctus est Lorem ipsum dolor sit amet.", "bbox": {"l": 133.76801021944917, "t": 513.7470728366188, "r": 351.4767205766048, "b": 523.7096828916267, "coord_origin": "TOPLEFT"}}], "children": []}, {"id": 4, "label": "text", "bbox": {"l": 133.76801021944917, "t": 525.7020529026275, "r": 477.47482078330734, "b": 595.4406532876844, "coord_origin": "TOPLEFT"}, "confidence": 0.9844803214073181, "cells": [{"id": 34, "text": "Duis autem vel eum iriure dolor in hendrerit in vulputate velit esse molestie", "bbox": {"l": 148.71201024396512, "t": 525.7020529026275, "r": 477.4710707833011, "b": 535.6646729576354, "coord_origin": "TOPLEFT"}}, {"id": 35, "text": "consequat, vel illum dolore eu feugiat nulla facilisis at vero eros et accumsan et", "bbox": {"l": 133.76801021944917, "t": 537.6570729686364, "r": 477.47293078330415, "b": 547.6196930236442, "coord_origin": "TOPLEFT"}}, {"id": 36, "text": "iusto odio dignissim qui blandit praesent luptatum zzril delenit augue duis dolore", "bbox": {"l": 133.76801021944917, "t": 549.612063034645, "r": 477.47482078330734, "b": 559.5746730896528, "coord_origin": "TOPLEFT"}}, {"id": 37, "text": "te feugait nulla facilisi. Lorem ipsum dolor sit amet, consectetuer adipiscing elit,", "bbox": {"l": 133.76801021944917, "t": 561.5670431006536, "r": 477.472810783304, "b": 571.5296631556615, "coord_origin": "TOPLEFT"}}, {"id": 38, "text": "sed diam nonummy nibh euismod tincidunt ut laoreet dolore magna aliquam erat", "bbox": {"l": 133.76801021944917, "t": 573.5220631666624, "r": 477.47479078330724, "b": 583.4846832216704, "coord_origin": "TOPLEFT"}}, {"id": 39, "text": "volutpat.", "bbox": {"l": 133.76801021944917, "t": 585.4780532326765, "r": 173.0565202839028, "b": 595.4406532876844, "coord_origin": "TOPLEFT"}}], "children": []}, {"id": 2, "label": "text", "bbox": {"l": 133.76801021944917, "t": 597.4330632986853, "r": 477.47482078330734, "b": 667.1716636837423, "coord_origin": "TOPLEFT"}, "confidence": 0.9872003793716431, "cells": [{"id": 40, "text": "Duis autem vel eum iriure dolor in hendrerit in vulputate velit esse molestie", "bbox": {"l": 148.71201024396512, "t": 597.4330632986853, "r": 477.4710707833011, "b": 607.3956633536932, "coord_origin": "TOPLEFT"}}, {"id": 41, "text": "consequat, vel illum dolore eu feugiat nulla facilisis at vero eros et accumsan et", "bbox": {"l": 133.76801021944917, "t": 609.3880633646941, "r": 477.47293078330415, "b": 619.3506634197018, "coord_origin": "TOPLEFT"}}, {"id": 42, "text": "iusto odio dignissim qui blandit praesent luptatum zzril delenit augue duis dolore", "bbox": {"l": 133.76801021944917, "t": 621.3430634307027, "r": 477.47482078330734, "b": 631.3056634857105, "coord_origin": "TOPLEFT"}}, {"id": 43, "text": "te feugait nulla facilisi. Lorem ipsum dolor sit amet, consectetuer adipiscing elit,", "bbox": {"l": 133.76801021944917, "t": 633.2980634967114, "r": 477.472810783304, "b": 643.2606635517193, "coord_origin": "TOPLEFT"}}, {"id": 44, "text": "sed diam nonummy nibh euismod tincidunt ut laoreet dolore magna aliquam erat", "bbox": {"l": 133.76801021944917, "t": 645.2540535627256, "r": 477.47479078330724, "b": 655.2166536177334, "coord_origin": "TOPLEFT"}}, {"id": 45, "text": "volutpat.", "bbox": {"l": 133.76801021944917, "t": 657.2090636287344, "r": 173.0565202839028, "b": 667.1716636837423, "coord_origin": "TOPLEFT"}}], "children": []}, {"id": 7, "label": "page_footer", "bbox": {"l": 303.13300049729594, "t": 695.1640638383003, "r": 308.1142905054678, "b": 705.1266638933081, "coord_origin": "TOPLEFT"}, "confidence": 0.850279688835144, "cells": [{"id": 46, "text": "1", "bbox": {"l": 303.13300049729594, "t": 695.1640638383003, "r": 308.1142905054678, "b": 705.1266638933081, "coord_origin": "TOPLEFT"}}], "children": []}]}, "tablestructure": {"table_map": {}}, "figures_classification": null, "equations_prediction": null}, "assembled": {"elements": [{"label": "section_header", "id": 5, "page_no": 1, "cluster": {"id": 5, "label": "section_header", "bbox": {"l": 133.76801021944917, "t": 124.00537068468714, "r": 191.51430031418315, "b": 138.36590076397772, "coord_origin": "TOPLEFT"}, "confidence": 0.95060133934021, "cells": [{"id": 0, "text": "Formula", "bbox": {"l": 133.76801021944917, "t": 124.00537068468714, "r": 191.51430031418315, "b": 138.36590076397772, "coord_origin": "TOPLEFT"}}], "children": []}, "text": "Formula"}, {"label": "text", "id": 1, "page_no": 1, "cluster": {"id": 1, "label": "text", "bbox": {"l": 133.76801021944917, "t": 149.114070823323, "r": 477.4806507833169, "b": 290.58367160443765, "coord_origin": "TOPLEFT"}, "confidence": 0.9878177046775818, "cells": [{"id": 1, "text": "Lorem ipsum dolor sit amet, consetetur sadipscing elitr, sed diam nonumy eir-", "bbox": {"l": 133.76801021944917, "t": 149.114070823323, "r": 477.47687078331063, "b": 159.0766608783307, "coord_origin": "TOPLEFT"}}, {"id": 2, "text": "mod tempor invidunt ut labore et dolore magna aliquyam erat, sed diam volup-", "bbox": {"l": 133.76801021944917, "t": 161.06909088933185, "r": 477.4778107833122, "b": 171.03167094433945, "coord_origin": "TOPLEFT"}}, {"id": 3, "text": "tua. At vero eos et accusam et justo duo dolores et ea rebum. Stet clita kasd", "bbox": {"l": 133.76801021944917, "t": 173.0241009553406, "r": 477.46985078329914, "b": 182.9866910103483, "coord_origin": "TOPLEFT"}}, {"id": 4, "text": "gubergren, no sea takimata sanctus est Lorem ipsum dolor sit amet. Lorem ip-", "bbox": {"l": 133.76801021944917, "t": 184.97906102134914, "r": 477.4718607833024, "b": 194.94165107635683, "coord_origin": "TOPLEFT"}}, {"id": 5, "text": "sum dolor sit amet, consetetur sadipscing elitr, sed diam nonumy eirmod tempor", "bbox": {"l": 133.76801021944917, "t": 196.93511108736357, "r": 477.4699407832993, "b": 206.89770114237137, "coord_origin": "TOPLEFT"}}, {"id": 6, "text": "invidunt ut labore et dolore magna aliquyam erat, sed diam voluptua. At vero", "bbox": {"l": 133.76801021944917, "t": 208.890071153372, "r": 477.4806507833169, "b": 218.8526612083799, "coord_origin": "TOPLEFT"}}, {"id": 7, "text": "eos et accusam et justo duo dolores et ea rebum.", "bbox": {"l": 133.76801021944917, "t": 220.84509121938083, "r": 355.20087058271434, "b": 230.80767127438855, "coord_origin": "TOPLEFT"}}, {"id": 8, "text": "Stet clita kasd gubergren,", "bbox": {"l": 362.6010705948545, "t": 220.84509121938083, "r": 477.4749507833075, "b": 230.80767127438855, "coord_origin": "TOPLEFT"}}, {"id": 9, "text": "no sea takimata sanctus est Lorem ipsum dolor sit amet. Lorem ipsum dolor", "bbox": {"l": 133.76801021944917, "t": 232.8001112853898, "r": 477.4708307833007, "b": 242.76269134039728, "coord_origin": "TOPLEFT"}}, {"id": 10, "text": "sit amet, consetetur sadipscing elitr, sed diam nonumy eirmod tempor invidunt", "bbox": {"l": 133.76801021944917, "t": 244.75506135139813, "r": 477.4709207833008, "b": 254.71765140640582, "coord_origin": "TOPLEFT"}}, {"id": 11, "text": "ut labore et dolore magna aliquyam erat, sed diam voluptua. At vero eos et", "bbox": {"l": 133.76801021944917, "t": 256.71008141740685, "r": 477.4658807832926, "b": 266.6726614724146, "coord_origin": "TOPLEFT"}}, {"id": 12, "text": "accusam et justo duo dolores et ea rebum. Stet clita kasd gubergren, no sea", "bbox": {"l": 133.76801021944917, "t": 268.6660714834211, "r": 477.47586078330903, "b": 278.6286615384289, "coord_origin": "TOPLEFT"}}, {"id": 13, "text": "takimata sanctus est Lorem ipsum dolor sit amet.", "bbox": {"l": 133.76801021944917, "t": 280.62109154942993, "r": 351.4767205766048, "b": 290.58367160443765, "coord_origin": "TOPLEFT"}}], "children": []}, "text": "Lorem ipsum dolor sit amet, consetetur sadipscing elitr, sed diam nonumy eirmod tempor invidunt ut labore et dolore magna aliquyam erat, sed diam voluptua. At vero eos et accusam et justo duo dolores et ea rebum. Stet clita kasd gubergren, no sea takimata sanctus est Lorem ipsum dolor sit amet. Lorem ipsum dolor sit amet, consetetur sadipscing elitr, sed diam nonumy eirmod tempor invidunt ut labore et dolore magna aliquyam erat, sed diam voluptua. At vero eos et accusam et justo duo dolores et ea rebum. Stet clita kasd gubergren, no sea takimata sanctus est Lorem ipsum dolor sit amet. Lorem ipsum dolor sit amet, consetetur sadipscing elitr, sed diam nonumy eirmod tempor invidunt ut labore et dolore magna aliquyam erat, sed diam voluptua. At vero eos et accusam et justo duo dolores et ea rebum. Stet clita kasd gubergren, no sea takimata sanctus est Lorem ipsum dolor sit amet."}, {"label": "text", "id": 3, "page_no": 1, "cluster": {"id": 3, "label": "text", "bbox": {"l": 133.76801021944917, "t": 292.5761116154388, "r": 477.47482078330734, "b": 350.3596819344868, "coord_origin": "TOPLEFT"}, "confidence": 0.9855858683586121, "cells": [{"id": 14, "text": "Duis autem vel eum iriure dolor in hendrerit in vulputate velit esse molestie", "bbox": {"l": 148.71201024396512, "t": 292.5761116154388, "r": 477.4710707833011, "b": 302.5386916704465, "coord_origin": "TOPLEFT"}}, {"id": 15, "text": "consequat, vel illum dolore eu feugiat nulla facilisis at vero eros et accumsan et", "bbox": {"l": 133.76801021944917, "t": 304.5310616814471, "r": 477.47293078330415, "b": 314.4936517364549, "coord_origin": "TOPLEFT"}}, {"id": 16, "text": "iusto odio dignissim qui blandit praesent luptatum zzril delenit augue duis dolore", "bbox": {"l": 133.76801021944917, "t": 316.48608174745607, "r": 477.47482078330734, "b": 326.4486618024637, "coord_origin": "TOPLEFT"}}, {"id": 17, "text": "te feugait nulla facilisi. Lorem ipsum dolor sit amet, consectetuer adipiscing elit,", "bbox": {"l": 133.76801021944917, "t": 328.4411018134648, "r": 477.472810783304, "b": 338.40368186847246, "coord_origin": "TOPLEFT"}}, {"id": 18, "text": "sed diam nonummy nibh euismod tincidunt.", "bbox": {"l": 133.76801021944917, "t": 340.3970618794788, "r": 326.20752053515014, "b": 350.3596819344868, "coord_origin": "TOPLEFT"}}], "children": []}, "text": "Duis autem vel eum iriure dolor in hendrerit in vulputate velit esse molestie consequat, vel illum dolore eu feugiat nulla facilisis at vero eros et accumsan et iusto odio dignissim qui blandit praesent luptatum zzril delenit augue duis dolore te feugait nulla facilisi. Lorem ipsum dolor sit amet, consectetuer adipiscing elit, sed diam nonummy nibh euismod tincidunt."}, {"label": "formula", "id": 6, "page_no": 1, "cluster": {"id": 6, "label": "formula", "bbox": {"l": 280.5539904602546, "t": 362.4346620011579, "r": 330.6965905425146, "b": 374.2696820665041, "coord_origin": "TOPLEFT"}, "confidence": 0.8711639046669006, "cells": [{"id": 19, "text": "a", "bbox": {"l": 280.5539904602546, "t": 364.3070620114962, "r": 285.8142404688841, "b": 374.2696820665041, "coord_origin": "TOPLEFT"}}, {"id": 20, "text": "2", "bbox": {"l": 285.8200104688936, "t": 362.4346620011579, "r": 289.78809047540335, "b": 369.40844203966316, "coord_origin": "TOPLEFT"}}, {"id": 21, "text": "+ 8 = 12", "bbox": {"l": 292.50400047985886, "t": 364.3070620114962, "r": 330.6965905425146, "b": 374.2696820665041, "coord_origin": "TOPLEFT"}}], "children": []}, "text": "a 2 + 8 = 12"}, {"label": "text", "id": 0, "page_no": 1, "cluster": {"id": 0, "label": "text", "bbox": {"l": 133.76801021944917, "t": 382.24005211051195, "r": 477.47589078330907, "b": 523.7096828916267, "coord_origin": "TOPLEFT"}, "confidence": 0.9880034923553467, "cells": [{"id": 22, "text": "Lorem ipsum dolor sit amet, consetetur sadipscing elitr, sed diam nonumy", "bbox": {"l": 148.71201024396512, "t": 382.24005211051195, "r": 477.4698807832991, "b": 392.2026621655199, "coord_origin": "TOPLEFT"}}, {"id": 23, "text": "eirmod tempor invidunt ut labore et dolore magna aliquyam erat, sed diam", "bbox": {"l": 133.76801021944917, "t": 394.19506217652076, "r": 477.47580078330884, "b": 404.15768223152867, "coord_origin": "TOPLEFT"}}, {"id": 24, "text": "voluptua. At vero eos et accusam et justo duo dolores et ea rebum. Stet clita", "bbox": {"l": 133.76801021944917, "t": 406.15005224252945, "r": 477.47580078330884, "b": 416.1126722975373, "coord_origin": "TOPLEFT"}}, {"id": 25, "text": "kasd gubergren, no sea takimata sanctus est Lorem ipsum dolor sit amet. Lorem", "bbox": {"l": 133.76801021944917, "t": 418.10507230853824, "r": 477.47284078330404, "b": 428.06768236354606, "coord_origin": "TOPLEFT"}}, {"id": 26, "text": "ipsum dolor sit amet, consetetur sadipscing elitr, sed diam nonumy eirmod", "bbox": {"l": 133.76801021944917, "t": 430.0600523745468, "r": 477.47589078330907, "b": 440.0226724295547, "coord_origin": "TOPLEFT"}}, {"id": 27, "text": "tempor invidunt ut labore et dolore magna aliquyam erat, sed diam voluptua. At", "bbox": {"l": 133.76801021944917, "t": 442.01605244056105, "r": 477.4717707833023, "b": 451.97866249556887, "coord_origin": "TOPLEFT"}}, {"id": 28, "text": "vero eos et accusam et justo duo dolores et ea rebum. Stet clita kasd gubergren,", "bbox": {"l": 133.76801021944917, "t": 453.9710625065698, "r": 477.47388078330573, "b": 463.93368256157777, "coord_origin": "TOPLEFT"}}, {"id": 29, "text": "no sea takimata sanctus est Lorem ipsum dolor sit amet. Lorem ipsum dolor", "bbox": {"l": 133.76801021944917, "t": 465.9260525725785, "r": 477.4708307833007, "b": 475.88867262758635, "coord_origin": "TOPLEFT"}}, {"id": 30, "text": "sit amet, consetetur sadipscing elitr, sed diam nonumy eirmod tempor invidunt", "bbox": {"l": 133.76801021944917, "t": 477.8810726385873, "r": 477.4709207833008, "b": 487.84368269359516, "coord_origin": "TOPLEFT"}}, {"id": 31, "text": "ut labore et dolore magna aliquyam erat, sed diam voluptua. At vero eos et", "bbox": {"l": 133.76801021944917, "t": 489.8360527045959, "r": 477.4658807832926, "b": 499.7986727596038, "coord_origin": "TOPLEFT"}}, {"id": 32, "text": "accusam et justo duo dolores et ea rebum. Stet clita kasd gubergren, no sea", "bbox": {"l": 133.76801021944917, "t": 501.7910427706045, "r": 477.47586078330903, "b": 511.7536628256125, "coord_origin": "TOPLEFT"}}, {"id": 33, "text": "takimata sanctus est Lorem ipsum dolor sit amet.", "bbox": {"l": 133.76801021944917, "t": 513.7470728366188, "r": 351.4767205766048, "b": 523.7096828916267, "coord_origin": "TOPLEFT"}}], "children": []}, "text": "Lorem ipsum dolor sit amet, consetetur sadipscing elitr, sed diam nonumy eirmod tempor invidunt ut labore et dolore magna aliquyam erat, sed diam voluptua. At vero eos et accusam et justo duo dolores et ea rebum. Stet clita kasd gubergren, no sea takimata sanctus est Lorem ipsum dolor sit amet. Lorem ipsum dolor sit amet, consetetur sadipscing elitr, sed diam nonumy eirmod tempor invidunt ut labore et dolore magna aliquyam erat, sed diam voluptua. At vero eos et accusam et justo duo dolores et ea rebum. Stet clita kasd gubergren, no sea takimata sanctus est Lorem ipsum dolor sit amet. Lorem ipsum dolor sit amet, consetetur sadipscing elitr, sed diam nonumy eirmod tempor invidunt ut labore et dolore magna aliquyam erat, sed diam voluptua. At vero eos et accusam et justo duo dolores et ea rebum. Stet clita kasd gubergren, no sea takimata sanctus est Lorem ipsum dolor sit amet."}, {"label": "text", "id": 4, "page_no": 1, "cluster": {"id": 4, "label": "text", "bbox": {"l": 133.76801021944917, "t": 525.7020529026275, "r": 477.47482078330734, "b": 595.4406532876844, "coord_origin": "TOPLEFT"}, "confidence": 0.9844803214073181, "cells": [{"id": 34, "text": "Duis autem vel eum iriure dolor in hendrerit in vulputate velit esse molestie", "bbox": {"l": 148.71201024396512, "t": 525.7020529026275, "r": 477.4710707833011, "b": 535.6646729576354, "coord_origin": "TOPLEFT"}}, {"id": 35, "text": "consequat, vel illum dolore eu feugiat nulla facilisis at vero eros et accumsan et", "bbox": {"l": 133.76801021944917, "t": 537.6570729686364, "r": 477.47293078330415, "b": 547.6196930236442, "coord_origin": "TOPLEFT"}}, {"id": 36, "text": "iusto odio dignissim qui blandit praesent luptatum zzril delenit augue duis dolore", "bbox": {"l": 133.76801021944917, "t": 549.612063034645, "r": 477.47482078330734, "b": 559.5746730896528, "coord_origin": "TOPLEFT"}}, {"id": 37, "text": "te feugait nulla facilisi. Lorem ipsum dolor sit amet, consectetuer adipiscing elit,", "bbox": {"l": 133.76801021944917, "t": 561.5670431006536, "r": 477.472810783304, "b": 571.5296631556615, "coord_origin": "TOPLEFT"}}, {"id": 38, "text": "sed diam nonummy nibh euismod tincidunt ut laoreet dolore magna aliquam erat", "bbox": {"l": 133.76801021944917, "t": 573.5220631666624, "r": 477.47479078330724, "b": 583.4846832216704, "coord_origin": "TOPLEFT"}}, {"id": 39, "text": "volutpat.", "bbox": {"l": 133.76801021944917, "t": 585.4780532326765, "r": 173.0565202839028, "b": 595.4406532876844, "coord_origin": "TOPLEFT"}}], "children": []}, "text": "Duis autem vel eum iriure dolor in hendrerit in vulputate velit esse molestie consequat, vel illum dolore eu feugiat nulla facilisis at vero eros et accumsan et iusto odio dignissim qui blandit praesent luptatum zzril delenit augue duis dolore te feugait nulla facilisi. Lorem ipsum dolor sit amet, consectetuer adipiscing elit, sed diam nonummy nibh euismod tincidunt ut laoreet dolore magna aliquam erat volutpat."}, {"label": "text", "id": 2, "page_no": 1, "cluster": {"id": 2, "label": "text", "bbox": {"l": 133.76801021944917, "t": 597.4330632986853, "r": 477.47482078330734, "b": 667.1716636837423, "coord_origin": "TOPLEFT"}, "confidence": 0.9872003793716431, "cells": [{"id": 40, "text": "Duis autem vel eum iriure dolor in hendrerit in vulputate velit esse molestie", "bbox": {"l": 148.71201024396512, "t": 597.4330632986853, "r": 477.4710707833011, "b": 607.3956633536932, "coord_origin": "TOPLEFT"}}, {"id": 41, "text": "consequat, vel illum dolore eu feugiat nulla facilisis at vero eros et accumsan et", "bbox": {"l": 133.76801021944917, "t": 609.3880633646941, "r": 477.47293078330415, "b": 619.3506634197018, "coord_origin": "TOPLEFT"}}, {"id": 42, "text": "iusto odio dignissim qui blandit praesent luptatum zzril delenit augue duis dolore", "bbox": {"l": 133.76801021944917, "t": 621.3430634307027, "r": 477.47482078330734, "b": 631.3056634857105, "coord_origin": "TOPLEFT"}}, {"id": 43, "text": "te feugait nulla facilisi. Lorem ipsum dolor sit amet, consectetuer adipiscing elit,", "bbox": {"l": 133.76801021944917, "t": 633.2980634967114, "r": 477.472810783304, "b": 643.2606635517193, "coord_origin": "TOPLEFT"}}, {"id": 44, "text": "sed diam nonummy nibh euismod tincidunt ut laoreet dolore magna aliquam erat", "bbox": {"l": 133.76801021944917, "t": 645.2540535627256, "r": 477.47479078330724, "b": 655.2166536177334, "coord_origin": "TOPLEFT"}}, {"id": 45, "text": "volutpat.", "bbox": {"l": 133.76801021944917, "t": 657.2090636287344, "r": 173.0565202839028, "b": 667.1716636837423, "coord_origin": "TOPLEFT"}}], "children": []}, "text": "Duis autem vel eum iriure dolor in hendrerit in vulputate velit esse molestie consequat, vel illum dolore eu feugiat nulla facilisis at vero eros et accumsan et iusto odio dignissim qui blandit praesent luptatum zzril delenit augue duis dolore te feugait nulla facilisi. Lorem ipsum dolor sit amet, consectetuer adipiscing elit, sed diam nonummy nibh euismod tincidunt ut laoreet dolore magna aliquam erat volutpat."}, {"label": "page_footer", "id": 7, "page_no": 1, "cluster": {"id": 7, "label": "page_footer", "bbox": {"l": 303.13300049729594, "t": 695.1640638383003, "r": 308.1142905054678, "b": 705.1266638933081, "coord_origin": "TOPLEFT"}, "confidence": 0.850279688835144, "cells": [{"id": 46, "text": "1", "bbox": {"l": 303.13300049729594, "t": 695.1640638383003, "r": 308.1142905054678, "b": 705.1266638933081, "coord_origin": "TOPLEFT"}}], "children": []}, "text": "1"}], "body": [{"label": "section_header", "id": 5, "page_no": 1, "cluster": {"id": 5, "label": "section_header", "bbox": {"l": 133.76801021944917, "t": 124.00537068468714, "r": 191.51430031418315, "b": 138.36590076397772, "coord_origin": "TOPLEFT"}, "confidence": 0.95060133934021, "cells": [{"id": 0, "text": "Formula", "bbox": {"l": 133.76801021944917, "t": 124.00537068468714, "r": 191.51430031418315, "b": 138.36590076397772, "coord_origin": "TOPLEFT"}}], "children": []}, "text": "Formula"}, {"label": "text", "id": 1, "page_no": 1, "cluster": {"id": 1, "label": "text", "bbox": {"l": 133.76801021944917, "t": 149.114070823323, "r": 477.4806507833169, "b": 290.58367160443765, "coord_origin": "TOPLEFT"}, "confidence": 0.9878177046775818, "cells": [{"id": 1, "text": "Lorem ipsum dolor sit amet, consetetur sadipscing elitr, sed diam nonumy eir-", "bbox": {"l": 133.76801021944917, "t": 149.114070823323, "r": 477.47687078331063, "b": 159.0766608783307, "coord_origin": "TOPLEFT"}}, {"id": 2, "text": "mod tempor invidunt ut labore et dolore magna aliquyam erat, sed diam volup-", "bbox": {"l": 133.76801021944917, "t": 161.06909088933185, "r": 477.4778107833122, "b": 171.03167094433945, "coord_origin": "TOPLEFT"}}, {"id": 3, "text": "tua. At vero eos et accusam et justo duo dolores et ea rebum. Stet clita kasd", "bbox": {"l": 133.76801021944917, "t": 173.0241009553406, "r": 477.46985078329914, "b": 182.9866910103483, "coord_origin": "TOPLEFT"}}, {"id": 4, "text": "gubergren, no sea takimata sanctus est Lorem ipsum dolor sit amet. Lorem ip-", "bbox": {"l": 133.76801021944917, "t": 184.97906102134914, "r": 477.4718607833024, "b": 194.94165107635683, "coord_origin": "TOPLEFT"}}, {"id": 5, "text": "sum dolor sit amet, consetetur sadipscing elitr, sed diam nonumy eirmod tempor", "bbox": {"l": 133.76801021944917, "t": 196.93511108736357, "r": 477.4699407832993, "b": 206.89770114237137, "coord_origin": "TOPLEFT"}}, {"id": 6, "text": "invidunt ut labore et dolore magna aliquyam erat, sed diam voluptua. At vero", "bbox": {"l": 133.76801021944917, "t": 208.890071153372, "r": 477.4806507833169, "b": 218.8526612083799, "coord_origin": "TOPLEFT"}}, {"id": 7, "text": "eos et accusam et justo duo dolores et ea rebum.", "bbox": {"l": 133.76801021944917, "t": 220.84509121938083, "r": 355.20087058271434, "b": 230.80767127438855, "coord_origin": "TOPLEFT"}}, {"id": 8, "text": "Stet clita kasd gubergren,", "bbox": {"l": 362.6010705948545, "t": 220.84509121938083, "r": 477.4749507833075, "b": 230.80767127438855, "coord_origin": "TOPLEFT"}}, {"id": 9, "text": "no sea takimata sanctus est Lorem ipsum dolor sit amet. Lorem ipsum dolor", "bbox": {"l": 133.76801021944917, "t": 232.8001112853898, "r": 477.4708307833007, "b": 242.76269134039728, "coord_origin": "TOPLEFT"}}, {"id": 10, "text": "sit amet, consetetur sadipscing elitr, sed diam nonumy eirmod tempor invidunt", "bbox": {"l": 133.76801021944917, "t": 244.75506135139813, "r": 477.4709207833008, "b": 254.71765140640582, "coord_origin": "TOPLEFT"}}, {"id": 11, "text": "ut labore et dolore magna aliquyam erat, sed diam voluptua. At vero eos et", "bbox": {"l": 133.76801021944917, "t": 256.71008141740685, "r": 477.4658807832926, "b": 266.6726614724146, "coord_origin": "TOPLEFT"}}, {"id": 12, "text": "accusam et justo duo dolores et ea rebum. Stet clita kasd gubergren, no sea", "bbox": {"l": 133.76801021944917, "t": 268.6660714834211, "r": 477.47586078330903, "b": 278.6286615384289, "coord_origin": "TOPLEFT"}}, {"id": 13, "text": "takimata sanctus est Lorem ipsum dolor sit amet.", "bbox": {"l": 133.76801021944917, "t": 280.62109154942993, "r": 351.4767205766048, "b": 290.58367160443765, "coord_origin": "TOPLEFT"}}], "children": []}, "text": "Lorem ipsum dolor sit amet, consetetur sadipscing elitr, sed diam nonumy eirmod tempor invidunt ut labore et dolore magna aliquyam erat, sed diam voluptua. At vero eos et accusam et justo duo dolores et ea rebum. Stet clita kasd gubergren, no sea takimata sanctus est Lorem ipsum dolor sit amet. Lorem ipsum dolor sit amet, consetetur sadipscing elitr, sed diam nonumy eirmod tempor invidunt ut labore et dolore magna aliquyam erat, sed diam voluptua. At vero eos et accusam et justo duo dolores et ea rebum. Stet clita kasd gubergren, no sea takimata sanctus est Lorem ipsum dolor sit amet. Lorem ipsum dolor sit amet, consetetur sadipscing elitr, sed diam nonumy eirmod tempor invidunt ut labore et dolore magna aliquyam erat, sed diam voluptua. At vero eos et accusam et justo duo dolores et ea rebum. Stet clita kasd gubergren, no sea takimata sanctus est Lorem ipsum dolor sit amet."}, {"label": "text", "id": 3, "page_no": 1, "cluster": {"id": 3, "label": "text", "bbox": {"l": 133.76801021944917, "t": 292.5761116154388, "r": 477.47482078330734, "b": 350.3596819344868, "coord_origin": "TOPLEFT"}, "confidence": 0.9855858683586121, "cells": [{"id": 14, "text": "Duis autem vel eum iriure dolor in hendrerit in vulputate velit esse molestie", "bbox": {"l": 148.71201024396512, "t": 292.5761116154388, "r": 477.4710707833011, "b": 302.5386916704465, "coord_origin": "TOPLEFT"}}, {"id": 15, "text": "consequat, vel illum dolore eu feugiat nulla facilisis at vero eros et accumsan et", "bbox": {"l": 133.76801021944917, "t": 304.5310616814471, "r": 477.47293078330415, "b": 314.4936517364549, "coord_origin": "TOPLEFT"}}, {"id": 16, "text": "iusto odio dignissim qui blandit praesent luptatum zzril delenit augue duis dolore", "bbox": {"l": 133.76801021944917, "t": 316.48608174745607, "r": 477.47482078330734, "b": 326.4486618024637, "coord_origin": "TOPLEFT"}}, {"id": 17, "text": "te feugait nulla facilisi. Lorem ipsum dolor sit amet, consectetuer adipiscing elit,", "bbox": {"l": 133.76801021944917, "t": 328.4411018134648, "r": 477.472810783304, "b": 338.40368186847246, "coord_origin": "TOPLEFT"}}, {"id": 18, "text": "sed diam nonummy nibh euismod tincidunt.", "bbox": {"l": 133.76801021944917, "t": 340.3970618794788, "r": 326.20752053515014, "b": 350.3596819344868, "coord_origin": "TOPLEFT"}}], "children": []}, "text": "Duis autem vel eum iriure dolor in hendrerit in vulputate velit esse molestie consequat, vel illum dolore eu feugiat nulla facilisis at vero eros et accumsan et iusto odio dignissim qui blandit praesent luptatum zzril delenit augue duis dolore te feugait nulla facilisi. Lorem ipsum dolor sit amet, consectetuer adipiscing elit, sed diam nonummy nibh euismod tincidunt."}, {"label": "formula", "id": 6, "page_no": 1, "cluster": {"id": 6, "label": "formula", "bbox": {"l": 280.5539904602546, "t": 362.4346620011579, "r": 330.6965905425146, "b": 374.2696820665041, "coord_origin": "TOPLEFT"}, "confidence": 0.8711639046669006, "cells": [{"id": 19, "text": "a", "bbox": {"l": 280.5539904602546, "t": 364.3070620114962, "r": 285.8142404688841, "b": 374.2696820665041, "coord_origin": "TOPLEFT"}}, {"id": 20, "text": "2", "bbox": {"l": 285.8200104688936, "t": 362.4346620011579, "r": 289.78809047540335, "b": 369.40844203966316, "coord_origin": "TOPLEFT"}}, {"id": 21, "text": "+ 8 = 12", "bbox": {"l": 292.50400047985886, "t": 364.3070620114962, "r": 330.6965905425146, "b": 374.2696820665041, "coord_origin": "TOPLEFT"}}], "children": []}, "text": "a 2 + 8 = 12"}, {"label": "text", "id": 0, "page_no": 1, "cluster": {"id": 0, "label": "text", "bbox": {"l": 133.76801021944917, "t": 382.24005211051195, "r": 477.47589078330907, "b": 523.7096828916267, "coord_origin": "TOPLEFT"}, "confidence": 0.9880034923553467, "cells": [{"id": 22, "text": "Lorem ipsum dolor sit amet, consetetur sadipscing elitr, sed diam nonumy", "bbox": {"l": 148.71201024396512, "t": 382.24005211051195, "r": 477.4698807832991, "b": 392.2026621655199, "coord_origin": "TOPLEFT"}}, {"id": 23, "text": "eirmod tempor invidunt ut labore et dolore magna aliquyam erat, sed diam", "bbox": {"l": 133.76801021944917, "t": 394.19506217652076, "r": 477.47580078330884, "b": 404.15768223152867, "coord_origin": "TOPLEFT"}}, {"id": 24, "text": "voluptua. At vero eos et accusam et justo duo dolores et ea rebum. Stet clita", "bbox": {"l": 133.76801021944917, "t": 406.15005224252945, "r": 477.47580078330884, "b": 416.1126722975373, "coord_origin": "TOPLEFT"}}, {"id": 25, "text": "kasd gubergren, no sea takimata sanctus est Lorem ipsum dolor sit amet. Lorem", "bbox": {"l": 133.76801021944917, "t": 418.10507230853824, "r": 477.47284078330404, "b": 428.06768236354606, "coord_origin": "TOPLEFT"}}, {"id": 26, "text": "ipsum dolor sit amet, consetetur sadipscing elitr, sed diam nonumy eirmod", "bbox": {"l": 133.76801021944917, "t": 430.0600523745468, "r": 477.47589078330907, "b": 440.0226724295547, "coord_origin": "TOPLEFT"}}, {"id": 27, "text": "tempor invidunt ut labore et dolore magna aliquyam erat, sed diam voluptua. At", "bbox": {"l": 133.76801021944917, "t": 442.01605244056105, "r": 477.4717707833023, "b": 451.97866249556887, "coord_origin": "TOPLEFT"}}, {"id": 28, "text": "vero eos et accusam et justo duo dolores et ea rebum. Stet clita kasd gubergren,", "bbox": {"l": 133.76801021944917, "t": 453.9710625065698, "r": 477.47388078330573, "b": 463.93368256157777, "coord_origin": "TOPLEFT"}}, {"id": 29, "text": "no sea takimata sanctus est Lorem ipsum dolor sit amet. Lorem ipsum dolor", "bbox": {"l": 133.76801021944917, "t": 465.9260525725785, "r": 477.4708307833007, "b": 475.88867262758635, "coord_origin": "TOPLEFT"}}, {"id": 30, "text": "sit amet, consetetur sadipscing elitr, sed diam nonumy eirmod tempor invidunt", "bbox": {"l": 133.76801021944917, "t": 477.8810726385873, "r": 477.4709207833008, "b": 487.84368269359516, "coord_origin": "TOPLEFT"}}, {"id": 31, "text": "ut labore et dolore magna aliquyam erat, sed diam voluptua. At vero eos et", "bbox": {"l": 133.76801021944917, "t": 489.8360527045959, "r": 477.4658807832926, "b": 499.7986727596038, "coord_origin": "TOPLEFT"}}, {"id": 32, "text": "accusam et justo duo dolores et ea rebum. Stet clita kasd gubergren, no sea", "bbox": {"l": 133.76801021944917, "t": 501.7910427706045, "r": 477.47586078330903, "b": 511.7536628256125, "coord_origin": "TOPLEFT"}}, {"id": 33, "text": "takimata sanctus est Lorem ipsum dolor sit amet.", "bbox": {"l": 133.76801021944917, "t": 513.7470728366188, "r": 351.4767205766048, "b": 523.7096828916267, "coord_origin": "TOPLEFT"}}], "children": []}, "text": "Lorem ipsum dolor sit amet, consetetur sadipscing elitr, sed diam nonumy eirmod tempor invidunt ut labore et dolore magna aliquyam erat, sed diam voluptua. At vero eos et accusam et justo duo dolores et ea rebum. Stet clita kasd gubergren, no sea takimata sanctus est Lorem ipsum dolor sit amet. Lorem ipsum dolor sit amet, consetetur sadipscing elitr, sed diam nonumy eirmod tempor invidunt ut labore et dolore magna aliquyam erat, sed diam voluptua. At vero eos et accusam et justo duo dolores et ea rebum. Stet clita kasd gubergren, no sea takimata sanctus est Lorem ipsum dolor sit amet. Lorem ipsum dolor sit amet, consetetur sadipscing elitr, sed diam nonumy eirmod tempor invidunt ut labore et dolore magna aliquyam erat, sed diam voluptua. At vero eos et accusam et justo duo dolores et ea rebum. Stet clita kasd gubergren, no sea takimata sanctus est Lorem ipsum dolor sit amet."}, {"label": "text", "id": 4, "page_no": 1, "cluster": {"id": 4, "label": "text", "bbox": {"l": 133.76801021944917, "t": 525.7020529026275, "r": 477.47482078330734, "b": 595.4406532876844, "coord_origin": "TOPLEFT"}, "confidence": 0.9844803214073181, "cells": [{"id": 34, "text": "Duis autem vel eum iriure dolor in hendrerit in vulputate velit esse molestie", "bbox": {"l": 148.71201024396512, "t": 525.7020529026275, "r": 477.4710707833011, "b": 535.6646729576354, "coord_origin": "TOPLEFT"}}, {"id": 35, "text": "consequat, vel illum dolore eu feugiat nulla facilisis at vero eros et accumsan et", "bbox": {"l": 133.76801021944917, "t": 537.6570729686364, "r": 477.47293078330415, "b": 547.6196930236442, "coord_origin": "TOPLEFT"}}, {"id": 36, "text": "iusto odio dignissim qui blandit praesent luptatum zzril delenit augue duis dolore", "bbox": {"l": 133.76801021944917, "t": 549.612063034645, "r": 477.47482078330734, "b": 559.5746730896528, "coord_origin": "TOPLEFT"}}, {"id": 37, "text": "te feugait nulla facilisi. Lorem ipsum dolor sit amet, consectetuer adipiscing elit,", "bbox": {"l": 133.76801021944917, "t": 561.5670431006536, "r": 477.472810783304, "b": 571.5296631556615, "coord_origin": "TOPLEFT"}}, {"id": 38, "text": "sed diam nonummy nibh euismod tincidunt ut laoreet dolore magna aliquam erat", "bbox": {"l": 133.76801021944917, "t": 573.5220631666624, "r": 477.47479078330724, "b": 583.4846832216704, "coord_origin": "TOPLEFT"}}, {"id": 39, "text": "volutpat.", "bbox": {"l": 133.76801021944917, "t": 585.4780532326765, "r": 173.0565202839028, "b": 595.4406532876844, "coord_origin": "TOPLEFT"}}], "children": []}, "text": "Duis autem vel eum iriure dolor in hendrerit in vulputate velit esse molestie consequat, vel illum dolore eu feugiat nulla facilisis at vero eros et accumsan et iusto odio dignissim qui blandit praesent luptatum zzril delenit augue duis dolore te feugait nulla facilisi. Lorem ipsum dolor sit amet, consectetuer adipiscing elit, sed diam nonummy nibh euismod tincidunt ut laoreet dolore magna aliquam erat volutpat."}, {"label": "text", "id": 2, "page_no": 1, "cluster": {"id": 2, "label": "text", "bbox": {"l": 133.76801021944917, "t": 597.4330632986853, "r": 477.47482078330734, "b": 667.1716636837423, "coord_origin": "TOPLEFT"}, "confidence": 0.9872003793716431, "cells": [{"id": 40, "text": "Duis autem vel eum iriure dolor in hendrerit in vulputate velit esse molestie", "bbox": {"l": 148.71201024396512, "t": 597.4330632986853, "r": 477.4710707833011, "b": 607.3956633536932, "coord_origin": "TOPLEFT"}}, {"id": 41, "text": "consequat, vel illum dolore eu feugiat nulla facilisis at vero eros et accumsan et", "bbox": {"l": 133.76801021944917, "t": 609.3880633646941, "r": 477.47293078330415, "b": 619.3506634197018, "coord_origin": "TOPLEFT"}}, {"id": 42, "text": "iusto odio dignissim qui blandit praesent luptatum zzril delenit augue duis dolore", "bbox": {"l": 133.76801021944917, "t": 621.3430634307027, "r": 477.47482078330734, "b": 631.3056634857105, "coord_origin": "TOPLEFT"}}, {"id": 43, "text": "te feugait nulla facilisi. Lorem ipsum dolor sit amet, consectetuer adipiscing elit,", "bbox": {"l": 133.76801021944917, "t": 633.2980634967114, "r": 477.472810783304, "b": 643.2606635517193, "coord_origin": "TOPLEFT"}}, {"id": 44, "text": "sed diam nonummy nibh euismod tincidunt ut laoreet dolore magna aliquam erat", "bbox": {"l": 133.76801021944917, "t": 645.2540535627256, "r": 477.47479078330724, "b": 655.2166536177334, "coord_origin": "TOPLEFT"}}, {"id": 45, "text": "volutpat.", "bbox": {"l": 133.76801021944917, "t": 657.2090636287344, "r": 173.0565202839028, "b": 667.1716636837423, "coord_origin": "TOPLEFT"}}], "children": []}, "text": "Duis autem vel eum iriure dolor in hendrerit in vulputate velit esse molestie consequat, vel illum dolore eu feugiat nulla facilisis at vero eros et accumsan et iusto odio dignissim qui blandit praesent luptatum zzril delenit augue duis dolore te feugait nulla facilisi. Lorem ipsum dolor sit amet, consectetuer adipiscing elit, sed diam nonummy nibh euismod tincidunt ut laoreet dolore magna aliquam erat volutpat."}], "headers": [{"label": "page_footer", "id": 7, "page_no": 1, "cluster": {"id": 7, "label": "page_footer", "bbox": {"l": 303.13300049729594, "t": 695.1640638383003, "r": 308.1142905054678, "b": 705.1266638933081, "coord_origin": "TOPLEFT"}, "confidence": 0.850279688835144, "cells": [{"id": 46, "text": "1", "bbox": {"l": 303.13300049729594, "t": 695.1640638383003, "r": 308.1142905054678, "b": 705.1266638933081, "coord_origin": "TOPLEFT"}}], "children": []}, "text": "1"}]}}] \ No newline at end of file