Commit Graph

  • d431579f75
    docs: Update MAINTAINERS.md Christoph Auer 2024-09-02 12:28:53 +0200
  • 85b7348846
    docs: Mention quackling on README (#58) Christoph Auer 2024-09-02 12:27:29 +0200
  • 8f237be9b4
    Update README.md Christoph Auer 2024-09-02 12:24:18 +0200
  • 6b84adebfa create a single parquet output Michele Dolfi 2024-08-30 16:24:42 +0200
  • 66ed096c40 chore: bump version to 1.8.5 [skip ci] v1.8.5 github-actions[bot] 2024-08-30 12:37:54 +0000
  • 3e789dfbdd feat: export document pages as multimodal output Michele Dolfi 2024-08-28 11:40:55 +0200
  • 48f4d1ba52
    fix: Add unit tests (#51) Peter W. J. Staar 2024-08-30 14:08:20 +0200
  • 408a158338 Pin new docling-parse v1.1.3 Christoph Auer 2024-08-30 13:41:37 +0200
  • 0bf89c7726 Fix lockfile Christoph Auer 2024-08-30 12:41:41 +0200
  • 28aad8f4b4 Merge branch 'dev/add-strict-tests' of github.com:DS4SD/docling into dev/add-strict-tests Christoph Auer 2024-08-30 12:39:47 +0200
  • c8cfd442d1 bumped GLM version Peter Staar 2024-08-30 12:33:46 +0200
  • 256f4d504e chore: bump version to 1.8.4 [skip ci] v1.8.4 github-actions[bot] 2024-08-30 08:47:57 +0000
  • de85e46ced
    fix: propagate row_section in tables (#57) Michele Dolfi 2024-08-30 10:36:00 +0200
  • 525a46fadd fix: propagate row_section in tables Michele Dolfi 2024-08-30 10:19:17 +0200
  • a8a60d52b1
    docs: add instructions for cpu-only installation (#56) Michele Dolfi 2024-08-30 10:20:21 +0200
  • 325cda7697 docs: add instructions for cpu-only installation Michele Dolfi 2024-08-30 10:01:14 +0200
  • 07538c089f commented out the json verification for now Peter Staar 2024-08-30 08:44:26 +0200
  • b14675f636 Merge branch 'dev/add-strict-tests' of github.com:DS4SD/docling into dev/add-strict-tests Christoph Auer 2024-08-29 16:37:18 +0200
  • 4bd5deb57d updated the error messages Peter Staar 2024-08-29 16:20:15 +0200
  • 85304caee5 pin docling-parse 1.1.2 Michele Dolfi 2024-08-29 16:01:30 +0200
  • 237aa1316b Merge branch 'main' of github.com:DS4SD/docling into dev/add-strict-tests Christoph Auer 2024-08-29 15:24:57 +0200
  • d7a447603d skip batch_convert Michele Dolfi 2024-08-29 09:05:51 +0200
  • c10f555749 reduce docs in example, since they are already in the tests Michele Dolfi 2024-08-29 08:58:02 +0200
  • a700411288 package verify utils and add more tests Michele Dolfi 2024-08-28 18:50:32 +0200
  • e44791691f Merge branch 'dev/add-strict-tests' of github.com:DS4SD/docling into dev/add-strict-tests Christoph Auer 2024-08-28 14:58:29 +0200
  • 07ec034f1c Validate conversion status on e2e test Christoph Auer 2024-08-28 14:58:21 +0200
  • 52b25bf030 Merge branch 'dev/add-strict-tests' of github.com:DS4SD/docling into dev/add-strict-tests Michele Dolfi 2024-08-28 14:57:14 +0200
  • 2cc940fbfa Remove unnecessary code Christoph Auer 2024-08-28 14:27:44 +0200
  • c09d2bca47 Merge branch 'dev/add-strict-tests' of github.com:DS4SD/docling into dev/add-strict-tests Christoph Auer 2024-08-28 14:22:32 +0200
  • 2d631439ba Add tests and update top_level_tests using only datamodels Christoph Auer 2024-08-28 14:20:54 +0200
  • 6c9fa58155 run examples after tests Michele Dolfi 2024-08-28 13:54:15 +0200
  • 03fbb51566 fix examples Michele Dolfi 2024-08-28 13:25:24 +0200
  • 9b6b8c4ca0 raise a failure if examples fail Michele Dolfi 2024-08-28 13:14:21 +0200
  • 2ad85fdf97 make sure examples return failures Michele Dolfi 2024-08-28 12:49:43 +0200
  • a39449d47b run all examples in CI Michele Dolfi 2024-08-28 12:43:54 +0200
  • 5c46749e70 chore: bump version to 1.8.3 [skip ci] v1.8.3 github-actions[bot] 2024-08-28 10:37:38 +0000
  • e1c8d69422 Merge branch 'main' into dev/add-strict-tests Peter Staar 2024-08-28 12:33:32 +0200
  • f49ee825c3
    fix: table cells overlap and model warnings (#53) Michele Dolfi 2024-08-28 12:30:42 +0200
  • 375207367b fix: table cells overlap and model warnings Michele Dolfi 2024-08-28 11:36:10 +0200
  • 0f172cce2f added test to verify the cells in the pages (3) Peter Staar 2024-08-28 10:58:41 +0200
  • c6440c8911 added test to verify the cells in the pages (2) Peter Staar 2024-08-28 10:58:19 +0200
  • e6ed6f4793 added test to verify the cells in the pages Peter Staar 2024-08-28 10:39:17 +0200
  • f853d0afa1 reformat code Peter Staar 2024-08-28 09:10:41 +0200
  • 0d4fd90036 added verification of input cells Peter Staar 2024-08-28 09:09:06 +0200
  • 3dbd6781df commented out json verification for now Peter Staar 2024-08-27 17:01:29 +0200
  • 93bdaf063b Fix backend tests Christoph Auer 2024-08-27 16:25:36 +0200
  • f517e63b02 Fix backend tests Christoph Auer 2024-08-27 15:18:51 +0200
  • 40d754f03d
    ci: avoid duplicate runs Michele Dolfi 2024-08-27 16:25:29 +0200
  • b548687a06 commented out the drawing Peter Staar 2024-08-27 16:13:37 +0200
  • 774704ae8c Merge branch 'main' of github.com:DS4SD/docling into dev/add-strict-tests Christoph Auer 2024-08-27 15:18:51 +0200
  • e59ea8e04e Fix backend tests Christoph Auer 2024-08-27 15:18:35 +0200
  • 4980b71185 reformatted code Peter Staar 2024-08-27 13:32:45 +0200
  • d0403aaebf chore: bump version to 1.8.2 [skip ci] v1.8.2 github-actions[bot] 2024-08-27 09:53:15 +0000
  • e46a66a176
    fix: refine conversion result (#52) Panos Vagenas 2024-08-27 11:50:43 +0200
  • 3b730688b6 fix: refine conversion result Panos Vagenas 2024-08-27 10:01:39 +0200
  • 35bd7b9cff replaced deprecated json function with model_dump_json Peter Staar 2024-08-26 20:38:42 +0200
  • 08364dfa56 replaced deprecated json function with model_dump_json Peter Staar 2024-08-26 20:32:23 +0200
  • 24c0b9d4c9 ran pre-commit Peter Staar 2024-08-26 20:22:31 +0200
  • c64489a82c added first test for json and md output Peter Staar 2024-08-26 20:21:18 +0200
  • 64640337a3 added the reference converted documents Peter Staar 2024-08-26 18:01:54 +0200
  • b7debe7250 need to start running all tests successfully Peter Staar 2024-08-26 17:50:39 +0200
  • 2c66075390 updated the toplevel function test Peter Staar 2024-08-26 17:49:38 +0200
  • 12eea8495f renamed the test folder and added the toplevel test Peter Staar 2024-08-26 17:00:30 +0200
  • f5eb49a811 add the pytests Peter Staar 2024-08-26 16:26:17 +0200
  • fe817b11d7
    docs: update interface in README (#50) Michele Dolfi 2024-08-26 15:36:39 +0200
  • df6b3f04ea docs: update interface in README Michele Dolfi 2024-08-26 15:34:43 +0200
  • 7052bee999 chore: bump version to 1.8.1 [skip ci] v1.8.1 github-actions[bot] 2024-08-26 11:55:37 +0000
  • 8cc147bc56
    fix: align output formats (#49) Michele Dolfi 2024-08-26 13:30:26 +0200
  • 9b323feca5 fix: align output formats Michele Dolfi 2024-08-26 11:29:13 +0200
  • 053eae4bdf chore: bump version to 1.8.0 [skip ci] v1.8.0 github-actions[bot] 2024-08-23 14:24:04 +0000
  • a294b7e64a
    feat: Page-level error reporting from PDF backend, introduce PARTIAL_SUCCESS status (#47) Christoph Auer 2024-08-23 16:18:41 +0200
  • 6a8e4f565e Add ErrorItem and evaluate page valid status Christoph Auer 2024-08-23 15:46:03 +0200
  • 3226b20779 chore: bump version to 1.7.1 [skip ci] v1.7.1 github-actions[bot] 2024-08-23 11:56:02 +0000
  • 8808463cec
    fix: Better raise exception when a page fails to parse (#46) Christoph Auer 2024-08-23 13:51:42 +0200
  • 79a8090863 Merge from main Christoph Auer 2024-08-23 13:13:51 +0200
  • 1e983639f7 Raise from page backend if page is not correctly parsed Christoph Auer 2024-08-23 13:07:40 +0200
  • 21f977544c Introduce page-level error checks Christoph Auer 2024-08-23 13:06:20 +0200
  • 7e84533299
    fix: Upgrade docling-parse to 1.1.1, safety checks for failed parse on pages (#45) Christoph Auer 2024-08-23 12:51:02 +0200
  • cae20ac099 Merge branch 'cau/upgrade-docling-parse-1.1.0' of github.com:DS4SD/docling into cau/error-reporting Christoph Auer 2024-08-23 12:40:23 +0200
  • b07881324c Bump to docling-parse 1.1.1 Christoph Auer 2024-08-23 11:26:52 +0200
  • 591a3c7b6b Introduce page-level error checks Christoph Auer 2024-08-23 11:41:18 +0200
  • 4d7ea030da Put safety-checks for failed parse of pages Christoph Auer 2024-08-22 18:56:34 +0200
  • 1930f08d4e chore: bump version to 1.7.0 [skip ci] v1.7.0 github-actions[bot] 2024-08-22 12:00:25 +0000
  • a8c6b29a67
    feat: Upgrade docling-parse PDF backend and interface to use page-by-page parsing (#44) Christoph Auer 2024-08-22 13:49:37 +0200
  • eeacb48549 repin after more packages on pypi Michele Dolfi 2024-08-22 13:43:53 +0200
  • e20fce8332 Upgrade lockfile Christoph Auer 2024-08-22 13:34:52 +0200
  • 87fce059ac Merge from main Christoph Auer 2024-08-22 13:15:57 +0200
  • ebcc1e5524 Propagate document_hash to PDF backends, use docling-parse 1.0.0 Christoph Auer 2024-08-22 13:05:24 +0200
  • f7c50c8b0e chore: bump version to 1.6.3 [skip ci] v1.6.3 github-actions[bot] 2024-08-22 11:02:35 +0000
  • fac5745dc8
    fix: usage of bytesio with docling-parse (#43) Michele Dolfi 2024-08-22 12:59:49 +0200
  • edded50b68 fix: usage of bytesio with docling-parse Michele Dolfi 2024-08-22 12:54:15 +0200
  • 55b538fa1b Merge branch 'main' of github.com:DS4SD/docling into cau/upgrade-docling-parse-py-page Christoph Auer 2024-08-22 12:26:52 +0200
  • 1347c01a9e chore: bump version to 1.6.2 [skip ci] v1.6.2 github-actions[bot] 2024-08-22 07:32:54 +0000
  • 69952682ed
    fix: remove [ocr] extra to fix wheel install (#42) Michele Dolfi 2024-08-22 09:25:19 +0200
  • fad1521caf fix: remove [ocr] extra to fix wheel install Michele Dolfi 2024-08-22 08:56:26 +0200
  • 47c6dab6d2 chore: bump version to 1.6.1 [skip ci] v1.6.1 github-actions[bot] 2024-08-21 17:41:26 +0000
  • f19871a5a1
    fix: Add scipy as dependency (#40) Christoph Auer 2024-08-21 17:21:02 +0200
  • 2ead4cc851 Add scipy as dependency Christoph Auer 2024-08-21 17:14:36 +0200
  • 4a1ceaf65c
    Update docling-ibm-models to v1.1.2 (#39) Christoph Auer 2024-08-21 17:12:38 +0200
  • 65b7f0b4f2 Update docling-ibm-models to v1.1.2 Christoph Auer 2024-08-21 17:01:49 +0200