Commit Graph

  • 9f39e7052c
    Merge 990ecac0bc into 98e2fcff63 mohammed ahmed 2025-07-23 21:07:46 +0300
  • b87a1d9ccb
    Merge 5e1e82ab3b into 98e2fcff63 Shkarupa Alex 2025-07-23 20:39:41 +0300
  • fe1d04ec09
    Merge fe8049c4c1 into 98e2fcff63 ricalot 2025-07-24 00:07:14 +0800
  • fe8049c4c1 fix: Update DOCLING_API_ENDPOINT to use service name instead of localhost ricalot 2025-07-24 00:06:27 +0800
  • c0b381a9a7
    Merge 53e68d3dc6 into 98e2fcff63 Cesar Berrospi Ramis 2025-07-23 15:42:58 +0000
  • 53e68d3dc6 fix(HTML): ensure correct concatenation of child strings in table cells and list items fix/html-p-tag-1927 Cesar Berrospi Ramis 2025-07-23 17:18:38 +0200
  • ba50258f70
    Merge 425f38a5aa into 98e2fcff63 Christoph Auer 2025-07-23 14:03:36 +0000
  • 425f38a5aa Clean up unused code cau/async-pipeline-and-converter Christoph Auer 2025-07-23 16:03:25 +0200
  • de0d9b50a2 Option to enable threadpool with doc_batch_concurrency setting Christoph Auer 2025-07-23 15:52:12 +0200
  • 7b4db1940d Merge branch 'main' of github.com:DS4SD/docling into cau/async-pipeline-and-converter Christoph Auer 2025-07-23 15:07:10 +0200
  • ea6154f8a1
    Merge 69e0123213 into 98e2fcff63 Nikos Livathinos 2025-07-23 15:03:57 +0200
  • 69e0123213 Merge branch 'main' of github.com:DS4SD/docling into nli/layout_heron nli/layout_heron Christoph Auer 2025-07-23 15:03:31 +0200
  • fd0f06bba5 Merge branch 'nli/layout_heron' of github.com:docling-project/docling into nli/layout_heron Ubuntu 2025-07-23 12:33:02 +0000
  • 547ce26702 Update test GT (from linux CPU) Ubuntu 2025-07-23 12:32:30 +0000
  • 6a2e4c125c
    Merge 90da15f611 into 98e2fcff63 Peter W. J. Staar 2025-07-23 14:16:50 +0200
  • 0418d2887f
    Merge f4c1836c96 into 98e2fcff63 Peter W. J. Staar 2025-07-23 14:06:06 +0200
  • 1a5ee2c0ce Update test GT Christoph Auer 2025-07-23 14:05:30 +0200
  • 92deef45f1
    Merge db1daf91f5 into 98e2fcff63 William Easton 2025-07-23 14:02:04 +0200
  • 1889421ce7
    Merge 8d4ac70f61 into 98e2fcff63 neo 2025-07-23 14:02:01 +0200
  • 5a63357654 Deployed 98e2fcf with MkDocs version: 1.6.1 gh-pages 2025-07-23 11:52:49 +0000
  • 94410b6d34
    Merge a1acce83b9 into 98e2fcff63 mohammed ahmed 2025-07-23 11:51:56 +0000
  • 98e2fcff63
    fix: Preserve PARTIAL_SUCCESS status when document timeout hits (#1975) main Copilot 2025-07-23 13:50:40 +0200
  • c7e6d6d8e6 Update docling-models tag for TableFormer Christoph Auer 2025-07-23 13:39:50 +0200
  • 57d77232dc Fix the PARTIAL_SUCCESS case in _determine_status properly Christoph Auer 2025-07-23 12:23:03 +0200
  • 252e5e214f Fix timeout status preservation issue by extending _determine_status method copilot-swe-agent[bot] 2025-07-23 09:44:10 +0000
  • e0a603a33a Complete timeout fix validation with tests and documentation copilot-swe-agent[bot] 2025-07-23 09:22:35 +0000
  • 8a1c4331fb Initial investigation: analyze ReadingOrderModel timeout issue copilot-swe-agent[bot] 2025-07-23 09:18:06 +0000
  • 92b5dd62fa Initial plan copilot-swe-agent[bot] 2025-07-23 09:06:46 +0000
  • 8d50a59d48
    fix: multi-page image support (tiff) (#1928) Copilot 2025-07-23 09:55:40 +0200
  • 14a7b3d086 Use default layout model in model_downloader default args Christoph Auer 2025-07-23 08:51:07 +0200
  • b552fec0c6 Use default layout model in model_downloader default args Christoph Auer 2025-07-23 08:50:22 +0200
  • 67bf4d47ba Merge branch 'main' of github.com:DS4SD/docling into nli/layout_heron Christoph Auer 2025-07-23 08:47:23 +0200
  • 2247dbb86f Proper test for 2 page tiff file (2) Christoph Auer 2025-07-23 08:45:38 +0200
  • b9ebc4a0dc DCO Remediation Commit for copilot-swe-agent[bot] <198982749+Copilot@users.noreply.github.com> Christoph Auer 2025-07-23 08:39:27 +0200
  • bccb644772 Proper test for 2 page tiff file Christoph Auer 2025-07-23 08:39:06 +0200
  • 2aab66288b Revert "Add multi-page TIFF test data and verification tests" Christoph Auer 2025-07-23 08:23:15 +0200
  • d571f36299 Merge branch 'main' of github.com:DS4SD/docling into copilot/fix-1903 Christoph Auer 2025-07-23 08:22:08 +0200
  • ec971bbe68 chore: bump version to 2.42.1 [skip ci] v2.42.1 github-actions[bot] 2025-07-22 16:45:48 +0000
  • 58e9a158a1 feat: Switch default layout model to DOCLING_LAYOUT_HERON. Update the unit test data. Nikos Livathinos 2025-07-22 17:30:16 +0200
  • 67441ca418
    fix: Keep formula clusters also when empty (#1970) Christoph Auer 2025-07-22 17:02:12 +0200
  • 90a7cc4bdd
    docs: enrich existing DoclingDocument (#1969) Michele Dolfi 2025-07-22 16:20:15 +0200
  • 6905d86ffb Keep formula clusters also when empty Christoph Auer 2025-07-22 16:03:35 +0200
  • 19524fd0b5 Merge remote-tracking branch 'origin/main' into docs-enrich-docling-doc Michele Dolfi 2025-07-22 14:52:44 +0200
  • 7cbce4428b add example for enriching an existing doclingdocument Michele Dolfi 2025-07-22 14:49:23 +0200
  • a069b1175b
    refactor(HTML): handle text from styled html (#1960) Cesar Berrospi Ramis 2025-07-22 13:16:31 +0200
  • 94da472c4d
    Merge 4e332500a8 into 5d98bcea1b Michele Dolfi 2025-07-22 13:51:39 +0800
  • dc5dca7d31 tests(HTML): re-enable test_ordered_lists Cesar Berrospi Ramis 2025-07-18 14:14:02 +0200
  • 713d7a3342 A new HTML backend that handles styled html (ignors it) as well as images. Alexander Vaagan 2025-04-26 16:30:09 +0200
  • 5d98bcea1b
    docs: add documentation for confidence scores (#1912) Fabiano Franz 2025-07-21 05:16:17 -0300
  • 1f60014a3b
    Update confidence_scores.md Christoph Auer 2025-07-21 09:30:18 +0200
  • c33cc217cd Merge from main Christoph Auer 2025-07-19 17:28:13 +0200
  • 558ea957a8 Fix: python3.9 compat Christoph Auer 2025-07-19 17:17:01 +0200
  • 7762391b3e DCO Remediation Commit for Christoph Auer <cau@zurich.ibm.com> Christoph Auer 2025-07-19 17:13:36 +0200
  • ac9f8e0761 Fix: don't starve on docs with > max_queue_size pages Christoph Auer 2025-07-19 17:12:51 +0200
  • 009cc24d0d Fix: don't starve on docs with > max_queue_size pages Christoph Auer 2025-07-19 17:11:32 +0200
  • 0579d3a3d2 Fix: don't starve on docs with > max_queue_size pages Christoph Auer 2025-07-19 17:11:32 +0200
  • 234101e540
    Merge d90442488c into 7561be537a Qiefan Jiang 2025-07-18 15:36:10 +0000
  • 7561be537a chore: bump version to 2.42.0 [skip ci] v2.42.0 github-actions[bot] 2025-07-18 15:34:59 +0000
  • b36ad76b2a Stop accumulating docs in test run Christoph Auer 2025-07-18 17:22:41 +0200
  • d66da87d96 Merge branch 'cau/async-pipeline-and-converter' of github.com:docling-project/docling into cau/async-pipeline-and-converter Ubuntu 2025-07-18 15:19:57 +0000
  • 89acdb5db2 Update threaded test Ubuntu 2025-07-18 15:18:21 +0000
  • f6015bf8ae Remove redundant method Christoph Auer 2025-07-18 17:17:24 +0200
  • fa71cde950 Revert "Unload doc backend" Christoph Auer 2025-07-18 16:54:27 +0200
  • 01066f0b6e Unload doc backend Christoph Auer 2025-07-18 16:48:35 +0200
  • 988db91bff Reorder test Christoph Auer 2025-07-18 14:33:13 +0200
  • cca05c45ea
    fix: Safe pipeline init, use device_map in transformers models (#1917) Christoph Auer 2025-07-18 15:14:36 +0200
  • 33a24848a0 Revise pipeline Christoph Auer 2025-07-18 14:33:03 +0200
  • ec0898b501 Make pipeline cache+init thread-safe Christoph Auer 2025-07-18 11:00:48 +0200
  • 1df31bc82f Merge branch 'main' of github.com:DS4SD/docling into cau/thread-safety-fixes-again Christoph Auer 2025-07-18 10:58:50 +0200
  • 9fd01f3399 Add test Christoph Auer 2025-07-17 20:49:37 +0200
  • a1acce83b9
    DCO Remediation Commit for codeflash-ai[bot] <148906541+codeflash-ai[bot]@users.noreply.github.com> mohammed 2025-07-16 20:43:54 +0300
  • ad90f337bc
    DCO Remediation Commit for mohammed <mohammed18200118@gmail.com>\n\nI, mohammed <mohammed18200118@gmail.com>, hereby add my Signed-off-by to this commit: d9824749bb678a74563c45965d6b4912b4340a2f\n\nSigned-off-by: mohammed <mohammed18200118@gmail.com>n mohammed 2025-07-16 20:42:56 +0300
  • 04085ba86d Remove unused args Christoph Auer 2025-07-16 17:50:38 +0200
  • 4397bb2c44 Pin docling-ibm-models Christoph Auer 2025-07-16 17:35:40 +0200
  • 8c905f3e70 Better threaded PDF pipeline Christoph Auer 2025-07-16 17:01:33 +0200
  • 7b84668e63
    keep the same variable name mohammed 2025-07-16 14:30:13 +0300
  • bd8b1c42d4
    clean up mohammed 2025-07-16 14:28:47 +0300
  • e1e3053695
    fix: fix HTML table parser and JATS backend bugs (#1948) Cesar Berrospi Ramis 2025-07-16 10:49:24 +0200
  • f98c7e21dd Cleanups and safety improvements Christoph Auer 2025-07-16 10:46:32 +0200
  • 0be9349884 Refactoring into async pipeline primitives and graph Christoph Auer 2025-07-16 10:12:51 +0200
  • a9ba0cdb5b fix: fix HTML table parser and JATS backend bugs Cesar Berrospi Ramis 2025-07-15 15:54:02 +0200
  • 9d29552194 Increase focus on confidence grades, scores are informational only Fabiano Franz 2025-07-15 17:39:24 -0300
  • ef25d03bc8 UpstreamAwareQueue Christoph Auer 2025-07-15 20:09:05 +0200
  • f56de726f3 Initial async pdf pipeline Christoph Auer 2025-07-15 19:25:48 +0200
  • 990ecac0bc
    DCO Remediation Commit for mohammed <mohammed18200118@gmail.com> mohammed 2025-07-15 15:26:49 +0300
  • d9824749bb
    fix: pandas vet error mohammed 2025-07-15 15:24:52 +0300
  • d6d2dbe2f9
    docs: Fix typos (#1943) stephencox-ict 2025-07-15 19:51:56 +1200
  • 4bcd483d2d
    Fix typos stephencox-ict 2025-07-15 13:53:58 +1200
  • a436be7367
    feat: Add option to control empty clusters in layout postprocessing (#1940) Christoph Auer 2025-07-14 18:32:01 +0200
  • 3e71e6fc6e Add option to control empty clusters in layout postprocessing Christoph Auer 2025-07-14 12:20:37 +0200
  • 130a10e2d9 Add multi-page TIFF test data and verification tests copilot-swe-agent[bot] 2025-07-14 08:23:50 +0000
  • db1daf91f5
    DCO Remediation Commit for William Easton <bill.easton@elastic.co> William Easton 2025-07-12 20:13:06 -0500
  • df5c15195b
    Support hierarchical markdown William Easton 2025-07-12 20:10:27 -0500
  • be5d0f71a3 DCO Remediation Commit for Gustavo Lima <crymerom@gmail.com> Gustavo Lima 2025-07-11 16:42:52 +0100
  • 84e45d7191 docs: fix README getting started to reflect single document conversion Gustavo Lima 2025-07-11 16:36:49 +0100
  • b6765b0c09
    Merge 117add0396 into 95e70962f1 benichou 2025-07-11 12:13:33 +0200
  • 6e0b3dcaf1 Remove pointless test Christoph Auer 2025-07-11 10:27:51 +0200
  • 05f51b30d9 add RGB conversion Christoph Auer 2025-07-11 10:26:43 +0200
  • 6aa85cc933 Merge branch 'main' of github.com:DS4SD/docling into copilot/fix-1903 Christoph Auer 2025-07-11 10:21:21 +0200
  • 95e70962f1
    fix: KeyError: 'fPr' when processing latex fractions in DOCX files (#1926) Copilot 2025-07-11 09:52:14 +0200