Commit Graph

  • 1042b6ce28
    Merge 4e332500a8 into aec29a7315 Michele Dolfi 2025-07-26 07:54:36 +0900
  • 2b3d442c18
    Merge d90442488c into aec29a7315 Qiefan Jiang 2025-07-26 07:53:53 +0900
  • 865d7b2c59
    Merge 5e1e82ab3b into aec29a7315 Shkarupa Alex 2025-07-26 07:51:45 +0900
  • 6ae17aa402
    Merge f4c1836c96 into aec29a7315 Peter W. J. Staar 2025-07-25 16:08:49 +0200
  • 9495270a52
    Merge 5f5a3cd914 into aec29a7315 Michele Dolfi 2025-07-25 13:36:34 +0000
  • 5f5a3cd914 use test file from test folder (still missing) feat-books-ocr Michele Dolfi 2025-07-25 15:36:09 +0200
  • 9da610e95b use PdfDocumentBackend Michele Dolfi 2025-07-25 15:19:53 +0200
  • bf016993a0 Deployed aec29a7 with MkDocs version: 1.6.1 gh-pages 2025-07-25 13:19:49 +0000
  • e5407304a0
    Merge e6d5e4e48f into aec29a7315 Christoph Auer 2025-07-25 13:19:49 +0000
  • c31b3416a1
    Merge fe8049c4c1 into aec29a7315 ricalot 2025-07-25 13:19:47 +0000
  • aec29a7315
    fix(markdown): ensure correct parsing of nested lists (#1995) main Cesar Berrospi Ramis 2025-07-25 15:17:57 +0200
  • 46b904e059 rename inputformat Michele Dolfi 2025-07-25 15:16:41 +0200
  • 79c59cb2b0 restore guess format Michele Dolfi 2025-07-25 15:15:05 +0200
  • bbb735d2de fix typing and unloading Michele Dolfi 2025-07-25 15:13:10 +0200
  • 3e4093db58 use HTMLParser and add options from CLI Michele Dolfi 2025-07-25 15:10:05 +0200
  • 95cf070e0b
    Merge db1daf91f5 into 1985841a19 William Easton 2025-07-25 13:55:12 +0200
  • d3e6f72a36 chore: update dependencies in uv.lock file Cesar Berrospi Ramis 2025-07-25 13:29:53 +0200
  • 82940c47a6 fix(markdown): ensure correct parsing of nested lists Cesar Berrospi Ramis 2025-07-25 13:16:28 +0200
  • 5512174826
    Merge 990ecac0bc into 1985841a19 mohammed ahmed 2025-07-25 13:01:36 +0200
  • e6d5e4e48f Remove ignores for typing/linting cau/async-pipeline-and-converter Christoph Auer 2025-07-25 12:37:55 +0200
  • 02a7deb882 Merge branch 'main' of github.com:DS4SD/docling into cau/async-pipeline-and-converter Christoph Auer 2025-07-25 12:28:31 +0200
  • 1985841a19
    ci: Fixes for test GT (#1992) Christoph Auer 2025-07-25 12:28:06 +0200
  • 7c3f9b7ab1 Fixes for cell indexing Christoph Auer 2025-07-25 11:55:48 +0200
  • 39499db39a Fixes for test GT Christoph Auer 2025-07-25 10:52:33 +0200
  • 1581171438
    Merge 69e0123213 into 945721a15d Nikos Livathinos 2025-07-25 06:45:33 +0000
  • 945721a15d
    fix(HTML): remove an unnecessary print command (#1988) Cesar Berrospi Ramis 2025-07-25 08:45:15 +0200
  • 7bc1c1ac3d add backend for METS with Google Books profile Michele Dolfi 2025-07-24 19:18:29 +0200
  • cb0817de76 fix(HTML): remove an unnecessary print command Cesar Berrospi Ramis 2025-07-24 18:03:54 +0200
  • 744a013a32 Use released docling-ibm-models Christoph Auer 2025-07-24 17:01:04 +0200
  • df257bf90e Fix settings defaults expectations Christoph Auer 2025-07-24 15:08:35 +0200
  • 4040bd6618 Merge branch 'main' of github.com:DS4SD/docling into cau/async-pipeline-and-converter Christoph Auer 2025-07-24 15:07:00 +0200
  • 4c3e4769ce
    Merge a1acce83b9 into 8227841c1b mohammed ahmed 2025-07-24 14:41:37 +0200
  • 8227841c1b chore: bump version to 2.42.2 [skip ci] v2.42.2 github-actions[bot] 2025-07-24 10:21:10 +0000
  • 5132f061a8
    fix(HTML): concatenation of child strings in table cells and list items (#1981) Cesar Berrospi Ramis 2025-07-24 11:19:25 +0200
  • 7b5f86098d
    docs: add chat with dosu (#1984) Michele Dolfi 2025-07-24 11:07:36 +0200
  • 0b83609531
    fix(docx): Adding plain latex equations to table cells (#1986) Rafael Teixeira de Lima 2025-07-24 11:02:24 +0200
  • 3fb645c4c2 Adding test files Rafael Teixeira de Lima 2025-07-24 10:24:50 +0200
  • ca8b7b0eef Adding plain latex equations to table cells Rafael Teixeira de Lima 2025-07-24 09:48:04 +0200
  • db13c68650 add chat with dosu Michele Dolfi 2025-07-24 08:25:36 +0200
  • fe8049c4c1 fix: Update DOCLING_API_ENDPOINT to use service name instead of localhost ricalot 2025-07-24 00:06:27 +0800
  • 53e68d3dc6 fix(HTML): ensure correct concatenation of child strings in table cells and list items Cesar Berrospi Ramis 2025-07-23 17:18:38 +0200
  • 425f38a5aa Clean up unused code Christoph Auer 2025-07-23 16:03:25 +0200
  • de0d9b50a2 Option to enable threadpool with doc_batch_concurrency setting Christoph Auer 2025-07-23 15:52:12 +0200
  • 7b4db1940d Merge branch 'main' of github.com:DS4SD/docling into cau/async-pipeline-and-converter Christoph Auer 2025-07-23 15:07:10 +0200
  • 69e0123213 Merge branch 'main' of github.com:DS4SD/docling into nli/layout_heron nli/layout_heron Christoph Auer 2025-07-23 15:03:31 +0200
  • fd0f06bba5 Merge branch 'nli/layout_heron' of github.com:docling-project/docling into nli/layout_heron Ubuntu 2025-07-23 12:33:02 +0000
  • 547ce26702 Update test GT (from linux CPU) Ubuntu 2025-07-23 12:32:30 +0000
  • 6a2e4c125c
    Merge 90da15f611 into 98e2fcff63 Peter W. J. Staar 2025-07-23 14:16:50 +0200
  • 1a5ee2c0ce Update test GT Christoph Auer 2025-07-23 14:05:30 +0200
  • 1889421ce7
    Merge 8d4ac70f61 into 98e2fcff63 neo 2025-07-23 14:02:01 +0200
  • 98e2fcff63
    fix: Preserve PARTIAL_SUCCESS status when document timeout hits (#1975) Copilot 2025-07-23 13:50:40 +0200
  • c7e6d6d8e6 Update docling-models tag for TableFormer Christoph Auer 2025-07-23 13:39:50 +0200
  • 57d77232dc Fix the PARTIAL_SUCCESS case in _determine_status properly Christoph Auer 2025-07-23 12:23:03 +0200
  • 252e5e214f Fix timeout status preservation issue by extending _determine_status method copilot-swe-agent[bot] 2025-07-23 09:44:10 +0000
  • e0a603a33a Complete timeout fix validation with tests and documentation copilot-swe-agent[bot] 2025-07-23 09:22:35 +0000
  • 8a1c4331fb Initial investigation: analyze ReadingOrderModel timeout issue copilot-swe-agent[bot] 2025-07-23 09:18:06 +0000
  • 92b5dd62fa Initial plan copilot-swe-agent[bot] 2025-07-23 09:06:46 +0000
  • 8d50a59d48
    fix: multi-page image support (tiff) (#1928) Copilot 2025-07-23 09:55:40 +0200
  • 14a7b3d086 Use default layout model in model_downloader default args Christoph Auer 2025-07-23 08:51:07 +0200
  • b552fec0c6 Use default layout model in model_downloader default args Christoph Auer 2025-07-23 08:50:22 +0200
  • 67bf4d47ba Merge branch 'main' of github.com:DS4SD/docling into nli/layout_heron Christoph Auer 2025-07-23 08:47:23 +0200
  • 2247dbb86f Proper test for 2 page tiff file (2) Christoph Auer 2025-07-23 08:45:38 +0200
  • b9ebc4a0dc DCO Remediation Commit for copilot-swe-agent[bot] <198982749+Copilot@users.noreply.github.com> Christoph Auer 2025-07-23 08:39:27 +0200
  • bccb644772 Proper test for 2 page tiff file Christoph Auer 2025-07-23 08:39:06 +0200
  • 2aab66288b Revert "Add multi-page TIFF test data and verification tests" Christoph Auer 2025-07-23 08:23:15 +0200
  • d571f36299 Merge branch 'main' of github.com:DS4SD/docling into copilot/fix-1903 Christoph Auer 2025-07-23 08:22:08 +0200
  • ec971bbe68 chore: bump version to 2.42.1 [skip ci] v2.42.1 github-actions[bot] 2025-07-22 16:45:48 +0000
  • 58e9a158a1 feat: Switch default layout model to DOCLING_LAYOUT_HERON. Update the unit test data. Nikos Livathinos 2025-07-22 17:30:16 +0200
  • 67441ca418
    fix: Keep formula clusters also when empty (#1970) Christoph Auer 2025-07-22 17:02:12 +0200
  • 90a7cc4bdd
    docs: enrich existing DoclingDocument (#1969) Michele Dolfi 2025-07-22 16:20:15 +0200
  • 6905d86ffb Keep formula clusters also when empty Christoph Auer 2025-07-22 16:03:35 +0200
  • 19524fd0b5 Merge remote-tracking branch 'origin/main' into docs-enrich-docling-doc Michele Dolfi 2025-07-22 14:52:44 +0200
  • 7cbce4428b add example for enriching an existing doclingdocument Michele Dolfi 2025-07-22 14:49:23 +0200
  • a069b1175b
    refactor(HTML): handle text from styled html (#1960) Cesar Berrospi Ramis 2025-07-22 13:16:31 +0200
  • dc5dca7d31 tests(HTML): re-enable test_ordered_lists Cesar Berrospi Ramis 2025-07-18 14:14:02 +0200
  • 713d7a3342 A new HTML backend that handles styled html (ignors it) as well as images. Alexander Vaagan 2025-04-26 16:30:09 +0200
  • 5d98bcea1b
    docs: add documentation for confidence scores (#1912) Fabiano Franz 2025-07-21 05:16:17 -0300
  • 1f60014a3b
    Update confidence_scores.md Christoph Auer 2025-07-21 09:30:18 +0200
  • c33cc217cd Merge from main Christoph Auer 2025-07-19 17:28:13 +0200
  • 558ea957a8 Fix: python3.9 compat Christoph Auer 2025-07-19 17:17:01 +0200
  • 7762391b3e DCO Remediation Commit for Christoph Auer <cau@zurich.ibm.com> Christoph Auer 2025-07-19 17:13:36 +0200
  • ac9f8e0761 Fix: don't starve on docs with > max_queue_size pages Christoph Auer 2025-07-19 17:12:51 +0200
  • 009cc24d0d Fix: don't starve on docs with > max_queue_size pages Christoph Auer 2025-07-19 17:11:32 +0200
  • 0579d3a3d2 Fix: don't starve on docs with > max_queue_size pages Christoph Auer 2025-07-19 17:11:32 +0200
  • 7561be537a chore: bump version to 2.42.0 [skip ci] v2.42.0 github-actions[bot] 2025-07-18 15:34:59 +0000
  • b36ad76b2a Stop accumulating docs in test run Christoph Auer 2025-07-18 17:22:41 +0200
  • d66da87d96 Merge branch 'cau/async-pipeline-and-converter' of github.com:docling-project/docling into cau/async-pipeline-and-converter Ubuntu 2025-07-18 15:19:57 +0000
  • 89acdb5db2 Update threaded test Ubuntu 2025-07-18 15:18:21 +0000
  • f6015bf8ae Remove redundant method Christoph Auer 2025-07-18 17:17:24 +0200
  • fa71cde950 Revert "Unload doc backend" Christoph Auer 2025-07-18 16:54:27 +0200
  • 01066f0b6e Unload doc backend Christoph Auer 2025-07-18 16:48:35 +0200
  • 988db91bff Reorder test Christoph Auer 2025-07-18 14:33:13 +0200
  • cca05c45ea
    fix: Safe pipeline init, use device_map in transformers models (#1917) Christoph Auer 2025-07-18 15:14:36 +0200
  • 33a24848a0 Revise pipeline Christoph Auer 2025-07-18 14:33:03 +0200
  • ec0898b501 Make pipeline cache+init thread-safe Christoph Auer 2025-07-18 11:00:48 +0200
  • 1df31bc82f Merge branch 'main' of github.com:DS4SD/docling into cau/thread-safety-fixes-again Christoph Auer 2025-07-18 10:58:50 +0200
  • 9fd01f3399 Add test Christoph Auer 2025-07-17 20:49:37 +0200
  • a1acce83b9
    DCO Remediation Commit for codeflash-ai[bot] <148906541+codeflash-ai[bot]@users.noreply.github.com> mohammed 2025-07-16 20:43:54 +0300
  • ad90f337bc
    DCO Remediation Commit for mohammed <mohammed18200118@gmail.com>\n\nI, mohammed <mohammed18200118@gmail.com>, hereby add my Signed-off-by to this commit: d9824749bb678a74563c45965d6b4912b4340a2f\n\nSigned-off-by: mohammed <mohammed18200118@gmail.com>n mohammed 2025-07-16 20:42:56 +0300
  • 04085ba86d Remove unused args Christoph Auer 2025-07-16 17:50:38 +0200