Commit Graph

  • 1c26769785
    feat(SmolDocling): Support MLX acceleration in VLM pipeline (#1199) Maxim Lysak 2025-03-19 15:38:54 +0100
  • 4c0ff55f0a Consmetic changes Christoph Auer 2025-03-19 15:01:09 +0100
  • 39a949df6e Merge branch 'dev/mlx' of github.com:DS4SD/docling into dev/mlx Christoph Auer 2025-03-19 14:56:28 +0100
  • 9cc5abd241 corrections in the documentation Maksym Lysak 2025-03-19 14:53:49 +0100
  • b454aa1551
    feat: Add PPTX notes slides (#474) Maciej Wieczorek 2025-03-19 14:52:09 +0100
  • db5cc61494 Updated documentation Maksym Lysak 2025-03-19 14:46:23 +0100
  • e13fa5ade8 Updated example Maksym Lysak 2025-03-19 14:40:21 +0100
  • 40b3f597f3 Updated README Maksym Lysak 2025-03-19 14:31:33 +0100
  • 26f0543c9a Merge branch 'dev/mlx' of github.com:DS4SD/docling into dev/mlx Christoph Auer 2025-03-19 14:27:48 +0100
  • 16664f2cd6 Fixed extract_text_from_backend definition Maksym Lysak 2025-03-19 14:13:05 +0100
  • a9cf823187 make vlm_pipeline python3.9 compatible Maksym Lysak 2025-03-19 13:56:57 +0100
  • 9182d8a622 Updated minimal vlm pipeline example Maksym Lysak 2025-03-19 13:48:44 +0100
  • 0875388ce6 Add CLI choices for VLM pipeline and model Christoph Auer 2025-03-19 13:18:04 +0100
  • bd0c4dfe10 mlx_model unit Maksym Lysak 2025-03-19 11:02:26 +0100
  • e7c29a89d0 Initial implementation to support MLX for VLM pipeline and SmolDocling Maksym Lysak 2025-03-19 10:51:20 +0100
  • 8e2b0b39c1 Add CLI choices for VLM pipeline and model Christoph Auer 2025-03-19 13:18:04 +0100
  • 34c3a395fe
    Merge branch 'docling-project:main' into main Ulan Yisaev 2025-03-19 13:27:17 +0200
  • 4c5b0f7894 feat: Move presenter notes into furniture Maciej Wieczorek 2025-03-19 11:59:55 +0100
  • f5adfb9724
    fix: Determine correct page size in DoclingParseV4Backend (#1196) Christoph Auer 2025-03-19 11:05:42 +0100
  • d5f7798763
    test(html): fix regression test after docling-core update (#1197) Cesar Berrospi Ramis 2025-03-19 11:03:46 +0100
  • 0cd9b48372 mlx_model unit Maksym Lysak 2025-03-19 11:02:26 +0100
  • 4d8c1b6935 Initial implementation to support MLX for VLM pipeline and SmolDocling Maksym Lysak 2025-03-19 10:51:20 +0100
  • 378db72f2e feat: Add PPTX notes slides Maciej Wieczorek 2024-11-15 15:53:35 -0500
  • 0b707d0882
    fix(msword): Fixing function return in equations handling (#1194) Rafael Teixeira de Lima 2025-03-19 10:34:25 +0100
  • 7e1fabbf09 test(html): fix regression test after docling-core update Cesar Berrospi Ramis 2025-03-19 09:51:57 +0100
  • 56d3a590f2 fix: Determine correct page size in DoclingParseV4Backend Christoph Auer 2025-03-19 09:38:55 +0100
  • a81faacfc9 Add message Rafael Teixeira de Lima 2025-03-19 09:06:10 +0100
  • 4f284b3746 Fixing function return Rafael Teixeira de Lima 2025-03-19 09:02:22 +0100
  • 1d680b0a32
    docs: Linux Foundation AI & Data (#1183) Michele Dolfi 2025-03-19 09:05:57 +0100
  • 3a4de79704 update docs index Michele Dolfi 2025-03-18 16:45:51 +0100
  • d61884335f Merge remote-tracking branch 'origin/main' into docs-lfai Michele Dolfi 2025-03-18 16:44:11 +0100
  • 54a78c307d
    docs: move apify to docs (#1182) Michele Dolfi 2025-03-18 16:43:55 +0100
  • 508385167c point the auxiliary files to the community repo and add lfai in README Michele Dolfi 2025-03-18 16:41:34 +0100
  • bcc44fcc5b move apify to docs Michele Dolfi 2025-03-18 16:06:25 +0100
  • 2f72167ff6
    feat: updated vlm pipeline (with latest changes from docling-core) (#1158) Maxim Lysak 2025-03-18 15:44:51 +0100
  • 1a2a9e4eff chore: bump version to 2.27.0 [skip ci] v2.27.0 github-actions[bot] 2025-03-18 13:37:45 +0000
  • d3be3461d4 Merge from main Christoph Auer 2025-03-18 14:35:46 +0100
  • 6eaae3cba0
    feat: add factory for ocr engines via plugins (#1010) Michele Dolfi 2025-03-18 13:58:05 +0100
  • ee13436376 Updated readme Maksym Lysak 2025-03-18 13:36:53 +0100
  • 7302fad3fd Update tests Christoph Auer 2025-03-18 12:41:24 +0100
  • fdaa53a618 add factory return and ignore options type Michele Dolfi 2025-03-18 12:37:40 +0100
  • b2021ac8d8 Merge remote-tracking branch 'origin/main' into feat-factory-plugins Michele Dolfi 2025-03-18 12:30:49 +0100
  • 535adff82b Cleaned up Maksym Lysak 2025-03-18 11:23:58 +0100
  • a2c14545d8 removed unnecessary transformation Maksym Lysak 2025-03-18 11:17:03 +0100
  • 5dffd10eae Added support for force_backend_text parameter Maksym Lysak 2025-03-18 10:14:16 +0100
  • 5c62f88175 satisfying mypy and other checks Maksym Lysak 2025-03-17 16:57:43 +0100
  • c97acfc8e0 re-using DocTagsDocument.from_doctags_and_image_pairs Maksym Lysak 2025-03-17 16:08:49 +0100
  • 8e54299eac preparing to migrate to new doctags deserializer Maksym Lysak 2025-03-17 13:29:07 +0100
  • 46dc2e621f Updated VLM pipeline doctags to docling conversion, now properly supports lists Maksym Lysak 2025-03-13 14:39:55 +0100
  • 6be3805fd0 Draft implementation of Doctag backend Maksym Lysak 2025-03-11 14:02:34 +0100
  • 3960b199d6
    feat: Add DoclingParseV4 backend, using high-level docling-parse API (#905) Christoph Auer 2025-03-18 10:38:19 +0100
  • 772487f9c9
    feat(actor): Docling Actor on Apify infrastructure (#875) Václav Vančura 2025-03-18 10:17:44 +0100
  • 9b4c2e3fdf add allow_external_plugins option Michele Dolfi 2025-03-18 09:07:28 +0100
  • 66fe0049fb Merge remote-tracking branch 'origin/main' into feat-factory-plugins Michele Dolfi 2025-03-17 18:24:14 +0100
  • 75a03c4257 disable GT generation on test_interfaces cau/dpv4-test-updates Christoph Auer 2025-03-17 11:31:18 +0100
  • 9359f86c6a Merge branch 'cau/docling-parse-api' of github.com:DS4SD/docling into cau/dpv4-test-updates Christoph Auer 2025-03-17 11:17:31 +0100
  • 50ac62b5fa test_input_doc use default backend Christoph Auer 2025-03-17 11:13:42 +0100
  • 7bce91893c Unset DPv1 backend on tests (use DPv4 default), re-generate test output Christoph Auer 2025-03-17 11:04:41 +0100
  • 284eabe1f7 removed unnecessary hasattr check Mislav 2025-03-17 22:43:57 +1300
  • eff907811a Merge branch 'main' of github.com:DS4SD/docling into cau/docling-parse-api Christoph Auer 2025-03-17 10:37:13 +0100
  • 627abcd082 formatted script Mislav 2025-03-17 22:08:18 +1300
  • 7e01798417
    docs: fix spelling of picture in usage (#1165) serced 2025-03-17 09:33:51 +0100
  • fe45d30942 Fixes for DPv4 backend init, better test coverage Christoph Auer 2025-03-17 09:26:31 +0100
  • e34c0750a7 Reset all tests to use docling-parse v1 for now Christoph Auer 2025-03-14 16:39:16 +0100
  • 412c013d95 Merge from main Christoph Auer 2025-03-14 13:52:36 +0100
  • d654568ad9 Test all backends, fixes Christoph Auer 2025-03-14 13:32:37 +0100
  • 9d95f0211e
    Merge branch 'docling-project:main' into main Ulan Yisaev 2025-03-14 14:23:42 +0200
  • af18215714 Rename docling backend to v4 Christoph Auer 2025-03-14 12:35:06 +0100
  • fa16b12316
    chore: move to docling-project org (#1160) Michele Dolfi 2025-03-14 12:35:29 +0100
  • b77f73beec Text fixes, new test data Christoph Auer 2025-03-14 11:44:09 +0100
  • 49e42202aa
    docs: fix spelling of picture in usage serced 2025-03-14 11:05:30 +0100
  • 63f9ad993e revert test content Michele Dolfi 2025-03-14 10:02:37 +0100
  • 2dfdd02eb1 Merge branch 'chore-move-org' of github.com:DS4SD/docling into chore-move-org Michele Dolfi 2025-03-14 09:42:28 +0100
  • 8ef990bad3 update github pages Michele Dolfi 2025-03-14 09:41:29 +0100
  • 6a9d041bfa Actor: Updated main Readme and Actor Readme Adam Kliment 2025-03-13 14:07:39 +0100
  • 3680f0f8c3
    Update docs/faq/index.md Michele Dolfi 2025-03-14 08:33:53 +0100
  • 7e8288ca01 chore: rename org Michele Dolfi 2025-03-13 21:02:16 +0100
  • f94da44ec5
    fix(html): handle nested empty lists (#1154) Cesar Berrospi Ramis 2025-03-13 16:56:58 +0100
  • e00f362405 Update tests, use TextCell.from_ocr property Christoph Auer 2025-03-13 16:04:08 +0100
  • 2b559fdf2c fix(html): handle nested empty lists Cesar Berrospi Ramis 2025-03-11 13:08:17 +0100
  • 0945973b79
    fix: use first table row as col headers (#1156) Panos Vagenas 2025-03-13 15:34:18 +0100
  • 6eb718f849
    feat: equations to latex in MSWord backend (with inline groups) (#1114) Rafael Teixeira de Lima 2025-03-13 15:12:22 +0100
  • ae6b6adc3b cleanup old commented code Panos Vagenas 2025-03-13 13:01:27 +0100
  • 4f93903044 fix: use first table row as col headers Panos Vagenas 2025-03-13 12:40:58 +0100
  • 53837fe30e
    Merge branch 'DS4SD:main' into main Václav Vančura 2025-03-13 11:11:43 +0100
  • ebd323a5e8
    Actor: Resolving conflicts with main (pass 2) Václav Vančura 2025-03-13 11:02:08 +0100
  • d7b306231e
    Actor: Resolving conflicts with main Václav Vančura 2025-03-13 10:56:00 +0100
  • 1c9d8e29b0 Actor: Always output a zip Adam Kliment 2025-03-13 09:37:39 +0100
  • 7cd1f06868 Actor: Fixed input getter Adam Kliment 2025-03-12 14:49:46 +0100
  • 1fe80d3c23 Actor: Removing obsolete actor.json keys Václav Vančura 2025-03-10 08:41:48 +0100
  • 72077c109d Actor: Update CHANGELOG and README for Docker and API changes Václav Vančura 2025-03-09 16:07:17 +0100
  • 5f5c0a9d50 Actor: Refactor actor.sh and add docling_processor.py Václav Vančura 2025-03-09 15:51:39 +0100
  • 7a5dc3c438 Actor: Overhaul the implementation using official docling-serve image Václav Vančura 2025-03-09 14:23:57 +0100
  • 9f86971fad Actor: Replace Docling CLI with docling-serve API Václav Vančura 2025-03-08 17:00:53 +0100
  • 11f2960907 Actor: Add section on Actors to README Václav Vančura 2025-02-07 14:14:45 +0100
  • 193101e52c Actor: Fix the Apify call syntax and final result URL message Václav Vančura 2025-02-07 14:03:39 +0100
  • 531c135899 Actor: Update README with output URL details Václav Vančura 2025-02-07 14:02:56 +0100
  • 5bbd1d34eb Actor: Adding dataset schema Václav Vančura 2025-02-07 13:12:37 +0100
  • 3cdb1b31c7 Actor: Adding CHANGELOG.md Václav Vančura 2025-02-07 13:12:26 +0100
  • 3245e1b8b7 Actor: Enhance README.md with output details Václav Vančura 2025-02-07 13:11:55 +0100