Commit Graph

  • 313eefebcc docs: add LangChain docs Panos Vagenas 2025-01-09 13:50:47 +0100
  • e64b5a2f62
    fix: allow earlier requests versions (#716) Michele Dolfi 2025-01-09 13:30:40 +0100
  • 5e95b0bd8d allow earlier requests versions Michele Dolfi 2025-01-09 11:21:09 +0100
  • e56abc6c50 ci: pinning llms-txt action version as per PR feedback Selvam Palanimalai 2025-01-08 12:07:35 -0500
  • 9a94b54f6c chore: bump version to 2.15.0 [skip ci] v2.15.0 github-actions[bot] 2025-01-08 12:06:38 +0000
  • 5cb4cf6f19
    fix: Correct scaling of debug visualizations, tune OCR (#700) Christoph Auer 2025-01-08 12:26:44 +0100
  • 592762fac2 chore: Update docling-core Christoph Auer 2025-01-08 09:34:31 +0100
  • 88e86d4235 feat: add support for google ocr Mr.Haddad 2025-01-08 12:10:48 +0300
  • ead396ab40
    docs: specify docstring types (#702) Michele Dolfi 2025-01-08 09:05:18 +0100
  • 8ec3f176eb fix(msexcel): ignore Mypy checking for _find_images_in_sheet function Jiun An Tsai 2025-01-07 15:37:34 +0800
  • 6701f34c85
    docs: add link to rag with granite (#698) Michele Dolfi 2025-01-07 20:01:41 +0100
  • ebb58f7ad4 docs: specify docstring types Michele Dolfi 2025-01-07 19:58:07 +0100
  • f4db1b8cce ci: added action in docs.yml to generate llms.txt Selvam Palanimalai 2025-01-07 13:32:51 -0500
  • 6bbed9cf8e chore: remove unused imports Christoph Auer 2025-01-07 19:30:37 +0100
  • a200705389 fix: Correct scaling of debug visualizations, tune OCR Christoph Auer 2025-01-07 19:29:07 +0100
  • 141fc14336
    Update mkdocs.yml Michele Dolfi 2025-01-07 17:35:28 +0100
  • e1f455bfdb docs: add link to rag with granite Michele Dolfi 2025-01-07 17:06:40 +0100
  • 2f43585f9e docs: fix links between docs pages Michele Dolfi 2025-01-07 17:03:26 +0100
  • 42856fdf79
    fix: Let BeautifulSoup detect the HTML encoding (#695) Christoph Auer 2025-01-07 15:49:28 +0100
  • 49f96f5959 fix: Let BeautifulSoup detect the HTML encoding Christoph Auer 2025-01-07 14:19:08 +0100
  • 2d24faecd9
    docs: add integrations, revamp docs (#693) Panos Vagenas 2025-01-07 14:15:54 +0100
  • d49650c54f
    fix(mspowerpoint): handle invalid images in PowerPoint slides (#650) Jinfeng Sun 2025-01-07 20:58:10 +0800
  • a32aa73b6e docs: add integrations, revamp docs Panos Vagenas 2025-01-07 12:58:04 +0100
  • 0ee849e8bc
    feat: added http header support for document converter and cli (#642) Luke Harrison 2025-01-07 04:15:14 -0500
  • 569038df42
    docs: Add OpenContracts as an integration (#679) JSIV 2025-01-07 04:14:42 -0500
  • 8ffcd767d9
    use pydantic to parse dict Luke Harrison 2025-01-06 10:08:47 -0500
  • e84bd49419
    Update mkdocs.yml JSIV 2025-01-06 07:54:16 -0600
  • 123b7e7194
    Add OpenContracts as an open source project JSIV 2025-01-05 23:00:55 -0600
  • 296cd8c3b9 changes in pyproject.toml matheus 2024-12-28 17:42:25 -0300
  • e05c19962c changes matheus 2024-12-28 14:32:02 -0300
  • 11e7bda477 add CSV backend support matheus 2024-12-28 14:23:44 -0300
  • 67c6d48179 Adding CSV backend support matheus 2024-12-28 14:15:47 -0300
  • 4e17a51cf6 MO-01 - Adding CSV backend support matheus 2024-12-28 14:14:46 -0300
  • 447802b5d1
    Atualizar o README.md Matheus Moura 2024-12-27 23:04:53 -0300
  • abdb9248f1
    Update README.md Matheus Moura 2024-12-27 23:02:34 -0300
  • 9c360181eb
    Update Docowling Matheus Moura 2024-12-27 23:02:14 -0300
  • 561a5d2ad4
    Update README.md Matheus Moura 2024-12-27 22:46:25 -0300
  • 4866915e7b
    Update README.md Matheus Moura 2024-12-27 22:44:44 -0300
  • ed658b6c5c
    Update README.md Matheus Moura 2024-12-27 22:42:45 -0300
  • d8a95961ef
    Update README.md Matheus Moura 2024-12-27 22:39:58 -0300
  • ec22321bc8
    Update README.md Matheus Moura 2024-12-27 22:06:17 -0300
  • 7148d98764
    Update README.md Matheus Moura 2024-12-27 22:04:03 -0300
  • 8a3365804a
    Add readme image Matheus Moura 2024-12-27 22:03:12 -0300
  • 6ccb246fcf
    Add files via upload Matheus Moura 2024-12-27 22:00:27 -0300
  • 2cf016a46c
    Update LICENSE Matheus Moura 2024-12-27 20:21:25 -0300
  • 8a61f25eef
    Update README.md Matheus Moura 2024-12-27 20:17:50 -0300
  • 9683245c54 fix(mspowerpoint): handle invalid images in PowerPoint slides Tendo33 2024-12-24 17:52:50 +0800
  • f3d9c3bfc9
    fixed formatting and typing issues Luke Harrison 2024-12-21 06:48:35 -0500
  • ea4e92527d
    added http header support for document converter and cli Luke Harrison 2024-12-21 06:13:28 -0500
  • 2b591f9872
    docs: add Weaviate RAG recipe notebook (#451) m-newhauser 2024-12-19 14:57:40 -0600
  • eccea1ea9a docs: add Weaviate RAG example Panos Vagenas 2024-12-19 21:26:16 +0100
  • faa80e3325 docs: add Weaviate RAG example Panos Vagenas 2024-12-19 20:58:50 +0100
  • fc645ea531
    docs: document Haystack & Vectara support (#628) Panos Vagenas 2024-12-19 13:33:02 +0100
  • 7f9464b399 feat: Enable markdown text formatting for docx SimJeg 2024-12-19 11:51:19 +0100
  • a0d4f5da04 docs: document Haystack & Vectara support Panos Vagenas 2024-12-19 09:22:20 +0100
  • 687c469c6c chore: Add example for inspection of picture content Christoph Auer 2024-12-18 16:09:07 +0100
  • 1418fa1488 chore: bump version to 2.14.0 [skip ci] v2.14.0 github-actions[bot] 2024-12-18 07:04:47 +0000
  • fd034802b6
    feat: Create a backend to transform PubMed XML files to DoclingDocument (#557) Lucas Morin 2024-12-17 19:27:09 +0100
  • 2490a09626 feat: Create a backend to transform PubMed XML files to DoclingDocument lucas-morin 2024-12-10 14:17:34 +0100
  • e31f09f71f chore: bump version to 2.13.0 [skip ci] v2.13.0 github-actions[bot] 2024-12-17 17:01:04 +0000
  • 60dc852f16
    feat: Updated Layout processing with forms and key-value areas (#530) Christoph Auer 2024-12-17 17:32:24 +0100
  • 7649ba7a76 Comment cleanup Christoph Auer 2024-12-17 17:13:38 +0100
  • 2c2026d3d2 Merge branch 'main' of github.com:DS4SD/docling into release_v3 Christoph Auer 2024-12-17 16:43:16 +0100
  • dca32bf28e Updated test GT for legacy Christoph Auer 2024-12-17 16:42:45 +0100
  • 00dec7a2f3
    test: generate file from CLI in a temporary directory (#618) Cesar Berrospi Ramis 2024-12-17 16:35:42 +0100
  • 4e087504cc
    feat: create a backend to parse USPTO patents into DoclingDocument (#606) Cesar Berrospi Ramis 2024-12-17 16:35:23 +0100
  • 6d38c7cc75 Annoying fixes for historical python versions Christoph Auer 2024-12-17 16:31:59 +0100
  • 94735ec9c4 chore: add safe initialization of PatentUsptoDocumentBackend Cesar Berrospi Ramis 2024-12-17 13:42:44 +0100
  • 89c84ff749 refactor: group XML backend parsers in a subfolder Cesar Berrospi Ramis 2024-12-17 11:31:11 +0100
  • e8248a8fdf test: generate file from CLI in a temporary directory Cesar Berrospi Ramis 2024-12-17 15:28:37 +0100
  • d29a245b8c Update docling-core pinning Christoph Auer 2024-12-17 14:44:50 +0100
  • 8ee1ba455c refactor: address several input formats with same mime type Cesar Berrospi Ramis 2024-12-13 17:00:29 +0100
  • c957901239 refactor: change the name of the USPTO input format Cesar Berrospi Ramis 2024-12-13 15:06:47 +0100
  • 0a35f45092 feat: add USPTO backend parser Cesar Berrospi Ramis 2024-12-11 17:16:50 +0100
  • 848355d74e feat: add PATENT_USPTO as input format Cesar Berrospi Ramis 2024-12-02 12:48:09 +0100
  • 8243325844 Update test GT Christoph Auer 2024-12-17 14:32:17 +0100
  • 6c8c625ce1 Roll back CLI changes from main Christoph Auer 2024-12-17 14:30:13 +0100
  • 3e599c7bbe
    docs: add Haystack RAG example (#615) Panos Vagenas 2024-12-17 14:24:40 +0100
  • b7f94183f1 Merge branch 'main' of github.com:DS4SD/docling into release_v3 cau/new-layout-processing Christoph Auer 2024-12-17 14:07:58 +0100
  • 8f09fcdf59 Merge branch 'main' of github.com:DS4SD/docling into release_v3 Christoph Auer 2024-12-17 14:07:58 +0100
  • 4d91d389c7 docs: add Haystack RAG example Panos Vagenas 2024-12-17 13:58:44 +0100
  • ec554cb4f2 Adjust confidence in EasyOcr Christoph Auer 2024-12-17 13:45:59 +0100
  • c5d1fbf208 Adjust confidence in EasyOcr Christoph Auer 2024-12-17 13:45:59 +0100
  • 1f5b1d46ab feat: Add Easyocr parameter recog_network (#613) itsainii 2024-12-17 16:47:18 +0800
  • 3b53bd38c8
    feat: Add Easyocr parameter recog_network (#613) itsainii 2024-12-17 16:47:18 +0800
  • 556cfde1e5
    Merge branch 'DS4SD:main' into add_easyocr_parameter itsainii 2024-12-17 15:15:51 +0800
  • 8d6dbd61c1 Add Easyocr recog_network parameter itsainii 2024-12-17 13:55:10 +0800
  • 44f74b4387 Update pipeline_options.py itsainii 2024-12-16 16:06:22 +0800
  • 029d777794 Update easyocr_model.py itsainii 2024-12-16 16:04:55 +0800
  • be2d174fcd
    Merge branch 'DS4SD:main' into main itsainii 2024-12-17 13:40:24 +0800
  • cf2606825a docs: Fix the path to the run_with_accelerator.py example (#608) Nikos Livathinos 2024-12-16 15:03:06 +0100
  • 3bb3bf5715
    docs: Fix the path to the run_with_accelerator.py example (#608) Nikos Livathinos 2024-12-16 15:03:06 +0100
  • 0fd50e53be Fix form and key value area groups Christoph Auer 2024-12-16 15:01:27 +0100
  • bed8fc81bd Fix form and key value area groups Christoph Auer 2024-12-16 15:01:27 +0100
  • efc25225ac Introduce OCR confidence, propagate to orphan in post-processing Christoph Auer 2024-12-16 14:42:01 +0100
  • 18e01f610e Introduce OCR confidence, propagate to orphan in post-processing Christoph Auer 2024-12-16 14:42:01 +0100
  • 142644db72 fix: Fix the path to the run_with_accelerator.py example Nikos Livathinos 2024-12-16 14:35:11 +0100
  • c020f2cba3 Rebase from main Christoph Auer 2024-12-16 11:26:24 +0100
  • 81bf033052 Rebase from main Christoph Auer 2024-12-16 11:26:24 +0100
  • 424c422439
    Update pipeline_options.py itsainii 2024-12-16 16:06:22 +0800