Christoph Auer
5cd0fdd258
chore: more cleanup
...
Signed-off-by: Christoph Auer <cau@zurich.ibm.com>
2025-02-26 12:48:14 +01:00
Christoph Auer
c5873f2496
chore: clean up code and comments
...
Signed-off-by: Christoph Auer <cau@zurich.ibm.com>
2025-02-26 12:47:10 +01:00
Christoph Auer
10f64a948c
Expose control over using flash_attention_2
...
Signed-off-by: Christoph Auer <cau@zurich.ibm.com>
2025-02-25 15:31:32 +01:00
Christoph Auer
341806e54b
Rename example
...
Signed-off-by: Christoph Auer <cau@zurich.ibm.com>
2025-02-25 13:43:05 +01:00
Christoph Auer
1cba96ecfd
Generalize and refactor VLM pipeline and models
...
Signed-off-by: Christoph Auer <cau@zurich.ibm.com>
2025-02-25 13:38:44 +01:00
Maksym Lysak
a7a1f32b10
Added example on how to get original predicted doctags in minimal_smol_docling
...
Signed-off-by: Maksym Lysak <mly@zurich.ibm.com>
2025-02-24 14:39:18 +01:00
Maksym Lysak
853544ba11
Addressing PR comments, added enabled property to SmolDocling, and related VLM pipeline option, few other minor things
...
Signed-off-by: Maksym Lysak <mly@zurich.ibm.com>
2025-02-24 13:46:47 +01:00
Christoph Auer
55fa4eb4e3
Fix repo id
...
Signed-off-by: Christoph Auer <cau@zurich.ibm.com>
2025-02-24 13:20:05 +01:00
Christoph Auer
6f9f4f4aee
Update minimal smoldocling example
...
Signed-off-by: Christoph Auer <cau@zurich.ibm.com>
2025-02-24 13:18:25 +01:00
Maksym Lysak
d7abe1b1cd
Updated example of Smol Docling usage
...
Signed-off-by: Maksym Lysak <mly@zurich.ibm.com>
2025-02-24 13:15:19 +01:00
Maksym Lysak
7c4ab5c716
Moved artifacts_path for SmolDocling into vlm_options instead of global pipeline option
...
Signed-off-by: Maksym Lysak <mly@zurich.ibm.com>
2025-02-24 13:15:19 +01:00
Maksym Lysak
f2751e11f9
Introduced SmolDoclingOptions to configure model parameters (such as query and artifacts path) via client code, see example in minimal_smol_docling. Provisioning for other potential vlm all-in-one models.
...
Signed-off-by: Maksym Lysak <mly@zurich.ibm.com>
2025-02-24 13:15:15 +01:00
Maksym Lysak
0fe12d819a
Updated vlm pipeline assembly and smol docling model code to support updated doctags
...
Signed-off-by: Maksym Lysak <mly@zurich.ibm.com>
2025-02-24 13:12:55 +01:00
Maksym Lysak
9901729d8c
Exposed "force_backend_text" as pipeline parameter
...
Signed-off-by: Maksym Lysak <mly@zurich.ibm.com>
2025-02-24 13:12:51 +01:00
Maksym Lysak
0dc3ac43b1
Added capability for vlm_pipeline to grab text from preconfigured backend
...
Signed-off-by: Maksym Lysak <mly@zurich.ibm.com>
2025-02-24 12:56:57 +01:00
Maksym Lysak
e0929781f4
Added tokens/sec measurement, improved example
...
Signed-off-by: Maksym Lysak <mly@zurich.ibm.com>
2025-02-24 12:56:57 +01:00
Maksym Lysak
2a43c199d5
Cleaned up logs, added pages to vlm_pipeline, basic timing per page measurement in smol_docling models
...
Signed-off-by: Maksym Lysak <mly@zurich.ibm.com>
2025-02-24 12:56:57 +01:00
Maksym Lysak
61bb9dbba2
Properly propagating image data per page, together with predicted tags in VLM pipeline. This enables correct figure extraction and page numbers in provenances
...
Signed-off-by: Maksym Lysak <mly@zurich.ibm.com>
2025-02-24 12:56:56 +01:00
Maksym Lysak
01c46e24b1
Fix for table span compute in vlm_pipeline
...
Signed-off-by: Maksym Lysak <mly@zurich.ibm.com>
2025-02-24 12:56:56 +01:00
Maksym Lysak
1b968e4984
Fixes to preserve page image and demo export to html
...
Signed-off-by: Maksym Lysak <mly@zurich.ibm.com>
2025-02-24 12:56:56 +01:00
Maksym Lysak
3c4c647615
WIP, first working code for inference of SmolDocling, and vlm pipeline assembly code, example included.
...
Signed-off-by: Maksym Lysak <mly@zurich.ibm.com>
2025-02-24 12:56:56 +01:00
Maksym Lysak
03c8d45790
wip smolDocling inference and vlm pipeline
...
Signed-off-by: Maksym Lysak <mly@zurich.ibm.com>
2025-02-24 12:56:52 +01:00
Christoph Auer
dc3a388aa2
Skeleton for SmolDocling model and VLM Pipeline
...
Signed-off-by: Christoph Auer <cau@zurich.ibm.com>
Signed-off-by: Maksym Lysak <mly@zurich.ibm.com>
2025-02-24 11:46:04 +01:00
Christoph Auer
c93e36988f
feat: Implement new reading-order model ( #916 )
...
* Implement new reading-order model, replacing DS GLM model (WIP)
Signed-off-by: Christoph Auer <cau@zurich.ibm.com>
* Update reading-order model branch
Signed-off-by: Christoph Auer <cau@zurich.ibm.com>
* Update lockfile [skip ci]
Signed-off-by: Christoph Auer <cau@zurich.ibm.com>
* Add captions, footnotes and merges [skip ci]
Signed-off-by: Christoph Auer <cau@zurich.ibm.com>
* Updates for reading-order implementation
Signed-off-by: Christoph Auer <cau@zurich.ibm.com>
* Updates for reading-order implementation
Signed-off-by: Christoph Auer <cau@zurich.ibm.com>
* Update tests and lockfile
Signed-off-by: Christoph Auer <cau@zurich.ibm.com>
* Fixes, update tests
Signed-off-by: Christoph Auer <cau@zurich.ibm.com>
* Add normalization, update tests again
Signed-off-by: Christoph Auer <cau@zurich.ibm.com>
* Update tests with code
Signed-off-by: Christoph Auer <cau@zurich.ibm.com>
* Push final lockfile
Signed-off-by: Christoph Auer <cau@zurich.ibm.com>
* sanitize text
Signed-off-by: Michele Dolfi <dol@zurich.ibm.com>
* Inlcude furniture, Update tests with furniture
Signed-off-by: Christoph Auer <cau@zurich.ibm.com>
* Fix content_layer assignment
Signed-off-by: Christoph Auer <cau@zurich.ibm.com>
* chore: Delete empty file docling/models/ds_glm_model.py
Signed-off-by: Nikos Livathinos <nli@zurich.ibm.com>
---------
Signed-off-by: Christoph Auer <cau@zurich.ibm.com>
Signed-off-by: Michele Dolfi <dol@zurich.ibm.com>
Signed-off-by: Nikos Livathinos <nli@zurich.ibm.com>
Co-authored-by: Michele Dolfi <dol@zurich.ibm.com>
Co-authored-by: Nikos Livathinos <nli@zurich.ibm.com>
2025-02-20 17:51:17 +01:00
Panos Vagenas
27c04007bc
docs: revamp picture description example ( #1015 )
...
* docs: revamp picture description example
Signed-off-by: Panos Vagenas <pva@zurich.ibm.com>
* Improvements for visualization example (#1017 )
* fix colab install, use granite and improve viz of description
Signed-off-by: Michele Dolfi <dol@zurich.ibm.com>
* switch docs to notbook
Signed-off-by: Michele Dolfi <dol@zurich.ibm.com>
* show results with all models
Signed-off-by: Michele Dolfi <dol@zurich.ibm.com>
* show other vlm
Signed-off-by: Michele Dolfi <dol@zurich.ibm.com>
---------
Signed-off-by: Michele Dolfi <dol@zurich.ibm.com>
---------
Signed-off-by: Panos Vagenas <pva@zurich.ibm.com>
Signed-off-by: Michele Dolfi <dol@zurich.ibm.com>
Co-authored-by: Michele Dolfi <dol@zurich.ibm.com>
2025-02-19 11:28:54 +01:00
Ahmed Nassar
77eb77bdc2
feat: Support cuda:n GPU device allocation ( #694 )
...
* Adding multi-gpu support, and cuda device allocation
Signed-off-by: ahn <ahn@zurich.ibm.com>
* Fixes pydantic exception with cuda:n
Signed-off-by: ahn <ahn@zurich.ibm.com>
* Pydantic field validator and comment restored.
Signed-off-by: ahn <ahn@zurich.ibm.com>
* chore: Accept AcceleratorDevice enum type
Signed-off-by: Christoph Auer <cau@zurich.ibm.com>
* Resetted some options to default, removed EasyOCR model wrap.
Signed-off-by: ahn <ahn@zurich.ibm.com>
* Fixed rebased issues
Signed-off-by: ahn <ahn@zurich.ibm.com>
* Revert accelerator test options
Signed-off-by: ahn <ahn@zurich.ibm.com>
---------
Signed-off-by: ahn <ahn@zurich.ibm.com>
Signed-off-by: Christoph Auer <cau@zurich.ibm.com>
Co-authored-by: ahn <ahn@sonny.zuvela.ibm.com>
Co-authored-by: ahn <ahn@zurich.ibm.com>
Co-authored-by: Christoph Auer <cau@zurich.ibm.com>
2025-02-17 11:31:13 +01:00
Cesar Berrospi Ramis
428b656793
feat(xml-jats): parse XML JATS documents ( #967 )
...
* chore(xml-jats): separate authors and affiliations
In XML PubMed (JATS) backend, convert authors and affiliations as they
are typically rendered on PDFs.
Signed-off-by: Cesar Berrospi Ramis <75900930+ceberam@users.noreply.github.com>
* fix(xml-jats): replace new line character by a space
Instead of removing new line character from text, replace it by a space character.
Signed-off-by: Cesar Berrospi Ramis <75900930+ceberam@users.noreply.github.com>
* feat(xml-jats): improve existing parser and extend features
Partially support lists, respect reading order, parse more sections, support equations, better text formatting.
Signed-off-by: Cesar Berrospi Ramis <75900930+ceberam@users.noreply.github.com>
* chore(xml-jats): rename PubMed objects to JATS
Signed-off-by: Cesar Berrospi Ramis <75900930+ceberam@users.noreply.github.com>
---------
Signed-off-by: Cesar Berrospi Ramis <75900930+ceberam@users.noreply.github.com>
2025-02-17 10:43:31 +01:00
Tobias Strebitzer
00d9405b0a
feat: Add support for CSV input with new backend to transform CSV files to DoclingDocument ( #945 )
...
* feat: Implement csv backend and format detection
Signed-off-by: Tobias Strebitzer <tobias.strebitzer@magloft.com>
* test: Implement csv parsing and format tests
Signed-off-by: Tobias Strebitzer <tobias.strebitzer@magloft.com>
* docs: Add example and CSV format documentation
Signed-off-by: Tobias Strebitzer <tobias.strebitzer@magloft.com>
* feat: Add support for various CSV dialects and update documentation
Signed-off-by: Tobias Strebitzer <tobias.strebitzer@magloft.com>
* feat: Add validation for delimiters and tests for inconsistent csv files
Signed-off-by: Tobias Strebitzer <tobias.strebitzer@magloft.com>
---------
Signed-off-by: Tobias Strebitzer <tobias.strebitzer@magloft.com>
2025-02-14 08:55:09 +01:00
Michele Dolfi
2d66e99b69
docs: Examples for picture descriptions ( #951 )
...
* add more examples for picture descriptions
Signed-off-by: Michele Dolfi <dol@zurich.ibm.com>
* fix merge typo
Signed-off-by: Michele Dolfi <dol@zurich.ibm.com>
---------
Signed-off-by: Michele Dolfi <dol@zurich.ibm.com>
2025-02-13 08:33:12 +01:00
Michele Dolfi
2716c7d4ff
feat: Introduce the enable_remote_services option to allow remote connections while processing ( #941 )
...
* feat: Introduce the allow_remote_services option to allow remote connections while processing
Signed-off-by: Michele Dolfi <dol@zurich.ibm.com>
* add option in the example
Signed-off-by: Michele Dolfi <dol@zurich.ibm.com>
* enhance docs
Signed-off-by: Michele Dolfi <dol@zurich.ibm.com>
* rename to enable_remote_services
Signed-off-by: Michele Dolfi <dol@zurich.ibm.com>
---------
Signed-off-by: Michele Dolfi <dol@zurich.ibm.com>
2025-02-12 15:18:01 +01:00
Michele Dolfi
4cc6e3ea5e
feat: Describe pictures using vision models ( #259 )
...
* draft for picture description models
Signed-off-by: Michele Dolfi <dol@zurich.ibm.com>
* vlm description using AutoModelForVision2Seq
Signed-off-by: Michele Dolfi <dol@zurich.ibm.com>
* add generation options
Signed-off-by: Michele Dolfi <dol@zurich.ibm.com>
* update vlm API
Signed-off-by: Michele Dolfi <dol@zurich.ibm.com>
* allow only localhost traffic
Signed-off-by: Michele Dolfi <dol@zurich.ibm.com>
* rename model
Signed-off-by: Michele Dolfi <dol@zurich.ibm.com>
* do not run with vlm api
Signed-off-by: Michele Dolfi <dol@zurich.ibm.com>
* more renaming
Signed-off-by: Michele Dolfi <dol@zurich.ibm.com>
* fix examples path
Signed-off-by: Michele Dolfi <dol@zurich.ibm.com>
* apply CLI download login
Signed-off-by: Michele Dolfi <dol@zurich.ibm.com>
* fix name of cli argument
Signed-off-by: Michele Dolfi <dol@zurich.ibm.com>
* use with_smolvlm in models download
Signed-off-by: Michele Dolfi <dol@zurich.ibm.com>
---------
Signed-off-by: Michele Dolfi <dol@zurich.ibm.com>
2025-02-07 16:30:42 +01:00
Michele Dolfi
9114ada7bc
fix: Test cases for RTL programmatic PDFs and fixes for the formula model ( #903 )
...
fix: Support for RTL programmatic documents
fix(parser): detect and handle rotated pages
fix(parser): fix bug causing duplicated text
fix(formula): improve stopping criteria
chore: update lock file
fix: temporary constrain beautifulsoup
* switch to code formula model v1.0.1 and new test pdf
Signed-off-by: Matteo-Omenetti <Matteo.Omenetti1@ibm.com>
* switch to code formula model v1.0.1 and new test pdf
Signed-off-by: Matteo-Omenetti <Matteo.Omenetti1@ibm.com>
* cleaned up the data folder in the tests
Signed-off-by: Peter Staar <taa@zurich.ibm.com>
* switch to code formula model v1.0.1 and new test pdf
Signed-off-by: Matteo-Omenetti <Matteo.Omenetti1@ibm.com>
* added three test-files for right-to-left
Signed-off-by: Peter Staar <taa@zurich.ibm.com>
* fix black
Signed-off-by: Matteo-Omenetti <Matteo.Omenetti1@ibm.com>
* added new gt for test_e2e_conversion
Signed-off-by: Matteo-Omenetti <Matteo.Omenetti1@ibm.com>
* added new gt for test_e2e_conversion
Signed-off-by: Matteo-Omenetti <Matteo.Omenetti1@ibm.com>
* Add code to expose text direction of cell
Signed-off-by: Christoph Auer <cau@zurich.ibm.com>
* new test file
Signed-off-by: Matteo-Omenetti <Matteo.Omenetti1@ibm.com>
* update lock
Signed-off-by: Michele Dolfi <dol@zurich.ibm.com>
* fix mypy reports
Signed-off-by: Michele Dolfi <dol@zurich.ibm.com>
* fix example filepaths
Signed-off-by: Michele Dolfi <dol@zurich.ibm.com>
* add test data results
Signed-off-by: Michele Dolfi <dol@zurich.ibm.com>
* pin wheel of latest docling-parse release
Signed-off-by: Michele Dolfi <dol@zurich.ibm.com>
* use latest docling-core
Signed-off-by: Michele Dolfi <dol@zurich.ibm.com>
* remove debugging code
Signed-off-by: Michele Dolfi <dol@zurich.ibm.com>
* fix path to files in example
Signed-off-by: Michele Dolfi <dol@zurich.ibm.com>
* Revert unwanted RTL additions
Signed-off-by: Christoph Auer <cau@zurich.ibm.com>
* Fix test data paths in examples
Signed-off-by: Christoph Auer <cau@zurich.ibm.com>
---------
Signed-off-by: Matteo-Omenetti <Matteo.Omenetti1@ibm.com>
Signed-off-by: Peter Staar <taa@zurich.ibm.com>
Signed-off-by: Christoph Auer <cau@zurich.ibm.com>
Signed-off-by: Michele Dolfi <dol@zurich.ibm.com>
Co-authored-by: Matteo-Omenetti <Matteo.Omenetti1@ibm.com>
Co-authored-by: Peter Staar <taa@zurich.ibm.com>
Co-authored-by: Christoph Auer <cau@zurich.ibm.com>
2025-02-07 08:43:31 +01:00
Nikos Livathinos
6d3fea0196
docs: Introduce example with custom models for RapidOCR ( #874 )
...
* docs: Introduce example with custom models for RapidOCR
Signed-off-by: Nikos Livathinos <nli@zurich.ibm.com>
* chore: Exclude the example with custom RapidOCR models from the examples to run in github actions
Signed-off-by: Nikos Livathinos <nli@zurich.ibm.com>
---------
Signed-off-by: Nikos Livathinos <nli@zurich.ibm.com>
2025-02-04 10:07:00 +01:00
Christoph Auer
f9144f2bb6
docs: Add example for inspection of picture content ( #624 )
...
* chore: Add example for inspection of picture content
Signed-off-by: Christoph Auer <cau@zurich.ibm.com>
* fix: Test case re-generation
Signed-off-by: Christoph Auer <cau@zurich.ibm.com>
* fix: Test case re-generation only on CPU
Signed-off-by: Christoph Auer <cau@zurich.ibm.com>
* fix: Add missing GT files
Signed-off-by: Christoph Auer <cau@zurich.ibm.com>
---------
Signed-off-by: Christoph Auer <cau@zurich.ibm.com>
2025-01-29 10:39:00 +01:00
Cesar Berrospi Ramis
4d41db3f7a
docs(backend XML): do not delete temp file in notebook ( #817 )
...
Signed-off-by: Cesar Berrospi Ramis <75900930+ceberam@users.noreply.github.com>
2025-01-27 18:53:39 +01:00
Farzad Sunavala
8a4ec77576
docs: typo ( #814 )
...
* Update rag_azuresearch.ipynb
Signed-off-by: Farzad Sunavala <40604067+farzad528@users.noreply.github.com>
* typo
Signed-off-by: Farzad Sunavala <40604067+farzad528@users.noreply.github.com>
---------
Signed-off-by: Farzad Sunavala <40604067+farzad528@users.noreply.github.com>
2025-01-27 11:24:26 +01:00
Farzad Sunavala
b885b2fa3c
docs: added markdown headings to enable TOC in github pages ( #808 )
...
* docs: added markdown headings to enable TOC in github pages
Signed-off-by: Farzad Sunavala <40604067+farzad528@users.noreply.github.com>
* minor renames
Signed-off-by: Farzad Sunavala <40604067+farzad528@users.noreply.github.com>
* part 3 heading
Signed-off-by: Farzad Sunavala <40604067+farzad528@users.noreply.github.com>
---------
Signed-off-by: Farzad Sunavala <40604067+farzad528@users.noreply.github.com>
2025-01-27 09:40:35 +01:00
Cesar Berrospi Ramis
c2ae1cc4ca
docs: description of supported formats and backends ( #788 )
...
* chore: remove type-ignore marks for attaching text to non GroupItems
After commit b74208 of docling-core, text items can be attached to any NodeItem
and therefore the ignore[arg-type] type marks can be removed.
Signed-off-by: Cesar Berrospi Ramis <75900930+ceberam@users.noreply.github.com>
* test: remove unnecessary imports
Signed-off-by: Cesar Berrospi Ramis <75900930+ceberam@users.noreply.github.com>
* docs: add documentation on supported formats and backends
Signed-off-by: Cesar Berrospi Ramis <75900930+ceberam@users.noreply.github.com>
* docs: add notebook example with XML backends
Signed-off-by: Cesar Berrospi Ramis <75900930+ceberam@users.noreply.github.com>
---------
Signed-off-by: Cesar Berrospi Ramis <75900930+ceberam@users.noreply.github.com>
2025-01-26 08:10:33 +01:00
Nikos Livathinos
3be2fb581f
feat: Introduce automatic language detection in TesseractOcrCliModel ( #800 )
...
* feat: Introduce automatic language detection in tesseract_ocr_cli model. Extend unit tests.
Signed-off-by: Nikos Livathinos <nli@zurich.ibm.com>
* docs: Add example how to use "auto" language with tesseract OCR engines
Signed-off-by: Nikos Livathinos <nli@zurich.ibm.com>
* fix: Refactor the TesseractOcrModel and TesseractOcrCliModel to validate if the auto-detected
language is installed in the system and if not fall back to a default option without language.
Signed-off-by: Nikos Livathinos <nli@zurich.ibm.com>
---------
Signed-off-by: Nikos Livathinos <nli@zurich.ibm.com>
2025-01-26 08:07:56 +01:00
Matteo
3213b247ad
feat: Code and equation model for PDF and code blocks in markdown ( #752 )
...
* propagated changes for new CodeItem class
Signed-off-by: Matteo Omenetti <omenetti.matteo@gmail.com>
* Rebased branch on latest main. changes for CodeItem
Signed-off-by: Matteo Omenetti <omenetti.matteo@gmail.com>
* removed unused files
Signed-off-by: Matteo Omenetti <omenetti.matteo@gmail.com>
* chore: update lockfile
Signed-off-by: Christoph Auer <cau@zurich.ibm.com>
* pin latest docling-core
Signed-off-by: Michele Dolfi <dol@zurich.ibm.com>
* update docling-core pinning
Signed-off-by: Michele Dolfi <dol@zurich.ibm.com>
* pin docling-core
Signed-off-by: Michele Dolfi <dol@zurich.ibm.com>
* use new add_code in backends and update typing in MD backend
Signed-off-by: Michele Dolfi <dol@zurich.ibm.com>
* added if statement for backend
Signed-off-by: Matteo Omenetti <omenetti.matteo@gmail.com>
* removed unused import
Signed-off-by: Matteo Omenetti <omenetti.matteo@gmail.com>
* removed print statements
Signed-off-by: Matteo Omenetti <omenetti.matteo@gmail.com>
* gt for new pdf
Signed-off-by: Matteo Omenetti <omenetti.matteo@gmail.com>
* Update docling/pipeline/standard_pdf_pipeline.py
Co-authored-by: Michele Dolfi <97102151+dolfim-ibm@users.noreply.github.com>
Signed-off-by: Matteo <43417658+Matteo-Omenetti@users.noreply.github.com>
* fixed doc comment of __call__ function of code_formula_model
Signed-off-by: Matteo Omenetti <omenetti.matteo@gmail.com>
* fix artifacts_path type
Signed-off-by: Michele Dolfi <dol@zurich.ibm.com>
* move imports
Signed-off-by: Michele Dolfi <dol@zurich.ibm.com>
* move expansion_factor to base class
Signed-off-by: Michele Dolfi <dol@zurich.ibm.com>
---------
Signed-off-by: Matteo Omenetti <omenetti.matteo@gmail.com>
Signed-off-by: Christoph Auer <cau@zurich.ibm.com>
Signed-off-by: Michele Dolfi <dol@zurich.ibm.com>
Signed-off-by: Matteo <43417658+Matteo-Omenetti@users.noreply.github.com>
Co-authored-by: Christoph Auer <cau@zurich.ibm.com>
Co-authored-by: Michele Dolfi <dol@zurich.ibm.com>
Co-authored-by: Michele Dolfi <97102151+dolfim-ibm@users.noreply.github.com>
2025-01-24 16:54:22 +01:00
Farzad Sunavala
c58f75d0f7
docs: fix minor typos ( #801 )
...
Signed-off-by: Farzad Sunavala <40604067+farzad528@users.noreply.github.com>
2025-01-24 16:27:05 +01:00
Farzad Sunavala
9020a934be
docs: add Azure RAG example ( #675 )
...
Signed-off-by: Panos Vagenas <35837085+vagenas@users.noreply.github.com>
Co-authored-by: Farzad Sunavala <fsunavala@microsoft.com>
2025-01-24 13:56:26 +01:00
Iacopo Ghinassi
768608351d
docs: fix correct Accelerator pipeline options in docs/examples/custom_convert.py ( #733 )
...
* Update custom_convert.py
Added the missing AcceleratorDevice and AcceleratorOptions functions in the imports and changed Device in the code to the correct AcceleratorDevice
Signed-off-by: Iacopo Ghinassi <45108036+Ighina@users.noreply.github.com>
* apply formatting
Signed-off-by: Michele Dolfi <dol@zurich.ibm.com>
---------
Signed-off-by: Iacopo Ghinassi <45108036+Ighina@users.noreply.github.com>
Signed-off-by: Michele Dolfi <dol@zurich.ibm.com>
Co-authored-by: Michele Dolfi <dol@zurich.ibm.com>
2025-01-19 16:55:26 +01:00
Michele Dolfi
57fc28d3d8
refactor: allow the usage of backends in the enrich models and generalize the interface ( #742 )
...
* fix get image with cropbox
Signed-off-by: Michele Dolfi <dol@zurich.ibm.com>
* allow the usage of backends in the enrich models and generalize the interface
Signed-off-by: Michele Dolfi <dol@zurich.ibm.com>
* move logic in BaseTextImageEnrichmentModel
Signed-off-by: Michele Dolfi <dol@zurich.ibm.com>
* renaming
Signed-off-by: Michele Dolfi <dol@zurich.ibm.com>
---------
Signed-off-by: Michele Dolfi <dol@zurich.ibm.com>
2025-01-15 09:52:38 +01:00
Peter W. J. Staar
f7e1cbf629
docs: Example to translate documents ( #739 )
...
* added example to translate documents
Signed-off-by: Peter Staar <taa@zurich.ibm.com>
* updated the mkdocs
Signed-off-by: Peter Staar <taa@zurich.ibm.com>
* fix PR hooks
Signed-off-by: Michele Dolfi <dol@zurich.ibm.com>
---------
Signed-off-by: Peter Staar <taa@zurich.ibm.com>
Signed-off-by: Michele Dolfi <dol@zurich.ibm.com>
Co-authored-by: Michele Dolfi <dol@zurich.ibm.com>
2025-01-15 06:51:15 +01:00
Panos Vagenas
4fa8028bd8
docs: add LangChain docs ( #717 )
...
Signed-off-by: Panos Vagenas <35837085+vagenas@users.noreply.github.com>
2025-01-09 14:12:05 +01:00
Panos Vagenas
2d24faecd9
docs: add integrations, revamp docs ( #693 )
...
Signed-off-by: Panos Vagenas <35837085+vagenas@users.noreply.github.com>
2025-01-07 14:15:54 +01:00
m-newhauser
2b591f9872
docs: add Weaviate RAG recipe notebook ( #451 )
...
Signed-off-by: Panos Vagenas <35837085+vagenas@users.noreply.github.com>
Co-authored-by: Panos Vagenas <35837085+vagenas@users.noreply.github.com>
2024-12-19 21:57:40 +01:00
Panos Vagenas
fc645ea531
docs: document Haystack & Vectara support ( #628 )
...
Signed-off-by: Panos Vagenas <35837085+vagenas@users.noreply.github.com>
2024-12-19 13:33:02 +01:00
Panos Vagenas
3e599c7bbe
docs: add Haystack RAG example ( #615 )
...
Signed-off-by: Panos Vagenas <35837085+vagenas@users.noreply.github.com>
2024-12-17 14:24:40 +01:00