Commit Graph

5 Commits

Author SHA1 Message Date
Michele Dolfi
85b097677e cleanup
Signed-off-by: Michele Dolfi <dol@zurich.ibm.com>
2024-09-03 14:13:28 +02:00
Michele Dolfi
6f9805b08c renaming
Signed-off-by: Michele Dolfi <dol@zurich.ibm.com>
2024-09-03 14:04:31 +02:00
Michele Dolfi
46082e99b6 add loading into HF datasets library
Signed-off-by: Michele Dolfi <dol@zurich.ibm.com>
2024-09-03 14:03:43 +02:00
Michele Dolfi
6b84adebfa create a single parquet output
Signed-off-by: Michele Dolfi <dol@zurich.ibm.com>
2024-08-30 16:24:42 +02:00
Michele Dolfi
3e789dfbdd feat: export document pages as multimodal output
Signed-off-by: Michele Dolfi <dol@zurich.ibm.com>
2024-08-30 14:14:46 +02:00