mirror of
https://github.com/DS4SD/docling.git
synced 2025-12-08 20:58:11 +00:00
Commit Graph
Select branches
Hide Pull Requests
add-json-export-indentation
adr-model-stages
cau/dpv4-test-updates
cau/fix-layout-vlm-pipeline-artifacts-path
cau/layout-vlm-pipeline-page-images
cau/multi-stage-vlm-pipeline
cau/new-layout-processing
cau/pin-docling-parse-pre-3.2
cau/test-dp-word-lines
cau/test-pypdfium2-beta
copilot/fix-document-timeout-bug
copilot/fix-keyerror-in-docling
copilot/fix-page-range-bug
cp_main_20250602
demo
dev-granite-docling-table
dev/add-asr-pipeline
dev/add-granite-docling-extension
dev/add-granite-docling-preview
dev/add-r2l-tests
dev/add-reading-order-model
dev/add-two-stage-vlm
dev/analysis-for-granite-docling
dev/doctag_backend
dev/fix_msword_backend_identify_text_after_image
dev/table-orientation
dev/update-html-parser-with-h1
dev/update-to-latest-docling-parse-again
docs/add-extraction-script
elh/update_2stage_inference
extend-metadata-in-examples
gh-pages
main
mao/doctags
mly/smol-docling-integration
nli/fix_glm_utils
nli/fix_ocr_tests
nli/layout_dfine
nli/layout_heron2
nli/layout_rtdetr_v2
nli/layoutmodel_improvements
nli/tesseract_ocr_models
ocr-enrichment
pretest-core-2-51-0
propagate-core-fixes-20250502
remodel-lists-2
revert-803-refactor_viz
rtdl/docx_latex
rtdl/drawingml_import
vku/uspto_meta
#1
#10
#100
#101
#1010
#1015
#1017
#102
#1021
#1024
#1027
#103
#1038
#1039
#1040
#1041
#1051
#1052
#1053
#1054
#1055
#1057
#1061
#1062
#1077
#1096
#1097
#1098
#11
#110
#1100
#1106
#1107
#111
#1114
#1115
#1118
#1124
#1130
#1140
#1141
#1147
#1150
#1152
#1154
#1156
#1158
#1160
#1165
#1167
#117
#1173
#118
#1182
#1183
#1194
#1196
#1197
#1199
#12
#120
#1201
#121
#1210
#122
#1220
#1222
#1223
#123
#1231
#1238
#1239
#1241
#1244
#1247
#1248
#1261
#1263
#1268
#1270
#1286
#129
#1294
#1295
#13
#131
#1313
#1315
#1316
#1319
#132
#1320
#1326
#1328
#1332
#1334
#1337
#134
#1340
#1346
#135
#1350
#1355
#1359
#1363
#1371
#1375
#1377
#1378
#1379
#138
#1381
#1382
#1383
#1389
#139
#1392
#1399
#14
#140
#1400
#1402
#141
#1411
#1415
#1416
#1419
#1427
#1428
#143
#1430
#1436
#1442
#1449
#145
#1458
#1459
#1463
#1465
#1486
#149
#1490
#1492
#1494
#1496
#15
#150
#1500
#151
#1511
#1512
#152
#1520
#1523
#1524
#1525
#1526
#1527
#1528
#153
#1530
#1536
#1538
#154
#1548
#1549
#155
#1551
#1553
#1556
#1559
#156
#1560
#1561
#1563
#1566
#157
#1570
#1576
#1577
#158
#1582
#1583
#1587
#1589
#159
#1593
#1596
#16
#160
#1600
#1609
#161
#1610
#1615
#1617
#1619
#162
#1636
#164
#1658
#1659
#1660
#1663
#1664
#1665
#1667
#1671
#1673
#1676
#1679
#168
#1683
#1684
#1688
#1689
#169
#1691
#1698
#17
#170
#1700
#1701
#1706
#1707
#171
#1711
#1717
#1718
#1723
#1724
#1725
#1728
#173
#1734
#1735
#1745
#1746
#1747
#175
#1759
#1763
#1769
#177
#1772
#1775
#178
#179
#1791
#1795
#18
#180
#1802
#1804
#1808
#1810
#1812
#1815
#1816
#1819
#182
#1820
#1821
#1824
#1825
#1827
#183
#1836
#1838
#184
#1844
#1850
#1851
#1852
#1856
#1857
#186
#1863
#1866
#1867
#187
#1870
#1874
#1875
#1876
#188
#1884
#189
#1897
#1898
#1898
#19
#190
#1902
#1904
#1905
#1907
#1908
#1910
#1912
#1914
#1917
#1923
#1925
#1926
#1928
#193
#1931
#1934
#1937
#194
#1940
#1943
#1948
#1951
#1952
#196
#1960
#1969
#1970
#1971
#1975
#1981
#1982
#1984
#1986
#1988
#1989
#1992
#1995
#2
#20
#2001
#2002
#2006
#2011
#2017
#2018
#2024
#203
#2031
#2039
#2042
#2048
#2061
#2068
#2069
#2078
#2079
#2083
#2084
#2084
#2088
#2093
#2094
#2095
#2095
#21
#2100
#2105
#2106
#2110
#2111
#2112
#2113
#2114
#2114
#2122
#2123
#2124
#2126
#2131
#2132
#2133
#2138
#214
#2141
#2146
#2154
#2155
#2165
#2166
#2169
#217
#217
#2171
#2178
#218
#218
#2183
#2185
#2187
#219
#2199
#22
#2200
#2208
#2212
#2218
#2219
#2227
#2227
#2231
#2234
#2237
#2238
#224
#2242
#2244
#2251
#2252
#226
#2262
#2264
#2265
#2266
#2272
#228
#2281
#2284
#2284
#2287
#2288
#229
#2291
#2294
#2304
#2309
#2313
#2315
#2322
#2323
#2324
#233
#2339
#234
#2340
#2341
#235
#2357
#2359
#2361
#2365
#2366
#2371
#2372
#2373
#2378
#2378
#2382
#2383
#2388
#2391
#2394
#240
#2401
#2403
#2403
#2404
#2407
#2409
#2409
#241
#2410
#2411
#2413
#2415
#2418
#2420
#2421
#2422
#2423
#2424
#2425
#2426
#2427
#2429
#2430
#2431
#2433
#2436
#2441
#2442
#2445
#2445
#2447
#2452
#2453
#2454
#2458
#2459
#2468
#2473
#2474
#248
#2484
#2486
#2488
#2488
#2489
#2498
#2499
#2501
#2502
#2503
#251
#2511
#2512
#2513
#2517
#2519
#2520
#2521
#2526
#2527
#2530
#2531
#2533
#2541
#2543
#2546
#2548
#2549
#2553
#2563
#2569
#2571
#2573
#2578
#2582
#2585
#2587
#2587
#2588
#2589
#259
#2596
#2599
#26
#2600
#2605
#2613
#2618
#2622
#2622
#2624
#2627
#2636
#2637
#2638
#2639
#2640
#2641
#2644
#2645
#2645
#2648
#2649
#2651
#2653
#2656
#2658
#2659
#2660
#2662
#2664
#2665
#2669
#2671
#2674
#2676
#2676
#2678
#2678
#2682
#2682
#2689
#2692
#2693
#27
#2706
#2707
#2708
#2712
#2716
#2717
#2720
#2721
#2721
#2723
#2723
#2728
#2735
#2738
#2738
#2739
#2740
#2740
#2741
#2741
#275
#276
#279
#28
#282
#286
#29
#290
#3
#302
#305
#307
#31
#310
#312
#314
#315
#316
#319
#32
#320
#322
#323
#325
#33
#330
#332
#334
#339
#34
#340
#341
#349
#35
#350
#36
#37
#371
#374
#375
#378
#379
#38
#384
#388
#39
#392
#393
#396
#4
#40
#401
#407
#408
#409
#415
#416
#42
#429
#43
#430
#432
#44
#442
#449
#45
#451
#456
#457
#46
#466
#467
#468
#47
#472
#474
#475
#482
#484
#487
#49
#490
#492
#495
#496
#497
#5
#50
#500
#501
#502
#504
#51
#511
#512
#513
#514
#517
#52
#528
#53
#530
#531
#532
#533
#534
#537
#54
#544
#549
#550
#551
#552
#555
#556
#557
#558
#56
#569
#57
#58
#59
#593
#6
#604
#606
#608
#613
#615
#616
#618
#624
#628
#63
#630
#631
#633
#642
#65
#650
#655
#656
#662
#675
#679
#68
#69
#691
#693
#694
#695
#697
#698
#7
#70
#700
#701
#702
#708
#71
#716
#717
#718
#719
#72
#733
#735
#739
#742
#75
#752
#759
#769
#772
#777
#783
#786
#788
#79
#793
#8
#80
#800
#801
#803
#804
#805
#808
#81
#811
#814
#815
#816
#817
#818
#819
#82
#820
#821
#824
#825
#826
#827
#83
#830
#831
#832
#837
#839
#84
#841
#842
#843
#850
#852
#853
#854
#855
#856
#857
#86
#862
#868
#869
#872
#873
#874
#875
#876
#878
#88
#880
#881
#883
#896
#897
#90
#901
#903
#905
#91
#910
#912
#916
#919
#92
#929
#93
#932
#935
#940
#941
#945
#948
#949
#95
#951
#958
#96
#965
#966
#967
#98
#99
#999
v0.1.1
v0.2.0
v0.3.0
v0.3.1
v0.4.0
v1.0.0
v1.0.1
v1.0.2
v1.1.0
v1.1.1
v1.1.2
v1.10.0
v1.11.0
v1.12.0
v1.12.1
v1.12.2
v1.13.0
v1.13.1
v1.14.0
v1.15.0
v1.16.0
v1.16.1
v1.17.0
v1.18.0
v1.19.0
v1.19.1
v1.2.0
v1.2.1
v1.20.0
v1.3.0
v1.4.0
v1.5.0
v1.6.0
v1.6.1
v1.6.2
v1.6.3
v1.7.0
v1.7.1
v1.8.0
v1.8.1
v1.8.2
v1.8.3
v1.8.4
v1.8.5
v1.9.0
v2.0.0
v2.1.0
v2.10.0
v2.11.0
v2.12.0
v2.13.0
v2.14.0
v2.15.0
v2.15.1
v2.16.0
v2.17.0
v2.18.0
v2.19.0
v2.2.0
v2.2.1
v2.20.0
v2.21.0
v2.22.0
v2.23.0
v2.23.1
v2.24.0
v2.25.0
v2.25.1
v2.25.2
v2.26.0
v2.27.0
v2.28.0
v2.28.1
v2.28.2
v2.28.3
v2.28.4
v2.29.0
v2.3.0
v2.3.1
v2.30.0
v2.31.0
v2.31.1
v2.31.2
v2.32.0
v2.33.0
v2.34.0
v2.35.0
v2.36.0
v2.36.1
v2.37.0
v2.38.0
v2.38.1
v2.39.0
v2.4.0
v2.4.1
v2.4.2
v2.40.0
v2.41.0
v2.42.0
v2.42.1
v2.42.2
v2.43.0
v2.44.0
v2.45.0
v2.46.0
v2.47.0
v2.47.1
v2.48.0
v2.49.0
v2.5.0
v2.5.1
v2.5.2
v2.50.0
v2.51.0
v2.52.0
v2.53.0
v2.54.0
v2.55.0
v2.55.1
v2.56.0
v2.56.1
v2.57.0
v2.58.0
v2.59.0
v2.6.0
v2.60.0
v2.60.1
v2.61.0
v2.61.1
v2.61.2
v2.62.0
v2.63.0
v2.64.0
v2.7.0
v2.7.1
v2.8.0
v2.8.1
v2.8.2
v2.8.3
v2.9.0
Select branches
Hide Pull Requests
add-json-export-indentation
adr-model-stages
cau/dpv4-test-updates
cau/fix-layout-vlm-pipeline-artifacts-path
cau/layout-vlm-pipeline-page-images
cau/multi-stage-vlm-pipeline
cau/new-layout-processing
cau/pin-docling-parse-pre-3.2
cau/test-dp-word-lines
cau/test-pypdfium2-beta
copilot/fix-document-timeout-bug
copilot/fix-keyerror-in-docling
copilot/fix-page-range-bug
cp_main_20250602
demo
dev-granite-docling-table
dev/add-asr-pipeline
dev/add-granite-docling-extension
dev/add-granite-docling-preview
dev/add-r2l-tests
dev/add-reading-order-model
dev/add-two-stage-vlm
dev/analysis-for-granite-docling
dev/doctag_backend
dev/fix_msword_backend_identify_text_after_image
dev/table-orientation
dev/update-html-parser-with-h1
dev/update-to-latest-docling-parse-again
docs/add-extraction-script
elh/update_2stage_inference
extend-metadata-in-examples
gh-pages
main
mao/doctags
mly/smol-docling-integration
nli/fix_glm_utils
nli/fix_ocr_tests
nli/layout_dfine
nli/layout_heron2
nli/layout_rtdetr_v2
nli/layoutmodel_improvements
nli/tesseract_ocr_models
ocr-enrichment
pretest-core-2-51-0
propagate-core-fixes-20250502
remodel-lists-2
revert-803-refactor_viz
rtdl/docx_latex
rtdl/drawingml_import
vku/uspto_meta
#1
#10
#100
#101
#1010
#1015
#1017
#102
#1021
#1024
#1027
#103
#1038
#1039
#1040
#1041
#1051
#1052
#1053
#1054
#1055
#1057
#1061
#1062
#1077
#1096
#1097
#1098
#11
#110
#1100
#1106
#1107
#111
#1114
#1115
#1118
#1124
#1130
#1140
#1141
#1147
#1150
#1152
#1154
#1156
#1158
#1160
#1165
#1167
#117
#1173
#118
#1182
#1183
#1194
#1196
#1197
#1199
#12
#120
#1201
#121
#1210
#122
#1220
#1222
#1223
#123
#1231
#1238
#1239
#1241
#1244
#1247
#1248
#1261
#1263
#1268
#1270
#1286
#129
#1294
#1295
#13
#131
#1313
#1315
#1316
#1319
#132
#1320
#1326
#1328
#1332
#1334
#1337
#134
#1340
#1346
#135
#1350
#1355
#1359
#1363
#1371
#1375
#1377
#1378
#1379
#138
#1381
#1382
#1383
#1389
#139
#1392
#1399
#14
#140
#1400
#1402
#141
#1411
#1415
#1416
#1419
#1427
#1428
#143
#1430
#1436
#1442
#1449
#145
#1458
#1459
#1463
#1465
#1486
#149
#1490
#1492
#1494
#1496
#15
#150
#1500
#151
#1511
#1512
#152
#1520
#1523
#1524
#1525
#1526
#1527
#1528
#153
#1530
#1536
#1538
#154
#1548
#1549
#155
#1551
#1553
#1556
#1559
#156
#1560
#1561
#1563
#1566
#157
#1570
#1576
#1577
#158
#1582
#1583
#1587
#1589
#159
#1593
#1596
#16
#160
#1600
#1609
#161
#1610
#1615
#1617
#1619
#162
#1636
#164
#1658
#1659
#1660
#1663
#1664
#1665
#1667
#1671
#1673
#1676
#1679
#168
#1683
#1684
#1688
#1689
#169
#1691
#1698
#17
#170
#1700
#1701
#1706
#1707
#171
#1711
#1717
#1718
#1723
#1724
#1725
#1728
#173
#1734
#1735
#1745
#1746
#1747
#175
#1759
#1763
#1769
#177
#1772
#1775
#178
#179
#1791
#1795
#18
#180
#1802
#1804
#1808
#1810
#1812
#1815
#1816
#1819
#182
#1820
#1821
#1824
#1825
#1827
#183
#1836
#1838
#184
#1844
#1850
#1851
#1852
#1856
#1857
#186
#1863
#1866
#1867
#187
#1870
#1874
#1875
#1876
#188
#1884
#189
#1897
#1898
#1898
#19
#190
#1902
#1904
#1905
#1907
#1908
#1910
#1912
#1914
#1917
#1923
#1925
#1926
#1928
#193
#1931
#1934
#1937
#194
#1940
#1943
#1948
#1951
#1952
#196
#1960
#1969
#1970
#1971
#1975
#1981
#1982
#1984
#1986
#1988
#1989
#1992
#1995
#2
#20
#2001
#2002
#2006
#2011
#2017
#2018
#2024
#203
#2031
#2039
#2042
#2048
#2061
#2068
#2069
#2078
#2079
#2083
#2084
#2084
#2088
#2093
#2094
#2095
#2095
#21
#2100
#2105
#2106
#2110
#2111
#2112
#2113
#2114
#2114
#2122
#2123
#2124
#2126
#2131
#2132
#2133
#2138
#214
#2141
#2146
#2154
#2155
#2165
#2166
#2169
#217
#217
#2171
#2178
#218
#218
#2183
#2185
#2187
#219
#2199
#22
#2200
#2208
#2212
#2218
#2219
#2227
#2227
#2231
#2234
#2237
#2238
#224
#2242
#2244
#2251
#2252
#226
#2262
#2264
#2265
#2266
#2272
#228
#2281
#2284
#2284
#2287
#2288
#229
#2291
#2294
#2304
#2309
#2313
#2315
#2322
#2323
#2324
#233
#2339
#234
#2340
#2341
#235
#2357
#2359
#2361
#2365
#2366
#2371
#2372
#2373
#2378
#2378
#2382
#2383
#2388
#2391
#2394
#240
#2401
#2403
#2403
#2404
#2407
#2409
#2409
#241
#2410
#2411
#2413
#2415
#2418
#2420
#2421
#2422
#2423
#2424
#2425
#2426
#2427
#2429
#2430
#2431
#2433
#2436
#2441
#2442
#2445
#2445
#2447
#2452
#2453
#2454
#2458
#2459
#2468
#2473
#2474
#248
#2484
#2486
#2488
#2488
#2489
#2498
#2499
#2501
#2502
#2503
#251
#2511
#2512
#2513
#2517
#2519
#2520
#2521
#2526
#2527
#2530
#2531
#2533
#2541
#2543
#2546
#2548
#2549
#2553
#2563
#2569
#2571
#2573
#2578
#2582
#2585
#2587
#2587
#2588
#2589
#259
#2596
#2599
#26
#2600
#2605
#2613
#2618
#2622
#2622
#2624
#2627
#2636
#2637
#2638
#2639
#2640
#2641
#2644
#2645
#2645
#2648
#2649
#2651
#2653
#2656
#2658
#2659
#2660
#2662
#2664
#2665
#2669
#2671
#2674
#2676
#2676
#2678
#2678
#2682
#2682
#2689
#2692
#2693
#27
#2706
#2707
#2708
#2712
#2716
#2717
#2720
#2721
#2721
#2723
#2723
#2728
#2735
#2738
#2738
#2739
#2740
#2740
#2741
#2741
#275
#276
#279
#28
#282
#286
#29
#290
#3
#302
#305
#307
#31
#310
#312
#314
#315
#316
#319
#32
#320
#322
#323
#325
#33
#330
#332
#334
#339
#34
#340
#341
#349
#35
#350
#36
#37
#371
#374
#375
#378
#379
#38
#384
#388
#39
#392
#393
#396
#4
#40
#401
#407
#408
#409
#415
#416
#42
#429
#43
#430
#432
#44
#442
#449
#45
#451
#456
#457
#46
#466
#467
#468
#47
#472
#474
#475
#482
#484
#487
#49
#490
#492
#495
#496
#497
#5
#50
#500
#501
#502
#504
#51
#511
#512
#513
#514
#517
#52
#528
#53
#530
#531
#532
#533
#534
#537
#54
#544
#549
#550
#551
#552
#555
#556
#557
#558
#56
#569
#57
#58
#59
#593
#6
#604
#606
#608
#613
#615
#616
#618
#624
#628
#63
#630
#631
#633
#642
#65
#650
#655
#656
#662
#675
#679
#68
#69
#691
#693
#694
#695
#697
#698
#7
#70
#700
#701
#702
#708
#71
#716
#717
#718
#719
#72
#733
#735
#739
#742
#75
#752
#759
#769
#772
#777
#783
#786
#788
#79
#793
#8
#80
#800
#801
#803
#804
#805
#808
#81
#811
#814
#815
#816
#817
#818
#819
#82
#820
#821
#824
#825
#826
#827
#83
#830
#831
#832
#837
#839
#84
#841
#842
#843
#850
#852
#853
#854
#855
#856
#857
#86
#862
#868
#869
#872
#873
#874
#875
#876
#878
#88
#880
#881
#883
#896
#897
#90
#901
#903
#905
#91
#910
#912
#916
#919
#92
#929
#93
#932
#935
#940
#941
#945
#948
#949
#95
#951
#958
#96
#965
#966
#967
#98
#99
#999
v0.1.1
v0.2.0
v0.3.0
v0.3.1
v0.4.0
v1.0.0
v1.0.1
v1.0.2
v1.1.0
v1.1.1
v1.1.2
v1.10.0
v1.11.0
v1.12.0
v1.12.1
v1.12.2
v1.13.0
v1.13.1
v1.14.0
v1.15.0
v1.16.0
v1.16.1
v1.17.0
v1.18.0
v1.19.0
v1.19.1
v1.2.0
v1.2.1
v1.20.0
v1.3.0
v1.4.0
v1.5.0
v1.6.0
v1.6.1
v1.6.2
v1.6.3
v1.7.0
v1.7.1
v1.8.0
v1.8.1
v1.8.2
v1.8.3
v1.8.4
v1.8.5
v1.9.0
v2.0.0
v2.1.0
v2.10.0
v2.11.0
v2.12.0
v2.13.0
v2.14.0
v2.15.0
v2.15.1
v2.16.0
v2.17.0
v2.18.0
v2.19.0
v2.2.0
v2.2.1
v2.20.0
v2.21.0
v2.22.0
v2.23.0
v2.23.1
v2.24.0
v2.25.0
v2.25.1
v2.25.2
v2.26.0
v2.27.0
v2.28.0
v2.28.1
v2.28.2
v2.28.3
v2.28.4
v2.29.0
v2.3.0
v2.3.1
v2.30.0
v2.31.0
v2.31.1
v2.31.2
v2.32.0
v2.33.0
v2.34.0
v2.35.0
v2.36.0
v2.36.1
v2.37.0
v2.38.0
v2.38.1
v2.39.0
v2.4.0
v2.4.1
v2.4.2
v2.40.0
v2.41.0
v2.42.0
v2.42.1
v2.42.2
v2.43.0
v2.44.0
v2.45.0
v2.46.0
v2.47.0
v2.47.1
v2.48.0
v2.49.0
v2.5.0
v2.5.1
v2.5.2
v2.50.0
v2.51.0
v2.52.0
v2.53.0
v2.54.0
v2.55.0
v2.55.1
v2.56.0
v2.56.1
v2.57.0
v2.58.0
v2.59.0
v2.6.0
v2.60.0
v2.60.1
v2.61.0
v2.61.1
v2.61.2
v2.62.0
v2.63.0
v2.64.0
v2.7.0
v2.7.1
v2.8.0
v2.8.1
v2.8.2
v2.8.3
v2.9.0
-
0240ae2930
Pass nested clusters through GLM as payload
Christoph Auer
2024-12-03 13:58:27 +01:00 -
4dcc738b6d
Pass nested cluster processing through full pipeline
Christoph Auer
2024-12-03 13:08:45 +01:00 -
34c7c79858
fix: improve handling of disallowed formats (#429)
Christoph Auer
2024-12-03 12:45:32 +01:00 -
0be736227f
fix: improve handling of disallowed formats (#429)
Christoph Auer
2024-12-03 12:45:32 +01:00 -
2254845da3
chore: bump version to 2.8.2 [skip ci]
v2.8.2
github-actions[bot]
2024-12-03 10:47:29 +00:00 -
25a0fa38d1
chore: bump version to 2.8.2 [skip ci]
github-actions[bot]
2024-12-03 10:47:29 +00:00 -
672962a8b2
chore: update numpy lock (#500)
Michele Dolfi
2024-12-03 11:21:31 +01:00 -
9f35e368f6
chore: update numpy lock (#500)
Michele Dolfi
2024-12-03 11:21:31 +01:00 -
c90c41c391
fix: ParserError EOF inside string (#470) (#472)
guglie
2024-12-03 11:21:18 +01:00 -
a7e3f713bb
fix: ParserError EOF inside string (#470) (#472)
guglie
2024-12-03 11:21:18 +01:00 -
5ba3807f31
docs: add styling for faq (#502)
Michele Dolfi
2024-12-03 11:20:49 +01:00 -
a01cedbb69
docs: add styling for faq (#502)
Michele Dolfi
2024-12-03 11:20:49 +01:00 -
051789d017
perf: prevent temp file leftovers, reuse core type (#487)
Panos Vagenas
2024-12-03 10:40:28 +01:00 -
418d8159bd
perf: prevent temp file leftovers, reuse core type (#487)
Panos Vagenas
2024-12-03 10:40:28 +01:00 -
7245cc6080
Implement hierachical cluster layout processing
Christoph Auer
2024-12-03 10:28:36 +01:00 -
d3f84b2457
fix: PermissionError when using tesseract_ocr_cli_model (#496)
Gaspard Petit
2024-12-03 04:22:03 -05:00 -
32e9b4a2cf
fix: PermissionError when using tesseract_ocr_cli_model (#496)
Gaspard Petit
2024-12-03 04:22:03 -05:00 -
e0cf80a919
Upgraded Layout Postprocessing, sending old code back to ERZ
Christoph Auer
2024-12-02 16:21:14 +01:00 -
33cff98d36
docs: typo in faq (#484)
Álvaro Huertas
2024-12-02 10:35:24 +01:00 -
6ca85993f4
docs: typo in faq (#484)
Álvaro Huertas
2024-12-02 10:35:24 +01:00 -
d4872103b8
docs: add automatic api reference (#475)
Michele Dolfi
2024-12-02 09:55:52 +01:00 -
048031d32b
docs: add automatic api reference (#475)
Michele Dolfi
2024-12-02 09:55:52 +01:00 -
8ccb3c6db6
docs: introduce faq section (#468)
Michele Dolfi
2024-11-29 22:34:56 +01:00 -
0e0360a37b
docs: introduce faq section (#468)
Michele Dolfi
2024-11-29 22:34:56 +01:00 -
cc46c938b6
chore: bump version to 2.8.1 [skip ci]
v2.8.1
github-actions[bot]
2024-11-29 13:04:48 +00:00 -
1d81b85443
chore: bump version to 2.8.1 [skip ci]
github-actions[bot]
2024-11-29 13:04:48 +00:00 -
dd8de46267
fix(cli): expose debug options (#467)
Michele Dolfi
2024-11-29 13:25:58 +01:00 -
7bd432496a
fix(cli): expose debug options (#467)
Michele Dolfi
2024-11-29 13:25:58 +01:00 -
af63818df5
fix: remove unused deps (#466)
Michele Dolfi
2024-11-29 13:18:06 +01:00 -
861b6a6499
fix: remove unused deps (#466)
Michele Dolfi
2024-11-29 13:18:06 +01:00 -
84c46fdeb3
docs: extend integration docs & README (#456)
Panos Vagenas
2024-11-28 09:41:21 +01:00 -
9d8d698921
docs: extend integration docs & README (#456)
Panos Vagenas
2024-11-28 09:41:21 +01:00 -
211f4f7570
chore: bump version to 2.8.0 [skip ci]
v2.8.0
github-actions[bot]
2024-11-27 13:29:32 +00:00 -
20a2cd0f53
chore: bump version to 2.8.0 [skip ci]
github-actions[bot]
2024-11-27 13:29:32 +00:00 -
85b29990be
feat(ocr): added support for RapidOCR engine (#415)
Swaymaw
2024-11-27 18:27:41 +05:30 -
767563bf8b
fix: use correct image index in word backend (#442)
Manuel030
2024-11-27 13:45:07 +01:00 -
29807a2d68
fix: Update tests and examples for docling-core 2.5.1 (#449)
Christoph Auer
2024-11-27 13:07:00 +01:00 -
6666d9ec07
chore: bump version to 2.7.1 [skip ci]
v2.7.1
github-actions[bot]
2024-11-26 15:01:33 +00:00 -
d0a1180478
fix: Fixes for wordx (#432)
Maxim Lysak
2024-11-26 14:44:43 +01:00 -
d7072b4b56
fix: force pydantic < 2.10.0 (#407)
Michele Dolfi
2024-11-22 08:23:11 +01:00 -
2a1d3fd221
chore: update the README (#409)
Peter W. J. Staar
2024-11-21 17:28:53 +01:00 -
7a45b92078
docs: add DocETL, Kotaemon, spaCy integrations; minor docs improvements (#408)
Panos Vagenas
2024-11-21 17:23:04 +01:00 -
97d571af97
chore: add downloads in README, security policy and update ci actions (#401)
Michele Dolfi
2024-11-21 13:59:45 +01:00 -
eb64f6d368
chore: bump version to 2.7.0 [skip ci]
v2.7.0
github-actions[bot]
2024-11-20 15:36:51 +00:00 -
7b013abcf3
fix: python3.9 support (#396)
Michele Dolfi
2024-11-20 15:21:40 +01:00 -
6efa96c983
feat: add support for
ocrmacOCR engine on macOS (#276)nuridol
2024-11-20 20:51:19 +09:00 -
32ebf55e33
fix: propagate document limits to converter (#388)
Michele Dolfi
2024-11-20 08:36:51 +01:00 -
2cfaceb787
chore: bump version to 2.6.0 [skip ci]
v2.6.0
github-actions[bot]
2024-11-19 16:07:34 +00:00 -
3f91e7d3f1
feat: added support for exporting DocItem to an image when page image is available (#379)
Shubham Gupta
2024-11-19 16:28:52 +01:00 -
911c3bda27
docs: fixed typo in v2 example v2 (#378)
Gaspard Petit
2024-11-19 10:27:19 -05:00 -
ed785ea122
feat: expose ocr-lang in CLI (#375)
Michele Dolfi
2024-11-19 15:58:49 +01:00 -
926dfd29d5
feat: added excel backend (#334)
Peter W. J. Staar
2024-11-19 12:21:17 +01:00 -
e6f89d520f
chore: update lock of deps (#371)
Michele Dolfi
2024-11-19 10:23:59 +01:00 -
7368013669
reformatted the code
Peter Staar
2024-11-19 06:31:57 +01:00 -
8c42f760a2
merged with main and resolved all conflicts
Peter Staar
2024-11-19 06:26:42 +01:00 -
7a97d7119f
feat: Extracting picture data for raster images found in PPTX (#349)
Maxim Lysak
2024-11-18 15:22:28 +01:00 -
7dbdbdeaf3
ci: fix mergify (#350)
Michele Dolfi
2024-11-15 17:13:01 +01:00 -
364d37ca96
ci(Mergify): configuration update (#339)
Michele Dolfi
2024-11-15 13:18:33 +01:00 -
ca8524ecae
docs: add automatic generation of CLI reference (#325)
Michele Dolfi
2024-11-15 13:18:17 +01:00 -
25fd149c38
docs: add architecture outline (#341)
Panos Vagenas
2024-11-15 12:52:41 +01:00 -
835e077b02
docs: fix parameter in usage.md (#332)
Carl
2024-11-15 09:24:15 +01:00 -
8533039b0c
fix: Fixing images in the input Word files (#330)
Maxim Lysak
2024-11-14 13:33:34 +01:00 -
bf2a85f1d4
chore: fix Qdrant notebook Colab link (#319)
Panos Vagenas
2024-11-14 10:42:02 +01:00 -
f4fc6cfd4a
added TableFormerMode.ACCURATE as default in cli
Peter Staar
2024-11-14 07:45:36 +01:00 -
8b437adcde
fix: reduce logging by keeping option for more verbose (#323)
Michele Dolfi
2024-11-13 10:08:24 +01:00 -
5a44236ac2
chore: bump version to 2.5.2 [skip ci]
v2.5.2
github-actions[bot]
2024-11-13 08:19:09 +00:00 -
c9341bf22e
fix: skip glm model downloads (#322)
Michele Dolfi
2024-11-13 08:45:28 +01:00 -
2c0c439a44
chore: bump version to 2.5.1 [skip ci]
v2.5.1
github-actions[bot]
2024-11-12 14:56:34 +00:00 -
fb8ba861e2
fix: Handling of single-cell tables in DOCX backend (#314)
Maxim Lysak
2024-11-12 15:20:55 +01:00 -
7f5d35ea3c
docs: Hybrid RAG with Qdrant (#312)
Anush
2024-11-12 19:48:14 +05:30 -
93fc1be61a
docs: add Data Prep Kit integration (#316)
Panos Vagenas
2024-11-12 12:21:48 +01:00 -
777237ebc9
chore: bump version to 2.5.0 [skip ci]
v2.5.0
github-actions[bot]
2024-11-12 10:19:55 +00:00 -
5d4a10b121
fix: Configure env prefix for docling settings (#315)
Christoph Auer
2024-11-12 10:57:16 +01:00 -
c6b3763ecb
feat(OCR): Introduce the OcrOptions.force_full_page_ocr parameter that forces a full page OCR scanning (#290)
Nikos Livathinos
2024-11-12 09:46:14 +01:00 -
81c8243a8b
fix: Added handling of grouped elements in pptx backend (#307)
Maxim Lysak
2024-11-11 16:38:21 +01:00 -
53bf2d1790
Added handling of code blocks in html with <pre> tag (#302)
Maxim Lysak
2024-11-11 15:00:11 +01:00 -
1239ade275
docs: add navigation indices (#305)
Panos Vagenas
2024-11-11 14:49:06 +01:00 -
97f214efdd
fix: allow mps usage for easyocr (#286)
Michele Dolfi
2024-11-10 14:26:17 +01:00 -
be8aa17291
chore: bump version to 2.4.2 [skip ci]
v2.4.2
github-actions[bot]
2024-11-08 16:31:47 +00:00 -
0eb065e9b6
fix(EasyOcrModel): Support the use_gpu pipeline parameter in EasyOcrModel. Initialize easyocr (#282)
Nikos Livathinos
2024-11-08 16:48:41 +01:00 -
118f162e64
chore: bump version to 2.4.1 [skip ci]
v2.4.1
github-actions[bot]
2024-11-08 12:37:36 +00:00 -
704d792a79
fix(tesserocr): Raise Exception if tesserocr has not loaded any languages (#279)
Nikos Livathinos
2024-11-08 13:03:09 +01:00 -
9e54a74410
another fix to the tests
Peter Staar
2024-11-08 12:48:53 +01:00 -
311640fb9d
reformatted the code
Peter Staar
2024-11-08 05:41:09 +01:00 -
5c82ff9890
fixed the tests
Peter Staar
2024-11-07 05:15:13 +01:00 -
b154d4f2d7
updated ground-truth
Peter Staar
2024-11-06 10:55:18 +01:00 -
0a5817a36e
updated the html tests (2)
Peter Staar
2024-11-06 05:46:09 +01:00 -
c7b9792d6b
updated the html tests
Peter Staar
2024-11-06 05:44:50 +01:00 -
6c22cba0a7
chore: add issue templates (#251)
Panos Vagenas
2024-11-05 23:18:20 +01:00 -
c3098e3c12
chore: fix typo (#241)
Ikko Eltociear Ashimine
2024-11-06 00:20:04 +09:00 -
a84ec276b0
docs: update badges & credits (#248)
Panos Vagenas
2024-11-05 13:57:06 +01:00 -
90836db90a
fix: Dockerfile example copy command (#234)
Anthony R
2024-11-05 12:48:27 +01:00 -
5ce02c5c59
docs: add coming-soon section (#235)
Panos Vagenas
2024-11-05 08:53:02 +01:00 -
d5e65aedac
docs: add artifacts-path param to CLI (#233)
Panos Vagenas
2024-11-05 08:51:21 +01:00 -
ddd1474c8d
reformatted the code
Peter Staar
2024-11-05 07:25:21 +01:00 -
3257034631
replace new lines and double spaces in list-items with single spaces
Peter Staar
2024-11-05 07:24:31 +01:00 -
f276c0cc90
updated the html backend to add svg, remove empty list-items and use data-content fields
Peter Staar
2024-11-05 06:37:43 +01:00 -
e30a9c25a2
chore: bump version to 2.4.0 [skip ci]
v2.4.0
github-actions[bot]
2024-11-04 15:11:09 +00:00 -
862d78d271
chore: update pyproject.toml metadata (#229)
Panos Vagenas
2024-11-04 15:48:00 +01:00 -
eeee3b4371
docs: add explicit artifacts path example (#224)
Panos Vagenas
2024-11-04 14:27:56 +01:00