mirror of
https://github.com/DS4SD/docling.git
synced 2025-12-08 12:48:28 +00:00
Commit Graph
Select branches
Hide Pull Requests
add-json-export-indentation
adr-model-stages
cau/dpv4-test-updates
cau/fix-layout-vlm-pipeline-artifacts-path
cau/layout-vlm-pipeline-page-images
cau/multi-stage-vlm-pipeline
cau/new-layout-processing
cau/pin-docling-parse-pre-3.2
cau/test-dp-word-lines
cau/test-pypdfium2-beta
copilot/fix-document-timeout-bug
copilot/fix-keyerror-in-docling
copilot/fix-page-range-bug
cp_main_20250602
demo
dev-granite-docling-table
dev/add-asr-pipeline
dev/add-granite-docling-extension
dev/add-granite-docling-preview
dev/add-r2l-tests
dev/add-reading-order-model
dev/add-two-stage-vlm
dev/analysis-for-granite-docling
dev/doctag_backend
dev/fix_msword_backend_identify_text_after_image
dev/table-orientation
dev/update-html-parser-with-h1
dev/update-to-latest-docling-parse-again
docs/add-extraction-script
elh/update_2stage_inference
extend-metadata-in-examples
gh-pages
main
mao/doctags
mly/smol-docling-integration
nli/fix_glm_utils
nli/fix_ocr_tests
nli/layout_dfine
nli/layout_heron2
nli/layout_rtdetr_v2
nli/layoutmodel_improvements
nli/tesseract_ocr_models
ocr-enrichment
pretest-core-2-51-0
propagate-core-fixes-20250502
remodel-lists-2
revert-803-refactor_viz
rtdl/docx_latex
rtdl/drawingml_import
vku/uspto_meta
#1
#10
#100
#101
#1010
#1015
#1017
#102
#1021
#1024
#1027
#103
#1038
#1039
#1040
#1041
#1051
#1052
#1053
#1054
#1055
#1057
#1061
#1062
#1077
#1096
#1097
#1098
#11
#110
#1100
#1106
#1107
#111
#1114
#1115
#1118
#1124
#1130
#1140
#1141
#1147
#1150
#1152
#1154
#1156
#1158
#1160
#1165
#1167
#117
#1173
#118
#1182
#1183
#1194
#1196
#1197
#1199
#12
#120
#1201
#121
#1210
#122
#1220
#1222
#1223
#123
#1231
#1238
#1239
#1241
#1244
#1247
#1248
#1261
#1263
#1268
#1270
#1286
#129
#1294
#1295
#13
#131
#1313
#1315
#1316
#1319
#132
#1320
#1326
#1328
#1332
#1334
#1337
#134
#1340
#1346
#135
#1350
#1355
#1359
#1363
#1371
#1375
#1377
#1378
#1379
#138
#1381
#1382
#1383
#1389
#139
#1392
#1399
#14
#140
#1400
#1402
#141
#1411
#1415
#1416
#1419
#1427
#1428
#143
#1430
#1436
#1442
#1449
#145
#1458
#1459
#1463
#1465
#1486
#149
#1490
#1492
#1494
#1496
#15
#150
#1500
#151
#1511
#1512
#152
#1520
#1523
#1524
#1525
#1526
#1527
#1528
#153
#1530
#1536
#1538
#154
#1548
#1549
#155
#1551
#1553
#1556
#1559
#156
#1560
#1561
#1563
#1566
#157
#1570
#1576
#1577
#158
#1582
#1583
#1587
#1589
#159
#1593
#1596
#16
#160
#1600
#1609
#161
#1610
#1615
#1617
#1619
#162
#1636
#164
#1658
#1659
#1660
#1663
#1664
#1665
#1667
#1671
#1673
#1676
#1679
#168
#1683
#1684
#1688
#1689
#169
#1691
#1698
#17
#170
#1700
#1701
#1706
#1707
#171
#1711
#1717
#1718
#1723
#1724
#1725
#1728
#173
#1734
#1735
#1745
#1746
#1747
#175
#1759
#1763
#1769
#177
#1772
#1775
#178
#179
#1791
#1795
#18
#180
#1802
#1804
#1808
#1810
#1812
#1815
#1816
#1819
#182
#1820
#1821
#1824
#1825
#1827
#183
#1836
#1838
#184
#1844
#1850
#1851
#1852
#1856
#1857
#186
#1863
#1866
#1867
#187
#1870
#1874
#1875
#1876
#188
#1884
#189
#1897
#1898
#1898
#19
#190
#1902
#1904
#1905
#1907
#1908
#1910
#1912
#1914
#1917
#1923
#1925
#1926
#1928
#193
#1931
#1934
#1937
#194
#1940
#1943
#1948
#1951
#1952
#196
#1960
#1969
#1970
#1971
#1975
#1981
#1982
#1984
#1986
#1988
#1989
#1992
#1995
#2
#20
#2001
#2002
#2006
#2011
#2017
#2018
#2024
#203
#2031
#2039
#2042
#2048
#2061
#2068
#2069
#2078
#2079
#2083
#2084
#2084
#2088
#2093
#2094
#2095
#2095
#21
#2100
#2105
#2106
#2110
#2111
#2112
#2113
#2114
#2114
#2122
#2123
#2124
#2126
#2131
#2132
#2133
#2138
#214
#2141
#2146
#2154
#2155
#2165
#2166
#2169
#217
#217
#2171
#2178
#218
#218
#2183
#2185
#2187
#219
#2199
#22
#2200
#2208
#2212
#2218
#2219
#2227
#2227
#2231
#2234
#2237
#2238
#224
#2242
#2244
#2251
#2252
#226
#2262
#2264
#2265
#2266
#2272
#228
#2281
#2284
#2284
#2287
#2288
#229
#2291
#2294
#2304
#2309
#2313
#2315
#2322
#2323
#2324
#233
#2339
#234
#2340
#2341
#235
#2357
#2359
#2361
#2365
#2366
#2371
#2372
#2373
#2378
#2378
#2382
#2383
#2388
#2391
#2394
#240
#2401
#2403
#2403
#2404
#2407
#2409
#2409
#241
#2410
#2411
#2413
#2415
#2418
#2420
#2421
#2422
#2423
#2424
#2425
#2426
#2427
#2429
#2430
#2431
#2433
#2436
#2441
#2442
#2445
#2445
#2447
#2452
#2453
#2454
#2458
#2459
#2468
#2473
#2474
#248
#2484
#2486
#2488
#2488
#2489
#2498
#2499
#2501
#2502
#2503
#251
#2511
#2512
#2513
#2517
#2519
#2520
#2521
#2526
#2527
#2530
#2531
#2533
#2541
#2543
#2546
#2548
#2549
#2553
#2563
#2569
#2571
#2573
#2578
#2582
#2585
#2587
#2587
#2588
#2589
#259
#2596
#2599
#26
#2600
#2605
#2613
#2618
#2622
#2622
#2624
#2627
#2636
#2637
#2638
#2639
#2640
#2641
#2644
#2645
#2645
#2648
#2649
#2651
#2653
#2656
#2658
#2659
#2660
#2662
#2664
#2665
#2669
#2671
#2674
#2676
#2676
#2678
#2678
#2682
#2682
#2689
#2692
#2693
#27
#2706
#2707
#2708
#2712
#2716
#2717
#2720
#2721
#2721
#2723
#2723
#2728
#2735
#2738
#2738
#2739
#2740
#2740
#2741
#2741
#275
#276
#279
#28
#282
#286
#29
#290
#3
#302
#305
#307
#31
#310
#312
#314
#315
#316
#319
#32
#320
#322
#323
#325
#33
#330
#332
#334
#339
#34
#340
#341
#349
#35
#350
#36
#37
#371
#374
#375
#378
#379
#38
#384
#388
#39
#392
#393
#396
#4
#40
#401
#407
#408
#409
#415
#416
#42
#429
#43
#430
#432
#44
#442
#449
#45
#451
#456
#457
#46
#466
#467
#468
#47
#472
#474
#475
#482
#484
#487
#49
#490
#492
#495
#496
#497
#5
#50
#500
#501
#502
#504
#51
#511
#512
#513
#514
#517
#52
#528
#53
#530
#531
#532
#533
#534
#537
#54
#544
#549
#550
#551
#552
#555
#556
#557
#558
#56
#569
#57
#58
#59
#593
#6
#604
#606
#608
#613
#615
#616
#618
#624
#628
#63
#630
#631
#633
#642
#65
#650
#655
#656
#662
#675
#679
#68
#69
#691
#693
#694
#695
#697
#698
#7
#70
#700
#701
#702
#708
#71
#716
#717
#718
#719
#72
#733
#735
#739
#742
#75
#752
#759
#769
#772
#777
#783
#786
#788
#79
#793
#8
#80
#800
#801
#803
#804
#805
#808
#81
#811
#814
#815
#816
#817
#818
#819
#82
#820
#821
#824
#825
#826
#827
#83
#830
#831
#832
#837
#839
#84
#841
#842
#843
#850
#852
#853
#854
#855
#856
#857
#86
#862
#868
#869
#872
#873
#874
#875
#876
#878
#88
#880
#881
#883
#896
#897
#90
#901
#903
#905
#91
#910
#912
#916
#919
#92
#929
#93
#932
#935
#940
#941
#945
#948
#949
#95
#951
#958
#96
#965
#966
#967
#98
#99
#999
v0.1.1
v0.2.0
v0.3.0
v0.3.1
v0.4.0
v1.0.0
v1.0.1
v1.0.2
v1.1.0
v1.1.1
v1.1.2
v1.10.0
v1.11.0
v1.12.0
v1.12.1
v1.12.2
v1.13.0
v1.13.1
v1.14.0
v1.15.0
v1.16.0
v1.16.1
v1.17.0
v1.18.0
v1.19.0
v1.19.1
v1.2.0
v1.2.1
v1.20.0
v1.3.0
v1.4.0
v1.5.0
v1.6.0
v1.6.1
v1.6.2
v1.6.3
v1.7.0
v1.7.1
v1.8.0
v1.8.1
v1.8.2
v1.8.3
v1.8.4
v1.8.5
v1.9.0
v2.0.0
v2.1.0
v2.10.0
v2.11.0
v2.12.0
v2.13.0
v2.14.0
v2.15.0
v2.15.1
v2.16.0
v2.17.0
v2.18.0
v2.19.0
v2.2.0
v2.2.1
v2.20.0
v2.21.0
v2.22.0
v2.23.0
v2.23.1
v2.24.0
v2.25.0
v2.25.1
v2.25.2
v2.26.0
v2.27.0
v2.28.0
v2.28.1
v2.28.2
v2.28.3
v2.28.4
v2.29.0
v2.3.0
v2.3.1
v2.30.0
v2.31.0
v2.31.1
v2.31.2
v2.32.0
v2.33.0
v2.34.0
v2.35.0
v2.36.0
v2.36.1
v2.37.0
v2.38.0
v2.38.1
v2.39.0
v2.4.0
v2.4.1
v2.4.2
v2.40.0
v2.41.0
v2.42.0
v2.42.1
v2.42.2
v2.43.0
v2.44.0
v2.45.0
v2.46.0
v2.47.0
v2.47.1
v2.48.0
v2.49.0
v2.5.0
v2.5.1
v2.5.2
v2.50.0
v2.51.0
v2.52.0
v2.53.0
v2.54.0
v2.55.0
v2.55.1
v2.56.0
v2.56.1
v2.57.0
v2.58.0
v2.59.0
v2.6.0
v2.60.0
v2.60.1
v2.61.0
v2.61.1
v2.61.2
v2.62.0
v2.63.0
v2.64.0
v2.7.0
v2.7.1
v2.8.0
v2.8.1
v2.8.2
v2.8.3
v2.9.0
Select branches
Hide Pull Requests
add-json-export-indentation
adr-model-stages
cau/dpv4-test-updates
cau/fix-layout-vlm-pipeline-artifacts-path
cau/layout-vlm-pipeline-page-images
cau/multi-stage-vlm-pipeline
cau/new-layout-processing
cau/pin-docling-parse-pre-3.2
cau/test-dp-word-lines
cau/test-pypdfium2-beta
copilot/fix-document-timeout-bug
copilot/fix-keyerror-in-docling
copilot/fix-page-range-bug
cp_main_20250602
demo
dev-granite-docling-table
dev/add-asr-pipeline
dev/add-granite-docling-extension
dev/add-granite-docling-preview
dev/add-r2l-tests
dev/add-reading-order-model
dev/add-two-stage-vlm
dev/analysis-for-granite-docling
dev/doctag_backend
dev/fix_msword_backend_identify_text_after_image
dev/table-orientation
dev/update-html-parser-with-h1
dev/update-to-latest-docling-parse-again
docs/add-extraction-script
elh/update_2stage_inference
extend-metadata-in-examples
gh-pages
main
mao/doctags
mly/smol-docling-integration
nli/fix_glm_utils
nli/fix_ocr_tests
nli/layout_dfine
nli/layout_heron2
nli/layout_rtdetr_v2
nli/layoutmodel_improvements
nli/tesseract_ocr_models
ocr-enrichment
pretest-core-2-51-0
propagate-core-fixes-20250502
remodel-lists-2
revert-803-refactor_viz
rtdl/docx_latex
rtdl/drawingml_import
vku/uspto_meta
#1
#10
#100
#101
#1010
#1015
#1017
#102
#1021
#1024
#1027
#103
#1038
#1039
#1040
#1041
#1051
#1052
#1053
#1054
#1055
#1057
#1061
#1062
#1077
#1096
#1097
#1098
#11
#110
#1100
#1106
#1107
#111
#1114
#1115
#1118
#1124
#1130
#1140
#1141
#1147
#1150
#1152
#1154
#1156
#1158
#1160
#1165
#1167
#117
#1173
#118
#1182
#1183
#1194
#1196
#1197
#1199
#12
#120
#1201
#121
#1210
#122
#1220
#1222
#1223
#123
#1231
#1238
#1239
#1241
#1244
#1247
#1248
#1261
#1263
#1268
#1270
#1286
#129
#1294
#1295
#13
#131
#1313
#1315
#1316
#1319
#132
#1320
#1326
#1328
#1332
#1334
#1337
#134
#1340
#1346
#135
#1350
#1355
#1359
#1363
#1371
#1375
#1377
#1378
#1379
#138
#1381
#1382
#1383
#1389
#139
#1392
#1399
#14
#140
#1400
#1402
#141
#1411
#1415
#1416
#1419
#1427
#1428
#143
#1430
#1436
#1442
#1449
#145
#1458
#1459
#1463
#1465
#1486
#149
#1490
#1492
#1494
#1496
#15
#150
#1500
#151
#1511
#1512
#152
#1520
#1523
#1524
#1525
#1526
#1527
#1528
#153
#1530
#1536
#1538
#154
#1548
#1549
#155
#1551
#1553
#1556
#1559
#156
#1560
#1561
#1563
#1566
#157
#1570
#1576
#1577
#158
#1582
#1583
#1587
#1589
#159
#1593
#1596
#16
#160
#1600
#1609
#161
#1610
#1615
#1617
#1619
#162
#1636
#164
#1658
#1659
#1660
#1663
#1664
#1665
#1667
#1671
#1673
#1676
#1679
#168
#1683
#1684
#1688
#1689
#169
#1691
#1698
#17
#170
#1700
#1701
#1706
#1707
#171
#1711
#1717
#1718
#1723
#1724
#1725
#1728
#173
#1734
#1735
#1745
#1746
#1747
#175
#1759
#1763
#1769
#177
#1772
#1775
#178
#179
#1791
#1795
#18
#180
#1802
#1804
#1808
#1810
#1812
#1815
#1816
#1819
#182
#1820
#1821
#1824
#1825
#1827
#183
#1836
#1838
#184
#1844
#1850
#1851
#1852
#1856
#1857
#186
#1863
#1866
#1867
#187
#1870
#1874
#1875
#1876
#188
#1884
#189
#1897
#1898
#1898
#19
#190
#1902
#1904
#1905
#1907
#1908
#1910
#1912
#1914
#1917
#1923
#1925
#1926
#1928
#193
#1931
#1934
#1937
#194
#1940
#1943
#1948
#1951
#1952
#196
#1960
#1969
#1970
#1971
#1975
#1981
#1982
#1984
#1986
#1988
#1989
#1992
#1995
#2
#20
#2001
#2002
#2006
#2011
#2017
#2018
#2024
#203
#2031
#2039
#2042
#2048
#2061
#2068
#2069
#2078
#2079
#2083
#2084
#2084
#2088
#2093
#2094
#2095
#2095
#21
#2100
#2105
#2106
#2110
#2111
#2112
#2113
#2114
#2114
#2122
#2123
#2124
#2126
#2131
#2132
#2133
#2138
#214
#2141
#2146
#2154
#2155
#2165
#2166
#2169
#217
#217
#2171
#2178
#218
#218
#2183
#2185
#2187
#219
#2199
#22
#2200
#2208
#2212
#2218
#2219
#2227
#2227
#2231
#2234
#2237
#2238
#224
#2242
#2244
#2251
#2252
#226
#2262
#2264
#2265
#2266
#2272
#228
#2281
#2284
#2284
#2287
#2288
#229
#2291
#2294
#2304
#2309
#2313
#2315
#2322
#2323
#2324
#233
#2339
#234
#2340
#2341
#235
#2357
#2359
#2361
#2365
#2366
#2371
#2372
#2373
#2378
#2378
#2382
#2383
#2388
#2391
#2394
#240
#2401
#2403
#2403
#2404
#2407
#2409
#2409
#241
#2410
#2411
#2413
#2415
#2418
#2420
#2421
#2422
#2423
#2424
#2425
#2426
#2427
#2429
#2430
#2431
#2433
#2436
#2441
#2442
#2445
#2445
#2447
#2452
#2453
#2454
#2458
#2459
#2468
#2473
#2474
#248
#2484
#2486
#2488
#2488
#2489
#2498
#2499
#2501
#2502
#2503
#251
#2511
#2512
#2513
#2517
#2519
#2520
#2521
#2526
#2527
#2530
#2531
#2533
#2541
#2543
#2546
#2548
#2549
#2553
#2563
#2569
#2571
#2573
#2578
#2582
#2585
#2587
#2587
#2588
#2589
#259
#2596
#2599
#26
#2600
#2605
#2613
#2618
#2622
#2622
#2624
#2627
#2636
#2637
#2638
#2639
#2640
#2641
#2644
#2645
#2645
#2648
#2649
#2651
#2653
#2656
#2658
#2659
#2660
#2662
#2664
#2665
#2669
#2671
#2674
#2676
#2676
#2678
#2678
#2682
#2682
#2689
#2692
#2693
#27
#2706
#2707
#2708
#2712
#2716
#2717
#2720
#2721
#2721
#2723
#2723
#2728
#2735
#2738
#2738
#2739
#2740
#2740
#2741
#2741
#275
#276
#279
#28
#282
#286
#29
#290
#3
#302
#305
#307
#31
#310
#312
#314
#315
#316
#319
#32
#320
#322
#323
#325
#33
#330
#332
#334
#339
#34
#340
#341
#349
#35
#350
#36
#37
#371
#374
#375
#378
#379
#38
#384
#388
#39
#392
#393
#396
#4
#40
#401
#407
#408
#409
#415
#416
#42
#429
#43
#430
#432
#44
#442
#449
#45
#451
#456
#457
#46
#466
#467
#468
#47
#472
#474
#475
#482
#484
#487
#49
#490
#492
#495
#496
#497
#5
#50
#500
#501
#502
#504
#51
#511
#512
#513
#514
#517
#52
#528
#53
#530
#531
#532
#533
#534
#537
#54
#544
#549
#550
#551
#552
#555
#556
#557
#558
#56
#569
#57
#58
#59
#593
#6
#604
#606
#608
#613
#615
#616
#618
#624
#628
#63
#630
#631
#633
#642
#65
#650
#655
#656
#662
#675
#679
#68
#69
#691
#693
#694
#695
#697
#698
#7
#70
#700
#701
#702
#708
#71
#716
#717
#718
#719
#72
#733
#735
#739
#742
#75
#752
#759
#769
#772
#777
#783
#786
#788
#79
#793
#8
#80
#800
#801
#803
#804
#805
#808
#81
#811
#814
#815
#816
#817
#818
#819
#82
#820
#821
#824
#825
#826
#827
#83
#830
#831
#832
#837
#839
#84
#841
#842
#843
#850
#852
#853
#854
#855
#856
#857
#86
#862
#868
#869
#872
#873
#874
#875
#876
#878
#88
#880
#881
#883
#896
#897
#90
#901
#903
#905
#91
#910
#912
#916
#919
#92
#929
#93
#932
#935
#940
#941
#945
#948
#949
#95
#951
#958
#96
#965
#966
#967
#98
#99
#999
v0.1.1
v0.2.0
v0.3.0
v0.3.1
v0.4.0
v1.0.0
v1.0.1
v1.0.2
v1.1.0
v1.1.1
v1.1.2
v1.10.0
v1.11.0
v1.12.0
v1.12.1
v1.12.2
v1.13.0
v1.13.1
v1.14.0
v1.15.0
v1.16.0
v1.16.1
v1.17.0
v1.18.0
v1.19.0
v1.19.1
v1.2.0
v1.2.1
v1.20.0
v1.3.0
v1.4.0
v1.5.0
v1.6.0
v1.6.1
v1.6.2
v1.6.3
v1.7.0
v1.7.1
v1.8.0
v1.8.1
v1.8.2
v1.8.3
v1.8.4
v1.8.5
v1.9.0
v2.0.0
v2.1.0
v2.10.0
v2.11.0
v2.12.0
v2.13.0
v2.14.0
v2.15.0
v2.15.1
v2.16.0
v2.17.0
v2.18.0
v2.19.0
v2.2.0
v2.2.1
v2.20.0
v2.21.0
v2.22.0
v2.23.0
v2.23.1
v2.24.0
v2.25.0
v2.25.1
v2.25.2
v2.26.0
v2.27.0
v2.28.0
v2.28.1
v2.28.2
v2.28.3
v2.28.4
v2.29.0
v2.3.0
v2.3.1
v2.30.0
v2.31.0
v2.31.1
v2.31.2
v2.32.0
v2.33.0
v2.34.0
v2.35.0
v2.36.0
v2.36.1
v2.37.0
v2.38.0
v2.38.1
v2.39.0
v2.4.0
v2.4.1
v2.4.2
v2.40.0
v2.41.0
v2.42.0
v2.42.1
v2.42.2
v2.43.0
v2.44.0
v2.45.0
v2.46.0
v2.47.0
v2.47.1
v2.48.0
v2.49.0
v2.5.0
v2.5.1
v2.5.2
v2.50.0
v2.51.0
v2.52.0
v2.53.0
v2.54.0
v2.55.0
v2.55.1
v2.56.0
v2.56.1
v2.57.0
v2.58.0
v2.59.0
v2.6.0
v2.60.0
v2.60.1
v2.61.0
v2.61.1
v2.61.2
v2.62.0
v2.63.0
v2.64.0
v2.7.0
v2.7.1
v2.8.0
v2.8.1
v2.8.2
v2.8.3
v2.9.0
-
bf82f4dc73
Deployed
edbabfcwith MkDocs version: 1.6.1 gh-pages
2025-12-08 11:47:28 +00:00 -
edbabfcac2
fix: add missing font download in the rapidocr artifacts (#2735)
main
Michele Dolfi
2025-12-08 12:44:53 +01:00 -
609069d12c
fix: Ensure proper image_scale for generated page images in VLM pipelines (#2728)
Christoph Auer
2025-12-05 13:16:11 +01:00 -
d007ba0e6f
fix(html): tackle paragraphs with block-level elements (#2720)
Cesar Berrospi Ramis
2025-12-05 12:52:53 +01:00 -
3df3cf8664
fix: add page as argument to build_prompt
elh/update_2stage_inference
ElHachem02
2025-12-04 13:36:20 +01:00 -
aebe25cf00
fix(html): prevent hierarchy reset in rich table cells (#2716)
Matvei Smirnov
2025-12-03 20:52:23 +03:00 -
0904dbb95a
feat: update inference code to shuffle layout elements and discard initial prompt
ElHachem02
2025-12-03 12:59:31 +01:00 -
92e4f2220a
Fix artifacts_path handling in Layout+VLM pipeline
cau/fix-layout-vlm-pipeline-artifacts-path
Christoph Auer
2025-12-03 12:52:22 +01:00 -
c97715f5fd
fix(docx): parse integrals as n-ary objects without chr element (#2712)
Cesar Berrospi Ramis
2025-12-03 11:25:52 +01:00 -
f80c903c24
chore: bump version to 2.64.0 [skip ci]
v2.64.0
github-actions[bot]
2025-12-02 11:25:22 +00:00 -
6ef4ffd643
fix: InputFormat.IMAGE must have correct pipeline (#2707)
Christoph Auer
2025-12-01 19:44:16 +01:00 -
5bbc94daf8
Add page image injection
cau/layout-vlm-pipeline-page-images
Christoph Auer
2025-12-01 15:20:41 +01:00 -
54cd6d7406
fix: do not consider singleton cells in xlsx as TableItems but rather TextItems (#2589)
glypt
2025-11-27 16:25:32 +01:00 -
c0b57ae389
chore: Cleaning the example of post_process_ocr_with_vlm (#2693)
Maxim Lysak
2025-11-27 12:38:45 +01:00 -
fa21128138
docs: Example on how to apply external OCR as post processing (#2517)
Maxim Lysak
2025-11-27 11:04:40 +01:00 -
0049857c7d
chore: update mlx lock (#2689)
Panos Vagenas
2025-11-27 10:25:07 +01:00 -
134436245a
feat(experimental): Add experimental TableCropsLayoutModel (#2669)
Christoph Auer
2025-11-25 05:14:51 +01:00 -
b75c6461f4
docs: More GPU results and improvements in the example docs (#2674)
Michele Dolfi
2025-11-24 15:26:08 +01:00 -
146b4f0535
docs: fix typo on jobkit page (#2671)
Muhammad Ali Hasan
2025-11-24 02:35:45 -06:00 -
e58055465c
fix(docx): Missing list items after numbered header (#2665)
Michele Dolfi
2025-11-24 08:49:21 +01:00 -
ad97e52851
feat: Factory and plugin-capability for Layout and Table models (#2637)
Christoph Auer
2025-11-21 10:26:06 +01:00 -
dcb57bf528
chore: bump version to 2.63.0 [skip ci]
v2.63.0
github-actions[bot]
2025-11-20 14:42:37 +00:00 -
2087c6bf9f
fix: Respect document_timeout in new threaded StandardPdfPipeline (#2653)
Christoph Auer
2025-11-20 14:57:14 +01:00 -
54e65d9511
chore: update Milvus on examples and references to deprecated method (#2664)
Cesar Berrospi Ramis
2025-11-20 13:22:45 +01:00 -
ce5a099dfd
docs: Add Hector as compatible AI agent platform integration (#2662)
kadirpekel
2025-11-20 13:02:47 +01:00 -
b559813b9b
feat: add save and load for conversion result (#2648)
Peter W. J. Staar
2025-11-20 12:45:26 +01:00 -
6fb9a5f98a
fix: In DocumentConverter.convert_string() make nullable name parameter optional (#2660)
Cristi Burcă
2025-11-20 05:24:27 +00:00 -
463a3fd474
fix: Enable GPU for RapidOCR when available (#2659)
Michele Dolfi
2025-11-19 17:12:00 +01:00 -
b216ad848d
docs: Added documentation to use SuryaOCR via plugin docling-surya (#2533)
Harry Ho
2025-11-19 22:27:24 +08:00 -
6fe6aae91a
Apply ruff formatting to test file
copilot/fix-page-range-bug
copilot-swe-agent[bot]
2025-11-19 13:28:01 +00:00 -
0788e714a9
Add comprehensive tests for page_range bug fix
copilot-swe-agent[bot]
2025-11-19 13:26:25 +00:00 -
58fc6ccf86
Fix page_range stopping at page 32 by using dynamic batch_size
copilot-swe-agent[bot]
2025-11-19 13:25:00 +00:00 -
18f705b235
Initial plan
copilot-swe-agent[bot]
2025-11-19 13:10:27 +00:00 -
03e7c7d924
docs: Fix broken homepage links (#2651)
Robyn Johnson
2025-11-19 01:19:56 -06:00 -
8af228f1e2
docs(examples): processing parquet file of images (#2641)
Michele Dolfi
2025-11-19 06:39:25 +01:00 -
da4c2e9dbe
fix: remove py3.14 requirement for default rapidocr (#2639)
Michele Dolfi
2025-11-18 17:23:43 +01:00 -
d549445e78
docs: Move Installation and Quickstart (Usage) under Getting started (#2644)
Ryan Soliveres
2025-11-19 00:09:41 +08:00 -
ac9fc585bb
docs: add redirection from getting started page (#2640)
Panos Vagenas
2025-11-17 14:13:51 +01:00 -
f5528623a7
docs(examples): remove deprecation warnings with export_to_dataframe (#2638)
Cesar Berrospi Ramis
2025-11-17 12:48:41 +01:00 -
d6ddf9f4cb
chore: bump version to 2.62.0 [skip ci]
v2.62.0
github-actions[bot]
2025-11-17 11:34:08 +00:00 -
3495b73de8
feat: add the Image backend (#2627)
Peter W. J. Staar
2025-11-17 11:37:22 +01:00 -
aa75dd13d3
test: mark timeout test as manual due to model requirement
copilot/fix-document-timeout-bug
copilot-swe-agent[bot]
2025-11-17 09:27:27 +00:00 -
e3aa8cd770
feat: add document_timeout support to StandardPdfPipeline
copilot-swe-agent[bot]
2025-11-17 09:23:28 +00:00 -
f3ed123b51
Initial plan
copilot-swe-agent[bot]
2025-11-17 09:17:41 +00:00 -
ae30373ee7
docs: combine Home and Getting Started pages (#2600)
Robyn Johnson
2025-11-14 06:29:25 -06:00 -
14b436d590
fix: correct the model-repo name (#2624)
Peter W. J. Staar
2025-11-14 13:21:08 +01:00 -
55908d6bb4
chore: pretest docling-core 2.51.0
pretest-core-2-51-0
Panos Vagenas
2025-11-12 16:35:49 +01:00 -
bbb66d8be0
Add documentation for reading order patch
copilot/fix-keyerror-in-docling
copilot-swe-agent[bot]
2025-11-12 13:07:43 +00:00 -
570fe949c9
Add monkey patch to fix KeyError in reading order model
copilot-swe-agent[bot]
2025-11-12 13:03:50 +00:00 -
609988d3e1
Initial plan
copilot-swe-agent[bot]
2025-11-12 12:48:22 +00:00 -
4852d8b4f2
feat(experimental): Layout + VLM model with layout prompt (#2244)
Christoph Auer
2025-11-12 13:42:09 +01:00 -
054c4a634d
fix(docx): parse page headers and footers (#2599)
Cesar Berrospi Ramis
2025-11-10 16:10:12 +01:00 -
463051b852
chore: bump version to 2.61.2 [skip ci]
v2.61.2
github-actions[bot]
2025-11-10 11:44:59 +00:00 -
5c27567c41
fix: default to EasyOCR in Python 3.14 (#2605)
Panos Vagenas
2025-11-10 12:09:00 +01:00 -
06ae8ae29a
chore: replace ds4sd with docling-project (#2596)
Peter W. J. Staar
2025-11-07 11:25:56 +01:00 -
c21327cd74
chore: bump version to 2.61.1 [skip ci]
v2.61.1
github-actions[bot]
2025-11-06 05:19:20 +00:00 -
ef623ffcee
fix(docx): slow table parsing (#2553)
Cesar Berrospi Ramis
2025-11-06 05:25:53 +01:00 -
0ba8d5d9e3
fix(html): slow table parsing (#2582)
Cesar Berrospi Ramis
2025-11-06 05:25:36 +01:00 -
8da3d287ed
docs: make navigation menus collapse and expand (#2573)
Robyn Johnson
2025-11-05 22:25:19 -06:00 -
0ccc0a3245
chore: bump version to 2.61.0 [skip ci]
v2.61.0
github-actions[bot]
2025-11-06 04:25:06 +00:00 -
fa925741b6
fix: temporarily pin NuExtract to working revision (#2588)
Panos Vagenas
2025-11-05 21:23:12 +01:00 -
8940045463
replace match with if
docs/add-extraction-script
Peter Staar
2025-11-05 16:57:16 +01:00 -
1ec6c58b95
adding extraction script
Peter Staar
2025-11-05 15:43:56 +01:00 -
6a04e27352
feat(vlm): track generated tokens and stop reasons for VLM models (#2543)
peets
2025-11-04 19:39:09 +01:00 -
1a5146abc9
fix(ocr): use PSM integer values directly instead of constructor (#2578)
정물결
2025-11-05 03:32:41 +09:00 -
32a5aed5ea
chore: bump version to 2.60.1 [skip ci]
v2.60.1
github-actions[bot]
2025-11-04 11:26:12 +00:00 -
0e1b0bd816
chore: switch print statements to debug logging (#2569)
Panos Vagenas
2025-11-04 11:32:39 +01:00 -
fb737d026e
chore: fix malformed f-string (#2563)
Johannes Damp
2025-11-04 11:01:26 +01:00 -
8360aa5449
fix: extract response from api_image_request in picture description (#2571)
peets
2025-11-04 08:39:15 +01:00 -
3467b0a035
chore: bump version to 2.60.0 [skip ci]
v2.60.0
github-actions[bot]
2025-10-31 14:43:29 +00:00 -
268d027c8f
feat: Use threading in the standard pipeline and move old behavior to legacy (#2452)
Michele Dolfi
2025-10-31 14:42:11 +01:00 -
01577e92d1
docs: Update link to Open WebUI docs (#2549)
Welteam
2025-10-31 12:21:11 +00:00 -
cb100437fa
docs: Update installation options with extras and review FAQ (#2548)
Michele Dolfi
2025-10-31 13:21:01 +01:00 -
741c44fa45
docs: fix typos (#2546)
Yasir Ali
2025-10-31 18:29:34 +09:00 -
a51275d080
fix(pdf): threadsafe for pypdfium2 backend (#2527)
Michele Dolfi
2025-10-30 17:58:39 +01:00 -
d27fe92e01
chore: bump version to 2.59.0 [skip ci]
v2.59.0
github-actions[bot]
2025-10-30 13:05:56 +00:00 -
97aa06bfbc
docs: Add details and examples on optimal GPU setup (#2531)
Michele Dolfi
2025-10-30 13:22:05 +01:00 -
d9c90eb45e
fix: xlsx cell parsing, now returning values instead of formulas (#2520)
glypt
2025-10-29 11:35:51 +01:00 -
b6c892b505
feat(vlm): add num_tokens as attribtue for VlmPrediction (#2489)
peets
2025-10-28 17:18:44 +01:00 -
cdffb47b9a
feat: Support for Python 3.14 (#2530)
Michele Dolfi
2025-10-28 14:32:15 +01:00 -
9a6fdf936b
docs: update opensearch notebook and backend documentation (#2519)
Cesar Berrospi Ramis
2025-10-27 10:02:50 +01:00 -
10c1f06b74
chore: bump version to 2.58.0 [skip ci]
v2.58.0
github-actions[bot]
2025-10-22 11:31:29 +00:00 -
bbe82a68d0
feat(pdf): Support for password-protected PDF documents (#2499)
Michele Dolfi
2025-10-22 12:48:01 +02:00 -
89820d01b5
perf: use docling-parse-v4 as default (#2503)
Michele Dolfi
2025-10-21 17:55:43 +02:00 -
86556d8367
docs: fix typo in mcp.md (#2502)
McGuireMark
2025-10-21 11:31:28 -04:00 -
4227fcc3e1
fix(markdown): set the correct discriminator in md backend options (#2501)
Cesar Berrospi Ramis
2025-10-21 14:30:48 +02:00 -
a30e6a7614
feat(backend): add generic options support and HTML image handling modes (#2011)
Legoshi
2025-10-21 12:52:17 +02:00 -
b66624bfff
fix(xlsx): speed up by detecting the true last non-empty row/column (#2404)
Richard (Huangrui) Chu
2025-10-21 02:08:20 -04:00 -
657ce8b01c
feat(ASR): MLX Whisper Support for Apple Silicon (#2366)
Ken Steele
2025-10-20 23:05:59 -07:00 -
a5af082d82
chore: fix parsing of release body message (#2498)
Michele Dolfi
2025-10-20 13:41:35 +02:00 -
5be856fbc0
chore: add action posting to discord (#2486)
Michele Dolfi
2025-10-17 16:31:57 +02:00 -
ee5aedc955
add ocr as enrichment for pictures in simple pipeline
ocr-enrichment
Michele Dolfi
2025-10-17 16:04:57 +02:00 -
dd03b53117
docs: discord badge with join link (#2473)
Michele Dolfi
2025-10-16 10:13:50 +02:00 -
1762bb8762
chore: update lock (#2468)
Michele Dolfi
2025-10-15 20:35:49 +02:00 -
ae61d640c1
chore: bump version to 2.57.0 [skip ci]
v2.57.0
github-actions[bot]
2025-10-15 09:20:31 +00:00 -
16829939cf
feat(docx): Process drawingml objects in docx (#2453)
Rafael Teixeira de Lima
2025-10-15 10:58:08 +02:00 -
3e6da2c62d
docs: Example on PII obfuscation (#2459)
Peter W. J. Staar
2025-10-14 15:39:16 +02:00 -
cd7f7ba145
fix: Use proper page concatentation in VLM pipeline MD/HTML conversion (#2458)
Christoph Auer
2025-10-14 14:12:26 +02:00 -
3687d865f8
chore: bump version to 2.56.1 [skip ci]
v2.56.1
github-actions[bot]
2025-10-13 16:30:04 +00:00 -
688a7dfd38
fix: avoid downloading easyocr models by default (#2454)
Michele Dolfi
2025-10-13 17:58:06 +02:00