mirror of
https://github.com/DS4SD/docling.git
synced 2025-12-08 12:48:28 +00:00
Commit Graph
Select branches
Hide Pull Requests
add-json-export-indentation
adr-model-stages
cau/dpv4-test-updates
cau/fix-layout-vlm-pipeline-artifacts-path
cau/layout-vlm-pipeline-page-images
cau/multi-stage-vlm-pipeline
cau/new-layout-processing
cau/pin-docling-parse-pre-3.2
cau/test-dp-word-lines
cau/test-pypdfium2-beta
copilot/fix-document-timeout-bug
copilot/fix-keyerror-in-docling
copilot/fix-page-range-bug
cp_main_20250602
demo
dev-granite-docling-table
dev/add-asr-pipeline
dev/add-granite-docling-extension
dev/add-granite-docling-preview
dev/add-r2l-tests
dev/add-reading-order-model
dev/add-two-stage-vlm
dev/analysis-for-granite-docling
dev/doctag_backend
dev/fix_msword_backend_identify_text_after_image
dev/table-orientation
dev/update-html-parser-with-h1
dev/update-to-latest-docling-parse-again
docs/add-extraction-script
elh/update_2stage_inference
extend-metadata-in-examples
gh-pages
main
mao/doctags
mly/smol-docling-integration
nli/fix_glm_utils
nli/fix_ocr_tests
nli/layout_dfine
nli/layout_heron2
nli/layout_rtdetr_v2
nli/layoutmodel_improvements
nli/tesseract_ocr_models
ocr-enrichment
pretest-core-2-51-0
propagate-core-fixes-20250502
remodel-lists-2
revert-803-refactor_viz
rtdl/docx_latex
rtdl/drawingml_import
vku/uspto_meta
#1
#10
#100
#101
#1010
#1015
#1017
#102
#1021
#1024
#1027
#103
#1038
#1039
#1040
#1041
#1051
#1052
#1053
#1054
#1055
#1057
#1061
#1062
#1077
#1096
#1097
#1098
#11
#110
#1100
#1106
#1107
#111
#1114
#1115
#1118
#1124
#1130
#1140
#1141
#1147
#1150
#1152
#1154
#1156
#1158
#1160
#1165
#1167
#117
#1173
#118
#1182
#1183
#1194
#1196
#1197
#1199
#12
#120
#1201
#121
#1210
#122
#1220
#1222
#1223
#123
#1231
#1238
#1239
#1241
#1244
#1247
#1248
#1261
#1263
#1268
#1270
#1286
#129
#1294
#1295
#13
#131
#1313
#1315
#1316
#1319
#132
#1320
#1326
#1328
#1332
#1334
#1337
#134
#1340
#1346
#135
#1350
#1355
#1359
#1363
#1371
#1375
#1377
#1378
#1379
#138
#1381
#1382
#1383
#1389
#139
#1392
#1399
#14
#140
#1400
#1402
#141
#1411
#1415
#1416
#1419
#1427
#1428
#143
#1430
#1436
#1442
#1449
#145
#1458
#1459
#1463
#1465
#1486
#149
#1490
#1492
#1494
#1496
#15
#150
#1500
#151
#1511
#1512
#152
#1520
#1523
#1524
#1525
#1526
#1527
#1528
#153
#1530
#1536
#1538
#154
#1548
#1549
#155
#1551
#1553
#1556
#1559
#156
#1560
#1561
#1563
#1566
#157
#1570
#1576
#1577
#158
#1582
#1583
#1587
#1589
#159
#1593
#1596
#16
#160
#1600
#1609
#161
#1610
#1615
#1617
#1619
#162
#1636
#164
#1658
#1659
#1660
#1663
#1664
#1665
#1667
#1671
#1673
#1676
#1679
#168
#1683
#1684
#1688
#1689
#169
#1691
#1698
#17
#170
#1700
#1701
#1706
#1707
#171
#1711
#1717
#1718
#1723
#1724
#1725
#1728
#173
#1734
#1735
#1745
#1746
#1747
#175
#1759
#1763
#1769
#177
#1772
#1775
#178
#179
#1791
#1795
#18
#180
#1802
#1804
#1808
#1810
#1812
#1815
#1816
#1819
#182
#1820
#1821
#1824
#1825
#1827
#183
#1836
#1838
#184
#1844
#1850
#1851
#1852
#1856
#1857
#186
#1863
#1866
#1867
#187
#1870
#1874
#1875
#1876
#188
#1884
#189
#1897
#1898
#1898
#19
#190
#1902
#1904
#1905
#1907
#1908
#1910
#1912
#1914
#1917
#1923
#1925
#1926
#1928
#193
#1931
#1934
#1937
#194
#1940
#1943
#1948
#1951
#1952
#196
#1960
#1969
#1970
#1971
#1975
#1981
#1982
#1984
#1986
#1988
#1989
#1992
#1995
#2
#20
#2001
#2002
#2006
#2011
#2017
#2018
#2024
#203
#2031
#2039
#2042
#2048
#2061
#2068
#2069
#2078
#2079
#2083
#2084
#2084
#2088
#2093
#2094
#2095
#2095
#21
#2100
#2105
#2106
#2110
#2111
#2112
#2113
#2114
#2114
#2122
#2123
#2124
#2126
#2131
#2132
#2133
#2138
#214
#2141
#2146
#2154
#2155
#2165
#2166
#2169
#217
#217
#2171
#2178
#218
#218
#2183
#2185
#2187
#219
#2199
#22
#2200
#2208
#2212
#2218
#2219
#2227
#2227
#2231
#2234
#2237
#2238
#224
#2242
#2244
#2251
#2252
#226
#2262
#2264
#2265
#2266
#2272
#228
#2281
#2284
#2284
#2287
#2288
#229
#2291
#2294
#2304
#2309
#2313
#2315
#2322
#2323
#2324
#233
#2339
#234
#2340
#2341
#235
#2357
#2359
#2361
#2365
#2366
#2371
#2372
#2373
#2378
#2378
#2382
#2383
#2388
#2391
#2394
#240
#2401
#2403
#2403
#2404
#2407
#2409
#2409
#241
#2410
#2411
#2413
#2415
#2418
#2420
#2421
#2422
#2423
#2424
#2425
#2426
#2427
#2429
#2430
#2431
#2433
#2436
#2441
#2442
#2445
#2445
#2447
#2452
#2453
#2454
#2458
#2459
#2468
#2473
#2474
#248
#2484
#2486
#2488
#2488
#2489
#2498
#2499
#2501
#2502
#2503
#251
#2511
#2512
#2513
#2517
#2519
#2520
#2521
#2526
#2527
#2530
#2531
#2533
#2541
#2543
#2546
#2548
#2549
#2553
#2563
#2569
#2571
#2573
#2578
#2582
#2585
#2587
#2587
#2588
#2589
#259
#2596
#2599
#26
#2600
#2605
#2613
#2618
#2622
#2622
#2624
#2627
#2636
#2637
#2638
#2639
#2640
#2641
#2644
#2645
#2645
#2648
#2649
#2651
#2653
#2656
#2658
#2659
#2660
#2662
#2664
#2665
#2669
#2671
#2674
#2676
#2676
#2678
#2678
#2682
#2682
#2689
#2692
#2693
#27
#2706
#2707
#2708
#2712
#2716
#2717
#2720
#2721
#2721
#2723
#2723
#2728
#2735
#2738
#2738
#2739
#2740
#2740
#2741
#2741
#275
#276
#279
#28
#282
#286
#29
#290
#3
#302
#305
#307
#31
#310
#312
#314
#315
#316
#319
#32
#320
#322
#323
#325
#33
#330
#332
#334
#339
#34
#340
#341
#349
#35
#350
#36
#37
#371
#374
#375
#378
#379
#38
#384
#388
#39
#392
#393
#396
#4
#40
#401
#407
#408
#409
#415
#416
#42
#429
#43
#430
#432
#44
#442
#449
#45
#451
#456
#457
#46
#466
#467
#468
#47
#472
#474
#475
#482
#484
#487
#49
#490
#492
#495
#496
#497
#5
#50
#500
#501
#502
#504
#51
#511
#512
#513
#514
#517
#52
#528
#53
#530
#531
#532
#533
#534
#537
#54
#544
#549
#550
#551
#552
#555
#556
#557
#558
#56
#569
#57
#58
#59
#593
#6
#604
#606
#608
#613
#615
#616
#618
#624
#628
#63
#630
#631
#633
#642
#65
#650
#655
#656
#662
#675
#679
#68
#69
#691
#693
#694
#695
#697
#698
#7
#70
#700
#701
#702
#708
#71
#716
#717
#718
#719
#72
#733
#735
#739
#742
#75
#752
#759
#769
#772
#777
#783
#786
#788
#79
#793
#8
#80
#800
#801
#803
#804
#805
#808
#81
#811
#814
#815
#816
#817
#818
#819
#82
#820
#821
#824
#825
#826
#827
#83
#830
#831
#832
#837
#839
#84
#841
#842
#843
#850
#852
#853
#854
#855
#856
#857
#86
#862
#868
#869
#872
#873
#874
#875
#876
#878
#88
#880
#881
#883
#896
#897
#90
#901
#903
#905
#91
#910
#912
#916
#919
#92
#929
#93
#932
#935
#940
#941
#945
#948
#949
#95
#951
#958
#96
#965
#966
#967
#98
#99
#999
v0.1.1
v0.2.0
v0.3.0
v0.3.1
v0.4.0
v1.0.0
v1.0.1
v1.0.2
v1.1.0
v1.1.1
v1.1.2
v1.10.0
v1.11.0
v1.12.0
v1.12.1
v1.12.2
v1.13.0
v1.13.1
v1.14.0
v1.15.0
v1.16.0
v1.16.1
v1.17.0
v1.18.0
v1.19.0
v1.19.1
v1.2.0
v1.2.1
v1.20.0
v1.3.0
v1.4.0
v1.5.0
v1.6.0
v1.6.1
v1.6.2
v1.6.3
v1.7.0
v1.7.1
v1.8.0
v1.8.1
v1.8.2
v1.8.3
v1.8.4
v1.8.5
v1.9.0
v2.0.0
v2.1.0
v2.10.0
v2.11.0
v2.12.0
v2.13.0
v2.14.0
v2.15.0
v2.15.1
v2.16.0
v2.17.0
v2.18.0
v2.19.0
v2.2.0
v2.2.1
v2.20.0
v2.21.0
v2.22.0
v2.23.0
v2.23.1
v2.24.0
v2.25.0
v2.25.1
v2.25.2
v2.26.0
v2.27.0
v2.28.0
v2.28.1
v2.28.2
v2.28.3
v2.28.4
v2.29.0
v2.3.0
v2.3.1
v2.30.0
v2.31.0
v2.31.1
v2.31.2
v2.32.0
v2.33.0
v2.34.0
v2.35.0
v2.36.0
v2.36.1
v2.37.0
v2.38.0
v2.38.1
v2.39.0
v2.4.0
v2.4.1
v2.4.2
v2.40.0
v2.41.0
v2.42.0
v2.42.1
v2.42.2
v2.43.0
v2.44.0
v2.45.0
v2.46.0
v2.47.0
v2.47.1
v2.48.0
v2.49.0
v2.5.0
v2.5.1
v2.5.2
v2.50.0
v2.51.0
v2.52.0
v2.53.0
v2.54.0
v2.55.0
v2.55.1
v2.56.0
v2.56.1
v2.57.0
v2.58.0
v2.59.0
v2.6.0
v2.60.0
v2.60.1
v2.61.0
v2.61.1
v2.61.2
v2.62.0
v2.63.0
v2.64.0
v2.7.0
v2.7.1
v2.8.0
v2.8.1
v2.8.2
v2.8.3
v2.9.0
Select branches
Hide Pull Requests
add-json-export-indentation
adr-model-stages
cau/dpv4-test-updates
cau/fix-layout-vlm-pipeline-artifacts-path
cau/layout-vlm-pipeline-page-images
cau/multi-stage-vlm-pipeline
cau/new-layout-processing
cau/pin-docling-parse-pre-3.2
cau/test-dp-word-lines
cau/test-pypdfium2-beta
copilot/fix-document-timeout-bug
copilot/fix-keyerror-in-docling
copilot/fix-page-range-bug
cp_main_20250602
demo
dev-granite-docling-table
dev/add-asr-pipeline
dev/add-granite-docling-extension
dev/add-granite-docling-preview
dev/add-r2l-tests
dev/add-reading-order-model
dev/add-two-stage-vlm
dev/analysis-for-granite-docling
dev/doctag_backend
dev/fix_msword_backend_identify_text_after_image
dev/table-orientation
dev/update-html-parser-with-h1
dev/update-to-latest-docling-parse-again
docs/add-extraction-script
elh/update_2stage_inference
extend-metadata-in-examples
gh-pages
main
mao/doctags
mly/smol-docling-integration
nli/fix_glm_utils
nli/fix_ocr_tests
nli/layout_dfine
nli/layout_heron2
nli/layout_rtdetr_v2
nli/layoutmodel_improvements
nli/tesseract_ocr_models
ocr-enrichment
pretest-core-2-51-0
propagate-core-fixes-20250502
remodel-lists-2
revert-803-refactor_viz
rtdl/docx_latex
rtdl/drawingml_import
vku/uspto_meta
#1
#10
#100
#101
#1010
#1015
#1017
#102
#1021
#1024
#1027
#103
#1038
#1039
#1040
#1041
#1051
#1052
#1053
#1054
#1055
#1057
#1061
#1062
#1077
#1096
#1097
#1098
#11
#110
#1100
#1106
#1107
#111
#1114
#1115
#1118
#1124
#1130
#1140
#1141
#1147
#1150
#1152
#1154
#1156
#1158
#1160
#1165
#1167
#117
#1173
#118
#1182
#1183
#1194
#1196
#1197
#1199
#12
#120
#1201
#121
#1210
#122
#1220
#1222
#1223
#123
#1231
#1238
#1239
#1241
#1244
#1247
#1248
#1261
#1263
#1268
#1270
#1286
#129
#1294
#1295
#13
#131
#1313
#1315
#1316
#1319
#132
#1320
#1326
#1328
#1332
#1334
#1337
#134
#1340
#1346
#135
#1350
#1355
#1359
#1363
#1371
#1375
#1377
#1378
#1379
#138
#1381
#1382
#1383
#1389
#139
#1392
#1399
#14
#140
#1400
#1402
#141
#1411
#1415
#1416
#1419
#1427
#1428
#143
#1430
#1436
#1442
#1449
#145
#1458
#1459
#1463
#1465
#1486
#149
#1490
#1492
#1494
#1496
#15
#150
#1500
#151
#1511
#1512
#152
#1520
#1523
#1524
#1525
#1526
#1527
#1528
#153
#1530
#1536
#1538
#154
#1548
#1549
#155
#1551
#1553
#1556
#1559
#156
#1560
#1561
#1563
#1566
#157
#1570
#1576
#1577
#158
#1582
#1583
#1587
#1589
#159
#1593
#1596
#16
#160
#1600
#1609
#161
#1610
#1615
#1617
#1619
#162
#1636
#164
#1658
#1659
#1660
#1663
#1664
#1665
#1667
#1671
#1673
#1676
#1679
#168
#1683
#1684
#1688
#1689
#169
#1691
#1698
#17
#170
#1700
#1701
#1706
#1707
#171
#1711
#1717
#1718
#1723
#1724
#1725
#1728
#173
#1734
#1735
#1745
#1746
#1747
#175
#1759
#1763
#1769
#177
#1772
#1775
#178
#179
#1791
#1795
#18
#180
#1802
#1804
#1808
#1810
#1812
#1815
#1816
#1819
#182
#1820
#1821
#1824
#1825
#1827
#183
#1836
#1838
#184
#1844
#1850
#1851
#1852
#1856
#1857
#186
#1863
#1866
#1867
#187
#1870
#1874
#1875
#1876
#188
#1884
#189
#1897
#1898
#1898
#19
#190
#1902
#1904
#1905
#1907
#1908
#1910
#1912
#1914
#1917
#1923
#1925
#1926
#1928
#193
#1931
#1934
#1937
#194
#1940
#1943
#1948
#1951
#1952
#196
#1960
#1969
#1970
#1971
#1975
#1981
#1982
#1984
#1986
#1988
#1989
#1992
#1995
#2
#20
#2001
#2002
#2006
#2011
#2017
#2018
#2024
#203
#2031
#2039
#2042
#2048
#2061
#2068
#2069
#2078
#2079
#2083
#2084
#2084
#2088
#2093
#2094
#2095
#2095
#21
#2100
#2105
#2106
#2110
#2111
#2112
#2113
#2114
#2114
#2122
#2123
#2124
#2126
#2131
#2132
#2133
#2138
#214
#2141
#2146
#2154
#2155
#2165
#2166
#2169
#217
#217
#2171
#2178
#218
#218
#2183
#2185
#2187
#219
#2199
#22
#2200
#2208
#2212
#2218
#2219
#2227
#2227
#2231
#2234
#2237
#2238
#224
#2242
#2244
#2251
#2252
#226
#2262
#2264
#2265
#2266
#2272
#228
#2281
#2284
#2284
#2287
#2288
#229
#2291
#2294
#2304
#2309
#2313
#2315
#2322
#2323
#2324
#233
#2339
#234
#2340
#2341
#235
#2357
#2359
#2361
#2365
#2366
#2371
#2372
#2373
#2378
#2378
#2382
#2383
#2388
#2391
#2394
#240
#2401
#2403
#2403
#2404
#2407
#2409
#2409
#241
#2410
#2411
#2413
#2415
#2418
#2420
#2421
#2422
#2423
#2424
#2425
#2426
#2427
#2429
#2430
#2431
#2433
#2436
#2441
#2442
#2445
#2445
#2447
#2452
#2453
#2454
#2458
#2459
#2468
#2473
#2474
#248
#2484
#2486
#2488
#2488
#2489
#2498
#2499
#2501
#2502
#2503
#251
#2511
#2512
#2513
#2517
#2519
#2520
#2521
#2526
#2527
#2530
#2531
#2533
#2541
#2543
#2546
#2548
#2549
#2553
#2563
#2569
#2571
#2573
#2578
#2582
#2585
#2587
#2587
#2588
#2589
#259
#2596
#2599
#26
#2600
#2605
#2613
#2618
#2622
#2622
#2624
#2627
#2636
#2637
#2638
#2639
#2640
#2641
#2644
#2645
#2645
#2648
#2649
#2651
#2653
#2656
#2658
#2659
#2660
#2662
#2664
#2665
#2669
#2671
#2674
#2676
#2676
#2678
#2678
#2682
#2682
#2689
#2692
#2693
#27
#2706
#2707
#2708
#2712
#2716
#2717
#2720
#2721
#2721
#2723
#2723
#2728
#2735
#2738
#2738
#2739
#2740
#2740
#2741
#2741
#275
#276
#279
#28
#282
#286
#29
#290
#3
#302
#305
#307
#31
#310
#312
#314
#315
#316
#319
#32
#320
#322
#323
#325
#33
#330
#332
#334
#339
#34
#340
#341
#349
#35
#350
#36
#37
#371
#374
#375
#378
#379
#38
#384
#388
#39
#392
#393
#396
#4
#40
#401
#407
#408
#409
#415
#416
#42
#429
#43
#430
#432
#44
#442
#449
#45
#451
#456
#457
#46
#466
#467
#468
#47
#472
#474
#475
#482
#484
#487
#49
#490
#492
#495
#496
#497
#5
#50
#500
#501
#502
#504
#51
#511
#512
#513
#514
#517
#52
#528
#53
#530
#531
#532
#533
#534
#537
#54
#544
#549
#550
#551
#552
#555
#556
#557
#558
#56
#569
#57
#58
#59
#593
#6
#604
#606
#608
#613
#615
#616
#618
#624
#628
#63
#630
#631
#633
#642
#65
#650
#655
#656
#662
#675
#679
#68
#69
#691
#693
#694
#695
#697
#698
#7
#70
#700
#701
#702
#708
#71
#716
#717
#718
#719
#72
#733
#735
#739
#742
#75
#752
#759
#769
#772
#777
#783
#786
#788
#79
#793
#8
#80
#800
#801
#803
#804
#805
#808
#81
#811
#814
#815
#816
#817
#818
#819
#82
#820
#821
#824
#825
#826
#827
#83
#830
#831
#832
#837
#839
#84
#841
#842
#843
#850
#852
#853
#854
#855
#856
#857
#86
#862
#868
#869
#872
#873
#874
#875
#876
#878
#88
#880
#881
#883
#896
#897
#90
#901
#903
#905
#91
#910
#912
#916
#919
#92
#929
#93
#932
#935
#940
#941
#945
#948
#949
#95
#951
#958
#96
#965
#966
#967
#98
#99
#999
v0.1.1
v0.2.0
v0.3.0
v0.3.1
v0.4.0
v1.0.0
v1.0.1
v1.0.2
v1.1.0
v1.1.1
v1.1.2
v1.10.0
v1.11.0
v1.12.0
v1.12.1
v1.12.2
v1.13.0
v1.13.1
v1.14.0
v1.15.0
v1.16.0
v1.16.1
v1.17.0
v1.18.0
v1.19.0
v1.19.1
v1.2.0
v1.2.1
v1.20.0
v1.3.0
v1.4.0
v1.5.0
v1.6.0
v1.6.1
v1.6.2
v1.6.3
v1.7.0
v1.7.1
v1.8.0
v1.8.1
v1.8.2
v1.8.3
v1.8.4
v1.8.5
v1.9.0
v2.0.0
v2.1.0
v2.10.0
v2.11.0
v2.12.0
v2.13.0
v2.14.0
v2.15.0
v2.15.1
v2.16.0
v2.17.0
v2.18.0
v2.19.0
v2.2.0
v2.2.1
v2.20.0
v2.21.0
v2.22.0
v2.23.0
v2.23.1
v2.24.0
v2.25.0
v2.25.1
v2.25.2
v2.26.0
v2.27.0
v2.28.0
v2.28.1
v2.28.2
v2.28.3
v2.28.4
v2.29.0
v2.3.0
v2.3.1
v2.30.0
v2.31.0
v2.31.1
v2.31.2
v2.32.0
v2.33.0
v2.34.0
v2.35.0
v2.36.0
v2.36.1
v2.37.0
v2.38.0
v2.38.1
v2.39.0
v2.4.0
v2.4.1
v2.4.2
v2.40.0
v2.41.0
v2.42.0
v2.42.1
v2.42.2
v2.43.0
v2.44.0
v2.45.0
v2.46.0
v2.47.0
v2.47.1
v2.48.0
v2.49.0
v2.5.0
v2.5.1
v2.5.2
v2.50.0
v2.51.0
v2.52.0
v2.53.0
v2.54.0
v2.55.0
v2.55.1
v2.56.0
v2.56.1
v2.57.0
v2.58.0
v2.59.0
v2.6.0
v2.60.0
v2.60.1
v2.61.0
v2.61.1
v2.61.2
v2.62.0
v2.63.0
v2.64.0
v2.7.0
v2.7.1
v2.8.0
v2.8.1
v2.8.2
v2.8.3
v2.9.0
-
10165dda8a
chore: bump version to 2.56.0 [skip ci]
v2.56.0
github-actions[bot]
2025-10-13 09:19:06 +00:00 -
db985bb159
fix(asr): Implement robust status check in AsrPipeline (#2442)
Animesh
2025-10-13 13:21:31 +05:30 -
90200443bc
docs: Remove deprecated call in custom_convert.py (#2447)
Jeremy Chen
2025-10-13 18:30:02 +11:00 -
2a0f56390a
docs: fixed a few typos (#2441)
Imad Saddik
2025-10-13 08:04:50 +01:00 -
f7244a4333
feat: AutoOCR model selecting the best OCR model available and deprecating the usage of EasyOCR (#2391)
Michele Dolfi
2025-10-10 16:11:39 +02:00 -
cce18b2ff7
fix: deal with chartsheets in workbooks (#2433)
Cesar Berrospi Ramis
2025-10-10 15:06:38 +02:00 -
f11f8c0a81
feat: Add Tesseract PSM options support (#2411)
Bruno Pio
2025-10-10 09:44:30 -03:00 -
ee5501320e
fix: skip temporary docx files (#2413)
Victor Moreli
2025-10-10 04:39:26 -03:00 -
b5f7fef29b
fix: AsrPipeline to handle absolute paths and BytesIO streams correctly (#2407)
pixiake
2025-10-10 15:37:15 +08:00 -
f2854b2e1d
docs: Add MongoDB + VoyageAI (#2382)
Utsav Talwar
2025-10-08 00:06:19 +05:30 -
0610d01afa
fix: enrichment of documents without pages metadata (pptx and xlsx) (#2401)
Michele Dolfi
2025-10-07 18:28:51 +02:00 -
9705f4020c
fix: Proper heading support in rich tables for HTML backend (#2394)
Maxim Lysak
2025-10-07 15:57:32 +02:00 -
8a4b946a1a
docs: add RAG example with MongoDB Atlas Vector Search and VoyageAI embeddings (#2341)
Utsav Talwar
2025-10-03 16:59:43 +05:30 -
22515b546a
chore: bump version to 2.55.1 [skip ci]
v2.55.1
github-actions[bot]
2025-10-03 10:26:26 +00:00 -
68230fe7e5
ci: split workflow to speedup CI runtime (#2313)
Rui Dias Gomes
2025-10-03 10:16:38 +01:00 -
ee73ffae15
fix(markdown): Setext heading support (#2359)
Matvei Smirnov
2025-10-03 11:32:53 +03:00 -
246de77d8c
fix(docs): fixed the color scheme (#2371)
Hakeem Abbas
2025-10-03 13:20:44 +05:00 -
a975a790c9
docs: example using Hashicorp Vault PII transform (#2373)
Michele Dolfi
2025-10-03 09:53:29 +02:00 -
9505202e38
ci: update docling-parse and remove pages.json (#2372)
Michele Dolfi
2025-10-03 09:53:13 +02:00 -
ca2be7ff3a
fix: Empty table handling (#2365)
Christoph Auer
2025-10-02 19:35:16 +02:00 -
e6c3b05e63
docs: Jobkit and connectors (#2357)
Lucas Morin
2025-10-02 13:46:56 +02:00 -
4f295ed051
fix: add table raw content when no table structure model is used (#1815)
Michele Dolfi
2025-10-02 13:46:42 +02:00 -
f0b630e24e
chore: bump version to 2.55.0 [skip ci]
v2.55.0
github-actions[bot]
2025-09-30 14:50:42 +00:00 -
1e9dc43b72
feat: Repetition-based StoppingCriteria for GraniteDocling (#2323)
Christoph Auer
2025-09-30 15:26:09 +02:00 -
68ae7ccf3c
fix: pin wider range of typer (#2309)
Michele Dolfi
2025-09-30 02:42:23 -04:00 -
654c70f990
fix: Update Transformers & VLLM inference code, CLI and VLM specs (#2322)
Christoph Auer
2025-09-29 21:06:54 +02:00 -
c803abed9a
feat: Rich tables support for HTML backend (#2324)
Maxim Lysak
2025-09-29 18:12:16 +02:00 -
325877aee9
docs(styling): update color scheme (#2154)
Hakeem Abbas
2025-09-29 14:44:40 +05:00 -
a873200c9d
docs(vlm): Update SmolDocling to GraniteDocling references (#2315)
Luis
2025-09-25 05:07:39 -04:00 -
9d67bb9ed6
fix: support escaped characters in markdown backend (#2304)
Lucas Morin
2025-09-23 18:00:16 +02:00 -
d599177547
chore: bump version to 2.54.0 [skip ci]
v2.54.0
github-actions[bot]
2025-09-22 15:28:30 +00:00 -
e2482a2ada
feat: Rich tables for MSWord backend (#2291)
Maxim Lysak
2025-09-22 16:41:59 +02:00 -
46efaaefee
feat: add a backend parser for WebVTT files (#2288)
Cesar Berrospi Ramis
2025-09-22 15:24:34 +02:00 -
b5628f1227
fix: correct y-axis scaling in draw_table_cells (#2287)
manuflexor
2025-09-19 13:42:29 +02:00 -
8b7e83a8c7
docs: Update API VLM example with granite-docling (#2294)
Christoph Auer
2025-09-19 12:23:53 +02:00 -
6455579a90
Stub for implementing uspto backend meta-data extraction
vku/uspto_meta
Viktor Kuropiatnyk
2025-09-18 10:51:01 +02:00 -
8322c2ea9b
docs: fix examples rendering (#2281)
Panos Vagenas
2025-09-18 02:50:50 +02:00 -
f1687fb09b
chore: bump version to 2.53.0 [skip ci]
v2.53.0
github-actions[bot]
2025-09-17 13:59:33 +00:00 -
17afb664d0
feat: Add granite-docling model (#2272)
Christoph Auer
2025-09-17 15:15:49 +02:00 -
223d7f9c62
Merge branch 'dev/add-granite-docling-extension' of github.com:DS4SD/docling into dev/add-granite-docling-extension
dev/add-granite-docling-extension
Christoph Auer
2025-09-16 16:34:09 +02:00 -
63bf6b0348
Update final repo_ids for GraniteDocling
Christoph Auer
2025-09-16 16:29:55 +02:00 -
bf9638244f
Update final repo_ids for GraniteDocling
Christoph Auer
2025-09-16 16:12:35 +02:00 -
a3709f4776
Merge branch 'main' of github.com:DS4SD/docling into dev/add-granite-docling-extension
Christoph Auer
2025-09-16 16:12:22 +02:00 -
ff351fd40c
docs: Describe examples (#2262)
Mingxuan Zhao
2025-09-16 10:00:38 -04:00 -
0e95171dd6
feat(RapidOcr): Support generic extra arguments for RapidOcr (#2266)
dmorady1
2025-09-16 07:26:10 +02:00 -
43d3c74bb2
update docs and README
Michele Dolfi
2025-09-15 15:44:42 +02:00 -
c5a59eb979
use granite-docling and add to the model downloader
Michele Dolfi
2025-09-15 15:39:08 +02:00 -
0f8728a8d4
typo
Michele Dolfi
2025-09-15 15:28:04 +02:00 -
6a2cfbdbb8
Merge remote-tracking branch 'origin/main' into dev/add-granite-docling-extension
Michele Dolfi
2025-09-15 15:26:45 +02:00 -
ad2f738231
chore: update lock (#2265)
Michele Dolfi
2025-09-15 11:19:15 +02:00 -
609d902eef
fix: handle empty result from RapidOCR to avoid crash (#2264)
Yuie.
2025-09-15 17:04:33 +09:00 -
10bb0aee2d
chore: bump version to 2.52.0 [skip ci]
v2.52.0
github-actions[bot]
2025-09-11 16:11:20 +00:00 -
0700af212c
fix: Add missing features in ThreadedStandardPdfPipeline (#2252)
Christoph Auer
2025-09-11 16:26:02 +02:00 -
2c9123419f
feat: enrichment steps on all convert pipelines (incl docx, html, etc) (#2251)
Michele Dolfi
2025-09-11 15:09:00 +02:00 -
c6965495a2
fix: address deprecation warnings of dependencies (#2237)
Michele Dolfi
2025-09-10 14:38:34 +02:00 -
f8cc545bab
docs: add an example of RAG with OpenSearch (#2238)
Cesar Berrospi Ramis
2025-09-10 14:37:22 +02:00 -
e5cd7020bd
docs: Add instructions for using Docling with MCP to README (#2219)
Roy Derks
2025-09-10 01:02:28 -07:00 -
1324eb75fc
add modified test results
dev-granite-docling-table
Michele Dolfi
2025-09-10 08:43:29 +02:00 -
a4efd70410
dev: use granite-docling for table structure
Michele Dolfi
2025-09-09 18:16:16 +02:00 -
55f5f3752f
docs: Document VLM support requirement in extraction example (#2231)
Tamás Bitai
2025-09-09 13:45:55 +02:00 -
ae9ec37cf1
doing some experiments with granite-docling
dev/analysis-for-granite-docling
Peter Staar
2025-09-08 06:03:18 +02:00 -
0e2f370f4f
updated the model specs
Peter Staar
2025-09-05 16:58:43 +02:00 -
df60673992
chore: bump version to 2.51.0 [skip ci]
v2.51.0
github-actions[bot]
2025-09-05 13:01:33 +00:00 -
c1dcb0597d
adding granite-docling preview
Peter Staar
2025-09-05 15:00:05 +02:00 -
b49d1ad4f1
feat: updating default parameters to get better performance with docling-parse (#2208)
Peter W. J. Staar
2025-09-05 14:06:21 +02:00 -
a9f41b088e
docs: add information extraction example (#2199)
Panos Vagenas
2025-09-05 11:27:09 +02:00 -
b3d7542061
feat: updated the backend for new docling-parse (#2187)
Peter W. J. Staar
2025-09-05 10:42:31 +02:00 -
2c3f6faf3d
chore: update deprecation note for OcrEngine (#2200)
Alina Ryan
2025-09-05 02:24:14 -04:00 -
effd9de250
updated the ground-truth output
dev/update-to-latest-docling-parse-again
Peter Staar
2025-09-04 05:22:54 +02:00 -
cffa6e05d0
reformatted code
Peter Staar
2025-09-03 16:22:19 +02:00 -
0ec99e0f37
updated docling to start running the tests ...
Peter Staar
2025-09-03 16:09:51 +02:00 -
3419c42f10
chore: bump version to 2.50.0 [skip ci]
v2.50.0
github-actions[bot]
2025-09-03 11:39:08 +00:00 -
e38aa0f7f2
feat: Heron layout model as new default (#1971)
Nikos Livathinos
2025-09-03 12:45:22 +02:00 -
293e81bf9d
fix(html): access to variable not yet declared (#2171)
Cesar Berrospi Ramis
2025-09-02 07:59:55 +02:00 -
d68d8b678e
chore: bump version to 2.49.0 [skip ci]
v2.49.0
github-actions[bot]
2025-09-01 16:39:43 +00:00 -
4d94e38223
fix(pypdfium2): Fix OCR bounding box misalignment caused by mismatched rotation metadata (#2039)
AndrewTsai0406
2025-09-01 23:22:43 +08:00 -
9f4bc5b2f1
feat: [Beta] Extraction with schema (#2138)
Christoph Auer
2025-09-01 16:09:48 +02:00 -
a283ccff25
feat(msexcel): set ContentLayer.INVISIBLE for invisible sheet (#1876)
Qiefan Jiang
2025-09-01 19:53:45 +08:00 -
be26044f14
chore: update docling-core lock (#2169)
Panos Vagenas
2025-09-01 13:46:10 +02:00 -
9f0286bcac
fix: translation example (#2166)
Shikhar Bhardwaj
2025-09-01 14:34:46 +05:30 -
9904d14e6a
fix: extend offline mode for rapidocr fonts (#2155)
geoHeil
2025-09-01 09:15:47 +02:00 -
96cab6b536
docs: enrich landing pages (#2165)
Panos Vagenas
2025-08-29 17:19:05 +02:00 -
946ea1c2cb
chore: Replace the layout_predictor.predict_batch() with layout_predictor.predict() in a loop
nli/layout_heron2
Nikos Livathinos
2025-08-28 15:14:51 +02:00 -
36d44f1225
chore: Add more logs in LayoutModel
Nikos Livathinos
2025-08-28 14:24:47 +02:00 -
baaf2698b4
chore: debug_heron.py: prepend the name in the saved files
Nikos Livathinos
2025-08-28 13:47:50 +02:00 -
d8ca358ae8
chore: Add debugging logs in LayoutModel
Nikos Livathinos
2025-08-28 13:45:33 +02:00 -
78f81e2c59
chore: Print the PagElements input to the ReadingOrder model
Nikos Livathinos
2025-08-28 10:15:27 +02:00 -
7debe3d5ec
chore: debug_heron.py: Save exported json with pretty format
Nikos Livathinos
2025-08-27 18:19:52 +02:00 -
32461ff258
chore: debug_heron.py: Update test file
Nikos Livathinos
2025-08-27 18:04:22 +02:00 -
c54d511c20
chore: debug_heron.py: Disable OCR
Nikos Livathinos
2025-08-27 17:31:33 +02:00 -
6ce3cd5763
chore: debug_heron.py update the test file
Nikos Livathinos
2025-08-27 16:38:40 +02:00 -
784283a50a
chore: Update test data for Heron in Linux
Nikos Livathinos
2025-08-27 14:16:13 +00:00 -
552a606b4e
chore: TMP script to debug heron
Nikos Livathinos
2025-08-27 16:07:49 +02:00 -
13255ad718
Merge from main
cau/multi-stage-vlm-pipeline
Christoph Auer
2025-08-27 15:28:47 +02:00 -
a9dcd43a7c
fix: Ensure that the visualisations happen on copies of the page image
Nikos Livathinos
2025-08-27 14:16:56 +02:00 -
fb3b7b93ae
chore: bump version to 2.48.0 [skip ci]
v2.48.0
github-actions[bot]
2025-08-26 05:29:31 +00:00 -
fa3327e1a6
fix(html): preserve code blocks in list items (#2131)
Cesar Berrospi Ramis
2025-08-26 06:43:48 +02:00 -
c0268416cf
chore: add analytics (#2133)
Michele Dolfi
2025-08-25 18:25:38 +02:00 -
1435fc3b81
Update test GT
Christoph Auer
2025-07-23 14:05:30 +02:00 -
83c45b5648
Update docling-models tag for TableFormer
Christoph Auer
2025-07-23 13:39:50 +02:00