mirror of
https://github.com/DS4SD/docling.git
synced 2025-12-08 12:48:28 +00:00
Commit Graph
Select branches
Hide Pull Requests
add-json-export-indentation
adr-model-stages
cau/dpv4-test-updates
cau/fix-layout-vlm-pipeline-artifacts-path
cau/layout-vlm-pipeline-page-images
cau/multi-stage-vlm-pipeline
cau/new-layout-processing
cau/pin-docling-parse-pre-3.2
cau/test-dp-word-lines
cau/test-pypdfium2-beta
copilot/fix-document-timeout-bug
copilot/fix-keyerror-in-docling
copilot/fix-page-range-bug
cp_main_20250602
demo
dev-granite-docling-table
dev/add-asr-pipeline
dev/add-granite-docling-extension
dev/add-granite-docling-preview
dev/add-r2l-tests
dev/add-reading-order-model
dev/add-two-stage-vlm
dev/analysis-for-granite-docling
dev/doctag_backend
dev/fix_msword_backend_identify_text_after_image
dev/table-orientation
dev/update-html-parser-with-h1
dev/update-to-latest-docling-parse-again
docs/add-extraction-script
elh/update_2stage_inference
extend-metadata-in-examples
gh-pages
main
mao/doctags
mly/smol-docling-integration
nli/fix_glm_utils
nli/fix_ocr_tests
nli/layout_dfine
nli/layout_heron2
nli/layout_rtdetr_v2
nli/layoutmodel_improvements
nli/tesseract_ocr_models
ocr-enrichment
pretest-core-2-51-0
propagate-core-fixes-20250502
remodel-lists-2
revert-803-refactor_viz
rtdl/docx_latex
rtdl/drawingml_import
vku/uspto_meta
#1
#10
#100
#101
#1010
#1015
#1017
#102
#1021
#1024
#1027
#103
#1038
#1039
#1040
#1041
#1051
#1052
#1053
#1054
#1055
#1057
#1061
#1062
#1077
#1096
#1097
#1098
#11
#110
#1100
#1106
#1107
#111
#1114
#1115
#1118
#1124
#1130
#1140
#1141
#1147
#1150
#1152
#1154
#1156
#1158
#1160
#1165
#1167
#117
#1173
#118
#1182
#1183
#1194
#1196
#1197
#1199
#12
#120
#1201
#121
#1210
#122
#1220
#1222
#1223
#123
#1231
#1238
#1239
#1241
#1244
#1247
#1248
#1261
#1263
#1268
#1270
#1286
#129
#1294
#1295
#13
#131
#1313
#1315
#1316
#1319
#132
#1320
#1326
#1328
#1332
#1334
#1337
#134
#1340
#1346
#135
#1350
#1355
#1359
#1363
#1371
#1375
#1377
#1378
#1379
#138
#1381
#1382
#1383
#1389
#139
#1392
#1399
#14
#140
#1400
#1402
#141
#1411
#1415
#1416
#1419
#1427
#1428
#143
#1430
#1436
#1442
#1449
#145
#1458
#1459
#1463
#1465
#1486
#149
#1490
#1492
#1494
#1496
#15
#150
#1500
#151
#1511
#1512
#152
#1520
#1523
#1524
#1525
#1526
#1527
#1528
#153
#1530
#1536
#1538
#154
#1548
#1549
#155
#1551
#1553
#1556
#1559
#156
#1560
#1561
#1563
#1566
#157
#1570
#1576
#1577
#158
#1582
#1583
#1587
#1589
#159
#1593
#1596
#16
#160
#1600
#1609
#161
#1610
#1615
#1617
#1619
#162
#1636
#164
#1658
#1659
#1660
#1663
#1664
#1665
#1667
#1671
#1673
#1676
#1679
#168
#1683
#1684
#1688
#1689
#169
#1691
#1698
#17
#170
#1700
#1701
#1706
#1707
#171
#1711
#1717
#1718
#1723
#1724
#1725
#1728
#173
#1734
#1735
#1745
#1746
#1747
#175
#1759
#1763
#1769
#177
#1772
#1775
#178
#179
#1791
#1795
#18
#180
#1802
#1804
#1808
#1810
#1812
#1815
#1816
#1819
#182
#1820
#1821
#1824
#1825
#1827
#183
#1836
#1838
#184
#1844
#1850
#1851
#1852
#1856
#1857
#186
#1863
#1866
#1867
#187
#1870
#1874
#1875
#1876
#188
#1884
#189
#1897
#1898
#1898
#19
#190
#1902
#1904
#1905
#1907
#1908
#1910
#1912
#1914
#1917
#1923
#1925
#1926
#1928
#193
#1931
#1934
#1937
#194
#1940
#1943
#1948
#1951
#1952
#196
#1960
#1969
#1970
#1971
#1975
#1981
#1982
#1984
#1986
#1988
#1989
#1992
#1995
#2
#20
#2001
#2002
#2006
#2011
#2017
#2018
#2024
#203
#2031
#2039
#2042
#2048
#2061
#2068
#2069
#2078
#2079
#2083
#2084
#2084
#2088
#2093
#2094
#2095
#2095
#21
#2100
#2105
#2106
#2110
#2111
#2112
#2113
#2114
#2114
#2122
#2123
#2124
#2126
#2131
#2132
#2133
#2138
#214
#2141
#2146
#2154
#2155
#2165
#2166
#2169
#217
#217
#2171
#2178
#218
#218
#2183
#2185
#2187
#219
#2199
#22
#2200
#2208
#2212
#2218
#2219
#2227
#2227
#2231
#2234
#2237
#2238
#224
#2242
#2244
#2251
#2252
#226
#2262
#2264
#2265
#2266
#2272
#228
#2281
#2284
#2284
#2287
#2288
#229
#2291
#2294
#2304
#2309
#2313
#2315
#2322
#2323
#2324
#233
#2339
#234
#2340
#2341
#235
#2357
#2359
#2361
#2365
#2366
#2371
#2372
#2373
#2378
#2378
#2382
#2383
#2388
#2391
#2394
#240
#2401
#2403
#2403
#2404
#2407
#2409
#2409
#241
#2410
#2411
#2413
#2415
#2418
#2420
#2421
#2422
#2423
#2424
#2425
#2426
#2427
#2429
#2430
#2431
#2433
#2436
#2441
#2442
#2445
#2445
#2447
#2452
#2453
#2454
#2458
#2459
#2468
#2473
#2474
#248
#2484
#2486
#2488
#2488
#2489
#2498
#2499
#2501
#2502
#2503
#251
#2511
#2512
#2513
#2517
#2519
#2520
#2521
#2526
#2527
#2530
#2531
#2533
#2541
#2543
#2546
#2548
#2549
#2553
#2563
#2569
#2571
#2573
#2578
#2582
#2585
#2587
#2587
#2588
#2589
#259
#2596
#2599
#26
#2600
#2605
#2613
#2618
#2622
#2622
#2624
#2627
#2636
#2637
#2638
#2639
#2640
#2641
#2644
#2645
#2645
#2648
#2649
#2651
#2653
#2656
#2658
#2659
#2660
#2662
#2664
#2665
#2669
#2671
#2674
#2676
#2676
#2678
#2678
#2682
#2682
#2689
#2692
#2693
#27
#2706
#2707
#2708
#2712
#2716
#2717
#2720
#2721
#2721
#2723
#2723
#2728
#2735
#2738
#2738
#2739
#2740
#2740
#2741
#2741
#275
#276
#279
#28
#282
#286
#29
#290
#3
#302
#305
#307
#31
#310
#312
#314
#315
#316
#319
#32
#320
#322
#323
#325
#33
#330
#332
#334
#339
#34
#340
#341
#349
#35
#350
#36
#37
#371
#374
#375
#378
#379
#38
#384
#388
#39
#392
#393
#396
#4
#40
#401
#407
#408
#409
#415
#416
#42
#429
#43
#430
#432
#44
#442
#449
#45
#451
#456
#457
#46
#466
#467
#468
#47
#472
#474
#475
#482
#484
#487
#49
#490
#492
#495
#496
#497
#5
#50
#500
#501
#502
#504
#51
#511
#512
#513
#514
#517
#52
#528
#53
#530
#531
#532
#533
#534
#537
#54
#544
#549
#550
#551
#552
#555
#556
#557
#558
#56
#569
#57
#58
#59
#593
#6
#604
#606
#608
#613
#615
#616
#618
#624
#628
#63
#630
#631
#633
#642
#65
#650
#655
#656
#662
#675
#679
#68
#69
#691
#693
#694
#695
#697
#698
#7
#70
#700
#701
#702
#708
#71
#716
#717
#718
#719
#72
#733
#735
#739
#742
#75
#752
#759
#769
#772
#777
#783
#786
#788
#79
#793
#8
#80
#800
#801
#803
#804
#805
#808
#81
#811
#814
#815
#816
#817
#818
#819
#82
#820
#821
#824
#825
#826
#827
#83
#830
#831
#832
#837
#839
#84
#841
#842
#843
#850
#852
#853
#854
#855
#856
#857
#86
#862
#868
#869
#872
#873
#874
#875
#876
#878
#88
#880
#881
#883
#896
#897
#90
#901
#903
#905
#91
#910
#912
#916
#919
#92
#929
#93
#932
#935
#940
#941
#945
#948
#949
#95
#951
#958
#96
#965
#966
#967
#98
#99
#999
v0.1.1
v0.2.0
v0.3.0
v0.3.1
v0.4.0
v1.0.0
v1.0.1
v1.0.2
v1.1.0
v1.1.1
v1.1.2
v1.10.0
v1.11.0
v1.12.0
v1.12.1
v1.12.2
v1.13.0
v1.13.1
v1.14.0
v1.15.0
v1.16.0
v1.16.1
v1.17.0
v1.18.0
v1.19.0
v1.19.1
v1.2.0
v1.2.1
v1.20.0
v1.3.0
v1.4.0
v1.5.0
v1.6.0
v1.6.1
v1.6.2
v1.6.3
v1.7.0
v1.7.1
v1.8.0
v1.8.1
v1.8.2
v1.8.3
v1.8.4
v1.8.5
v1.9.0
v2.0.0
v2.1.0
v2.10.0
v2.11.0
v2.12.0
v2.13.0
v2.14.0
v2.15.0
v2.15.1
v2.16.0
v2.17.0
v2.18.0
v2.19.0
v2.2.0
v2.2.1
v2.20.0
v2.21.0
v2.22.0
v2.23.0
v2.23.1
v2.24.0
v2.25.0
v2.25.1
v2.25.2
v2.26.0
v2.27.0
v2.28.0
v2.28.1
v2.28.2
v2.28.3
v2.28.4
v2.29.0
v2.3.0
v2.3.1
v2.30.0
v2.31.0
v2.31.1
v2.31.2
v2.32.0
v2.33.0
v2.34.0
v2.35.0
v2.36.0
v2.36.1
v2.37.0
v2.38.0
v2.38.1
v2.39.0
v2.4.0
v2.4.1
v2.4.2
v2.40.0
v2.41.0
v2.42.0
v2.42.1
v2.42.2
v2.43.0
v2.44.0
v2.45.0
v2.46.0
v2.47.0
v2.47.1
v2.48.0
v2.49.0
v2.5.0
v2.5.1
v2.5.2
v2.50.0
v2.51.0
v2.52.0
v2.53.0
v2.54.0
v2.55.0
v2.55.1
v2.56.0
v2.56.1
v2.57.0
v2.58.0
v2.59.0
v2.6.0
v2.60.0
v2.60.1
v2.61.0
v2.61.1
v2.61.2
v2.62.0
v2.63.0
v2.64.0
v2.7.0
v2.7.1
v2.8.0
v2.8.1
v2.8.2
v2.8.3
v2.9.0
Select branches
Hide Pull Requests
add-json-export-indentation
adr-model-stages
cau/dpv4-test-updates
cau/fix-layout-vlm-pipeline-artifacts-path
cau/layout-vlm-pipeline-page-images
cau/multi-stage-vlm-pipeline
cau/new-layout-processing
cau/pin-docling-parse-pre-3.2
cau/test-dp-word-lines
cau/test-pypdfium2-beta
copilot/fix-document-timeout-bug
copilot/fix-keyerror-in-docling
copilot/fix-page-range-bug
cp_main_20250602
demo
dev-granite-docling-table
dev/add-asr-pipeline
dev/add-granite-docling-extension
dev/add-granite-docling-preview
dev/add-r2l-tests
dev/add-reading-order-model
dev/add-two-stage-vlm
dev/analysis-for-granite-docling
dev/doctag_backend
dev/fix_msword_backend_identify_text_after_image
dev/table-orientation
dev/update-html-parser-with-h1
dev/update-to-latest-docling-parse-again
docs/add-extraction-script
elh/update_2stage_inference
extend-metadata-in-examples
gh-pages
main
mao/doctags
mly/smol-docling-integration
nli/fix_glm_utils
nli/fix_ocr_tests
nli/layout_dfine
nli/layout_heron2
nli/layout_rtdetr_v2
nli/layoutmodel_improvements
nli/tesseract_ocr_models
ocr-enrichment
pretest-core-2-51-0
propagate-core-fixes-20250502
remodel-lists-2
revert-803-refactor_viz
rtdl/docx_latex
rtdl/drawingml_import
vku/uspto_meta
#1
#10
#100
#101
#1010
#1015
#1017
#102
#1021
#1024
#1027
#103
#1038
#1039
#1040
#1041
#1051
#1052
#1053
#1054
#1055
#1057
#1061
#1062
#1077
#1096
#1097
#1098
#11
#110
#1100
#1106
#1107
#111
#1114
#1115
#1118
#1124
#1130
#1140
#1141
#1147
#1150
#1152
#1154
#1156
#1158
#1160
#1165
#1167
#117
#1173
#118
#1182
#1183
#1194
#1196
#1197
#1199
#12
#120
#1201
#121
#1210
#122
#1220
#1222
#1223
#123
#1231
#1238
#1239
#1241
#1244
#1247
#1248
#1261
#1263
#1268
#1270
#1286
#129
#1294
#1295
#13
#131
#1313
#1315
#1316
#1319
#132
#1320
#1326
#1328
#1332
#1334
#1337
#134
#1340
#1346
#135
#1350
#1355
#1359
#1363
#1371
#1375
#1377
#1378
#1379
#138
#1381
#1382
#1383
#1389
#139
#1392
#1399
#14
#140
#1400
#1402
#141
#1411
#1415
#1416
#1419
#1427
#1428
#143
#1430
#1436
#1442
#1449
#145
#1458
#1459
#1463
#1465
#1486
#149
#1490
#1492
#1494
#1496
#15
#150
#1500
#151
#1511
#1512
#152
#1520
#1523
#1524
#1525
#1526
#1527
#1528
#153
#1530
#1536
#1538
#154
#1548
#1549
#155
#1551
#1553
#1556
#1559
#156
#1560
#1561
#1563
#1566
#157
#1570
#1576
#1577
#158
#1582
#1583
#1587
#1589
#159
#1593
#1596
#16
#160
#1600
#1609
#161
#1610
#1615
#1617
#1619
#162
#1636
#164
#1658
#1659
#1660
#1663
#1664
#1665
#1667
#1671
#1673
#1676
#1679
#168
#1683
#1684
#1688
#1689
#169
#1691
#1698
#17
#170
#1700
#1701
#1706
#1707
#171
#1711
#1717
#1718
#1723
#1724
#1725
#1728
#173
#1734
#1735
#1745
#1746
#1747
#175
#1759
#1763
#1769
#177
#1772
#1775
#178
#179
#1791
#1795
#18
#180
#1802
#1804
#1808
#1810
#1812
#1815
#1816
#1819
#182
#1820
#1821
#1824
#1825
#1827
#183
#1836
#1838
#184
#1844
#1850
#1851
#1852
#1856
#1857
#186
#1863
#1866
#1867
#187
#1870
#1874
#1875
#1876
#188
#1884
#189
#1897
#1898
#1898
#19
#190
#1902
#1904
#1905
#1907
#1908
#1910
#1912
#1914
#1917
#1923
#1925
#1926
#1928
#193
#1931
#1934
#1937
#194
#1940
#1943
#1948
#1951
#1952
#196
#1960
#1969
#1970
#1971
#1975
#1981
#1982
#1984
#1986
#1988
#1989
#1992
#1995
#2
#20
#2001
#2002
#2006
#2011
#2017
#2018
#2024
#203
#2031
#2039
#2042
#2048
#2061
#2068
#2069
#2078
#2079
#2083
#2084
#2084
#2088
#2093
#2094
#2095
#2095
#21
#2100
#2105
#2106
#2110
#2111
#2112
#2113
#2114
#2114
#2122
#2123
#2124
#2126
#2131
#2132
#2133
#2138
#214
#2141
#2146
#2154
#2155
#2165
#2166
#2169
#217
#217
#2171
#2178
#218
#218
#2183
#2185
#2187
#219
#2199
#22
#2200
#2208
#2212
#2218
#2219
#2227
#2227
#2231
#2234
#2237
#2238
#224
#2242
#2244
#2251
#2252
#226
#2262
#2264
#2265
#2266
#2272
#228
#2281
#2284
#2284
#2287
#2288
#229
#2291
#2294
#2304
#2309
#2313
#2315
#2322
#2323
#2324
#233
#2339
#234
#2340
#2341
#235
#2357
#2359
#2361
#2365
#2366
#2371
#2372
#2373
#2378
#2378
#2382
#2383
#2388
#2391
#2394
#240
#2401
#2403
#2403
#2404
#2407
#2409
#2409
#241
#2410
#2411
#2413
#2415
#2418
#2420
#2421
#2422
#2423
#2424
#2425
#2426
#2427
#2429
#2430
#2431
#2433
#2436
#2441
#2442
#2445
#2445
#2447
#2452
#2453
#2454
#2458
#2459
#2468
#2473
#2474
#248
#2484
#2486
#2488
#2488
#2489
#2498
#2499
#2501
#2502
#2503
#251
#2511
#2512
#2513
#2517
#2519
#2520
#2521
#2526
#2527
#2530
#2531
#2533
#2541
#2543
#2546
#2548
#2549
#2553
#2563
#2569
#2571
#2573
#2578
#2582
#2585
#2587
#2587
#2588
#2589
#259
#2596
#2599
#26
#2600
#2605
#2613
#2618
#2622
#2622
#2624
#2627
#2636
#2637
#2638
#2639
#2640
#2641
#2644
#2645
#2645
#2648
#2649
#2651
#2653
#2656
#2658
#2659
#2660
#2662
#2664
#2665
#2669
#2671
#2674
#2676
#2676
#2678
#2678
#2682
#2682
#2689
#2692
#2693
#27
#2706
#2707
#2708
#2712
#2716
#2717
#2720
#2721
#2721
#2723
#2723
#2728
#2735
#2738
#2738
#2739
#2740
#2740
#2741
#2741
#275
#276
#279
#28
#282
#286
#29
#290
#3
#302
#305
#307
#31
#310
#312
#314
#315
#316
#319
#32
#320
#322
#323
#325
#33
#330
#332
#334
#339
#34
#340
#341
#349
#35
#350
#36
#37
#371
#374
#375
#378
#379
#38
#384
#388
#39
#392
#393
#396
#4
#40
#401
#407
#408
#409
#415
#416
#42
#429
#43
#430
#432
#44
#442
#449
#45
#451
#456
#457
#46
#466
#467
#468
#47
#472
#474
#475
#482
#484
#487
#49
#490
#492
#495
#496
#497
#5
#50
#500
#501
#502
#504
#51
#511
#512
#513
#514
#517
#52
#528
#53
#530
#531
#532
#533
#534
#537
#54
#544
#549
#550
#551
#552
#555
#556
#557
#558
#56
#569
#57
#58
#59
#593
#6
#604
#606
#608
#613
#615
#616
#618
#624
#628
#63
#630
#631
#633
#642
#65
#650
#655
#656
#662
#675
#679
#68
#69
#691
#693
#694
#695
#697
#698
#7
#70
#700
#701
#702
#708
#71
#716
#717
#718
#719
#72
#733
#735
#739
#742
#75
#752
#759
#769
#772
#777
#783
#786
#788
#79
#793
#8
#80
#800
#801
#803
#804
#805
#808
#81
#811
#814
#815
#816
#817
#818
#819
#82
#820
#821
#824
#825
#826
#827
#83
#830
#831
#832
#837
#839
#84
#841
#842
#843
#850
#852
#853
#854
#855
#856
#857
#86
#862
#868
#869
#872
#873
#874
#875
#876
#878
#88
#880
#881
#883
#896
#897
#90
#901
#903
#905
#91
#910
#912
#916
#919
#92
#929
#93
#932
#935
#940
#941
#945
#948
#949
#95
#951
#958
#96
#965
#966
#967
#98
#99
#999
v0.1.1
v0.2.0
v0.3.0
v0.3.1
v0.4.0
v1.0.0
v1.0.1
v1.0.2
v1.1.0
v1.1.1
v1.1.2
v1.10.0
v1.11.0
v1.12.0
v1.12.1
v1.12.2
v1.13.0
v1.13.1
v1.14.0
v1.15.0
v1.16.0
v1.16.1
v1.17.0
v1.18.0
v1.19.0
v1.19.1
v1.2.0
v1.2.1
v1.20.0
v1.3.0
v1.4.0
v1.5.0
v1.6.0
v1.6.1
v1.6.2
v1.6.3
v1.7.0
v1.7.1
v1.8.0
v1.8.1
v1.8.2
v1.8.3
v1.8.4
v1.8.5
v1.9.0
v2.0.0
v2.1.0
v2.10.0
v2.11.0
v2.12.0
v2.13.0
v2.14.0
v2.15.0
v2.15.1
v2.16.0
v2.17.0
v2.18.0
v2.19.0
v2.2.0
v2.2.1
v2.20.0
v2.21.0
v2.22.0
v2.23.0
v2.23.1
v2.24.0
v2.25.0
v2.25.1
v2.25.2
v2.26.0
v2.27.0
v2.28.0
v2.28.1
v2.28.2
v2.28.3
v2.28.4
v2.29.0
v2.3.0
v2.3.1
v2.30.0
v2.31.0
v2.31.1
v2.31.2
v2.32.0
v2.33.0
v2.34.0
v2.35.0
v2.36.0
v2.36.1
v2.37.0
v2.38.0
v2.38.1
v2.39.0
v2.4.0
v2.4.1
v2.4.2
v2.40.0
v2.41.0
v2.42.0
v2.42.1
v2.42.2
v2.43.0
v2.44.0
v2.45.0
v2.46.0
v2.47.0
v2.47.1
v2.48.0
v2.49.0
v2.5.0
v2.5.1
v2.5.2
v2.50.0
v2.51.0
v2.52.0
v2.53.0
v2.54.0
v2.55.0
v2.55.1
v2.56.0
v2.56.1
v2.57.0
v2.58.0
v2.59.0
v2.6.0
v2.60.0
v2.60.1
v2.61.0
v2.61.1
v2.61.2
v2.62.0
v2.63.0
v2.64.0
v2.7.0
v2.7.1
v2.8.0
v2.8.1
v2.8.2
v2.8.3
v2.9.0
-
969115b1dd
Use default layout model in model_downloader default args
Christoph Auer
2025-07-23 08:51:07 +02:00 -
e0482723c4
Use default layout model in model_downloader default args
Christoph Auer
2025-07-23 08:50:22 +02:00 -
a982995fb7
feat: Switch default layout model to DOCLING_LAYOUT_HERON. Update the unit test data.
Nikos Livathinos
2025-07-22 17:30:16 +02:00 -
d32d2c97e1
chore: PR approval reminder (#2132)
Michele Dolfi
2025-08-25 15:08:37 +02:00 -
3f60a0fa78
feat: Upgrade to RapidOCR 3.x (#2088)
geoHeil
2025-08-25 13:10:33 +03:00 -
2aef5cf328
chore: bump version to 2.47.1 [skip ci]
v2.47.1
github-actions[bot]
2025-08-23 14:11:33 +00:00 -
488f6cdd2d
fix: vllm extra only for linux x86_64 (#2126)
Michele Dolfi
2025-08-23 13:33:15 +02:00 -
6736e66bb4
style: show converted page count in PaginatedPipeline debug statement (#2124)
Raphael Norman-Tenazas
2025-08-23 06:13:20 -04:00 -
b04e205d1e
chore: bump version to 2.47.0 [skip ci]
v2.47.0
github-actions[bot]
2025-08-22 14:15:39 +00:00 -
cdf079dd06
feat(CLI): Option to download arbitrary HuggingFace model (#2123)
VIktor Kuropiantnyk
2025-08-22 15:23:29 +02:00 -
449bde0a6c
test: update docx reference results (#2122)
Michele Dolfi
2025-08-22 14:26:36 +02:00 -
3c660c0511
feat: batching support for VLMs in transformers backend, add initial VLLM backend (#2094)
Christoph Auer
2025-08-22 13:17:33 +02:00 -
3f03709885
fix: Improve numbered list detection for msword docs (#2100)
Nikhil Verma
2025-08-22 14:08:34 +05:30 -
94fcc46aa9
feat(html): Support formatting tags in HTML texts (#2111)
krrome
2025-08-22 10:37:34 +02:00 -
e76298c40d
docs: DPK pipeline example using docling library (#2112)
Maroun Touma
2025-08-21 04:14:36 -04:00 -
cc66773890
draft for model and stages redesign
adr-model-stages
Michele Dolfi
2025-08-21 10:13:17 +02:00 -
8996d612aa
docs: add Getting Started page (#2113)
Panos Vagenas
2025-08-21 08:44:53 +02:00 -
555506d8e6
chore: bump version to 2.46.0 [skip ci]
v2.46.0
github-actions[bot]
2025-08-20 15:25:07 +00:00 -
76d2cb76b3
chore: update docling-core lock (#2110)
Panos Vagenas
2025-08-20 16:41:48 +02:00 -
684adc17df
Add extra_processor_kwargs
Christoph Auer
2025-08-20 14:19:50 +02:00 -
5f57ff2a45
perf: Clean up resources with docling-parse v4, no parsed_page output by default (#2105)
Christoph Auer
2025-08-20 10:46:31 +02:00 -
c5f2e2fdd6
fix(HTML): parse footer tag as a group in furniture content layer (#2106)
Cesar Berrospi Ramis
2025-08-20 08:42:25 +02:00 -
8820b5558b
perf: speed up function
_parse_orientation(#1934)mohammed ahmed
2025-08-19 11:55:18 +03:00 -
956f82f115
chore: upgrade dependencies in lock file (#2093)
Michele Dolfi
2025-08-19 10:11:44 +02:00 -
6bbb8e6340
Add GoT OCR 2.0
Christoph Auer
2025-08-18 15:57:06 +02:00 -
b5b7e6dd5c
Add GoT OCR 2.0
Christoph Auer
2025-08-18 15:57:06 +02:00 -
d2494da8b8
feat: new code formula model (#2042)
Matteo
2025-08-18 16:01:46 +02:00 -
4a107f4f57
Adjust example instatiation of multi-stage VLM pipeline
Christoph Auer
2025-08-18 14:36:42 +02:00 -
3d07f1c78e
Cleanup hf_transformers_model batching impl
Christoph Auer
2025-08-18 13:37:46 +02:00 -
c3a7d1d999
chore: bump version to 2.45.0 [skip ci]
v2.45.0
github-actions[bot]
2025-08-18 10:25:51 +00:00 -
31087f3fcc
feat: add backend for METS with Google Books profile (#1989)
Michele Dolfi
2025-08-18 11:43:20 +02:00 -
fead482e92
Merge from main, include decode_response
Christoph Auer
2025-08-18 11:29:15 +02:00 -
e372cfe01a
Small fixes
Christoph Auer
2025-08-18 11:12:02 +02:00 -
9687297262
feat(html): Support in-line anchor tags in HTML texts (#1659)
krrome
2025-08-18 09:57:16 +02:00 -
76c1fbd6e8
docs: Add docling Quarkus integration (#2083)
Eric Deandrea
2025-08-18 00:55:51 -04:00 -
f42676aab9
Implement proper batch inference for HuggingFaceTransformersVlmModel
Christoph Auer
2025-08-15 17:56:14 +02:00 -
1aa522792a
Tweak defaults
Christoph Auer
2025-08-15 14:49:34 +02:00 -
16fea9cd8b
Add VLLM backend support, optimize process_images
Christoph Auer
2025-08-15 13:18:02 +02:00 -
18b1a43744
Fix KeyboardInterrupt behaviour
Christoph Auer
2025-08-14 21:11:40 +02:00 -
52b54b21c3
Remove prints
Christoph Auer
2025-08-14 20:48:34 +02:00 -
c4de11bdb3
Add VLM task interpreters
Christoph Auer
2025-08-14 20:48:10 +02:00 -
c8737f71da
Add VLM task interpreters
Christoph Auer
2025-08-14 20:44:23 +02:00 -
78c13e1dad
Add multithreaded VLM pipeline
Christoph Auer
2025-08-13 14:54:23 +02:00 -
cffe1f0ae5
Adding feature to import drawingml objects in doclingdocument
rtdl/drawingml_import
Rafael Teixeira de Lima
2025-08-14 16:25:59 +02:00 -
126944c7ee
Prepare existing codes for use with new multi-stage VLM pipeline
Christoph Auer
2025-08-13 14:02:19 +02:00 -
5f050f94e1
feat(vlm): Ability to preprocess VLM response (#1907)
Shkarupa Alex
2025-08-12 16:20:24 +03:00 -
ccfee05847
chore: bump version to 2.44.0 [skip ci]
v2.44.0
github-actions[bot]
2025-08-12 09:51:35 +00:00 -
b09033cb73
feat: add convert_string to document-converter (#2069)
Peter W. J. Staar
2025-08-12 11:02:38 +02:00 -
e2cca931be
docs: add Langflow integration (#2068)
Panos Vagenas
2025-08-11 17:03:29 +03:00 -
ed56f2de5d
fix(html): Parse rawspan and colspan when they include non numerical values (#2048)
Maroun Touma
2025-08-11 07:53:29 -04:00 -
bfda6d34d8
docs: Add Arconia integration (#2061)
Thomas Vitale
2025-08-08 09:35:47 +02:00 -
c5f49dc2db
chore: upgrade locked dependencies (#2024)
Michele Dolfi
2025-07-31 16:05:27 +02:00 -
0130e3ae96
fix: support new mlx-vlm module (#2001)
TwoLeaves
2025-07-31 22:13:17 +10:00 -
2eb760d060
fix: extend error reporting when verbose logging is enabled (#2017)
Michele Dolfi
2025-07-30 11:23:26 +02:00 -
86f70128aa
fix(HTML): replace non-standard Unicode characters (#2006)
Cesar Berrospi Ramis
2025-07-29 11:05:35 +02:00 -
aae42b37a8
chore: bump version to 2.43.0 [skip ci]
v2.43.0
github-actions[bot]
2025-07-28 09:45:53 +00:00 -
aed772ab33
feat: Threaded PDF pipeline (#1951)
Christoph Auer
2025-07-26 11:49:37 +02:00 -
aec29a7315
fix(markdown): ensure correct parsing of nested lists (#1995)
Cesar Berrospi Ramis
2025-07-25 15:17:57 +02:00 -
1985841a19
ci: Fixes for test GT (#1992)
Christoph Auer
2025-07-25 12:28:06 +02:00 -
945721a15d
fix(HTML): remove an unnecessary print command (#1988)
Cesar Berrospi Ramis
2025-07-25 08:45:15 +02:00 -
8227841c1b
chore: bump version to 2.42.2 [skip ci]
v2.42.2
github-actions[bot]
2025-07-24 10:21:10 +00:00 -
5132f061a8
fix(HTML): concatenation of child strings in table cells and list items (#1981)
Cesar Berrospi Ramis
2025-07-24 11:19:25 +02:00 -
7b5f86098d
docs: add chat with dosu (#1984)
Michele Dolfi
2025-07-24 11:07:36 +02:00 -
0b83609531
fix(docx): Adding plain latex equations to table cells (#1986)
Rafael Teixeira de Lima
2025-07-24 11:02:24 +02:00 -
98e2fcff63
fix: Preserve PARTIAL_SUCCESS status when document timeout hits (#1975)
Copilot
2025-07-23 13:50:40 +02:00 -
8d50a59d48
fix: multi-page image support (tiff) (#1928)
Copilot
2025-07-23 09:55:40 +02:00 -
ec971bbe68
chore: bump version to 2.42.1 [skip ci]
v2.42.1
github-actions[bot]
2025-07-22 16:45:48 +00:00 -
67441ca418
fix: Keep formula clusters also when empty (#1970)
Christoph Auer
2025-07-22 17:02:12 +02:00 -
90a7cc4bdd
docs: enrich existing DoclingDocument (#1969)
Michele Dolfi
2025-07-22 16:20:15 +02:00 -
a069b1175b
refactor(HTML): handle text from styled html (#1960)
Cesar Berrospi Ramis
2025-07-22 13:16:31 +02:00 -
5d98bcea1b
docs: add documentation for confidence scores (#1912)
Fabiano Franz
2025-07-21 05:16:17 -03:00 -
7561be537a
chore: bump version to 2.42.0 [skip ci]
v2.42.0
github-actions[bot]
2025-07-18 15:34:59 +00:00 -
cca05c45ea
fix: Safe pipeline init, use device_map in transformers models (#1917)
Christoph Auer
2025-07-18 15:14:36 +02:00 -
e1e3053695
fix: fix HTML table parser and JATS backend bugs (#1948)
Cesar Berrospi Ramis
2025-07-16 10:49:24 +02:00 -
d6d2dbe2f9
docs: Fix typos (#1943)
stephencox-ict
2025-07-15 19:51:56 +12:00 -
a436be7367
feat: Add option to control empty clusters in layout postprocessing (#1940)
Christoph Auer
2025-07-14 18:32:01 +02:00 -
95e70962f1
fix: KeyError: 'fPr' when processing latex fractions in DOCX files (#1926)
Copilot
2025-07-11 09:52:14 +02:00 -
c5fb353f10
fix: Change granite vision model URL from preview to stable version (#1925)
Copilot
2025-07-11 08:46:03 +02:00 -
6c4bf9d087
chore: bump version to 2.41.0 [skip ci]
v2.41.0
github-actions[bot]
2025-07-10 14:25:05 +00:00 -
f4c1836c96
functional working two-stage, need to implement a good prompt now to leverage bounding boxes
dev/add-two-stage-vlm
Peter Staar
2025-07-10 16:15:54 +02:00 -
b2d5c783ae
working two-stage vlm approach from the cli
Peter Staar
2025-07-10 15:38:15 +02:00 -
cc6193b3b9
test: Update tests to use default PDF backend (DPv4) (#1923)
Christoph Auer
2025-07-10 15:16:56 +02:00 -
fb74d0c5b3
working TwoStageVlmModel
Peter Staar
2025-07-10 15:11:53 +02:00 -
b2336830eb
fixed the circular dependenciea
Peter Staar
2025-07-10 10:35:47 +02:00 -
70872e6539
merged with main and refactored the code to fix MyPy
Peter Staar
2025-07-10 09:58:06 +02:00 -
e596143bf8
Merge branch 'main' into dev/add-two-stage-vlm
Peter Staar
2025-07-10 06:52:31 +02:00 -
0f395688b8
refactored the code and added vlm2stage as a cli option
Peter Staar
2025-07-10 06:48:34 +02:00 -
2b8616d6d5
feat: Layout model specification and multiple choices (#1910)
Christoph Auer
2025-07-10 06:37:27 +02:00 -
ec588df971
feat: enable precision control in float serialization (#1914)
Panos Vagenas
2025-07-09 16:39:17 +02:00 -
dcf6fd6a41
fixed the MyPy complaining
Peter Staar
2025-07-09 06:48:03 +02:00 -
931eb55b88
fix(ocr-utils): unit test and fix the
rotate_bounding_boxfunction (#1897)Clément Doumouro
2025-07-08 18:03:29 +02:00 -
c10e2920a4
refactoring redundant code and fixing mypy errors
Peter Staar
2025-07-08 16:37:20 +02:00 -
b5479ab971
working on MyPy
Peter Staar
2025-07-08 15:05:54 +02:00 -
49e9a00c05
merged in layout-model-spec
Peter Staar
2025-07-08 13:29:30 +02:00 -
517230b9c4
Updated naming
Christoph Auer
2025-07-08 13:07:56 +02:00 -
af0461e5b1
Move to pipeline_options.layout_options.model
Christoph Auer
2025-07-08 11:24:06 +02:00 -
f2094f858b
Establish layout_model spec and example instantations
Christoph Auer
2025-07-08 10:23:18 +02:00 -
810446c8dc
feat: working on a two stage VLM model
Peter Staar
2025-07-08 09:49:39 +02:00 -
4eceefa47c
feat: add TwoStageVlmModel
Peter Staar
2025-07-08 07:38:48 +02:00 -
a07ba863c4
feat: add image-text-to-text models in transformers (#1772)
geoHeil
2025-07-08 05:54:57 +02:00