mirror of
https://github.com/DS4SD/docling.git
synced 2025-12-08 12:48:28 +00:00
Commit Graph
Select branches
Hide Pull Requests
add-json-export-indentation
adr-model-stages
cau/dpv4-test-updates
cau/fix-layout-vlm-pipeline-artifacts-path
cau/layout-vlm-pipeline-page-images
cau/multi-stage-vlm-pipeline
cau/new-layout-processing
cau/pin-docling-parse-pre-3.2
cau/test-dp-word-lines
cau/test-pypdfium2-beta
copilot/fix-document-timeout-bug
copilot/fix-keyerror-in-docling
copilot/fix-page-range-bug
cp_main_20250602
demo
dev-granite-docling-table
dev/add-asr-pipeline
dev/add-granite-docling-extension
dev/add-granite-docling-preview
dev/add-r2l-tests
dev/add-reading-order-model
dev/add-two-stage-vlm
dev/analysis-for-granite-docling
dev/doctag_backend
dev/fix_msword_backend_identify_text_after_image
dev/table-orientation
dev/update-html-parser-with-h1
dev/update-to-latest-docling-parse-again
docs/add-extraction-script
elh/update_2stage_inference
extend-metadata-in-examples
gh-pages
main
mao/doctags
mly/smol-docling-integration
nli/fix_glm_utils
nli/fix_ocr_tests
nli/layout_dfine
nli/layout_heron2
nli/layout_rtdetr_v2
nli/layoutmodel_improvements
nli/tesseract_ocr_models
ocr-enrichment
pretest-core-2-51-0
propagate-core-fixes-20250502
remodel-lists-2
revert-803-refactor_viz
rtdl/docx_latex
rtdl/drawingml_import
vku/uspto_meta
#1
#10
#100
#101
#1010
#1015
#1017
#102
#1021
#1024
#1027
#103
#1038
#1039
#1040
#1041
#1051
#1052
#1053
#1054
#1055
#1057
#1061
#1062
#1077
#1096
#1097
#1098
#11
#110
#1100
#1106
#1107
#111
#1114
#1115
#1118
#1124
#1130
#1140
#1141
#1147
#1150
#1152
#1154
#1156
#1158
#1160
#1165
#1167
#117
#1173
#118
#1182
#1183
#1194
#1196
#1197
#1199
#12
#120
#1201
#121
#1210
#122
#1220
#1222
#1223
#123
#1231
#1238
#1239
#1241
#1244
#1247
#1248
#1261
#1263
#1268
#1270
#1286
#129
#1294
#1295
#13
#131
#1313
#1315
#1316
#1319
#132
#1320
#1326
#1328
#1332
#1334
#1337
#134
#1340
#1346
#135
#1350
#1355
#1359
#1363
#1371
#1375
#1377
#1378
#1379
#138
#1381
#1382
#1383
#1389
#139
#1392
#1399
#14
#140
#1400
#1402
#141
#1411
#1415
#1416
#1419
#1427
#1428
#143
#1430
#1436
#1442
#1449
#145
#1458
#1459
#1463
#1465
#1486
#149
#1490
#1492
#1494
#1496
#15
#150
#1500
#151
#1511
#1512
#152
#1520
#1523
#1524
#1525
#1526
#1527
#1528
#153
#1530
#1536
#1538
#154
#1548
#1549
#155
#1551
#1553
#1556
#1559
#156
#1560
#1561
#1563
#1566
#157
#1570
#1576
#1577
#158
#1582
#1583
#1587
#1589
#159
#1593
#1596
#16
#160
#1600
#1609
#161
#1610
#1615
#1617
#1619
#162
#1636
#164
#1658
#1659
#1660
#1663
#1664
#1665
#1667
#1671
#1673
#1676
#1679
#168
#1683
#1684
#1688
#1689
#169
#1691
#1698
#17
#170
#1700
#1701
#1706
#1707
#171
#1711
#1717
#1718
#1723
#1724
#1725
#1728
#173
#1734
#1735
#1745
#1746
#1747
#175
#1759
#1763
#1769
#177
#1772
#1775
#178
#179
#1791
#1795
#18
#180
#1802
#1804
#1808
#1810
#1812
#1815
#1816
#1819
#182
#1820
#1821
#1824
#1825
#1827
#183
#1836
#1838
#184
#1844
#1850
#1851
#1852
#1856
#1857
#186
#1863
#1866
#1867
#187
#1870
#1874
#1875
#1876
#188
#1884
#189
#1897
#1898
#1898
#19
#190
#1902
#1904
#1905
#1907
#1908
#1910
#1912
#1914
#1917
#1923
#1925
#1926
#1928
#193
#1931
#1934
#1937
#194
#1940
#1943
#1948
#1951
#1952
#196
#1960
#1969
#1970
#1971
#1975
#1981
#1982
#1984
#1986
#1988
#1989
#1992
#1995
#2
#20
#2001
#2002
#2006
#2011
#2017
#2018
#2024
#203
#2031
#2039
#2042
#2048
#2061
#2068
#2069
#2078
#2079
#2083
#2084
#2084
#2088
#2093
#2094
#2095
#2095
#21
#2100
#2105
#2106
#2110
#2111
#2112
#2113
#2114
#2114
#2122
#2123
#2124
#2126
#2131
#2132
#2133
#2138
#214
#2141
#2146
#2154
#2155
#2165
#2166
#2169
#217
#217
#2171
#2178
#218
#218
#2183
#2185
#2187
#219
#2199
#22
#2200
#2208
#2212
#2218
#2219
#2227
#2227
#2231
#2234
#2237
#2238
#224
#2242
#2244
#2251
#2252
#226
#2262
#2264
#2265
#2266
#2272
#228
#2281
#2284
#2284
#2287
#2288
#229
#2291
#2294
#2304
#2309
#2313
#2315
#2322
#2323
#2324
#233
#2339
#234
#2340
#2341
#235
#2357
#2359
#2361
#2365
#2366
#2371
#2372
#2373
#2378
#2378
#2382
#2383
#2388
#2391
#2394
#240
#2401
#2403
#2403
#2404
#2407
#2409
#2409
#241
#2410
#2411
#2413
#2415
#2418
#2420
#2421
#2422
#2423
#2424
#2425
#2426
#2427
#2429
#2430
#2431
#2433
#2436
#2441
#2442
#2445
#2445
#2447
#2452
#2453
#2454
#2458
#2459
#2468
#2473
#2474
#248
#2484
#2486
#2488
#2488
#2489
#2498
#2499
#2501
#2502
#2503
#251
#2511
#2512
#2513
#2517
#2519
#2520
#2521
#2526
#2527
#2530
#2531
#2533
#2541
#2543
#2546
#2548
#2549
#2553
#2563
#2569
#2571
#2573
#2578
#2582
#2585
#2587
#2587
#2588
#2589
#259
#2596
#2599
#26
#2600
#2605
#2613
#2618
#2622
#2622
#2624
#2627
#2636
#2637
#2638
#2639
#2640
#2641
#2644
#2645
#2645
#2648
#2649
#2651
#2653
#2656
#2658
#2659
#2660
#2662
#2664
#2665
#2669
#2671
#2674
#2676
#2676
#2678
#2678
#2682
#2682
#2689
#2692
#2693
#27
#2706
#2707
#2708
#2712
#2716
#2717
#2720
#2721
#2721
#2723
#2723
#2728
#2735
#2738
#2738
#2739
#2740
#2740
#2741
#2741
#275
#276
#279
#28
#282
#286
#29
#290
#3
#302
#305
#307
#31
#310
#312
#314
#315
#316
#319
#32
#320
#322
#323
#325
#33
#330
#332
#334
#339
#34
#340
#341
#349
#35
#350
#36
#37
#371
#374
#375
#378
#379
#38
#384
#388
#39
#392
#393
#396
#4
#40
#401
#407
#408
#409
#415
#416
#42
#429
#43
#430
#432
#44
#442
#449
#45
#451
#456
#457
#46
#466
#467
#468
#47
#472
#474
#475
#482
#484
#487
#49
#490
#492
#495
#496
#497
#5
#50
#500
#501
#502
#504
#51
#511
#512
#513
#514
#517
#52
#528
#53
#530
#531
#532
#533
#534
#537
#54
#544
#549
#550
#551
#552
#555
#556
#557
#558
#56
#569
#57
#58
#59
#593
#6
#604
#606
#608
#613
#615
#616
#618
#624
#628
#63
#630
#631
#633
#642
#65
#650
#655
#656
#662
#675
#679
#68
#69
#691
#693
#694
#695
#697
#698
#7
#70
#700
#701
#702
#708
#71
#716
#717
#718
#719
#72
#733
#735
#739
#742
#75
#752
#759
#769
#772
#777
#783
#786
#788
#79
#793
#8
#80
#800
#801
#803
#804
#805
#808
#81
#811
#814
#815
#816
#817
#818
#819
#82
#820
#821
#824
#825
#826
#827
#83
#830
#831
#832
#837
#839
#84
#841
#842
#843
#850
#852
#853
#854
#855
#856
#857
#86
#862
#868
#869
#872
#873
#874
#875
#876
#878
#88
#880
#881
#883
#896
#897
#90
#901
#903
#905
#91
#910
#912
#916
#919
#92
#929
#93
#932
#935
#940
#941
#945
#948
#949
#95
#951
#958
#96
#965
#966
#967
#98
#99
#999
v0.1.1
v0.2.0
v0.3.0
v0.3.1
v0.4.0
v1.0.0
v1.0.1
v1.0.2
v1.1.0
v1.1.1
v1.1.2
v1.10.0
v1.11.0
v1.12.0
v1.12.1
v1.12.2
v1.13.0
v1.13.1
v1.14.0
v1.15.0
v1.16.0
v1.16.1
v1.17.0
v1.18.0
v1.19.0
v1.19.1
v1.2.0
v1.2.1
v1.20.0
v1.3.0
v1.4.0
v1.5.0
v1.6.0
v1.6.1
v1.6.2
v1.6.3
v1.7.0
v1.7.1
v1.8.0
v1.8.1
v1.8.2
v1.8.3
v1.8.4
v1.8.5
v1.9.0
v2.0.0
v2.1.0
v2.10.0
v2.11.0
v2.12.0
v2.13.0
v2.14.0
v2.15.0
v2.15.1
v2.16.0
v2.17.0
v2.18.0
v2.19.0
v2.2.0
v2.2.1
v2.20.0
v2.21.0
v2.22.0
v2.23.0
v2.23.1
v2.24.0
v2.25.0
v2.25.1
v2.25.2
v2.26.0
v2.27.0
v2.28.0
v2.28.1
v2.28.2
v2.28.3
v2.28.4
v2.29.0
v2.3.0
v2.3.1
v2.30.0
v2.31.0
v2.31.1
v2.31.2
v2.32.0
v2.33.0
v2.34.0
v2.35.0
v2.36.0
v2.36.1
v2.37.0
v2.38.0
v2.38.1
v2.39.0
v2.4.0
v2.4.1
v2.4.2
v2.40.0
v2.41.0
v2.42.0
v2.42.1
v2.42.2
v2.43.0
v2.44.0
v2.45.0
v2.46.0
v2.47.0
v2.47.1
v2.48.0
v2.49.0
v2.5.0
v2.5.1
v2.5.2
v2.50.0
v2.51.0
v2.52.0
v2.53.0
v2.54.0
v2.55.0
v2.55.1
v2.56.0
v2.56.1
v2.57.0
v2.58.0
v2.59.0
v2.6.0
v2.60.0
v2.60.1
v2.61.0
v2.61.1
v2.61.2
v2.62.0
v2.63.0
v2.64.0
v2.7.0
v2.7.1
v2.8.0
v2.8.1
v2.8.2
v2.8.3
v2.9.0
Select branches
Hide Pull Requests
add-json-export-indentation
adr-model-stages
cau/dpv4-test-updates
cau/fix-layout-vlm-pipeline-artifacts-path
cau/layout-vlm-pipeline-page-images
cau/multi-stage-vlm-pipeline
cau/new-layout-processing
cau/pin-docling-parse-pre-3.2
cau/test-dp-word-lines
cau/test-pypdfium2-beta
copilot/fix-document-timeout-bug
copilot/fix-keyerror-in-docling
copilot/fix-page-range-bug
cp_main_20250602
demo
dev-granite-docling-table
dev/add-asr-pipeline
dev/add-granite-docling-extension
dev/add-granite-docling-preview
dev/add-r2l-tests
dev/add-reading-order-model
dev/add-two-stage-vlm
dev/analysis-for-granite-docling
dev/doctag_backend
dev/fix_msword_backend_identify_text_after_image
dev/table-orientation
dev/update-html-parser-with-h1
dev/update-to-latest-docling-parse-again
docs/add-extraction-script
elh/update_2stage_inference
extend-metadata-in-examples
gh-pages
main
mao/doctags
mly/smol-docling-integration
nli/fix_glm_utils
nli/fix_ocr_tests
nli/layout_dfine
nli/layout_heron2
nli/layout_rtdetr_v2
nli/layoutmodel_improvements
nli/tesseract_ocr_models
ocr-enrichment
pretest-core-2-51-0
propagate-core-fixes-20250502
remodel-lists-2
revert-803-refactor_viz
rtdl/docx_latex
rtdl/drawingml_import
vku/uspto_meta
#1
#10
#100
#101
#1010
#1015
#1017
#102
#1021
#1024
#1027
#103
#1038
#1039
#1040
#1041
#1051
#1052
#1053
#1054
#1055
#1057
#1061
#1062
#1077
#1096
#1097
#1098
#11
#110
#1100
#1106
#1107
#111
#1114
#1115
#1118
#1124
#1130
#1140
#1141
#1147
#1150
#1152
#1154
#1156
#1158
#1160
#1165
#1167
#117
#1173
#118
#1182
#1183
#1194
#1196
#1197
#1199
#12
#120
#1201
#121
#1210
#122
#1220
#1222
#1223
#123
#1231
#1238
#1239
#1241
#1244
#1247
#1248
#1261
#1263
#1268
#1270
#1286
#129
#1294
#1295
#13
#131
#1313
#1315
#1316
#1319
#132
#1320
#1326
#1328
#1332
#1334
#1337
#134
#1340
#1346
#135
#1350
#1355
#1359
#1363
#1371
#1375
#1377
#1378
#1379
#138
#1381
#1382
#1383
#1389
#139
#1392
#1399
#14
#140
#1400
#1402
#141
#1411
#1415
#1416
#1419
#1427
#1428
#143
#1430
#1436
#1442
#1449
#145
#1458
#1459
#1463
#1465
#1486
#149
#1490
#1492
#1494
#1496
#15
#150
#1500
#151
#1511
#1512
#152
#1520
#1523
#1524
#1525
#1526
#1527
#1528
#153
#1530
#1536
#1538
#154
#1548
#1549
#155
#1551
#1553
#1556
#1559
#156
#1560
#1561
#1563
#1566
#157
#1570
#1576
#1577
#158
#1582
#1583
#1587
#1589
#159
#1593
#1596
#16
#160
#1600
#1609
#161
#1610
#1615
#1617
#1619
#162
#1636
#164
#1658
#1659
#1660
#1663
#1664
#1665
#1667
#1671
#1673
#1676
#1679
#168
#1683
#1684
#1688
#1689
#169
#1691
#1698
#17
#170
#1700
#1701
#1706
#1707
#171
#1711
#1717
#1718
#1723
#1724
#1725
#1728
#173
#1734
#1735
#1745
#1746
#1747
#175
#1759
#1763
#1769
#177
#1772
#1775
#178
#179
#1791
#1795
#18
#180
#1802
#1804
#1808
#1810
#1812
#1815
#1816
#1819
#182
#1820
#1821
#1824
#1825
#1827
#183
#1836
#1838
#184
#1844
#1850
#1851
#1852
#1856
#1857
#186
#1863
#1866
#1867
#187
#1870
#1874
#1875
#1876
#188
#1884
#189
#1897
#1898
#1898
#19
#190
#1902
#1904
#1905
#1907
#1908
#1910
#1912
#1914
#1917
#1923
#1925
#1926
#1928
#193
#1931
#1934
#1937
#194
#1940
#1943
#1948
#1951
#1952
#196
#1960
#1969
#1970
#1971
#1975
#1981
#1982
#1984
#1986
#1988
#1989
#1992
#1995
#2
#20
#2001
#2002
#2006
#2011
#2017
#2018
#2024
#203
#2031
#2039
#2042
#2048
#2061
#2068
#2069
#2078
#2079
#2083
#2084
#2084
#2088
#2093
#2094
#2095
#2095
#21
#2100
#2105
#2106
#2110
#2111
#2112
#2113
#2114
#2114
#2122
#2123
#2124
#2126
#2131
#2132
#2133
#2138
#214
#2141
#2146
#2154
#2155
#2165
#2166
#2169
#217
#217
#2171
#2178
#218
#218
#2183
#2185
#2187
#219
#2199
#22
#2200
#2208
#2212
#2218
#2219
#2227
#2227
#2231
#2234
#2237
#2238
#224
#2242
#2244
#2251
#2252
#226
#2262
#2264
#2265
#2266
#2272
#228
#2281
#2284
#2284
#2287
#2288
#229
#2291
#2294
#2304
#2309
#2313
#2315
#2322
#2323
#2324
#233
#2339
#234
#2340
#2341
#235
#2357
#2359
#2361
#2365
#2366
#2371
#2372
#2373
#2378
#2378
#2382
#2383
#2388
#2391
#2394
#240
#2401
#2403
#2403
#2404
#2407
#2409
#2409
#241
#2410
#2411
#2413
#2415
#2418
#2420
#2421
#2422
#2423
#2424
#2425
#2426
#2427
#2429
#2430
#2431
#2433
#2436
#2441
#2442
#2445
#2445
#2447
#2452
#2453
#2454
#2458
#2459
#2468
#2473
#2474
#248
#2484
#2486
#2488
#2488
#2489
#2498
#2499
#2501
#2502
#2503
#251
#2511
#2512
#2513
#2517
#2519
#2520
#2521
#2526
#2527
#2530
#2531
#2533
#2541
#2543
#2546
#2548
#2549
#2553
#2563
#2569
#2571
#2573
#2578
#2582
#2585
#2587
#2587
#2588
#2589
#259
#2596
#2599
#26
#2600
#2605
#2613
#2618
#2622
#2622
#2624
#2627
#2636
#2637
#2638
#2639
#2640
#2641
#2644
#2645
#2645
#2648
#2649
#2651
#2653
#2656
#2658
#2659
#2660
#2662
#2664
#2665
#2669
#2671
#2674
#2676
#2676
#2678
#2678
#2682
#2682
#2689
#2692
#2693
#27
#2706
#2707
#2708
#2712
#2716
#2717
#2720
#2721
#2721
#2723
#2723
#2728
#2735
#2738
#2738
#2739
#2740
#2740
#2741
#2741
#275
#276
#279
#28
#282
#286
#29
#290
#3
#302
#305
#307
#31
#310
#312
#314
#315
#316
#319
#32
#320
#322
#323
#325
#33
#330
#332
#334
#339
#34
#340
#341
#349
#35
#350
#36
#37
#371
#374
#375
#378
#379
#38
#384
#388
#39
#392
#393
#396
#4
#40
#401
#407
#408
#409
#415
#416
#42
#429
#43
#430
#432
#44
#442
#449
#45
#451
#456
#457
#46
#466
#467
#468
#47
#472
#474
#475
#482
#484
#487
#49
#490
#492
#495
#496
#497
#5
#50
#500
#501
#502
#504
#51
#511
#512
#513
#514
#517
#52
#528
#53
#530
#531
#532
#533
#534
#537
#54
#544
#549
#550
#551
#552
#555
#556
#557
#558
#56
#569
#57
#58
#59
#593
#6
#604
#606
#608
#613
#615
#616
#618
#624
#628
#63
#630
#631
#633
#642
#65
#650
#655
#656
#662
#675
#679
#68
#69
#691
#693
#694
#695
#697
#698
#7
#70
#700
#701
#702
#708
#71
#716
#717
#718
#719
#72
#733
#735
#739
#742
#75
#752
#759
#769
#772
#777
#783
#786
#788
#79
#793
#8
#80
#800
#801
#803
#804
#805
#808
#81
#811
#814
#815
#816
#817
#818
#819
#82
#820
#821
#824
#825
#826
#827
#83
#830
#831
#832
#837
#839
#84
#841
#842
#843
#850
#852
#853
#854
#855
#856
#857
#86
#862
#868
#869
#872
#873
#874
#875
#876
#878
#88
#880
#881
#883
#896
#897
#90
#901
#903
#905
#91
#910
#912
#916
#919
#92
#929
#93
#932
#935
#940
#941
#945
#948
#949
#95
#951
#958
#96
#965
#966
#967
#98
#99
#999
v0.1.1
v0.2.0
v0.3.0
v0.3.1
v0.4.0
v1.0.0
v1.0.1
v1.0.2
v1.1.0
v1.1.1
v1.1.2
v1.10.0
v1.11.0
v1.12.0
v1.12.1
v1.12.2
v1.13.0
v1.13.1
v1.14.0
v1.15.0
v1.16.0
v1.16.1
v1.17.0
v1.18.0
v1.19.0
v1.19.1
v1.2.0
v1.2.1
v1.20.0
v1.3.0
v1.4.0
v1.5.0
v1.6.0
v1.6.1
v1.6.2
v1.6.3
v1.7.0
v1.7.1
v1.8.0
v1.8.1
v1.8.2
v1.8.3
v1.8.4
v1.8.5
v1.9.0
v2.0.0
v2.1.0
v2.10.0
v2.11.0
v2.12.0
v2.13.0
v2.14.0
v2.15.0
v2.15.1
v2.16.0
v2.17.0
v2.18.0
v2.19.0
v2.2.0
v2.2.1
v2.20.0
v2.21.0
v2.22.0
v2.23.0
v2.23.1
v2.24.0
v2.25.0
v2.25.1
v2.25.2
v2.26.0
v2.27.0
v2.28.0
v2.28.1
v2.28.2
v2.28.3
v2.28.4
v2.29.0
v2.3.0
v2.3.1
v2.30.0
v2.31.0
v2.31.1
v2.31.2
v2.32.0
v2.33.0
v2.34.0
v2.35.0
v2.36.0
v2.36.1
v2.37.0
v2.38.0
v2.38.1
v2.39.0
v2.4.0
v2.4.1
v2.4.2
v2.40.0
v2.41.0
v2.42.0
v2.42.1
v2.42.2
v2.43.0
v2.44.0
v2.45.0
v2.46.0
v2.47.0
v2.47.1
v2.48.0
v2.49.0
v2.5.0
v2.5.1
v2.5.2
v2.50.0
v2.51.0
v2.52.0
v2.53.0
v2.54.0
v2.55.0
v2.55.1
v2.56.0
v2.56.1
v2.57.0
v2.58.0
v2.59.0
v2.6.0
v2.60.0
v2.60.1
v2.61.0
v2.61.1
v2.61.2
v2.62.0
v2.63.0
v2.64.0
v2.7.0
v2.7.1
v2.8.0
v2.8.1
v2.8.2
v2.8.3
v2.9.0
-
844babb390
docs: update links in data_prep_kit (#1559)
Oleg Lavrovsky
2025-05-11 20:38:25 +02:00 -
776e7ecf9a
fix(HTML): handle row spans in header rows (#1536)
Cesar Berrospi Ramis
2025-05-09 15:14:32 +02:00 -
6e956dc551
Merge branch 'main' into nli/layoutmodel_improvements
nli/layoutmodel_improvements
Nikos Livathinos
2025-05-09 14:47:44 +02:00 -
3220a592e7
docs: add serialization docs, update chunking docs (#1556)
Panos Vagenas
2025-05-08 21:43:01 +02:00 -
f1658edbad
fix: mime error in document streams (#1523)
DavidLee
2025-05-06 15:30:46 +08:00 -
7c705739f9
fix: usage of hashlib for FIPS (#1512)
Michele Dolfi
2025-05-02 15:03:29 +02:00 -
99d8572f6d
chore: propagate docling-core fixes
propagate-core-fixes-20250502
Panos Vagenas
2025-05-02 14:47:21 +02:00 -
de56523974
chore: format JSON test files to enable comparison (#1511)
Panos Vagenas
2025-05-02 11:52:18 +03:00 -
b147331f2a
chore: restore typing hint for self.script_readers (#1500)
Ihar Hrachyshka
2025-04-30 14:33:27 -04:00 -
4ab7e9ddfb
fix: Guard against attribute errors in TesseractOcrModel __del__ (#1494)
Ben Browning
2025-04-30 11:51:33 -04:00 -
cc453961a9
fix: enable cuda_use_flash_attention2 for PictureDescriptionVlmModel (#1496)
Zach Cox
2025-04-30 02:02:52 -04:00 -
976e92e289
fix: updated the time-recorder label for reading order (#1490)
Peter W. J. Staar
2025-04-29 13:02:53 +02:00 -
d8959c6b19
chore: update dependencies in lock file (#1458)
Michele Dolfi
2025-04-28 08:52:46 +02:00 -
a097ccd8d5
chore: typo fix (#1465)
nkh0472
2025-04-28 14:52:09 +08:00 -
3afbe6c969
docs: update supported formats guide (#1463)
Emmanuel Ferdman
2025-04-28 09:51:54 +03:00 -
94d66a0765
fix: Incorrect scaling of TableModel bboxes when do_cell_matching is False (#1459)
Maxim Lysak
2025-04-25 12:34:12 +02:00 -
c67133dde4
chore: bump version to 2.31.0 [skip ci]
v2.31.0
github-actions[bot]
2025-04-25 08:28:25 +00:00 -
a2fbbba9f7
feat: add tutorial using Milvus and Docling for RAG pipeline (#1449)
Ryan Lin
2025-04-25 03:12:35 -04:00 -
a553a1e5bf
Merge branch 'main' into nli/layoutmodel_improvements
Nikos Livathinos
2025-04-24 10:03:05 +02:00 -
976431ed7f
chore: update locked deps (#1442)
Michele Dolfi
2025-04-23 14:59:31 +02:00 -
ed20124544
fix(html): handle address, details, and summary tags (#1436)
Cesar Berrospi Ramis
2025-04-23 09:30:59 +02:00 -
c2470ed216
docs: Fix wrong output format in example code (#1427)
nkh0472
2025-04-22 18:32:55 +08:00 -
64918a81ac
docs: Add OpenSSF Best Practices badge (#1430)
Michele Dolfi
2025-04-22 11:23:28 +02:00 -
32710d5fac
test: Allow pypdfium2 5.x versions
cau/test-pypdfium2-beta
Christoph Auer
2025-04-22 09:06:25 +02:00 -
995b3b0ab1
docs: Typo fixes in docling_document.md (#1400)
Ben Cox
2025-04-22 07:49:08 +01:00 -
8012a3e4d6
fix: Treat overflowing -v flags as DEBUG (#1419)
Eugene
2025-04-19 13:02:41 +04:00 -
88948b0bba
docs: Updated the [Usage] link in architecture.md (#1416)
Leandro Rosas
2025-04-19 09:20:52 +01:00 -
4ce338f455
fix: Adjust the LayoutModel default paths for the docling-layout-heron
Nikos Livathinos
2025-04-15 23:29:01 +02:00 -
fa7fc9e63d
fix(codecov): fix codecov argument and yaml file (#1399)
Cesar Berrospi Ramis
2025-04-15 18:12:57 +02:00 -
e5f8bb086d
Merge branch 'main' into nli/layoutmodel_improvements
Nikos Livathinos
2025-04-15 16:08:12 +02:00 -
51463e3c1f
feat: Refactor the LayoutModel to use
docling-layout-heron. Pinpoint docling-ibm-models to the branch of new layout modelNikos Livathinos
2025-04-15 16:04:55 +02:00 -
0782086009
Merge branch 'main' into nli/layoutmodel_improvements
Nikos Livathinos
2025-04-15 13:24:09 +02:00 -
550b1ca2f8
chore: propagate docling-core fix (#1389)
Panos Vagenas
2025-04-15 10:51:47 +02:00 -
a7dd59c5cb
docs(ocr): Add docs entry for OnnxTR OCR plugin (#1382)
Felix Dittrich
2025-04-15 09:46:59 +02:00 -
06227e9970
ci: sign pypi packages (#1392)
Michele Dolfi
2025-04-15 08:59:16 +02:00 -
5458a88464
ci: add coverage and ruff (#1383)
Michele Dolfi
2025-04-14 18:01:26 +02:00 -
293c28ca7c
docs(security): more statements about secure development (#1381)
Michele Dolfi
2025-04-14 13:53:26 +02:00 -
01fbfd5652
docs: Add testing in the docs (#1379)
Michele Dolfi
2025-04-14 12:31:48 +02:00 -
d9c3999175
chore: update lock file (#1378)
Michele Dolfi
2025-04-14 10:38:10 +02:00 -
a026b4e84b
docs: Add Notes for Installing in Intel macOS (#1377)
Juil Park
2025-04-14 17:21:13 +09:00 -
c391adb5f0
chore: bump version to 2.30.0 [skip ci]
v2.30.0
github-actions[bot]
2025-04-14 08:20:31 +00:00 -
7e40ad3261
fix(deps): widen typer upper bound (#1375)
Michele Dolfi
2025-04-14 09:23:39 +02:00 -
c0ba88edf1
feat(cli): add option for html with split-page mode (#1355)
Peter W. J. Staar
2025-04-14 08:41:50 +02:00 -
0de70e7991
fix: auto-recognize .xlsx, .docx and .pptx files (#1340)
Tim Kellogg
2025-04-14 01:45:13 -04:00 -
b295da4bfe
chore: Update repository URL in CITATION.cff (#1363)
Simon Leiß
2025-04-14 06:57:04 +02:00 -
415b877984
fix(docx): declare image_data variable when handling pictures (#1359)
Cesar Berrospi Ramis
2025-04-11 13:04:00 +02:00 -
250399948d
fix: Implement PictureDescriptionApiOptions.bitmap_area_threshold (#1248)
Rowan Skewes
2025-04-11 19:14:05 +10:00 -
eef2bdea77
feat(xlsx): create a page for each worksheet in XLSX backend (#1332)
Cesar Berrospi Ramis
2025-04-11 10:29:53 +02:00 -
c605edd8e9
feat: OllamaVlmModel for Granite Vision 3.2 (#1337)
Gabe Goodhart
2025-04-10 10:03:04 -06:00 -
6b696b504a
fix: Properly address page in pipeline _assemble_document when page_range is provided (#1334)
Joan Fabrégat
2025-04-10 16:11:28 +02:00 -
72ab8e1821
chore: bump version to 2.29.0 [skip ci]
v2.29.0
github-actions[bot]
2025-04-10 12:24:09 +00:00 -
355d8dc7a6
chore: Logo parameter in docling CLI, prints cute ascii logo (#1294)
Maxim Lysak
2025-04-09 05:29:48 +02:00 -
14e9c0ce9a
fix(docx): Adding new latex symbols, simplifying how equations are added to text (#1295)
Rafael Teixeira de Lima
2025-04-08 17:11:37 +02:00 -
0499cd1c1e
feat: handle <code> tags as code blocks (#1320)
Fernando Santos
2025-04-08 05:32:06 -03:00 -
2e99e5a54f
docs: add plugins docs (#1319)
Michele Dolfi
2025-04-08 09:44:37 +02:00 -
61de30966f
chore: update lock file (#1315)
Michele Dolfi
2025-04-07 17:47:51 +02:00 -
dc3bf9ceac
fix(pptx): check if picture shape has an image attached (#1316)
Maxim Lysak
2025-04-07 17:36:56 +02:00 -
bfcab3d677
feat(docx): add text formatting and hyperlink support (#630)
Simon Jégou
2025-04-03 15:11:50 +02:00 -
88a9756861
Detecting table orientation
dev/table-orientation
Maksym Lysak
2025-04-03 11:10:57 +02:00 -
71148eb381
docs: add visual grounding example (#1270)
Panos Vagenas
2025-04-02 14:03:19 +02:00 -
d2d68747f9
fix(docx): Improve text parsing (#1268)
Rafael Teixeira de Lima
2025-04-02 12:56:44 +02:00 -
b3d111a3cd
fix: Tesseract OCR CLI can't process images composed with numbers only (#1201)
Guilhem VERMOREL
2025-03-31 10:53:49 +02:00 -
44f2b081ec
chore: bump version to 2.28.4 [skip ci]
v2.28.4
github-actions[bot]
2025-03-29 11:56:42 +00:00 -
7afad7e52d
fix: Fixes tables when using OCR (#1261)
Maxim Lysak
2025-03-29 10:06:00 +01:00 -
124f921077
chore: bump version to 2.28.3 [skip ci]
v2.28.3
github-actions[bot]
2025-03-28 18:30:03 +00:00 -
8bd71e8e33
fix: Word-level pdf cells for tables (#1238)
Maxim Lysak
2025-03-28 16:34:48 +01:00 -
82694b2136
chore: bump version to 2.28.2 [skip ci]
v2.28.2
github-actions[bot]
2025-03-26 16:52:06 +00:00 -
9210812bfa
fix: improve HTML layer detection, various MD fixes (#1241)
Panos Vagenas
2025-03-26 16:07:14 +01:00 -
85c4df887b
fix(html): fix HTML parsed heading level (#1244)
Panos Vagenas
2025-03-26 10:30:23 +01:00 -
9eb1686f93
chore: bump version to 2.28.1 [skip ci]
v2.28.1
github-actions[bot]
2025-03-25 18:20:23 +00:00 -
38b7108a22
chore: update locked deps (#1239)
Panos Vagenas
2025-03-25 15:48:02 +01:00 -
f1f7df49e3
Update test-cases
cau/test-dp-word-lines
Christoph Auer
2025-03-25 13:49:08 +01:00 -
825b226fab
fix(converter): Cache same pipeline class with different options (#1152)
mislavmartinic
2025-03-26 00:18:44 +13:00 -
6df8827231
fix(debug): Missing translation of bbox to to_bounding_box (#1220)
Hoang-Long Do
2025-03-25 18:18:10 +07:00 -
f739d0e4c5
fix(docx): identifying numbered headers (#1231)
Rafael Teixeira de Lima
2025-03-25 11:41:02 +01:00 -
0974ba4e1c
docs(examples): batch conversion doc
raises_on_error(#1147)Clément Doumouro
2025-03-25 11:14:39 +01:00 -
8ebb0bf1a0
chore: properly clean up apt temporary files in Dockerfile (#1223)
Peter Dave Hello
2025-03-25 18:10:09 +08:00 -
7df157204b
chore: bump version to 2.28.0 [skip ci]
v2.28.0
github-actions[bot]
2025-03-19 15:18:10 +00:00 -
1c26769785
feat(SmolDocling): Support MLX acceleration in VLM pipeline (#1199)
Maxim Lysak
2025-03-19 15:38:54 +01:00 -
b454aa1551
feat: Add PPTX notes slides (#474)
Maciej Wieczorek
2025-03-19 14:52:09 +01:00 -
f5adfb9724
fix: Determine correct page size in DoclingParseV4Backend (#1196)
Christoph Auer
2025-03-19 11:05:42 +01:00 -
d5f7798763
test(html): fix regression test after docling-core update (#1197)
Cesar Berrospi Ramis
2025-03-19 11:03:46 +01:00 -
0b707d0882
fix(msword): Fixing function return in equations handling (#1194)
Rafael Teixeira de Lima
2025-03-19 10:34:25 +01:00 -
1d680b0a32
docs: Linux Foundation AI & Data (#1183)
Michele Dolfi
2025-03-19 09:05:57 +01:00 -
54a78c307d
docs: move apify to docs (#1182)
Michele Dolfi
2025-03-18 16:43:55 +01:00 -
2f72167ff6
feat: updated vlm pipeline (with latest changes from docling-core) (#1158)
Maxim Lysak
2025-03-18 15:44:51 +01:00 -
1a2a9e4eff
chore: bump version to 2.27.0 [skip ci]
v2.27.0
github-actions[bot]
2025-03-18 13:37:45 +00:00 -
6eaae3cba0
feat: add factory for ocr engines via plugins (#1010)
Michele Dolfi
2025-03-18 13:58:05 +01:00 -
3960b199d6
feat: Add DoclingParseV4 backend, using high-level docling-parse API (#905)
Christoph Auer
2025-03-18 10:38:19 +01:00 -
772487f9c9
feat(actor): Docling Actor on Apify infrastructure (#875)
Václav Vančura
2025-03-18 10:17:44 +01:00 -
75a03c4257
disable GT generation on test_interfaces
cau/dpv4-test-updates
Christoph Auer
2025-03-17 11:31:18 +01:00 -
9359f86c6a
Merge branch 'cau/docling-parse-api' of github.com:DS4SD/docling into cau/dpv4-test-updates
Christoph Auer
2025-03-17 11:17:31 +01:00 -
50ac62b5fa
test_input_doc use default backend
Christoph Auer
2025-03-17 11:13:42 +01:00 -
7bce91893c
Unset DPv1 backend on tests (use DPv4 default), re-generate test output
Christoph Auer
2025-03-17 11:04:41 +01:00 -
eff907811a
Merge branch 'main' of github.com:DS4SD/docling into cau/docling-parse-api
Christoph Auer
2025-03-17 10:37:13 +01:00 -
7e01798417
docs: fix spelling of picture in usage (#1165)
serced
2025-03-17 09:33:51 +01:00 -
fe45d30942
Fixes for DPv4 backend init, better test coverage
Christoph Auer
2025-03-17 09:26:31 +01:00 -
e34c0750a7
Reset all tests to use docling-parse v1 for now
Christoph Auer
2025-03-14 16:39:16 +01:00 -
412c013d95
Merge from main
Christoph Auer
2025-03-14 13:52:36 +01:00 -
d654568ad9
Test all backends, fixes
Christoph Auer
2025-03-14 13:32:37 +01:00