Archive video ocr challenge

Get video frames and read text [ver. 2.0.0]

ocr video pol

Git repo URL: git://gonito.net/video-ocr / Branch: master
Run git clone --single-branch git://gonito.net/video-ocr -b master to get the challenge data
Browse at https://gonito.net/gitlist/video-ocr.git/master

Leaderboard

# submitter when ver. description test-A CER test-A F1 ×
1 [anonymized] 2023-07-12 17:57 2.0.0 Tesseract with 10x bigger images tesseract 98.51 11.66 21
2 [anonymized] 2023-08-16 07:32 2.0.0 TROCR trained synthetic data with craft ~5000 input images + 3000 previous 65.76 7.55 21
3 [anonymized] 2023-08-07 11:45 2.0.0 TROCR trained synthetic data with craft ~800 input images train 66.86 7.52 21
4 [anonymized] 2023-08-13 12:58 2.0.0 Donut, no training donut 79.53 0.00 21
# tags test-A F1 subtitle test-A R test-A P test-A CER test-A F1
1 tesseract 11.66 8.09 20.84 98.51 11.66
2 7.55 7.52 7.58 65.76 7.55
3 train 7.52 7.52 7.52 66.86 7.52
4 donut 0.00 0.00 0.00 79.53 0.00